Detail výsledku

Estimating Extreme 3D Image Rotations using Cascaded Attention

DEKEL, S.; KELLER, Y.; ČADÍK, M. Estimating Extreme 3D Image Rotations using Cascaded Attention. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Seattle: IEEE Computer Society, 2024. p. 2588-2598. ISBN: 979-8-3503-5301-3.
Typ
článek ve sborníku konference
Jazyk
anglicky
Autoři
Dekel Shay
Keller Yosi, prof., M.Sc., Ph.D.
Čadík Martin, doc. Ing., Ph.D., UPGM (FIT)
Abstrakt

Estimating large, extreme inter-image rotations is critical for numerous computer vision domains involving images related by limited or non-overlapping fields of view. In this work, we propose an attention-based approach with a pipeline of novel algorithmic components. First, as rotation estimation pertains to image pairs, we introduce an inter-image distillation scheme using Decoders to improve embeddings. Second, whereas contemporary methods compute a 4D correlation volume (4DCV) encoding inter-image relationships, we propose an Encoder-based cross-attention approach between activation maps to compute an enhanced equivalent of the 4DCV. Finally, we present a cascaded Decoder-based technique for alternately refining the cross-attention and the rotation query. Our approach outperforms current state-of-the-art methods on extreme rotation estimation. We make our code publicly available.

Klíčová slova

camera orientation estimation, extreme rotation, 3D rotation, cascaded attention

URL
Rok
2024
Strany
2588–2598
Sborník
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
Konference
The IEEE/CVF Conference on Computer Vision and Pattern Recognition 2024
ISBN
979-8-3503-5301-3
Vydavatel
IEEE Computer Society
Místo
Seattle
DOI
UT WoS
001322555902090
EID Scopus
BibTeX
@inproceedings{BUT188275,
  author="Shay {Dekel} and Yosi {Keller} and Martin {Čadík}",
  title="Estimating Extreme 3D Image Rotations using Cascaded Attention",
  booktitle="Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)",
  year="2024",
  pages="2588--2598",
  publisher="IEEE Computer Society",
  address="Seattle",
  doi="10.1109/CVPR52733.2024.00250",
  isbn="979-8-3503-5301-3",
  url="https://cadik.posvete.cz/"
}
Projekty
Topografická analýza obrazu s využitím metod hlubokého učení, MŠMT, INTER-EXCELLENCE - Podprogram INTER-ACTION, LTAIZ19004, zahájení: 2019-07-01, ukončení: 2022-06-30, ukončen
Výzkumné skupiny
Pracoviště
Nahoru