Semantic Scholar Open Access 2021 120 sitasi

Cross Modal Retrieval with Querybank Normalisation

Simion-Vlad Bogolin Ioana Croitoru Hailin Jin Yang Liu Samuel Albanie

Abstrak

Profiting from large-scale training datasets, advances in neural architecture design and efficient inference, joint embeddings have become the dominant approach for tackling cross-modal retrieval. In this work we first show that, despite their effectiveness, state-of-the-art joint embeddings suffer significantly from the longstanding “hubness problem” in which a small number of gallery embeddings form the nearest neighbours of many queries. Drawing inspiration from the NLP literature, we formulate a simple but effective framework called Querybank Normalisation (QB-NORM) that re-normalises query similarities to account for hubs in the embedding space. QB-NORM improves retrieval performance without requiring retraining. Differently from prior work, we show that QB-NORM works effectively without concurrent access to any test set queries. Within the QB-NORM framework, we also propose a novel similarity normalisation method, the Dynamic Inverted Softmax, that is significantly more robust than existing approaches. We showcase QB-NORM across a range of cross modal retrieval models and benchmarks where it consistently enhances strong baselines beyond the state of the art. Code is available at https://vladbogo.github.io/QB-Norm/.

Topik & Kata Kunci

Penulis (5)

S

Simion-Vlad Bogolin

I

Ioana Croitoru

H

Hailin Jin

Y

Yang Liu

S

Samuel Albanie

Format Sitasi

Bogolin, S., Croitoru, I., Jin, H., Liu, Y., Albanie, S. (2021). Cross Modal Retrieval with Querybank Normalisation. https://doi.org/10.1109/CVPR52688.2022.00513

Akses Cepat

Informasi Jurnal
Tahun Terbit
2021
Bahasa
en
Total Sitasi
120×
Sumber Database
Semantic Scholar
DOI
10.1109/CVPR52688.2022.00513
Akses
Open Access ✓