In this paper, we address the problem of image retrieval by learning images representation based on the activations of a Convolutional Neural Network. We present an end-to-end trainable network architecture that exploits a novel multi-scale local pooling based on NetVLAD and a triplet mining procedure based on samples difficulty to obtain an effective image representation. Extensive experiments show that our approach is able to reach state-of-the-art results on three standard datasets.
Image retrieval using multi-scale CNN features pooling
Uricchio T.;
2020-01-01
Abstract
In this paper, we address the problem of image retrieval by learning images representation based on the activations of a Convolutional Neural Network. We present an end-to-end trainable network architecture that exploits a novel multi-scale local pooling based on NetVLAD and a triplet mining procedure based on samples difficulty to obtain an effective image representation. Extensive experiments show that our approach is able to reach state-of-the-art results on three standard datasets.File in questo prodotto:
File | Dimensione | Formato | |
---|---|---|---|
2004.09695.pdf
solo utenti autorizzati
Tipologia:
Documento in pre-print (manoscritto inviato all'editore, precedente alla peer review)
Licenza:
DRM non definito
Dimensione
7.15 MB
Formato
Adobe PDF
|
7.15 MB | Adobe PDF | Visualizza/Apri Richiedi una copia |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.