Deep neural networks (DNNs) for social image classification are prone to performance reduction and overfitting when trained on datasets plagued by noisy or imbalanced labels. Weight loss methods tend to ignore the influence of noisy or frequent category examples during the training, resulting in a reduction of final accuracy and, in presence of extreme noise, even a failure of the learning process. A new advisor network is introduced to address both imbalance and noise problems, able to pilot learning of a main network by adjusting the visual features and the gradient with a meta-learning strategy. In a curriculum learning fashion, impact of redundant data is reduced while recognizable noisy label images are downplayed or redirected. A Meta Feature Re-Weighting (MFRW) and a Meta Equalization Softmax (MES) methods are introduced to let the main network focus only on the information in an image deemed relevant by the advisor network and to adjust the training gradient to reduce the adverse effects of frequent or noisy categories. The proposed method is first tested on synthetic versions of CIFAR10 and CIFAR100, and then on the more realistic Imagenet-LT, Places-LT, and Clothing1M datasets, reporting state-of-the-art results.
Meta-learning advisor networks for long-tail and noisy labels in social image classification
Uricchio, Tiberio;
2023-01-01
Abstract
Deep neural networks (DNNs) for social image classification are prone to performance reduction and overfitting when trained on datasets plagued by noisy or imbalanced labels. Weight loss methods tend to ignore the influence of noisy or frequent category examples during the training, resulting in a reduction of final accuracy and, in presence of extreme noise, even a failure of the learning process. A new advisor network is introduced to address both imbalance and noise problems, able to pilot learning of a main network by adjusting the visual features and the gradient with a meta-learning strategy. In a curriculum learning fashion, impact of redundant data is reduced while recognizable noisy label images are downplayed or redirected. A Meta Feature Re-Weighting (MFRW) and a Meta Equalization Softmax (MES) methods are introduced to let the main network focus only on the information in an image deemed relevant by the advisor network and to adjust the training gradient to reduce the adverse effects of frequent or noisy categories. The proposed method is first tested on synthetic versions of CIFAR10 and CIFAR100, and then on the more realistic Imagenet-LT, Places-LT, and Clothing1M datasets, reporting state-of-the-art results.File | Dimensione | Formato | |
---|---|---|---|
3584360.pdf
accesso aperto
Tipologia:
Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza:
Creative commons
Dimensione
8.92 MB
Formato
Adobe PDF
|
8.92 MB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.