MultiGrain: a unified image embedding for classes and instances

M. Berman, H. Jégou, A. Vedaldi, I. Kokkinos, и M. Douze.
(2019)cite arxiv:1902.05509.

Аннотация

MultiGrain is a network architecture producing compact vector representations that are suited both for image classification and particular object retrieval. It builds on a standard classification trunk. The top of the network produces an embedding containing coarse and fine-grained information, so that images can be recognized based on the object class, particular object, or if they are distorted copies. Our joint training is simple: we minimize a cross-entropy loss for classification and a ranking loss that determines if two images are identical up to data augmentation, with no need for additional labels. A key component of MultiGrain is a pooling layer that allow us to take advantage of high-resolution images with a network trained at a lower resolution. When fed to a linear classifier, the learned embeddings provide state-of-the-art classification accuracy. For instance, we obtain 79.3% top-1 accuracy with a ResNet-50 learned on Imagenet, which is a +1.7% absolute improvement over the AutoAugment method. When compared with the cosine similarity, the same embeddings perform on par with the state-of-the-art for image retrieval at moderate resolutions.

ключ BibTeX: berman2019multigrain
тип записи: misc
год: 2019
url: http://arxiv.org/abs/1902.05509
Примечание: cite arxiv:1902.05509

тэги

Пользователи данного ресурса

Комментарии и рецензиипоказать / перейти в невидимый режим

@nmatsuk 5 лет назад
Ссылки
Закладки
Рецензия удаляется

Пожалуйста, войдите в систему, чтобы принять участие в дискуссии (добавить собственные рецензию, или комментарий)

BibSonomy