Abstract

Recognition of grocery products in store shelves poses peculiar challenges. Firstly, the task mandates the recognition of an extremely high number of different items, in the order of several thousands for medium-small shops, with many of them featuring small inter and intra class variability. Then, available product databases usually include just one or a few studio-quality images per product (referred to herein as reference images), whilst at test time recognition is performed on pictures displaying a portion of a shelf containing several products and taken in the store by cheap cameras (referred to herein as query images). Moreover, as the items on sale in a store as well as their appearance change frequently overtime, a practical recognition system should handle seamlessly new products/packages. We developed a deep learning based pipeline to solve this task. First we deploy state of the art object detectors to obtain an initial product-agnostic item detection, then, we pursue product recognition through a similarity search between global descriptors computed on reference and cropped query images. To maximize performance, we learn an ad-hoc global descriptor by a CNN trained on reference images based on an image embedding loss. We have tested our pipeline on the standard grocery product [1] dataset and improved the currents state of the art. While computationally expensive at training time our system turn out not only accurate but also quite fast at test time.


Original document

The different versions of the original document can be found in:

http://dx.doi.org/10.1109/ipas.2018.8708890
https://cris.unibo.it/handle/11585/661460,
https://academic.microsoft.com/#/detail/2963686633
Back to Top

Document information

Published on 01/01/2018

Volume 2018, 2018
DOI: 10.1109/ipas.2018.8708890
Licence: CC BY-NC-SA license

Document Score

0

Views 3
Recommendations 0

Share this document

claim authorship

Are you one of the authors of this document?