Loading [MathJax]/extensions/MathMenu.js
UVM++: A Large-Scale Benchmark for Beverage Recognition in Intelligent Vending Machine | IEEE Journals & Magazine | IEEE Xplore

UVM++: A Large-Scale Benchmark for Beverage Recognition in Intelligent Vending Machine


Abstract:

In recent years, artificial intelligence (AI)-driven computer technology has become popular in many practical applications in the retail industry. In particular, the deve...Show More

Abstract:

In recent years, artificial intelligence (AI)-driven computer technology has become popular in many practical applications in the retail industry. In particular, the development of face recognition, mobile payment, cloud computing and other technologies has given birth to three new forms of consumption related to “unmanned retail”, i.e., unmanned convenience stores, unmanned vending machines and unmanned vending racks. Due to the sudden outbreak of COVID-19, the characteristics of unmanned retail such as intelligent operation, convenience and a lack of contact have allowed the public to gradually become used to this approach. An unmanned vending machine (UVM) can be reconfigured from a refrigerated cabinet for selling commodities such as beverages. It can take photos of the goods purchased by customers, and can recognize these commodities based on information from the collected images. However, some commercial UVMs are subject to strong restrictions on the types and placement of goods, making them difficult to achieve data collection and convenient operation. In this research, we therefore design a system for automated data collection and annotation. The mechanical structure of our system is designed to mimic the placement of drinks and the scene of each drink with other different types of drinks around it. We construct a large-scale benchmark dataset called UVM++, which is composed of several sub-datasets collected from different scenes. Extensive experimental results are presented and the performance of popular object detection methods on this task is examined. In particular, we perform an empirical study of cross-scene detection by various object detectors. Our findings suggest that the proposed data annotation system and the constructed benchmark dataset, UVM++, can be useful for performing various tasks in the environment of a UVM, thus driving development in the field of new retail.
Published in: IEEE Transactions on Consumer Electronics ( Volume: 70, Issue: 1, February 2024)
Page(s): 249 - 262
Date of Publication: 13 October 2023

ISSN Information:

Funding Agency:


Contact IEEE to Subscribe

References

References is not available for this document.