Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3447993.3483244acmconferencesArticle/Chapter ViewAbstractPublication PagesmobicomConference Proceedingsconference-collections
research-article

RFID and camera fusion for recognition of human-object interactions

Published: 25 October 2021 Publication History

Abstract

Recognition of human-object interactions is practically important in various human-centric sensing scenarios such as smart supermarket, factory, and home. This paper proposes an RF-Camera system by fusing RFID and Computer Vision (CV) techniques, which is the first work to recognize the human gestural interactions with physical objects in multi-subject and multi-object scenarios. In RF-Camera, we first propose a dimension reduction method to transform the subject's 3D hand trajectory captured by depth camera to a 2D image, using which the subject's gesture can be recognized. We also propose a method to extract the facial image of target subject from an image that may contain irrelevant subjects, thereby further recognizing his/her identity. Finally, we model the physical movements of the held object's tag and further predict the tag phase data, by comparing which with real phase data of each tag human-object matching can be discovered. When implementing RF-Camera, three technical challenges need to be addressed. (i) To remove noisy data corresponding to irrelevant actions from raw sensing data, we propose a state transition diagram to determine the boundary of effective data. (ii) To predict phase data of the held target tag with unknown hand-tag offset, we quantify target tag trajectory by adding a variable hand-tag vector to captured hand trajectory. (iii) To ensure high reading rates of target tags in tag-dense scenarios, we propose a CV-assisted RFID scheduling method, in which analytics on CV data can help schedule RFID readings. We conduct extensive experiments to evaluate the performance of RF-Camera. Experimental results demonstrate that RF-Camera can recognize the gestural actions, human identity and human-object matching with an average accuracy higher than 90% in most cases.

References

[1]
Yuchen Zhou, F Richard Yu, Jian Chen, and Yonghong Kuo. Cyber-Physical-Social Systems: A State-of-the-Art Survey, Challenges and Opportunities. IEEE Communications Surveys & Tutorials, 22(1):389--425, 2019.
[2]
Ji Zhou, Yanhong Zhou, Baicun Wang, and Jiyuan Zang. Human-Cyber-Physical Systems (HCPSs) in the Context of New-Generation Intelligent Manufacturing. Engineering, 5(4):624--636, 2019.
[3]
Pavlo Molchanov, Xiaodong Yang, Shalini Gupta, Kihwan Kim, Stephen Tyree, and Jan Kautz. Online Detection and Classification of Dynamic Hand Gestures with Recurrent 3D Convolutional Neural Networks. In Proc. of IEEE CVPR 2016, pages 4207--4215.
[4]
Pradyumna Narayana, Ross Beveridge, and Bruce A Draper. Gesture Recognition: Focus on the hands. In Proc. of IEEE CVPR 2018, pages 5235--5244.
[5]
Xuan Son Nguyen, Luc Brun, Olivier Lézoray, and Sébastien Bougleux. A Neural Network based on SPD Manifold Learning for Skeleton-based Hand Gesture Rruan2016audiogestecognition. In Proc. of IEEE CVPR 2019, pages 12036--12045.
[6]
Wenjie Ruan, Quan Z Sheng, Lei Yang, Tao Gu, Peipei Xu, and Longfei Shangguan. AudioGest: Enabling Fine-Grained Hand Gesture Detection by Decoding Echo Signal. In Proc. of ACM UbiComp 2016, pages 474--485.
[7]
Yanwen Wang, Jiaxing Shen, and Yuanqing Zheng. Push the Limit of Acoustic Gesture Recognition. In Proc. of IEEE INFOCOM 2020, pages 566--575.
[8]
Qifan Pu, Sidhant Gupta, Shyamnath Gollakota, and Shwetak Patel. Whole-home Gesture Recognition Using Wireless Signals. In Proc. of ACM Mobicom 2013, pages 15--18.
[9]
Yue Zheng, Yi Zhang, Kun Qian, Guidong Zhang, Yunhao Liu, Chenshu Wu, and Zheng Yang. Zero-Effort Cross-Domain Gesture Recognition with Wi-Fi. In Proc. of ACM MobiSys 2019, pages 313--325.
[10]
Chenning Li, Manni Liu, and Zhichao Cao. WiHF: Enable User Identified Gesture Recognition with WiFi. In Proc. of IEEE INFOCOM 2020, pages 586--595.
[11]
Zimu Zhou, Longfei Shangguan, Xiaolong Zheng, Lei Yang, and Yunhao Liu. Design and Implementation of an RFID-based Customer Shopping Behavior Mining System. IEEE/ACM transactions on networking, 25(4):2405--2418, 2017.
[12]
Longfei Shangguan, Zimu Zhou, and Kyle Jamieson. Enabling Gesture-based Interactions with Objects. In Proc. of ACM MobiSys 2017, pages 239--251.
[13]
Chuyu Wang, Jian Liu, Yingying Chen, Hongbo Liu, Lei Xie, Wei Wang, Bingbing He, and Sanglu Lu. Multi-Touch in the Air: Device-Free Finger Tracking and Gesture Recognition via COTS RFID. In Proc. of IEEE INFOCOM 2018, pages 1691--1699.
[14]
Azure kinect. https://docs.microsoft.com/en-us/azure/kinect-dk/sensor-sdk-download. Accessed July 09, 2021.
[15]
D. F. Silva and G. E. A. P. A. Batista and E. Keogh. Prefix and Suffix Invariant Dynamic Time Warping. In Proc. of IEEE ICDM 2016, pages 1209--1214.
[16]
Su-Ryun Lee, Sung-Don Joo, and Chae-Woo Lee. An Enhanced Dynamic Framed Slotted ALOHA Algorithm for RFID Tag Identification. In Proc. of ACM MobiQuitous 2005, pages 166--172.
[17]
Muhammad Shahzad and Alex X. Liu. Probabilistic Optimal Tree Hopping for RFID Identification. IEEE/ACM Transactions on Networking, 23(3):796--809, 2015.
[18]
Longfei Shangguan, Zheng Yang, Alex X. Liu, Zimu Zhou, and Yunhao Liu. Relative Localization of RFID Tags using Spatial-Temporal Phase Profiling. In Proc. of USENIX NSDI 2015, pages 251--263.
[19]
E. A. Wan and R. Van Der Merwe. The Unscented Kalman Filter for Nonlinear Estimation. In Proc. of IEEE Adaptive Systems for Signal Processing, Communications, and Control Symposium, pages 153--158, 2000.
[20]
Konstantinos G. Derpanis. Overview of the RANSAC Algorithm. 2005.
[21]
Xin Liu, Kan Meina, Wanglong Wu, Shiguang Shan, and Xilin Chen. VIPLFaceNet: An Open Source Deep Face Recognition SDK. Frontiers of Computer Science, 11(2):208--218, 2017.
[22]
Description of Unwrap. https://www.mathworks.com/help/matlab/ref/unwrap.html.
[23]
Ming Tao, Shuqiang Huang, Yang Li, Min Yan, and Yuyu Zhou. SA-PSO based Optimizing Reader Deployment in Large-scale RFID Systems. Journal of Network and Computer Applications, 52:90--100, 2015.
[24]
Bin Cao, Yu Gu, Zhihan Lv, Shan Yang, Jianwei Zhao, and Yujie Li. RFID Reader Anticollision based on Distributed Parallel Particle Swarm Optimization. IEEE Internet of Things Journal, 8(5):3099--3107, 2021.
[25]
Abdoul Aziz Mbacke, Nathalie Mitton, and Herve Rivano. A Survey of RFID Readers Anticollision Protocols. IEEE Journal of Radio Frequency Identification, 2(1):38--48, 2018.
[26]
Mike Lenehan. Octane sdk. https://support.impinj.com/hc/en-us/articles/202755268-Octane-SDK. Accessed January 09, 2021.
[27]
Wei Wang, Alex X Liu, and Ke Sun. Device-Free Gesture Tracking Using Acoustic Signals. In Proc. of ACM MobiCom 2016, pages 82--94.
[28]
Rajalakshmi Nandakumar, Vikram Iyer, Desney Tan, and Shyamnath Gollakota. FingerIO: Using Active Sonar for Fine-Grained Finger Tracking. In Proc. of ACM CHI 2016, pages 1515--1525.
[29]
Vincent Becker, Linus Fessler, and Gábor Sörös. GestEar: Combining Audio and Motion Sensing for Gesture Recognition on Smartwatches. In Proc. of ACM MobiCom 2016, pages 10--19.
[30]
Yuki Kubo, Yuto Koguchi, Buntarou Shizuki, Shin Takahashi, and Otmar Hilliges. AudioTouch: Minimally Invasive Sensing of Micro-Gestures via Active Bio-Acoustic Sensing. In Proc. of ACM MobileHCI 2019, pages 1--13.
[31]
Wei Wang, Alex X Liu, Muhammad Shahzad, Kang Ling, and Sanglu Lu. Understanding and Modeling of WiFi Signal based Human Activity Recognition. In Proc. of ACM MobiCom 2015, pages 65--76.
[32]
Sheng Tan and Jie Yang. WiFinger: Leveraging Commodity WiFi for Fine-grained Finger Gesture Recognition. In Proc. of ACM MobiHoc 2016, pages 201--210.
[33]
Yongpan Zou, Jiang Xiao, Jinsong Han, Kaishun Wu, Yun Li, and Lionel M Ni. GRfid: A Device-Free RFID-based Gesture Recognition System. IEEE Transactions on Mobile Computing, 16(2):381--393, 2016.
[34]
Chuyu Wang, Lei Xie, Keyan Zhang, Wei Wang, Yanling Bu, and Sanglu Lu. Spin-Antenna: 3D Motion Tracking for Tag Array Labeled Objects via Spinning Antenna. In Proc. of IEEE INFOCOM 2019, pages 1--9.
[35]
Zhongqin Wang, Min Xu, Ning Ye, Ruchuan Wang, and Haiping Huang. RF-Focus: Computer Vision-assisted Region-of-interest RFID Tag Recognition and Localization in Multipath-prevalent Environments. Proc. of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies, 3(1):1--30, 2019.
[36]
Jianxin Wu, Adebola Osuntogun, Tanzeem Choudhury, Matthai Philipose, and James M. Rehg. A Scalable Approach to Activity Recognition based on Object Use. In Proc. of IEEE ICCV 2007, pages 1--8.
[37]
Zijuan Liu, Xiulong Liu, and Keqiu Li. Deeper Exercise Monitoring for Smart Gym using Fused RFID and CV Data. In Proc. of IEEE INFOCOM 2020, pages 11--19.
[38]
Tara Boroushaki, Junshan Leng, Ian Clester, Alberto Rodriguez, and Fadel Adib. Robotic Grasping of Fully-occluded Objects using RF Perception. In Proc. of IEEE ICRA 2021.

Cited By

View all
  • (2025)Efficient Missing Key Tag Identification in Large-Scale RFID Systems: An Iterative Verification and Selection MethodIEEE Transactions on Mobile Computing10.1109/TMC.2024.349359724:3(2253-2269)Online publication date: Mar-2025
  • (2025)Review on Systems Combining Computer Vision and Radio Frequency IdentificationIEEE Internet of Things Journal10.1109/JIOT.2024.348475512:2(1291-1319)Online publication date: 15-Jan-2025
  • (2024)Artificial Intelligence of Things: A SurveyACM Transactions on Sensor Networks10.1145/369063921:1(1-75)Online publication date: 30-Aug-2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
MobiCom '21: Proceedings of the 27th Annual International Conference on Mobile Computing and Networking
October 2021
887 pages
ISBN:9781450383424
DOI:10.1145/3447993
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 25 October 2021

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. RFID
  2. camera
  3. human-object interactions
  4. multi-modal fusion

Qualifiers

  • Research-article

Funding Sources

  • National Natural Science Foundation of China

Conference

ACM MobiCom '21
Sponsor:

Acceptance Rates

Overall Acceptance Rate 440 of 2,972 submissions, 15%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)225
  • Downloads (Last 6 weeks)21
Reflects downloads up to 18 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2025)Efficient Missing Key Tag Identification in Large-Scale RFID Systems: An Iterative Verification and Selection MethodIEEE Transactions on Mobile Computing10.1109/TMC.2024.349359724:3(2253-2269)Online publication date: Mar-2025
  • (2025)Review on Systems Combining Computer Vision and Radio Frequency IdentificationIEEE Internet of Things Journal10.1109/JIOT.2024.348475512:2(1291-1319)Online publication date: 15-Jan-2025
  • (2024)Artificial Intelligence of Things: A SurveyACM Transactions on Sensor Networks10.1145/369063921:1(1-75)Online publication date: 30-Aug-2024
  • (2024)Enabling 6D Pose Tracking on Your Acoustic DevicesProceedings of the 22nd Annual International Conference on Mobile Systems, Applications and Services10.1145/3643832.3661875(15-28)Online publication date: 3-Jun-2024
  • (2024)PmTrackProceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies10.1145/36314337:4(1-30)Online publication date: 12-Jan-2024
  • (2024)Graft: Efficient Inference Serving for Hybrid Deep Learning With SLO Guarantees via DNN Re-AlignmentIEEE Transactions on Parallel and Distributed Systems10.1109/TPDS.2023.334051835:2(280-296)Online publication date: 1-Feb-2024
  • (2024)Fine-Grained Recognition of Manipulation Activities on Objects via Multi-Modal SensingIEEE Transactions on Mobile Computing10.1109/TMC.2024.336452223:10(9614-9628)Online publication date: Oct-2024
  • (2024)SlpRoF: Improving the Temporal Coverage and Robustness of RF-Based Vital Sign Monitoring During SleepIEEE Transactions on Mobile Computing10.1109/TMC.2023.334092523:7(7848-7864)Online publication date: Jul-2024
  • (2024)On Batch Writing in COTS RFID SystemsIEEE Transactions on Mobile Computing10.1109/TMC.2023.328323823:5(3846-3857)Online publication date: May-2024
  • (2024)Double Polling-Based Tag Information Collection for Sensor-Augmented RFID SystemsIEEE Transactions on Mobile Computing10.1109/TMC.2023.327792523:5(3496-3509)Online publication date: May-2024
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media