research-article

RFID and camera fusion for recognition of human-object interactions

Authors:

Keqiu LiAuthors Info & Claims

MobiCom '21: Proceedings of the 27th Annual International Conference on Mobile Computing and Networking

Pages 296 - 308

https://doi.org/10.1145/3447993.3483244

Published: 25 October 2021 Publication History

Abstract

Recognition of human-object interactions is practically important in various human-centric sensing scenarios such as smart supermarket, factory, and home. This paper proposes an RF-Camera system by fusing RFID and Computer Vision (CV) techniques, which is the first work to recognize the human gestural interactions with physical objects in multi-subject and multi-object scenarios. In RF-Camera, we first propose a dimension reduction method to transform the subject's 3D hand trajectory captured by depth camera to a 2D image, using which the subject's gesture can be recognized. We also propose a method to extract the facial image of target subject from an image that may contain irrelevant subjects, thereby further recognizing his/her identity. Finally, we model the physical movements of the held object's tag and further predict the tag phase data, by comparing which with real phase data of each tag human-object matching can be discovered. When implementing RF-Camera, three technical challenges need to be addressed. (i) To remove noisy data corresponding to irrelevant actions from raw sensing data, we propose a state transition diagram to determine the boundary of effective data. (ii) To predict phase data of the held target tag with unknown hand-tag offset, we quantify target tag trajectory by adding a variable hand-tag vector to captured hand trajectory. (iii) To ensure high reading rates of target tags in tag-dense scenarios, we propose a CV-assisted RFID scheduling method, in which analytics on CV data can help schedule RFID readings. We conduct extensive experiments to evaluate the performance of RF-Camera. Experimental results demonstrate that RF-Camera can recognize the gestural actions, human identity and human-object matching with an average accuracy higher than 90% in most cases.

References

[1]

Yuchen Zhou, F Richard Yu, Jian Chen, and Yonghong Kuo. Cyber-Physical-Social Systems: A State-of-the-Art Survey, Challenges and Opportunities. IEEE Communications Surveys & Tutorials, 22(1):389--425, 2019.

Digital Library

[2]

Ji Zhou, Yanhong Zhou, Baicun Wang, and Jiyuan Zang. Human-Cyber-Physical Systems (HCPSs) in the Context of New-Generation Intelligent Manufacturing. Engineering, 5(4):624--636, 2019.

[3]

Pavlo Molchanov, Xiaodong Yang, Shalini Gupta, Kihwan Kim, Stephen Tyree, and Jan Kautz. Online Detection and Classification of Dynamic Hand Gestures with Recurrent 3D Convolutional Neural Networks. In Proc. of IEEE CVPR 2016, pages 4207--4215.

[4]

Pradyumna Narayana, Ross Beveridge, and Bruce A Draper. Gesture Recognition: Focus on the hands. In Proc. of IEEE CVPR 2018, pages 5235--5244.

[5]

Xuan Son Nguyen, Luc Brun, Olivier Lézoray, and Sébastien Bougleux. A Neural Network based on SPD Manifold Learning for Skeleton-based Hand Gesture Rruan2016audiogestecognition. In Proc. of IEEE CVPR 2019, pages 12036--12045.

[6]

Wenjie Ruan, Quan Z Sheng, Lei Yang, Tao Gu, Peipei Xu, and Longfei Shangguan. AudioGest: Enabling Fine-Grained Hand Gesture Detection by Decoding Echo Signal. In Proc. of ACM UbiComp 2016, pages 474--485.

[7]

Yanwen Wang, Jiaxing Shen, and Yuanqing Zheng. Push the Limit of Acoustic Gesture Recognition. In Proc. of IEEE INFOCOM 2020, pages 566--575.

[8]

Qifan Pu, Sidhant Gupta, Shyamnath Gollakota, and Shwetak Patel. Whole-home Gesture Recognition Using Wireless Signals. In Proc. of ACM Mobicom 2013, pages 15--18.

[9]

Yue Zheng, Yi Zhang, Kun Qian, Guidong Zhang, Yunhao Liu, Chenshu Wu, and Zheng Yang. Zero-Effort Cross-Domain Gesture Recognition with Wi-Fi. In Proc. of ACM MobiSys 2019, pages 313--325.

[10]

Chenning Li, Manni Liu, and Zhichao Cao. WiHF: Enable User Identified Gesture Recognition with WiFi. In Proc. of IEEE INFOCOM 2020, pages 586--595.

[11]

Zimu Zhou, Longfei Shangguan, Xiaolong Zheng, Lei Yang, and Yunhao Liu. Design and Implementation of an RFID-based Customer Shopping Behavior Mining System. IEEE/ACM transactions on networking, 25(4):2405--2418, 2017.

Digital Library

[12]

Longfei Shangguan, Zimu Zhou, and Kyle Jamieson. Enabling Gesture-based Interactions with Objects. In Proc. of ACM MobiSys 2017, pages 239--251.

[13]

Chuyu Wang, Jian Liu, Yingying Chen, Hongbo Liu, Lei Xie, Wei Wang, Bingbing He, and Sanglu Lu. Multi-Touch in the Air: Device-Free Finger Tracking and Gesture Recognition via COTS RFID. In Proc. of IEEE INFOCOM 2018, pages 1691--1699.

Digital Library

[14]

Azure kinect. https://docs.microsoft.com/en-us/azure/kinect-dk/sensor-sdk-download. Accessed July 09, 2021.

[15]

D. F. Silva and G. E. A. P. A. Batista and E. Keogh. Prefix and Suffix Invariant Dynamic Time Warping. In Proc. of IEEE ICDM 2016, pages 1209--1214.

[16]

Su-Ryun Lee, Sung-Don Joo, and Chae-Woo Lee. An Enhanced Dynamic Framed Slotted ALOHA Algorithm for RFID Tag Identification. In Proc. of ACM MobiQuitous 2005, pages 166--172.

[17]

Muhammad Shahzad and Alex X. Liu. Probabilistic Optimal Tree Hopping for RFID Identification. IEEE/ACM Transactions on Networking, 23(3):796--809, 2015.

Digital Library

[18]

Longfei Shangguan, Zheng Yang, Alex X. Liu, Zimu Zhou, and Yunhao Liu. Relative Localization of RFID Tags using Spatial-Temporal Phase Profiling. In Proc. of USENIX NSDI 2015, pages 251--263.

[19]

E. A. Wan and R. Van Der Merwe. The Unscented Kalman Filter for Nonlinear Estimation. In Proc. of IEEE Adaptive Systems for Signal Processing, Communications, and Control Symposium, pages 153--158, 2000.

[20]

Konstantinos G. Derpanis. Overview of the RANSAC Algorithm. 2005.

[21]

Xin Liu, Kan Meina, Wanglong Wu, Shiguang Shan, and Xilin Chen. VIPLFaceNet: An Open Source Deep Face Recognition SDK. Frontiers of Computer Science, 11(2):208--218, 2017.

Digital Library

[22]

Description of Unwrap. https://www.mathworks.com/help/matlab/ref/unwrap.html.

[23]

Ming Tao, Shuqiang Huang, Yang Li, Min Yan, and Yuyu Zhou. SA-PSO based Optimizing Reader Deployment in Large-scale RFID Systems. Journal of Network and Computer Applications, 52:90--100, 2015.

Digital Library

[24]

Bin Cao, Yu Gu, Zhihan Lv, Shan Yang, Jianwei Zhao, and Yujie Li. RFID Reader Anticollision based on Distributed Parallel Particle Swarm Optimization. IEEE Internet of Things Journal, 8(5):3099--3107, 2021.

[25]

Abdoul Aziz Mbacke, Nathalie Mitton, and Herve Rivano. A Survey of RFID Readers Anticollision Protocols. IEEE Journal of Radio Frequency Identification, 2(1):38--48, 2018.

[26]

Mike Lenehan. Octane sdk. https://support.impinj.com/hc/en-us/articles/202755268-Octane-SDK. Accessed January 09, 2021.

[27]

Wei Wang, Alex X Liu, and Ke Sun. Device-Free Gesture Tracking Using Acoustic Signals. In Proc. of ACM MobiCom 2016, pages 82--94.

[28]

Rajalakshmi Nandakumar, Vikram Iyer, Desney Tan, and Shyamnath Gollakota. FingerIO: Using Active Sonar for Fine-Grained Finger Tracking. In Proc. of ACM CHI 2016, pages 1515--1525.

[29]

Vincent Becker, Linus Fessler, and Gábor Sörös. GestEar: Combining Audio and Motion Sensing for Gesture Recognition on Smartwatches. In Proc. of ACM MobiCom 2016, pages 10--19.

[30]

Yuki Kubo, Yuto Koguchi, Buntarou Shizuki, Shin Takahashi, and Otmar Hilliges. AudioTouch: Minimally Invasive Sensing of Micro-Gestures via Active Bio-Acoustic Sensing. In Proc. of ACM MobileHCI 2019, pages 1--13.

[31]

Wei Wang, Alex X Liu, Muhammad Shahzad, Kang Ling, and Sanglu Lu. Understanding and Modeling of WiFi Signal based Human Activity Recognition. In Proc. of ACM MobiCom 2015, pages 65--76.

Digital Library

[32]

Sheng Tan and Jie Yang. WiFinger: Leveraging Commodity WiFi for Fine-grained Finger Gesture Recognition. In Proc. of ACM MobiHoc 2016, pages 201--210.

[33]

Yongpan Zou, Jiang Xiao, Jinsong Han, Kaishun Wu, Yun Li, and Lionel M Ni. GRfid: A Device-Free RFID-based Gesture Recognition System. IEEE Transactions on Mobile Computing, 16(2):381--393, 2016.

Digital Library

[34]

Chuyu Wang, Lei Xie, Keyan Zhang, Wei Wang, Yanling Bu, and Sanglu Lu. Spin-Antenna: 3D Motion Tracking for Tag Array Labeled Objects via Spinning Antenna. In Proc. of IEEE INFOCOM 2019, pages 1--9.

Digital Library

[35]

Zhongqin Wang, Min Xu, Ning Ye, Ruchuan Wang, and Haiping Huang. RF-Focus: Computer Vision-assisted Region-of-interest RFID Tag Recognition and Localization in Multipath-prevalent Environments. Proc. of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies, 3(1):1--30, 2019.

Digital Library

[36]

Jianxin Wu, Adebola Osuntogun, Tanzeem Choudhury, Matthai Philipose, and James M. Rehg. A Scalable Approach to Activity Recognition based on Object Use. In Proc. of IEEE ICCV 2007, pages 1--8.

[37]

Zijuan Liu, Xiulong Liu, and Keqiu Li. Deeper Exercise Monitoring for Smart Gym using Fused RFID and CV Data. In Proc. of IEEE INFOCOM 2020, pages 11--19.

Digital Library

[38]

Tara Boroushaki, Junshan Leng, Ian Clester, Alberto Rodriguez, and Fadel Adib. Robotic Grasping of Fully-occluded Objects using RF Perception. In Proc. of IEEE ICRA 2021.

Digital Library

Cited By

Yin JXie XMao HGuo S(2025)Efficient Missing Key Tag Identification in Large-Scale RFID Systems: An Iterative Verification and Selection MethodIEEE Transactions on Mobile Computing10.1109/TMC.2024.349359724:3(2253-2269)Online publication date: Mar-2025
https://doi.org/10.1109/TMC.2024.3493597
Tavanti ENepa PGabbrielli RPirozzi M(2025)Review on Systems Combining Computer Vision and Radio Frequency IdentificationIEEE Internet of Things Journal10.1109/JIOT.2024.348475512:2(1291-1319)Online publication date: 15-Jan-2025
https://doi.org/10.1109/JIOT.2024.3484755
Siam SAhn HLiu LAlam SShen HCao ZShroff NKrishnamachari BSrivastava MZhang M(2024)Artificial Intelligence of Things: A SurveyACM Transactions on Sensor Networks10.1145/369063921:1(1-75)Online publication date: 30-Aug-2024
https://dl.acm.org/doi/10.1145/3690639
Show More Cited By

Index Terms

RFID and camera fusion for recognition of human-object interactions
1. Human-centered computing
  1. Human computer interaction (HCI)
  2. Ubiquitous and mobile computing
    1. Ubiquitous and mobile computing systems and tools

Recommendations

Pairwise Body-Part Attention for Recognizing Human-Object Interactions
Computer Vision – ECCV 2018
Abstract
In human-object interactions (HOI) recognition, conventional methods consider the human body as a whole and pay a uniform attention to the entire body region. They ignore the fact that normally, human interacts with an object by using some parts ...
Indoor mobile object tracking using RFID

Indoor tracking has been a challenging task compared to outdoor cases provided by GPS and a variety of ranging-based solutions. In this work, we propose a promising approach using RFID for indoor mobile object tracking. A moving object equipped with an ...
RFID: The Next Serious Threat to Privacy

Radio Frequency Identification, or RFID, is a technology which has been receiving considerable attention as of late. It is a fairly simple technology involving radio wave communication between a microchip and an electronic reader, in which an ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

MobiCom '21: Proceedings of the 27th Annual International Conference on Mobile Computing and Networking

October 2021

887 pages

ISBN:9781450383424

DOI:10.1145/3447993

Copyright © 2021 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

SIGMOBILE: ACM Special Interest Group on Mobility of Systems, Users, Data and Computing

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 25 October 2021

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

National Natural Science Foundation of China

Conference

ACM MobiCom '21

Sponsor:

SIGMOBILE

ACM MobiCom '21: The 27th Annual International Conference on Mobile Computing and Networking

October 25 - 29, 2021

Louisiana, New Orleans

Acceptance Rates

Overall Acceptance Rate 440 of 2,972 submissions, 15%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

24
Total Citations
View Citations
1,580
Total Downloads

Downloads (Last 12 months)225
Downloads (Last 6 weeks)21

Reflects downloads up to 18 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

Yin JXie XMao HGuo S(2025)Efficient Missing Key Tag Identification in Large-Scale RFID Systems: An Iterative Verification and Selection MethodIEEE Transactions on Mobile Computing10.1109/TMC.2024.349359724:3(2253-2269)Online publication date: Mar-2025
https://doi.org/10.1109/TMC.2024.3493597
Tavanti ENepa PGabbrielli RPirozzi M(2025)Review on Systems Combining Computer Vision and Radio Frequency IdentificationIEEE Internet of Things Journal10.1109/JIOT.2024.348475512:2(1291-1319)Online publication date: 15-Jan-2025
https://doi.org/10.1109/JIOT.2024.3484755
Siam SAhn HLiu LAlam SShen HCao ZShroff NKrishnamachari BSrivastava MZhang M(2024)Artificial Intelligence of Things: A SurveyACM Transactions on Sensor Networks10.1145/369063921:1(1-75)Online publication date: 30-Aug-2024
https://dl.acm.org/doi/10.1145/3690639
Wu YChen SMeng XTong XLiu XXie XQu WOkoshi TKo JLiKamWa R(2024)Enabling 6D Pose Tracking on Your Acoustic DevicesProceedings of the 22nd Annual International Conference on Mobile Systems, Applications and Services10.1145/3643832.3661875(15-28)Online publication date: 3-Jun-2024
https://dl.acm.org/doi/10.1145/3643832.3661875
Liu HLiu XXie XTong XLi K(2024)PmTrackProceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies10.1145/36314337:4(1-30)Online publication date: 12-Jan-2024
https://dl.acm.org/doi/10.1145/3631433
Wu JWang LJin QLiu F(2024)Graft: Efficient Inference Serving for Hybrid Deep Learning With SLO Guarantees via DNN Re-AlignmentIEEE Transactions on Parallel and Distributed Systems10.1109/TPDS.2023.334051835:2(280-296)Online publication date: 1-Feb-2024
https://dl.acm.org/doi/10.1109/TPDS.2023.3340518
Liu XZhang BWang LChen SXie XTong XGu TLi K(2024)Fine-Grained Recognition of Manipulation Activities on Objects via Multi-Modal SensingIEEE Transactions on Mobile Computing10.1109/TMC.2024.336452223:10(9614-9628)Online publication date: Oct-2024
https://doi.org/10.1109/TMC.2024.3364522
Wang PMa XZheng RChen LZhang XZeghlache DZhang D(2024)SlpRoF: Improving the Temporal Coverage and Robustness of RF-Based Vital Sign Monitoring During SleepIEEE Transactions on Mobile Computing10.1109/TMC.2023.334092523:7(7848-7864)Online publication date: Jul-2024
https://doi.org/10.1109/TMC.2023.3340925
Liu KChen LYu JCui H(2024)On Batch Writing in COTS RFID SystemsIEEE Transactions on Mobile Computing10.1109/TMC.2023.328323823:5(3846-3857)Online publication date: May-2024
https://doi.org/10.1109/TMC.2023.3283238
Lin KChen HYan NNi ZWang ZYu J(2024)Double Polling-Based Tag Information Collection for Sensor-Augmented RFID SystemsIEEE Transactions on Mobile Computing10.1109/TMC.2023.327792523:5(3496-3509)Online publication date: May-2024
https://doi.org/10.1109/TMC.2023.3277925
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten