research-article

Reproducibility Companion Paper: Human Object Interaction Detection via Multi-level Conditioned Network

Authors:

Xu Sun,

Maria Sinziiana Astefanoaei,

Andreas LeibetsederAuthors Info & Claims

ICMR '22: Proceedings of the 2022 International Conference on Multimedia Retrieval

Pages 681 - 684

https://doi.org/10.1145/3512527.3531438

Published: 27 June 2022 Publication History

Get Access

Abstract

To support the replication of ?Human Object Interaction Detection via Multi-level Conditioned Network", which was presented at ICMR'20, this companion paper provides the details of the artifacts. Human Object Interaction Detection (HOID) aims to recognize fine-grained object-specific human actions, which demands the capabilities of both visual perception and reasoning. In this paper, we explain the file structure of the source code and publish the details of our experiments settings. We also provide a program for component analysis to assist other researchers with experiments on alternative models that are not included in our experiments. Moreover, we provide a demo program for facilitating the use of our model.

Supplementary Material

MP4 File (ICMR22-rp2.mp4)

This work is a reproducibility paper of ?Human Object Interaction Detection via Multi-level Conditioned Network? which was published in ICMR 2020. In the video, we first introduce the task definition of Human-Object Interaction Detection, and clarify our motivation as bridging the gap between the low-level visual information of pixels and complex semantics of HOIs. Then we briefly describe the general framework of the proposed MLCNet. Afterwards, we show how we set up the experiments. Meanwhile, the main experimental results and some visualization results are presented. Finally, we report some efforts in the process of the reproducibility work, including code refactoring and the support of compatibility of multi-version of dependencies.

Download
793.71 MB

References

[1]

Yu-Wei. Chao, Yunfan. Liu, Xieyang. Liu, Huayi. Zeng, and Jia. Deng. 2018. Learning to Detect Human-Object Interactions. In WACV.

Google Scholar

[2]

Yu-Wei Chao, Zhan Wang, Yugeng He, Jiaxuan Wang, and Jia Deng. 2015. HICO: A Benchmark for Recognizing Human-Object Interactions in Images. In ICCV.

Google Scholar

[3]

Hao-Shu Fang, Guansong Lu, Xiaolin Fang, Jianwen Xie, Yu-Wing Tai, and Cewu Lu. 2018. Weakly and Semi Supervised Human Body Part Parsing via Pose-Guided Knowledge Transfer. In CVPR.

Google Scholar

[4]

Chen Gao, Yuliang Zou, and Jia-Bin Huang. 2018. iCAN: Instance-Centric Attention Network for Human-Object Interaction Detection. In BMVC.

Google Scholar

[5]

Saurabh Gupta and Jitendra Malik. 2015. Visual Semantic Role Labeling. CoRR (2015).

Google Scholar

[6]

Yong-Lu Li, Siyuan Zhou, Xijie Huang, Liang Xu, Ze Ma, Hao-Shu Fang, Yanfeng Wang, and Cewu Lu. 2019. Transferable Interactiveness Knowledge for Human- Object Interaction Detection. In CVPR.

Google Scholar

[7]

Tsung-Yi Lin, Michael Maire, Serge Belongie, James Hays, Pietro Perona, Deva Ramanan, Piotr Dollár, and C. Lawrence Zitnick. 2014. Microsoft COCO: Common Objects in Context. In ECCV.

Google Scholar

[8]

Shaoqing Ren, Kaiming He, Ross Girshick, and Jian Sun. 2015. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. In NeurIPS.

Google Scholar

[9]

Xu Sun, Yunqing He, Tongwei Ren, and Gangshan Wu. 2021. Spatial-Temporal Human-Object Interaction Detection. In ICME.

Google Scholar

[10]

Xu Sun, Xinwen Hu, Tongwei Ren, and Gangshan Wu. 2020. Human Object Interaction Detection via Multi-Level Conditioned Network. In ICMR.

Google Scholar

[11]

Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N Gomez, Lukasz Kaiser, and Illia Polosukhin. 2017. Attention is all you need. NeurIPS (2017).

Google Scholar

[12]

Jianwei Yang, Jiasen Lu, Dhruv Batra, and Devi Parikh. 2017. A Faster Pytorch Implementation of Faster R-CNN. https://github.com/jwyang/faster-rcnn.pytorch (2017).

Google Scholar

Index Terms

Reproducibility Companion Paper: Human Object Interaction Detection via Multi-level Conditioned Network
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Computer vision tasks
        Scene understanding

Recommendations

Human Object Interaction Detection via Multi-level Conditioned Network
ICMR '20: Proceedings of the 2020 International Conference on Multimedia Retrieval

As one of the essential problems in scene understanding, human object interaction detection (HOID) aims to recognize fine-grained object-specific human actions, which demands the capabilities of both visual perception and reasoning. Existing methods ...
Human object interaction detection based on feature optimization and key human-object enhancement
Abstract
Aiming at the problem of unclear or missing human object interaction behavior objects in complex background, we propose a human object interaction detection algorithm based on feature optimization and key human-object enhancement. In ...
Object Centric Body Part Attention Network for Human-Object Interaction Detection
Pattern Recognition and Computer Vision
Abstract
The current transformer-based human object interaction (HOI) detection methods have achieved great progress, however, these methods adopt the same structre of decoder to detect human and object, which limits the accuracy of object feature ...

Comments

Information & Contributors

Information

Published In

ICMR '22: Proceedings of the 2022 International Conference on Multimedia Retrieval

June 2022

714 pages

ISBN:9781450392389

DOI:10.1145/3512527

General Chairs:
Vincent Oria
New Jersey Institute of Technology, USA
,
Maria Luisa Sapino
Università degli Studi di Torino, Italy
,
Shin'ichi Satoh
National Institute of Informatics, Japan
,
Brigitte Kerhervé
Université du Québec à Montréal, Canada
,
Program Chairs:
Wen-Huang Cheng
National Yang Ming Chao Tung University, Taiwan
,
Ichiro Ide
Nagoya University, Japan
,
Vivek Singh
Rutgers University, USA

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 27 June 2022

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

Natural Science Foundation of Jiangsu Province
Collaborative Innovation Center of Novel Software Technology and Industrialization
National Science Foundation of China

Conference

ICMR '22

Sponsor:

SIGMM

ICMR '22: International Conference on Multimedia Retrieval

June 27 - 30, 2022

NJ, Newark, USA

Acceptance Rates

Overall Acceptance Rate 254 of 830 submissions, 31%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
73
Total Downloads

Downloads (Last 12 months)18
Downloads (Last 6 weeks)0

Reflects downloads up to 02 Sep 2024

Other Metrics

View Author Metrics

Citations

View Options

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Index Terms

Recommendations

Human Object Interaction Detection via Multi-level Conditioned Network

Human object interaction detection based on feature optimization and key human-object enhancement

Object Centric Body Part Attention Network for Human-Object Interaction Detection

Comments

Published In

Sponsors

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Funding Sources

Conference

Acceptance Rates

Other Metrics

Article Metrics

Other Metrics

Login options

Full Access

PDF

eReader

Abstract

Supplementary Material

References

Index Terms

Recommendations

Human Object Interaction Detection via Multi-level Conditioned Network

Human object interaction detection based on feature optimization and key human-object enhancement

Object Centric Body Part Attention Network for Human-Object Interaction Detection

Comments

Information

Published In

Sponsors

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Funding Sources

Conference

Acceptance Rates

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Get Access

Login options

Full Access

View options

PDF

eReader

Figures

Other

Share

Share this Publication link

Share on social media

Affiliations