Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3512527.3531438acmconferencesArticle/Chapter ViewAbstractPublication PagesicmrConference Proceedingsconference-collections
research-article

Reproducibility Companion Paper: Human Object Interaction Detection via Multi-level Conditioned Network

Published: 27 June 2022 Publication History

Abstract

To support the replication of ?Human Object Interaction Detection via Multi-level Conditioned Network", which was presented at ICMR'20, this companion paper provides the details of the artifacts. Human Object Interaction Detection (HOID) aims to recognize fine-grained object-specific human actions, which demands the capabilities of both visual perception and reasoning. In this paper, we explain the file structure of the source code and publish the details of our experiments settings. We also provide a program for component analysis to assist other researchers with experiments on alternative models that are not included in our experiments. Moreover, we provide a demo program for facilitating the use of our model.

Supplementary Material

MP4 File (ICMR22-rp2.mp4)
This work is a reproducibility paper of ?Human Object Interaction Detection via Multi-level Conditioned Network? which was published in ICMR 2020. In the video, we first introduce the task definition of Human-Object Interaction Detection, and clarify our motivation as bridging the gap between the low-level visual information of pixels and complex semantics of HOIs. Then we briefly describe the general framework of the proposed MLCNet. Afterwards, we show how we set up the experiments. Meanwhile, the main experimental results and some visualization results are presented. Finally, we report some efforts in the process of the reproducibility work, including code refactoring and the support of compatibility of multi-version of dependencies.

References

[1]
Yu-Wei. Chao, Yunfan. Liu, Xieyang. Liu, Huayi. Zeng, and Jia. Deng. 2018. Learning to Detect Human-Object Interactions. In WACV.
[2]
Yu-Wei Chao, Zhan Wang, Yugeng He, Jiaxuan Wang, and Jia Deng. 2015. HICO: A Benchmark for Recognizing Human-Object Interactions in Images. In ICCV.
[3]
Hao-Shu Fang, Guansong Lu, Xiaolin Fang, Jianwen Xie, Yu-Wing Tai, and Cewu Lu. 2018. Weakly and Semi Supervised Human Body Part Parsing via Pose-Guided Knowledge Transfer. In CVPR.
[4]
Chen Gao, Yuliang Zou, and Jia-Bin Huang. 2018. iCAN: Instance-Centric Attention Network for Human-Object Interaction Detection. In BMVC.
[5]
Saurabh Gupta and Jitendra Malik. 2015. Visual Semantic Role Labeling. CoRR (2015).
[6]
Yong-Lu Li, Siyuan Zhou, Xijie Huang, Liang Xu, Ze Ma, Hao-Shu Fang, Yanfeng Wang, and Cewu Lu. 2019. Transferable Interactiveness Knowledge for Human- Object Interaction Detection. In CVPR.
[7]
Tsung-Yi Lin, Michael Maire, Serge Belongie, James Hays, Pietro Perona, Deva Ramanan, Piotr Dollár, and C. Lawrence Zitnick. 2014. Microsoft COCO: Common Objects in Context. In ECCV.
[8]
Shaoqing Ren, Kaiming He, Ross Girshick, and Jian Sun. 2015. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. In NeurIPS.
[9]
Xu Sun, Yunqing He, Tongwei Ren, and Gangshan Wu. 2021. Spatial-Temporal Human-Object Interaction Detection. In ICME.
[10]
Xu Sun, Xinwen Hu, Tongwei Ren, and Gangshan Wu. 2020. Human Object Interaction Detection via Multi-Level Conditioned Network. In ICMR.
[11]
Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N Gomez, Lukasz Kaiser, and Illia Polosukhin. 2017. Attention is all you need. NeurIPS (2017).
[12]
Jianwei Yang, Jiasen Lu, Dhruv Batra, and Devi Parikh. 2017. A Faster Pytorch Implementation of Faster R-CNN. https://github.com/jwyang/faster-rcnn.pytorch (2017).

Index Terms

  1. Reproducibility Companion Paper: Human Object Interaction Detection via Multi-level Conditioned Network

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    ICMR '22: Proceedings of the 2022 International Conference on Multimedia Retrieval
    June 2022
    714 pages
    ISBN:9781450392389
    DOI:10.1145/3512527
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 27 June 2022

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. conditioned network
    2. feature transformation
    3. human object interaction detection
    4. multilevel visual representation
    5. multimodal feature fusion

    Qualifiers

    • Research-article

    Funding Sources

    Conference

    ICMR '22
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 254 of 830 submissions, 31%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 73
      Total Downloads
    • Downloads (Last 12 months)18
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 02 Sep 2024

    Other Metrics

    Citations

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media