Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3474085.3478325acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
short-paper

Fast and Flexible Human Pose Estimation with HyperPose

Published: 17 October 2021 Publication History
  • Get Citation Alerts
  • Abstract

    Estimating human pose is an important yet challenging task in multimedia applications. Existing pose estimation libraries target reproducing standard pose estimation algorithms. When it comes to customising these algorithms for real-world applications, none of the existing libraries can offer both the flexibility of developing custom pose estimation algorithms and the high-performance of executing these algorithms on commodity devices. In this paper, we introduce Hyperpose, a novel flexible and high-performance pose estimation library. Hyperpose provides expressive Python APIs that enable developers to easily customise pose estimation algorithms for their applications. It further provides a model inference engine highly optimised for real-time pose estimation. This engine can dynamically dispatch carefully designed pose estimation tasks to CPUs and GPUs, thus automatically achieving high utilisation of hardware resources irrespective of deployment environments. Extensive evaluation results show that Hyperpose can achieve up to 3.1x~7.3x higher pose estimation throughput compared to state-of-the-art pose estimation libraries without compromising estimation accuracy. By 2021, Hyperpose has received over 1000 stars on GitHub and attracted users from both industry and academy.

    References

    [1]
    Zhe Cao, Tomas Simon, Shih-En Wei, and Yaser Sheikh. 2017. Realtime multi-person 2d pose estimation using part affinity fields. In Proceedings of the IEEE conference on computer vision and pattern recognition. 7291--7299.
    [2]
    Hao Dong, Akara Supratak, Luo Mai, Fangde Liu, Axel Oehmichen, Simiao Yu, and Yike Guo. 2017. TensorLayer: A Versatile Library for Efficient Deep Learning Development. In Proceedings of the 25th ACM International Conference on Multimedia (Mountain View, California, USA) (MM '17). Association for Computing Machin- ery, New York, NY, USA, 1201--1204. https://doi.org/10.1145/3123266.3129391
    [3]
    Hao-Shu Fang, Shuqin Xie, Yu-Wing Tai, and Cewu Lu. 2017. Rmpe: Regional multi-person pose estimation. In Proceedings of the IEEE international conference on computer vision. 2334--2343.
    [4]
    Zigang Geng, Ke Sun, Bin Xiao, Zhaoxiang Zhang, and Jingdong Wang. 2021. Bottom-Up Human Pose Estimation via Disentangled Keypoint Regression. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 14676--14686.
    [5]
    Ginés Hidalgo 2020. OpenPose: Real-time multi-person keypoint detection library. https://github.com/Carnegie Mellon University-Perceptual-Computing-Lab/openpose
    [6]
    Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep Residual Learning for Image Recognition. In The IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
    [7]
    Andrew G Howard, Menglong Zhu, Bo Chen, Dmitry Kalenichenko, Weijun Wang, Tobias Weyand, Marco Andreetto, and Hartwig Adam. 2017. Mobilenets: Efficient convolutional neural networks for mobile vision applications. arXiv preprint arXiv:1704.04861 (2017).
    [8]
    Dongwoo Kim Ildoo Kim. 2019. tf-pose-estimation. Retrieved June 7, 2021 from https://github.com/ildoonet/tf-pose-estimation
    [9]
    Alexandros Koliousis, Pijika Watcharapichat, Matthias Weidlich, Luo Mai, Paolo Costa, and Peter Pietzuch. [n.d.]. CROSSBOW: Scaling Deep Learning with Small Batch Sizes on Multi-GPU Servers. Proceedings of the VLDB Endowment 12, 11 ([n. d.]).
    [10]
    Sven Kreiss, Lorenzo Bertoni, and Alexandre Alahi. 2019. Pifpaf: Composite fields for human pose estimation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 11977--11986.
    [11]
    Luo Mai, Alexandros Koliousis, Guo Li, Andrei-Octavian Brabete, and Peter Pietzuch. 2019. Taming hyper-parameters in deep learning systems. ACM SIGOPS Operating Systems Review 53, 1 (2019), 52--58.
    [12]
    Luo Mai, Guo Li, Marcel Wagenländer, Konstantinos Fertakis, Andrei-Octavian Brabete, and Peter Pietzuch. 2020. KungFu: Making Training in Distributed Machine Learning Adaptive. In 14th USENIX Symposium on Operating Systems Design and Implementation (OSDI 20). 937--954.
    [13]
    Luo Mai, Kai Zeng, Rahul Potharaju, Le Xu, Steve Suh, Shivaram Venkataraman, Paolo Costa, Terry Kim, Saravanan Muthukrishnan, Vamsi Kuppa, et al. 2018. Chi: A scalable and programmable control plane for distributed stream processing systems. Proceedings of the VLDB Endowment 11, 10 (2018), 1303--1316.
    [14]
    Taiki Sekii. 2018. Pose proposal networks. In Proceedings of the European Confer- ence on Computer Vision (ECCV). 342--357.
    [15]
    Sven Kreiss 2021. OpenPifPaf: Composite Fields for Semantic Keypoint Detection and Spatio-Temporal Association. https://github.com/openpifpaf/openpifpaf
    [16]
    Satoshi Terasaki. 2018. Chainer implementation of Pose Proposal Networks. https: //github.com/Idein/chainer-pose-proposal-net
    [17]
    Le Xu, Shivaram Venkataraman, Indranil Gupta, Luo Mai, and Rahul Potharaju. 2021. Move Fast and Meet Deadlines: Fine-grained Real-time Stream Processing with Cameo. In 18th {USENIX}Symposium on Networked Systems Design and Implementation ({NSDI}21). 389--405.
    [18]
    Wei Yang. 2019. pytorch-pose. https://github.com/bearpaw/pytorch-pose
    [19]
    Xingyi Zhou, Dequan Wang, and Philipp Krähenbühl. 2019. Objects as points. arXiv preprint arXiv:1904.07850 (2019)

    Cited By

    View all
    • (2024)Визначення правильної постави велосипедиста засобами комп'ютерного зоруScientific Bulletin of UNFU10.36930/4034031134:3(87-95)Online publication date: 28-Mar-2024
    • (2024)Shadow of HyperPose: New Animation SystemACM SIGGRAPH 2024 Talks10.1145/3641233.3664311(1-2)Online publication date: 18-Jul-2024
    • (2023)Automated Implementation of the Edinburgh Visual Gait Score (EVGS) Using OpenPose and Handheld Smartphone VideoSensors10.3390/s2310483923:10(4839)Online publication date: 17-May-2023
    • Show More Cited By

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    MM '21: Proceedings of the 29th ACM International Conference on Multimedia
    October 2021
    5796 pages
    ISBN:9781450386517
    DOI:10.1145/3474085
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 17 October 2021

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. computer vision
    2. high-performance computing
    3. pose estimation

    Qualifiers

    • Short-paper

    Funding Sources

    Conference

    MM '21
    Sponsor:
    MM '21: ACM Multimedia Conference
    October 20 - 24, 2021
    Virtual Event, China

    Acceptance Rates

    Overall Acceptance Rate 995 of 4,171 submissions, 24%

    Upcoming Conference

    MM '24
    The 32nd ACM International Conference on Multimedia
    October 28 - November 1, 2024
    Melbourne , VIC , Australia

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)50
    • Downloads (Last 6 weeks)4
    Reflects downloads up to 26 Jul 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Визначення правильної постави велосипедиста засобами комп'ютерного зоруScientific Bulletin of UNFU10.36930/4034031134:3(87-95)Online publication date: 28-Mar-2024
    • (2024)Shadow of HyperPose: New Animation SystemACM SIGGRAPH 2024 Talks10.1145/3641233.3664311(1-2)Online publication date: 18-Jul-2024
    • (2023)Automated Implementation of the Edinburgh Visual Gait Score (EVGS) Using OpenPose and Handheld Smartphone VideoSensors10.3390/s2310483923:10(4839)Online publication date: 17-May-2023
    • (2023)Markerless human pose estimation for biomedical applications: a surveyFrontiers in Computer Science10.3389/fcomp.2023.11531605Online publication date: 6-Jul-2023
    • (2023)Real-Time Localization for Closed-Loop Control of Assistive FurnitureIEEE Robotics and Automation Letters10.1109/LRA.2023.32873658:8(4799-4806)Online publication date: Aug-2023
    • (2023)Deep Learning-based Feature Fusion for Action Recognition Using Skeleton Information2023 International Conference on Robotics and Automation in Industry (ICRAI)10.1109/ICRAI57502.2023.10089577(1-6)Online publication date: 3-Mar-2023
    • (2023)Smart Vision Software Application using Machine Learning2023 Third International Conference on Artificial Intelligence and Smart Energy (ICAIS)10.1109/ICAIS56108.2023.10073814(537-541)Online publication date: 2-Feb-2023

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media