MobRecon: Mobile-Friendly Hand Mesh Reconstruction from Monocular Image

Chen, Xingyu; Liu, Yufeng; Dong, Yajiao; Zhang, Xiong; Ma, Chongyang; Xiong, Yanmin; Zhang, Yuan; Guo, Xiaoyan

Computer Science > Computer Vision and Pattern Recognition

arXiv:2112.02753 (cs)

[Submitted on 6 Dec 2021 (v1), last revised 31 Mar 2022 (this version, v2)]

Title:MobRecon: Mobile-Friendly Hand Mesh Reconstruction from Monocular Image

Authors:Xingyu Chen, Yufeng Liu, Yajiao Dong, Xiong Zhang, Chongyang Ma, Yanmin Xiong, Yuan Zhang, Xiaoyan Guo

View PDF

Abstract:In this work, we propose a framework for single-view hand mesh reconstruction, which can simultaneously achieve high reconstruction accuracy, fast inference speed, and temporal coherence. Specifically, for 2D encoding, we propose lightweight yet effective stacked structures. Regarding 3D decoding, we provide an efficient graph operator, namely depth-separable spiral convolution. Moreover, we present a novel feature lifting module for bridging the gap between 2D and 3D representations. This module begins with a map-based position regression (MapReg) block to integrate the merits of both heatmap encoding and position regression paradigms for improved 2D accuracy and temporal coherence. Furthermore, MapReg is followed by pose pooling and pose-to-vertex lifting approaches, which transform 2D pose encodings to semantic features of 3D vertices. Overall, our hand reconstruction framework, called MobRecon, comprises affordable computational costs and miniature model size, which reaches a high inference speed of 83FPS on Apple A14 CPU. Extensive experiments on popular datasets such as FreiHAND, RHD, and HO3Dv2 demonstrate that our MobRecon achieves superior performance on reconstruction accuracy and temporal coherence. Our code is publicly available at this https URL.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2112.02753 [cs.CV]
	(or arXiv:2112.02753v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2112.02753
Journal reference:	CVPR2022

Submission history

From: Xingyu Chen [view email]
[v1] Mon, 6 Dec 2021 03:01:24 UTC (8,356 KB)
[v2] Thu, 31 Mar 2022 03:30:50 UTC (6,928 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:MobRecon: Mobile-Friendly Hand Mesh Reconstruction from Monocular Image

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:MobRecon: Mobile-Friendly Hand Mesh Reconstruction from Monocular Image

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators