3D-ViTac: Learning Fine-Grained Manipulation with Visuo-Tactile Sensing

Huang, Binghao; Wang, Yixuan; Yang, Xinyi; Luo, Yiyue; Li, Yunzhu

Computer Science > Robotics

arXiv:2410.24091 (cs)

[Submitted on 31 Oct 2024 (v1), last revised 6 Jan 2025 (this version, v2)]

Title:3D-ViTac: Learning Fine-Grained Manipulation with Visuo-Tactile Sensing

Authors:Binghao Huang, Yixuan Wang, Xinyi Yang, Yiyue Luo, Yunzhu Li

View PDF HTML (experimental)

Abstract:Tactile and visual perception are both crucial for humans to perform fine-grained interactions with their environment. Developing similar multi-modal sensing capabilities for robots can significantly enhance and expand their manipulation skills. This paper introduces \textbf{3D-ViTac}, a multi-modal sensing and learning system designed for dexterous bimanual manipulation. Our system features tactile sensors equipped with dense sensing units, each covering an area of 3$mm^2$. These sensors are low-cost and flexible, providing detailed and extensive coverage of physical contacts, effectively complementing visual information. To integrate tactile and visual data, we fuse them into a unified 3D representation space that preserves their 3D structures and spatial relationships. The multi-modal representation can then be coupled with diffusion policies for imitation learning. Through concrete hardware experiments, we demonstrate that even low-cost robots can perform precise manipulations and significantly outperform vision-only policies, particularly in safe interactions with fragile items and executing long-horizon tasks involving in-hand manipulation. Our project page is available at \url{this https URL}.

Comments:	Accepted at Conference on Robot Learning (CoRL) 2024
Subjects:	Robotics (cs.RO); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2410.24091 [cs.RO]
	(or arXiv:2410.24091v2 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.2410.24091

Submission history

From: Binghao Huang [view email]
[v1] Thu, 31 Oct 2024 16:22:53 UTC (8,683 KB)
[v2] Mon, 6 Jan 2025 22:23:50 UTC (8,683 KB)

Computer Science > Robotics

Title:3D-ViTac: Learning Fine-Grained Manipulation with Visuo-Tactile Sensing

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:3D-ViTac: Learning Fine-Grained Manipulation with Visuo-Tactile Sensing

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators