research-article

DuploTrack: a real-time system for authoring and guiding duplo block assembly

Authors:

Michael CohenAuthors Info & Claims

UIST '12: Proceedings of the 25th annual ACM symposium on User interface software and technology

Pages 389 - 402

https://doi.org/10.1145/2380116.2380167

Published: 07 October 2012 Publication History

Abstract

We demonstrate a realtime system which infers and tracks the assembly process of a snap-together block model using a Kinect® sensor. The inference enables us to build a virtual replica of the model at every step. Tracking enables us to provide context specific visual feedback on a screen by augmenting the rendered virtual model aligned with the physical model. The system allows users to author a new model and uses the inferred assembly process to guide its recreation by others. We propose a novel way of assembly guidance where the next block to be added is rendered in blinking mode with the tracked virtual model on screen. The system is also able to detect any mistakes made and helps correct them by providing appropriate feedback. We focus on assemblies of Duplo® blocks.

We discuss the shortcomings of existing methods of guidance - static figures or recorded videos - and demonstrate how our method avoids those shortcomings. We also report on a user study to compare our system with standard figure-based guidance methods found in user manuals. The results of the user study suggest that our method is able to aid users' structural perception of the model better, leads to fewer assembly errors, and reduces model construction time.

Supplementary Material

JPG File (paper_0104-file3.jpg)

Download
11.27 KB

suppl.mov (paper_0104-file3.mp4)

Supplemental video

Download
72.11 MB

References

[1]

Agrawala, M., Li, W., and Berthouzoz, F. Design principles for visual communication. Communications of the ACM (Apr. 2011), 60--69.

Digital Library

[2]

Agrawala, M., Phan, D., Heiser, J., Haymaker, J., Klingner, J., Hanrahan, P., and Tversky, B. Designing effective step-by-step assembly instructions. In Proc. SIGGRAPH '03, ACM Press (2003), 828--837.

Digital Library

[3]

Anderson, D., Frankel, J. L., Marks, J., Agarwala, A., Beardsley, P., Hodgins, J., Leigh, D., Ryall, K., Sullivan, E., and Yedidia, J. S. Tangible interaction + graphical interpretation: a new approach to 3d modeling. In Proc. SIGGRAPH '00, ACM Press/Addison-Wesley Publishing Co. (2000), 393--402.

Digital Library

[4]

Besl, P., and McKay, N. A method for registration of 3-d shapes. IEEE Transactions on Pattern Analysis and Machine Intelligence 14 (1992), 239--256.

Digital Library

[5]

Boud, A. C., Haniff, D. J., Baber, C., and Steiner, S. J. Virtual reality and augmented reality as a training tool for assembly tasks. In Proc. IV '99, IEEE Computer Society (1999), 32--36.

Digital Library

[6]

Casey, M. B., Winner, E., Brabeck, M. M., Sullivan, K., Gilhooly, K. J., Keane, M. T. G., Logie, R. H., and Erdos, G. Visual-spatial abilities in art, maths and science majors: Effects of sex, family handedness and spatial experience. Lines of thinking: Reflections on the psychology of thought 2 (1990), 275--294.

[7]

Domini, F., and Caudek, C. 3-d structure perceived from dynamic information: a new theory. Trends in Cognitive Sciences 7, 10 (2003), 444--449.

[8]

Ferris, S. H. Motion parallax and absolute distance. Journal of Experimental Psychology 95, 2 (1972), 258--263.

[9]

Goldstein, D., Haldane, D., and Mitchell, C. Sex differences in visual-spatial ability: The role of performance factors. Memory & Cognition 18, 5 (1990), 546--550.

[10]

Heiser, J., Phan, D., Agrawala, M., Tversky, B., and Hanrahan, P. Identification and validation of cognitive design principles for automated generation of assembly instructions. In Proc. AVI '04, ACM Press (2004), 311--319.

Digital Library

[11]

Henderson, S. J., and Feiner, S. K. Augmented reality in the psychomotor phase of a procedural task. In Proc. ISMAR '11, IEEE Computer Society (2011), 191--200.

Digital Library

[12]

Hou, L., and Wang, X. Using augmented reality to cognitively facilitate product assembly process. Augmented Reality (2010), 99--112.

[13]

Izadi, S., Kim, D., Hilliges, O., Molyneaux, D., Newcombe, R., Kohli, P., Shotton, J., Hodges, S., Freeman, D., Davison, A., and Fitzgibbon, A. Kinectfusion: real-time 3d reconstruction and interaction using a moving depth camera. In Proc. UIST '11, ACM Press (2011), 559--568.

Digital Library

[14]

Jota, R., and Benko, H. Constructing virtual 3d models with physical building blocks. In Proc. CHI EA '11, ACM Press (2011), 2173--2178.

Digital Library

[15]

Ju, W., Bonanni, L., Fletcher, R., Hurwitz, R., Judd, T., Post, R., Reynolds, M., and Yoon, J. Origami desk: integrating technological innovation and human-centric design. In Proc. DIS '02, ACM Press (2002), 399--405.

Digital Library

[16]

Kraut, R. E., Fussell, S. R., and Siegel, J. Visual information as a conversational resource in collaborative physical tasks. Hum.-Comput. Interact. 18, 1 (June 2003), 13--49.

Digital Library

[17]

Li, W., Agrawala, M., Curless, B., and Salesin, D. Automated generation of interactive 3d exploded view diagrams. In Proc. SIGGRAPH '08, ACM Press (2008), 101:1--101:7.

Digital Library

[18]

Miller, A., White, B., Charbonneau, E., Kanzler, Z., and LaViola Jr., J. J. Interactive 3d model acquisition and tracking of building block structures. IEEE Transactions on Visualization and Computer Graphics 18, 4 (Apr. 2012), 651--659.

Digital Library

[19]

Mitra, N. J., Yang, Y.-L., Yan, D.-M., Li, W., and Agrawala, M. Illustrating how mechanical assemblies work. In Proc. SIGGRAPH '10, ACM Press (2010), 58:1--58:12.

Digital Library

[20]

Molineros, J. M. Computer vision and augmented reality for guiding assembly. PhD thesis, Pennsylvania State University, University Park, PA, USA, 2002.

Digital Library

[21]

Pongnumkul, S., Dontcheva, M., Li, W., Wang, J., Bourdev, L., Avidan, S., and Cohen, M. F. Pause-and-play: automatically linking screencast video tutorials with applications. In Proc. UIST '11, ACM Press (2011), 135--144.

Digital Library

[22]

Preiss, D. D., and Sternberg, R. J. Effects of technology on verbal and visual-spatial abilities. Cognitive Technology 11, 1 (2006), 14--22.

[23]

Rubner, Y., Tomasi, C., and Guibas, L. J. A metric for distributions with applications to image databases. In Proc. ICCV '98, IEEE Computer Society (1998), 59--66.

Digital Library

[24]

Shepard, R. N., and Metzler, J. Mental rotation of three-dimensional objects. Science 171 (1971), 701--703.

[25]

Stumpf, H., and Eliot, J. A structural analysis of visual spatial ability in academically talented students. Learning and Individual Differences 11, 2 (1999), 137--151.

[26]

Surhone, L., Timpledon, M., and Marseken, S. Lego Digital Designer. VDM Verlag Dr. Mueller AG & Company Kg, 2010.

[27]

Tang, A., Owen, C., Biocca, F., and Mou, W. Comparative effectiveness of augmented reality in object assembly. In Proc. CHI '03, ACM Press (2003), 73--80.

Digital Library

[28]

Wallach, H., and O'Connell, D. N. The kinetic depth effect. Journal of Experimental Psychology 45, 4 (1953).

Cited By

Sadprasid BGutwin CBateman S(2024)Improving Video Navigation for Spatial Task Tutorials by Spatially Segmenting and Situating How-To VideosProceedings of the 2024 ACM Symposium on Spatial User Interaction10.1145/3677386.3682103(1-13)Online publication date: 7-Oct-2024
https://dl.acm.org/doi/10.1145/3677386.3682103
Keelawat PSuzuki R(2024)Transforming Procedural Instructions into In-Situ Augmented Reality Guides with InstructARAdjunct Proceedings of the 37th Annual ACM Symposium on User Interface Software and Technology10.1145/3672539.3686321(1-3)Online publication date: 13-Oct-2024
https://dl.acm.org/doi/10.1145/3672539.3686321
Dominiak JWalczak AStefanidi EAdamkiewicz KGrudzień KNiess JWoźniak P(2024)ProtoBricks: A Research Toolkit for Tangible Prototyping & Data PhysicalizationProceedings of the 2024 ACM Designing Interactive Systems Conference10.1145/3643834.3661573(476-495)Online publication date: 1-Jul-2024
https://dl.acm.org/doi/10.1145/3643834.3661573
Show More Cited By

Index Terms

DuploTrack: a real-time system for authoring and guiding duplo block assembly

Recommendations

Mixing Realities at Ismar 2009: Scary and Wondrous

The Eighth IEEE International Symposium on Mixed and Augmented Reality (Ismar 2009) combined a traditional science-and-technology track with an art, media, and humanities track to provide a nontraditional cross-disciplinary view of an increasingly ...
O-Displaying: an orientation-based augmented reality display on a smart glass with a user tracking from a depth camera
SA '17: SIGGRAPH Asia 2017 Posters

Augmented reality (AR) contents are widely viewed from mobile devices, e.g. smart phones, tablets, and smart glasses [Olsson and Salo 2011]. To properly display AR contents on a screen, color cameras are widely used to detect markers, feature points, ...
Interacting with virtual reality scenes on mobile devices
MobileHCI '05: Proceedings of the 7th international conference on Human computer interaction with mobile devices & services

This paper discusses alternative approaches for interacting with virtual reality scenes on mobile devices, based upon work conducted as part of the locus project [4]. Three prototypes are introduced that adopt different interaction paradigms for mobile ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

UIST '12: Proceedings of the 25th annual ACM symposium on User interface software and technology

October 2012

608 pages

ISBN:9781450315807

DOI:10.1145/2380116

General Chair:
Rob Miller
MIT CSAIL, USA
,
Program Chairs:
Hrvoje Benko
Microsoft Research, USA
,
Celine Latulipe
University of North Carolina at Charlotte, USA

Copyright © 2012 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 07 October 2012

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Conference

UIST '12

Sponsor:

UIST '12: The 25th Annual ACM Symposium on User Interface Software and Technology

October 7 - 10, 2012

Massachusetts, Cambridge, USA

Acceptance Rates

Overall Acceptance Rate 561 of 2,567 submissions, 22%

Upcoming Conference

UIST '25

Sponsor:
sigchi
sigchi

The 38th Annual ACM Symposium on User Interface Software and Technology

September 28 - October 1, 2025

Busan , Republic of Korea

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

89
Total Citations
View Citations
1,331
Total Downloads

Downloads (Last 12 months)39
Downloads (Last 6 weeks)1

Reflects downloads up to 25 Jan 2025

Other Metrics

View Author Metrics

Citations

Cited By

Sadprasid BGutwin CBateman S(2024)Improving Video Navigation for Spatial Task Tutorials by Spatially Segmenting and Situating How-To VideosProceedings of the 2024 ACM Symposium on Spatial User Interaction10.1145/3677386.3682103(1-13)Online publication date: 7-Oct-2024
https://dl.acm.org/doi/10.1145/3677386.3682103
Keelawat PSuzuki R(2024)Transforming Procedural Instructions into In-Situ Augmented Reality Guides with InstructARAdjunct Proceedings of the 37th Annual ACM Symposium on User Interface Software and Technology10.1145/3672539.3686321(1-3)Online publication date: 13-Oct-2024
https://dl.acm.org/doi/10.1145/3672539.3686321
Dominiak JWalczak AStefanidi EAdamkiewicz KGrudzień KNiess JWoźniak P(2024)ProtoBricks: A Research Toolkit for Tangible Prototyping & Data PhysicalizationProceedings of the 2024 ACM Designing Interactive Systems Conference10.1145/3643834.3661573(476-495)Online publication date: 1-Jul-2024
https://dl.acm.org/doi/10.1145/3643834.3661573
Yang JQiu LCorona-Moreno EShi LBui HLam MLanday J(2024)AMMA: Adaptive Multimodal Assistants Through Automated State Tracking and User Model-Directed Guidance Planning2024 IEEE Conference Virtual Reality and 3D User Interfaces (VR)10.1109/VR58804.2024.00108(892-902)Online publication date: 16-Mar-2024
https://doi.org/10.1109/VR58804.2024.00108
Walsman AZhang MFishman AFarhadi AFox D(2024)Learning to Build by Building Your Own InstructionsComputer Vision – ECCV 202410.1007/978-3-031-73024-5_16(261-278)Online publication date: 24-Nov-2024
https://doi.org/10.1007/978-3-031-73024-5_16
Evangelista Belo JWissing JFeuchtner TGrønbæk K(2023)CADTrack: Instructions and Support for Orientation Disambiguation of Near-Symmetrical ObjectsProceedings of the ACM on Human-Computer Interaction10.1145/36264627:ISS(1-20)Online publication date: 1-Nov-2023
https://dl.acm.org/doi/10.1145/3626462
Huang XMiller KSample ABanovic N(2023)StructureSenseProceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies10.1145/35703436:4(1-25)Online publication date: 11-Jan-2023
https://dl.acm.org/doi/10.1145/3570343
Liu ZZhu ZJiang EHuang FVillanueva AQian XWang TRamani K(2023)InstruMentAR: Auto-Generation of Augmented Reality Tutorials for Operating Digital Instruments Through Recording Embodied DemonstrationProceedings of the 2023 CHI Conference on Human Factors in Computing Systems10.1145/3544548.3581442(1-17)Online publication date: 19-Apr-2023
https://dl.acm.org/doi/10.1145/3544548.3581442
Ramakers RLeen DKim JLuyten KHouben SVeuskens T(2023)Measurement Patterns: User-Oriented Strategies for Dealing with Measurements and Dimensions in Making ProcessesProceedings of the 2023 CHI Conference on Human Factors in Computing Systems10.1145/3544548.3581157(1-17)Online publication date: 19-Apr-2023
https://dl.acm.org/doi/10.1145/3544548.3581157
Myers MWright NMcGough AMartin N(2023)Hand Guided High Resolution Feature Enhancement for Fine-Grained Atomic Action Segmentation within Complex Human Assemblies2023 IEEE/CVF Winter Conference on Applications of Computer Vision Workshops (WACVW)10.1109/WACVW58289.2023.00052(1-10)Online publication date: Jan-2023
https://doi.org/10.1109/WACVW58289.2023.00052
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten