Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2380116.2380167acmconferencesArticle/Chapter ViewAbstractPublication PagesuistConference Proceedingsconference-collections
research-article

DuploTrack: a real-time system for authoring and guiding duplo block assembly

Published: 07 October 2012 Publication History

Abstract

We demonstrate a realtime system which infers and tracks the assembly process of a snap-together block model using a Kinect® sensor. The inference enables us to build a virtual replica of the model at every step. Tracking enables us to provide context specific visual feedback on a screen by augmenting the rendered virtual model aligned with the physical model. The system allows users to author a new model and uses the inferred assembly process to guide its recreation by others. We propose a novel way of assembly guidance where the next block to be added is rendered in blinking mode with the tracked virtual model on screen. The system is also able to detect any mistakes made and helps correct them by providing appropriate feedback. We focus on assemblies of Duplo® blocks.
We discuss the shortcomings of existing methods of guidance - static figures or recorded videos - and demonstrate how our method avoids those shortcomings. We also report on a user study to compare our system with standard figure-based guidance methods found in user manuals. The results of the user study suggest that our method is able to aid users' structural perception of the model better, leads to fewer assembly errors, and reduces model construction time.

Supplementary Material

JPG File (paper_0104-file3.jpg)
suppl.mov (paper_0104-file3.mp4)
Supplemental video

References

[1]
Agrawala, M., Li, W., and Berthouzoz, F. Design principles for visual communication. Communications of the ACM (Apr. 2011), 60--69.
[2]
Agrawala, M., Phan, D., Heiser, J., Haymaker, J., Klingner, J., Hanrahan, P., and Tversky, B. Designing effective step-by-step assembly instructions. In Proc. SIGGRAPH '03, ACM Press (2003), 828--837.
[3]
Anderson, D., Frankel, J. L., Marks, J., Agarwala, A., Beardsley, P., Hodgins, J., Leigh, D., Ryall, K., Sullivan, E., and Yedidia, J. S. Tangible interaction + graphical interpretation: a new approach to 3d modeling. In Proc. SIGGRAPH '00, ACM Press/Addison-Wesley Publishing Co. (2000), 393--402.
[4]
Besl, P., and McKay, N. A method for registration of 3-d shapes. IEEE Transactions on Pattern Analysis and Machine Intelligence 14 (1992), 239--256.
[5]
Boud, A. C., Haniff, D. J., Baber, C., and Steiner, S. J. Virtual reality and augmented reality as a training tool for assembly tasks. In Proc. IV '99, IEEE Computer Society (1999), 32--36.
[6]
Casey, M. B., Winner, E., Brabeck, M. M., Sullivan, K., Gilhooly, K. J., Keane, M. T. G., Logie, R. H., and Erdos, G. Visual-spatial abilities in art, maths and science majors: Effects of sex, family handedness and spatial experience. Lines of thinking: Reflections on the psychology of thought 2 (1990), 275--294.
[7]
Domini, F., and Caudek, C. 3-d structure perceived from dynamic information: a new theory. Trends in Cognitive Sciences 7, 10 (2003), 444--449.
[8]
Ferris, S. H. Motion parallax and absolute distance. Journal of Experimental Psychology 95, 2 (1972), 258--263.
[9]
Goldstein, D., Haldane, D., and Mitchell, C. Sex differences in visual-spatial ability: The role of performance factors. Memory & Cognition 18, 5 (1990), 546--550.
[10]
Heiser, J., Phan, D., Agrawala, M., Tversky, B., and Hanrahan, P. Identification and validation of cognitive design principles for automated generation of assembly instructions. In Proc. AVI '04, ACM Press (2004), 311--319.
[11]
Henderson, S. J., and Feiner, S. K. Augmented reality in the psychomotor phase of a procedural task. In Proc. ISMAR '11, IEEE Computer Society (2011), 191--200.
[12]
Hou, L., and Wang, X. Using augmented reality to cognitively facilitate product assembly process. Augmented Reality (2010), 99--112.
[13]
Izadi, S., Kim, D., Hilliges, O., Molyneaux, D., Newcombe, R., Kohli, P., Shotton, J., Hodges, S., Freeman, D., Davison, A., and Fitzgibbon, A. Kinectfusion: real-time 3d reconstruction and interaction using a moving depth camera. In Proc. UIST '11, ACM Press (2011), 559--568.
[14]
Jota, R., and Benko, H. Constructing virtual 3d models with physical building blocks. In Proc. CHI EA '11, ACM Press (2011), 2173--2178.
[15]
Ju, W., Bonanni, L., Fletcher, R., Hurwitz, R., Judd, T., Post, R., Reynolds, M., and Yoon, J. Origami desk: integrating technological innovation and human-centric design. In Proc. DIS '02, ACM Press (2002), 399--405.
[16]
Kraut, R. E., Fussell, S. R., and Siegel, J. Visual information as a conversational resource in collaborative physical tasks. Hum.-Comput. Interact. 18, 1 (June 2003), 13--49.
[17]
Li, W., Agrawala, M., Curless, B., and Salesin, D. Automated generation of interactive 3d exploded view diagrams. In Proc. SIGGRAPH '08, ACM Press (2008), 101:1--101:7.
[18]
Miller, A., White, B., Charbonneau, E., Kanzler, Z., and LaViola Jr., J. J. Interactive 3d model acquisition and tracking of building block structures. IEEE Transactions on Visualization and Computer Graphics 18, 4 (Apr. 2012), 651--659.
[19]
Mitra, N. J., Yang, Y.-L., Yan, D.-M., Li, W., and Agrawala, M. Illustrating how mechanical assemblies work. In Proc. SIGGRAPH '10, ACM Press (2010), 58:1--58:12.
[20]
Molineros, J. M. Computer vision and augmented reality for guiding assembly. PhD thesis, Pennsylvania State University, University Park, PA, USA, 2002.
[21]
Pongnumkul, S., Dontcheva, M., Li, W., Wang, J., Bourdev, L., Avidan, S., and Cohen, M. F. Pause-and-play: automatically linking screencast video tutorials with applications. In Proc. UIST '11, ACM Press (2011), 135--144.
[22]
Preiss, D. D., and Sternberg, R. J. Effects of technology on verbal and visual-spatial abilities. Cognitive Technology 11, 1 (2006), 14--22.
[23]
Rubner, Y., Tomasi, C., and Guibas, L. J. A metric for distributions with applications to image databases. In Proc. ICCV '98, IEEE Computer Society (1998), 59--66.
[24]
Shepard, R. N., and Metzler, J. Mental rotation of three-dimensional objects. Science 171 (1971), 701--703.
[25]
Stumpf, H., and Eliot, J. A structural analysis of visual spatial ability in academically talented students. Learning and Individual Differences 11, 2 (1999), 137--151.
[26]
Surhone, L., Timpledon, M., and Marseken, S. Lego Digital Designer. VDM Verlag Dr. Mueller AG & Company Kg, 2010.
[27]
Tang, A., Owen, C., Biocca, F., and Mou, W. Comparative effectiveness of augmented reality in object assembly. In Proc. CHI '03, ACM Press (2003), 73--80.
[28]
Wallach, H., and O'Connell, D. N. The kinetic depth effect. Journal of Experimental Psychology 45, 4 (1953).

Cited By

View all
  • (2024)Improving Video Navigation for Spatial Task Tutorials by Spatially Segmenting and Situating How-To VideosProceedings of the 2024 ACM Symposium on Spatial User Interaction10.1145/3677386.3682103(1-13)Online publication date: 7-Oct-2024
  • (2024)Transforming Procedural Instructions into In-Situ Augmented Reality Guides with InstructARAdjunct Proceedings of the 37th Annual ACM Symposium on User Interface Software and Technology10.1145/3672539.3686321(1-3)Online publication date: 13-Oct-2024
  • (2024)ProtoBricks: A Research Toolkit for Tangible Prototyping & Data PhysicalizationProceedings of the 2024 ACM Designing Interactive Systems Conference10.1145/3643834.3661573(476-495)Online publication date: 1-Jul-2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
UIST '12: Proceedings of the 25th annual ACM symposium on User interface software and technology
October 2012
608 pages
ISBN:9781450315807
DOI:10.1145/2380116
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 07 October 2012

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. active visual feedback
  2. assembly tasks
  3. augmented reality
  4. depth camera
  5. tracking
  6. virtual reality

Qualifiers

  • Research-article

Conference

UIST '12

Acceptance Rates

Overall Acceptance Rate 561 of 2,567 submissions, 22%

Upcoming Conference

UIST '25
The 38th Annual ACM Symposium on User Interface Software and Technology
September 28 - October 1, 2025
Busan , Republic of Korea

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)39
  • Downloads (Last 6 weeks)1
Reflects downloads up to 25 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Improving Video Navigation for Spatial Task Tutorials by Spatially Segmenting and Situating How-To VideosProceedings of the 2024 ACM Symposium on Spatial User Interaction10.1145/3677386.3682103(1-13)Online publication date: 7-Oct-2024
  • (2024)Transforming Procedural Instructions into In-Situ Augmented Reality Guides with InstructARAdjunct Proceedings of the 37th Annual ACM Symposium on User Interface Software and Technology10.1145/3672539.3686321(1-3)Online publication date: 13-Oct-2024
  • (2024)ProtoBricks: A Research Toolkit for Tangible Prototyping & Data PhysicalizationProceedings of the 2024 ACM Designing Interactive Systems Conference10.1145/3643834.3661573(476-495)Online publication date: 1-Jul-2024
  • (2024)AMMA: Adaptive Multimodal Assistants Through Automated State Tracking and User Model-Directed Guidance Planning2024 IEEE Conference Virtual Reality and 3D User Interfaces (VR)10.1109/VR58804.2024.00108(892-902)Online publication date: 16-Mar-2024
  • (2024)Learning to Build by Building Your Own InstructionsComputer Vision – ECCV 202410.1007/978-3-031-73024-5_16(261-278)Online publication date: 24-Nov-2024
  • (2023)CADTrack: Instructions and Support for Orientation Disambiguation of Near-Symmetrical ObjectsProceedings of the ACM on Human-Computer Interaction10.1145/36264627:ISS(1-20)Online publication date: 1-Nov-2023
  • (2023)StructureSenseProceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies10.1145/35703436:4(1-25)Online publication date: 11-Jan-2023
  • (2023)InstruMentAR: Auto-Generation of Augmented Reality Tutorials for Operating Digital Instruments Through Recording Embodied DemonstrationProceedings of the 2023 CHI Conference on Human Factors in Computing Systems10.1145/3544548.3581442(1-17)Online publication date: 19-Apr-2023
  • (2023)Measurement Patterns: User-Oriented Strategies for Dealing with Measurements and Dimensions in Making ProcessesProceedings of the 2023 CHI Conference on Human Factors in Computing Systems10.1145/3544548.3581157(1-17)Online publication date: 19-Apr-2023
  • (2023)Hand Guided High Resolution Feature Enhancement for Fine-Grained Atomic Action Segmentation within Complex Human Assemblies2023 IEEE/CVF Winter Conference on Applications of Computer Vision Workshops (WACVW)10.1109/WACVW58289.2023.00052(1-10)Online publication date: Jan-2023
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media