research-article

Creating consistent scene graphs using a probabilistic grammar

Authors:

Siddhartha Chaudhuri,

Vladimir G. Kim,

Niloy J. Mitra,

Thomas FunkhouserAuthors Info & Claims

ACM Transactions on Graphics (TOG), Volume 33, Issue 6

Article No.: 211, Pages 1 - 12

https://doi.org/10.1145/2661229.2661243

Published: 19 November 2014 Publication History

Abstract

Growing numbers of 3D scenes in online repositories provide new opportunities for data-driven scene understanding, editing, and synthesis. Despite the plethora of data now available online, most of it cannot be effectively used for data-driven applications because it lacks consistent segmentations, category labels, and/or functional groupings required for co-analysis. In this paper, we develop algorithms that infer such information via parsing with a probabilistic grammar learned from examples. First, given a collection of scene graphs with consistent hierarchies and labels, we train a probabilistic hierarchical grammar to represent the distributions of shapes, cardinalities, and spatial relationships of semantic objects within the collection. Then, we use the learned grammar to parse new scenes to assign them segmentations, labels, and hierarchies consistent with the collection. During experiments with these algorithms, we find that: they work effectively for scene graphs for indoor scenes commonly found online (bedrooms, classrooms, and libraries); they outperform alternative approaches that consider only shape similarities and/or spatial relationships without hierarchy; they require relatively small sets of training data; they are robust to moderate over-segmentation in the inputs; and, they can robustly transfer labels from one data set to another. As a result, the proposed algorithms can be used to provide consistent hierarchies for large collections of scenes within the same semantic class.

References

[1]

Bishop, C. M. 2006. Pattern Recognition and Machine Learning. Springer-Verlag New York, Inc.

Digital Library

[2]

Bokeloh, M., Wand, M., and Seidel, H.-P. 2010. A connection between partial symmetry and inverse procedural modeling. ACM Trans. Graph. 29, 4, 104.

Digital Library

[3]

Boulch, A., Houllier, S., Marlet, R., and Tournaire, O. 2013. Semantizing complex 3D scenes using constrained attribute grammars. In Computer Graphics Forum, vol. 32, Wiley Online Library, 33--42.

Digital Library

[4]

Chaudhuri, S., Kalogerakis, E., Guibas, L., and Koltun, V. 2011. Probabilistic reasoning for assembly-based 3D modeling. In ACM Trans. Graph., vol. 30, ACM, 35.

Digital Library

[5]

Choi, W., Chao, Y. W., Pantofaru, C., and Savarese, S. 2013. Understanding indoor scenes using 3D geometric phrases. In CVPR.

Digital Library

[6]

Earley, J. 1970. An efficient context-free parsing algorithm. Communications of the ACM 13, 2, 94--102.

Digital Library

[7]

Fisher, M., and Hanrahan, P. 2010. Context-based search for 3D models. In ACM Trans. Graph., vol. 29, ACM, 182.

Digital Library

[8]

Fisher, M., Savva, M., and Hanrahan, P. 2011. Characterizing structural relationships in scenes using graph kernels. In ACM Trans. Graph., vol. 30, ACM, 34.

Digital Library

[9]

Fisher, M., Ritchie, D., Savva, M., Funkhouser, T., and Hanrahan, P. 2012. Example-based synthesis of 3D object arrangements. ACM Trans. Graph. 31, 6, 135.

Digital Library

[10]

Golovinskiy, A., and Funkhouser, T. 2009. Consistent segmentation of 3D models. Computers & Graphics 33, 3, 262--269.

Digital Library

[11]

Hu, R., Fan, L., and Liu, L. 2012. Co-segmentation of 3D shapes via subspace clustering. In Computer Graphics Forum, vol. 31, Wiley Online Library, 1703--1713.

Digital Library

[12]

Huang, Q.-X., and Guibas, L. 2013. Consistent shape maps via semidefinite programming. In Computer Graphics Forum, vol. 32, Wiley Online Library, 177--186.

Digital Library

[13]

Huang, Q., Koltun, V., and Guibas, L. 2011. Joint shape segmentation with linear programming. In ACM Trans. Graph., vol. 30, ACM, 125.

Digital Library

[14]

Huang, Q.-X., Zhang, G.-X., Gao, L., Hu, S.-M., Butscher, A., and Guibas, L. 2012. An optimization approach for extracting and encoding consistent maps in a shape collection. ACM Trans. Graph. 31, 6, 167.

Digital Library

[15]

Kalogerakis, E., Hertzmann, A., and Singh, K. 2010. Learning 3D mesh segmentation and labeling. In SIGGRAPH.

Digital Library

[16]

Kalogerakis, E., Chaudhuri, S., Koller, D., and Koltun, V. 2012. A probabilistic model for component-based shape synthesis. ACM Trans. Graph. 31, 4, 55.

Digital Library

[17]

Kim, V. G., Li, W., Mitra, N. J., DiVerdi, S., and Funkhouser, T. 2012. Exploring collections of 3D models using fuzzy correspondences. ACM Trans. Graph. 31, 4 (July), 54:1--54:11.

Digital Library

[18]

Kim, V. G., Li, W., Mitra, N. J., Chaudhuri, S., DiVerdi, S., and Funkhouser, T. 2013. Learning part-based templates from large collections of 3D shapes. ACM Trans. Graph.

Digital Library

[19]

Martinović, A., and Van Gool, L. 2013. Bayesian grammar learning for inverse procedural modeling. In CVPR.

Digital Library

[20]

Mathias, M., Martinovic, A., Weissenberg, J., and van Gool, L. 2011. Procedural 3D building reconstruction using shape grammars and detectors. In 3DIMPVT.

Digital Library

[21]

Nguyen, A., Ben-Chen, M., Welnicka, K., Ye, Y., and Guibas, L. 2011. An optimization approach to improving collections of shape maps. In CGF, vol. 30, 1481--1491.

[22]

Parzen, E. 1962. On estimation of a probability density function and mode. Ann. Math. Stat. 33, 3, 1065--1076.

[23]

Sidi, O., van Kaick, O., Kleiman, Y., Zhang, H., and Cohen-Or, D. 2011. Unsupervised co-segmentation of a set of shapes via descriptor-space spectral clustering. In ACM Trans. Graph., vol. 30, ACM, 126.

Digital Library

[24]

Socher, R., Lin, C. C., Ng, A., and Manning, C. 2011. Parsing natural scenes and natural language with recursive neural networks. In ICML, 129--136.

[25]

Št'ava, O., Beneš, B., Měch, R., Aliaga, D. G., and Krištof, P. 2010. Inverse procedural modeling by automatic generation of L-systems. In Computer Graphics Forum, vol. 29, Wiley Online Library, 665--674.

Digital Library

[26]

Talton, J., Yang, L., Kumar, R., Lim, M., Goodman, N., and Měch, R. 2012. Learning design patterns with bayesian grammar induction. In UIST, ACM, 63--74.

Digital Library

[27]

Teboul, O., Kokkinos, I., Simon, L., Koutsourakis, P., and Paragios, N. 2013. Parsing facades with shape grammars and reinforcement learning. Trans. PAMI 35, 7, 1744--1756.

Digital Library

[28]

Trimble, 2012. Trimble 3D warehouse, http://sketchup.google.com/3Dwarehouse/.

[29]

van Kaick, O., Xu, K., Zhang, H., Wang, Y., Sun, S., Shamir, A., and Cohen-Or, D. 2013. Co-hierarchical analysis of shape structures. ACM Trans. Graph. 32, 4, 69.

Digital Library

[30]

Wang, Y., Xu, K., Li, J., Zhang, H., Shamir, A., Liu, L., Cheng, Z., and Xiong, Y. 2011. Symmetry hierarchy of man-made objects. In Computer Graphics Forum, vol. 30,Wiley Online Library, 287--296.

[31]

Wu, F., Yan, D.-M., Dong, W., Zhang, X., and Wonka, P. 2014. Inverse procedural modeling of facade layouts. ACM Trans. Graph. 33, 4.

Digital Library

[32]

Xu, K., Chen, K., Fu, H., Sun, W.-L., and Hu, S.-M. 2013. Sketch2Scene: sketch-based co-retrieval and co-placement of 3D models. ACM Trans. Graph. 32, 4, 123:1--123:12.

Digital Library

[33]

Xu, K., Ma, R., Zhang, H., Zhu, C., Shamir, A., Cohen-Or, D., and Huang, H. 2014. Organizing heterogeneous scene collection through contextual focal points. ACM Transactions on Graphics, (Proc. of SIGGRAPH 2014) 33, 4, to appear.

Digital Library

[34]

Yeh, Y.-T., Yang, L., Watson, M., Goodman, N. D., and Hanrahan, P. 2012. Synthesizing open worlds with constraints using locally annealed reversible jump mcmc. ACM Transactions on Graphics (TOG) 31, 4, 56.

Digital Library

[35]

Zhang, H., Xu, K., Jiang, W., Lin, J., Cohen-Or, D., and Chen, B. 2013. Layered analysis of irregular facades via symmetry maximization. ACM Trans. Graph. 32, 4, 121.

Digital Library

[36]

Zhao, Y., and Zhu, S.-C. 2013. Scene parsing by integrating function, geometry and appearance models. CVPR.

Digital Library

[37]

Zheng, Y., Cohen-Or, D., Averkiou, M., and Mitra, N. J. 2014. Recurring part arrangements in shape collections. Computer Graphics Forum (Special issue of Eurographics 2014).

Cited By

Min WWu WZhang GZheng L(2024)FuncScene: Function-centric Indoor Scene Synthesis via a Variational AutoEncoder FrameworkComputer Aided Geometric Design10.1016/j.cagd.2024.102319(102319)Online publication date: May-2024
https://doi.org/10.1016/j.cagd.2024.102319
Williams C(2024)Structured Generative Models for Scene UnderstandingInternational Journal of Computer Vision10.1007/s11263-024-02316-zOnline publication date: 12-Dec-2024
https://doi.org/10.1007/s11263-024-02316-z
Patil APatil SLi MFisher MSavva MZhang H(2023)Advances in Data‐Driven Analysis and Synthesis of 3D Indoor ScenesComputer Graphics Forum10.1111/cgf.14927Online publication date: 11-Sep-2023
https://doi.org/10.1111/cgf.14927
Show More Cited By

Index Terms

Creating consistent scene graphs using a probabilistic grammar
1. Computing methodologies
  1. Computer graphics
    1. Shape modeling
2. Theory of computation
  1. Randomness, geometry and discrete structures

Recommendations

Manhattan Scene Understanding via XSlit Imaging
CVPR '13: Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition

A Manhattan World (MW) is composed of planar surfaces and parallel lines aligned with three mutually orthogonal principal axes. Traditional MW understanding algorithms rely on geometry priors such as the vanishing points and reference (ground) planes ...
Indoor Scene Understanding with Geometric and Semantic Contexts

Truly understanding a scene involves integrating information at multiple levels as well as studying the interactions between scene elements. Individual object detectors, layout estimators and scene classifiers are powerful but ultimately confounded by ...
Scene Parsing with Object Instance Inference Using Regions and Per-exemplar Detectors

This paper describes a system for interpreting a scene by assigning a semantic label at every pixel and inferring the spatial extent of individual object instances together with their occlusion relationships. First we present a method for labeling each ...

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Graphics

ACM Transactions on Graphics Volume 33, Issue 6

November 2014

704 pages

ISSN:0730-0301

EISSN:1557-7368

DOI:10.1145/2661229

Issue’s Table of Contents

Copyright © 2014 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 19 November 2014

Published in TOG Volume 33, Issue 6

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

65
Total Citations
View Citations
700
Total Downloads

Downloads (Last 12 months)26
Downloads (Last 6 weeks)1

Reflects downloads up to 28 Jan 2025

Other Metrics

View Author Metrics

Citations

Cited By

Min WWu WZhang GZheng L(2024)FuncScene: Function-centric Indoor Scene Synthesis via a Variational AutoEncoder FrameworkComputer Aided Geometric Design10.1016/j.cagd.2024.102319(102319)Online publication date: May-2024
https://doi.org/10.1016/j.cagd.2024.102319
Williams C(2024)Structured Generative Models for Scene UnderstandingInternational Journal of Computer Vision10.1007/s11263-024-02316-zOnline publication date: 12-Dec-2024
https://doi.org/10.1007/s11263-024-02316-z
Patil APatil SLi MFisher MSavva MZhang H(2023)Advances in Data‐Driven Analysis and Synthesis of 3D Indoor ScenesComputer Graphics Forum10.1111/cgf.14927Online publication date: 11-Sep-2023
https://doi.org/10.1111/cgf.14927
Yu FQian YGil-Ureta FJackson BBennett EZhang H(2023)HAL3D: Hierarchical Active Learning for Fine-Grained 3D Part Labeling2023 IEEE/CVF International Conference on Computer Vision (ICCV)10.1109/ICCV51070.2023.00086(865-875)Online publication date: 1-Oct-2023
https://doi.org/10.1109/ICCV51070.2023.00086
Fu QHe SFu HLi XDeng Z(2023)Fuzzy-based indoor scene modeling with differentiated examplesComputational Visual Media10.1007/s41095-022-0299-z9:4(717-732)Online publication date: 23-May-2023
https://doi.org/10.1007/s41095-022-0299-z
Soremekun EPavese EHavrikov NGrunske LZeller A(2022)Inputs From Hell:IEEE Transactions on Software Engineering10.1109/TSE.2020.301371648:4(1138-1153)Online publication date: 1-Apr-2022
https://doi.org/10.1109/TSE.2020.3013716
Kumar KEssa IHa S(2022)Graph-based Cluttered Scene Generation and Interactive Exploration using Deep Reinforcement Learning2022 International Conference on Robotics and Automation (ICRA)10.1109/ICRA46639.2022.9811874(7521-7527)Online publication date: 23-May-2022
https://doi.org/10.1109/ICRA46639.2022.9811874
Wald JNavab NTombari F(2022)Learning 3D Semantic Scene Graphs with Instance EmbeddingsInternational Journal of Computer Vision10.1007/s11263-021-01546-9Online publication date: 22-Jan-2022
https://doi.org/10.1007/s11263-021-01546-9
Liu JSengers P(2021)Legibility and the Legacy of Racialized Dispossession in Digital AgricultureProceedings of the ACM on Human-Computer Interaction10.1145/34798675:CSCW2(1-21)Online publication date: 18-Oct-2021
https://dl.acm.org/doi/10.1145/3479867
Quigley JVasantha GCorney JPurves DSherlock A(2021)Design as a Marked Point ProcessJournal of Mechanical Design10.1115/1.4052844144:2Online publication date: 6-Dec-2021
https://doi.org/10.1115/1.4052844
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Issue’s Table of Contents