Article

Scene Parsing by Integrating Function, Geometry and Appearance Models

Authors:

Yibiao Zhao,

Song-Chun ZhuAuthors Info & Claims

CVPR '13: Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition

Pages 3119 - 3126

https://doi.org/10.1109/CVPR.2013.401

Published: 23 June 2013 Publication History

Abstract

Indoor functional objects exhibit large view and appearance variations, thus are difficult to be recognized by the traditional appearance-based classification paradigm. In this paper, we present an algorithm to parse indoor images based on two observations: i) The functionality is the most essential property to define an indoor object, e.g. "a chair to sit on", ii) The geometry (3D shape) of an object is designed to serve its function. We formulate the nature of the object function into a stochastic grammar model. This model characterizes a joint distribution over the function-geometry-appearance (FGA) hierarchy. The hierarchical structure includes a scene category, functional groups, functional objects, functional parts and 3D geometric shapes. We use a simulated annealing MCMC algorithm to find the maximum a posteriori (MAP) solution, i.e. a parse tree. We design four data-driven steps to accelerate the search in the FGA space: i) group the line segments into 3D primitive shapes, ii) assign functional labels to these 3D primitive shapes, iii) fill in missing objects/parts according to the functional labels, and iv) synthesize 2D segmentation maps and verify the current parse tree by the Metropolis-Hastings acceptance probability. The experimental results on several challenging indoor datasets demonstrate the proposed approach not only significantly widens the scope of indoor scene parsing algorithm from the segmentation and the 3D recovery to the functional object recognition, but also yields improved overall performance.

Cited By

View all

Jiang HWeng DDongye XLuo LZhang Z(2023)Commonsense Knowledge-Driven Joint Reasoning Approach for Object Retrieval in Virtual RealityACM Transactions on Graphics10.1145/361832042:6(1-18)Online publication date: 5-Dec-2023
https://dl.acm.org/doi/10.1145/3618320
Astolfi GRezende FPorto JMatsubara EPistori H(2021)Syntactic Pattern Recognition in Computer VisionACM Computing Surveys10.1145/344724154:3(1-35)Online publication date: 17-Apr-2021
https://dl.acm.org/doi/10.1145/3447241
Hassanin MKhan STahtali M(2021)Visual Affordance and Function UnderstandingACM Computing Surveys10.1145/344637054:3(1-35)Online publication date: 17-Apr-2021
https://dl.acm.org/doi/10.1145/3446370
Show More Cited By

Index Terms

Scene Parsing by Integrating Function, Geometry and Appearance Models
1. Computing methodologies
  1. Computer graphics
    1. Shape modeling

Index terms have been assigned to the content through auto-classification.

Recommendations

Single-View 3D Scene Parsing by Attributed Grammar
CVPR '14: Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition

In this paper, we present an attributed grammar for parsing man-made outdoor scenes into semantic surfaces, and recovering its 3D model simultaneously. The grammar takes superpixels as its terminal nodes and use five production rules to generate the ...
Indoor Scene Understanding with Geometric and Semantic Contexts

Truly understanding a scene involves integrating information at multiple levels as well as studying the interactions between scene elements. Individual object detectors, layout estimators and scene classifiers are powerful but ultimately confounded by ...
LLLR parsing
SAC '13: Proceedings of the 28th Annual ACM Symposium on Applied Computing

The idea of an LLLR parsing is presented. An LLLR(k) parser can be constructed for any LR(k) grammar but it produces the left parse of the input string in linear time (in respect to the length of the derivation) without backtracking. If used as a basis ...

Comments

Information & Contributors

Information

Published In

CVPR '13: Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition

June 2013

3752 pages

ISBN:9780769549897

Publisher

IEEE Computer Society

United States

Publication History

Published: 23 June 2013

Author Tags

Qualifiers

Article

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

13
Total Citations
View Citations
0
Total Downloads

Downloads (Last 12 months)0
Downloads (Last 6 weeks)0

Reflects downloads up to 30 Aug 2024

Other Metrics

View Author Metrics

Citations

Cited By

View all

Jiang HWeng DDongye XLuo LZhang Z(2023)Commonsense Knowledge-Driven Joint Reasoning Approach for Object Retrieval in Virtual RealityACM Transactions on Graphics10.1145/361832042:6(1-18)Online publication date: 5-Dec-2023
https://dl.acm.org/doi/10.1145/3618320
Astolfi GRezende FPorto JMatsubara EPistori H(2021)Syntactic Pattern Recognition in Computer VisionACM Computing Surveys10.1145/344724154:3(1-35)Online publication date: 17-Apr-2021
https://dl.acm.org/doi/10.1145/3447241
Hassanin MKhan STahtali M(2021)Visual Affordance and Function UnderstandingACM Computing Surveys10.1145/344637054:3(1-35)Online publication date: 17-Apr-2021
https://dl.acm.org/doi/10.1145/3446370
Huang SChen YYuan TQi SZhu YZhu SWallach HLarochelle HBeygelzimer Ad'Alché-Buc FFox E(2019)PerspectiveNetProceedings of the 33rd International Conference on Neural Information Processing Systems10.5555/3454287.3455086(8905-8917)Online publication date: 8-Dec-2019
https://dl.acm.org/doi/10.5555/3454287.3455086
Zou CGuo RLi ZHoiem D(2019)Complete 3D Scene Parsing from an RGBD ImageInternational Journal of Computer Vision10.1007/s11263-018-1133-z127:2(143-162)Online publication date: 1-Feb-2019
https://dl.acm.org/doi/10.1007/s11263-018-1133-z
Huang SQi SXiao YZhu YWu YZhu S(2018)Cooperative holistic scene understandingProceedings of the 32nd International Conference on Neural Information Processing Systems10.5555/3326943.3326963(206-217)Online publication date: 3-Dec-2018
https://dl.acm.org/doi/10.5555/3326943.3326963
Ismail ASeifelnasr MGuo H(2018)Understanding Indoor SceneProceedings of the 3rd International Conference on Multimedia Systems and Signal Processing10.1145/3220162.3220182(64-70)Online publication date: 28-Apr-2018
https://dl.acm.org/doi/10.1145/3220162.3220182
Liang WZhao YZhu YZhu S(2016)What is whereProceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence10.5555/3061053.3061099(3418-3424)Online publication date: 9-Jul-2016
https://dl.acm.org/doi/10.5555/3061053.3061099
Hueting MPătrăucean VOvsjanikov MMitra NGuthe MHullin MStamminger MWeinkauf T(2016)Scene structure inference through scene map estimationProceedings of the Conference on Vision, Modeling and Visualization10.5555/3056901.3056909(45-52)Online publication date: 10-Oct-2016
https://dl.acm.org/doi/10.5555/3056901.3056909
Yu LYeung STerzopoulos D(2016)The ClutterpaletteIEEE Transactions on Visualization and Computer Graphics10.1109/TVCG.2015.241757522:2(1138-1148)Online publication date: 1-Feb-2016
https://dl.acm.org/doi/10.1109/TVCG.2015.2417575
Show More Cited By

View Options

View options

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Abstract

Cited By

Index Terms

Recommendations

Single-View 3D Scene Parsing by Attributed Grammar

Indoor Scene Understanding with Geometric and Semantic Contexts

LLLR parsing

Comments

Information

Published In

Publisher

Publication History

Author Tags

Qualifiers

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

View options

Get Access

Login options

Full Access

Figures

Other

Share

Share this Publication link

Share on social media

Affiliations