Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1109/CVPR.2013.401guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

Scene Parsing by Integrating Function, Geometry and Appearance Models

Published: 23 June 2013 Publication History

Abstract

Indoor functional objects exhibit large view and appearance variations, thus are difficult to be recognized by the traditional appearance-based classification paradigm. In this paper, we present an algorithm to parse indoor images based on two observations: i) The functionality is the most essential property to define an indoor object, e.g. "a chair to sit on", ii) The geometry (3D shape) of an object is designed to serve its function. We formulate the nature of the object function into a stochastic grammar model. This model characterizes a joint distribution over the function-geometry-appearance (FGA) hierarchy. The hierarchical structure includes a scene category, functional groups, functional objects, functional parts and 3D geometric shapes. We use a simulated annealing MCMC algorithm to find the maximum a posteriori (MAP) solution, i.e. a parse tree. We design four data-driven steps to accelerate the search in the FGA space: i) group the line segments into 3D primitive shapes, ii) assign functional labels to these 3D primitive shapes, iii) fill in missing objects/parts according to the functional labels, and iv) synthesize 2D segmentation maps and verify the current parse tree by the Metropolis-Hastings acceptance probability. The experimental results on several challenging indoor datasets demonstrate the proposed approach not only significantly widens the scope of indoor scene parsing algorithm from the segmentation and the 3D recovery to the functional object recognition, but also yields improved overall performance.

Cited By

View all
  • (2023)Commonsense Knowledge-Driven Joint Reasoning Approach for Object Retrieval in Virtual RealityACM Transactions on Graphics10.1145/361832042:6(1-18)Online publication date: 5-Dec-2023
  • (2021)Syntactic Pattern Recognition in Computer VisionACM Computing Surveys10.1145/344724154:3(1-35)Online publication date: 17-Apr-2021
  • (2021)Visual Affordance and Function UnderstandingACM Computing Surveys10.1145/344637054:3(1-35)Online publication date: 17-Apr-2021
  • Show More Cited By

Index Terms

  1. Scene Parsing by Integrating Function, Geometry and Appearance Models
    Index terms have been assigned to the content through auto-classification.

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image Guide Proceedings
    CVPR '13: Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition
    June 2013
    3752 pages
    ISBN:9780769549897

    Publisher

    IEEE Computer Society

    United States

    Publication History

    Published: 23 June 2013

    Author Tags

    1. affordance
    2. function
    3. functionality
    4. image parsing
    5. scene parsing
    6. stochastic scene grammar

    Qualifiers

    • Article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 30 Aug 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2023)Commonsense Knowledge-Driven Joint Reasoning Approach for Object Retrieval in Virtual RealityACM Transactions on Graphics10.1145/361832042:6(1-18)Online publication date: 5-Dec-2023
    • (2021)Syntactic Pattern Recognition in Computer VisionACM Computing Surveys10.1145/344724154:3(1-35)Online publication date: 17-Apr-2021
    • (2021)Visual Affordance and Function UnderstandingACM Computing Surveys10.1145/344637054:3(1-35)Online publication date: 17-Apr-2021
    • (2019)PerspectiveNetProceedings of the 33rd International Conference on Neural Information Processing Systems10.5555/3454287.3455086(8905-8917)Online publication date: 8-Dec-2019
    • (2019)Complete 3D Scene Parsing from an RGBD ImageInternational Journal of Computer Vision10.1007/s11263-018-1133-z127:2(143-162)Online publication date: 1-Feb-2019
    • (2018)Cooperative holistic scene understandingProceedings of the 32nd International Conference on Neural Information Processing Systems10.5555/3326943.3326963(206-217)Online publication date: 3-Dec-2018
    • (2018)Understanding Indoor SceneProceedings of the 3rd International Conference on Multimedia Systems and Signal Processing10.1145/3220162.3220182(64-70)Online publication date: 28-Apr-2018
    • (2016)What is whereProceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence10.5555/3061053.3061099(3418-3424)Online publication date: 9-Jul-2016
    • (2016)Scene structure inference through scene map estimationProceedings of the Conference on Vision, Modeling and Visualization10.5555/3056901.3056909(45-52)Online publication date: 10-Oct-2016
    • (2016)The ClutterpaletteIEEE Transactions on Visualization and Computer Graphics10.1109/TVCG.2015.241757522:2(1138-1148)Online publication date: 1-Feb-2016
    • Show More Cited By

    View Options

    View options

    Get Access

    Login options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media