Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3264856.3264859acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

ASYSST: A Framework for Synopsis Synthesis Empowering Visually Impaired

Published: 15 October 2018 Publication History

Abstract

In an indoor scenario, the visually impaired do not have the information about the surroundings and finds it difficult to navigate from room to room. The sensor-based solutions are expensive and may not always be comfortable for the end users. In this paper, we focus on the problem of synthesis of textual description from a given floor plan image to assist the visually impaired. The textual description, in addition to a text reading software, can aid the visually impaired person while moving inside a building. In this work, for the first time, we propose an end to end framework (ASYSST) for textual description synthesis from digitized building floor plans. We have introduced a novel Bag of Decor (BoD) feature to learn $5$ classes of a room from $1355$ samples under a supervised learning paradigm. These learned labels are fed into a description synthesis framework to yield a holistic description of a floor plan image. Experimental analysis of real publicly available floor plan data-set proves the superiority of our framework.

References

[1]
Viktor Ayzenshtadt, Christoph Langenhan, Saqib Bukhari, Klaus--Dieter Althoff, Frank Petzold, and Andreas Dengel. 2017. Extending the flexibility of case-based design support tools: A use case in the architectural domain. In ICCBR.
[2]
Raffaella Bernardi, Ruket Cakici, Desmond Elliott, Aykut Erdem, Erkut Erdem, Nazli Ikizler--Cinbis, Frank Keller, Adrian Muscat, and Barbara Plank. 2016. Automatic Description Generation from Images: A Survey of Models, Datasets, and Evaluation Measures. JAIR 55 (2016), 409--442.
[3]
Jinying Chen, Huaigu Cao, and Premkumar Natarajan. 2015. Integrating natural language processing with image document analysis: what we learned from two real--world applications. IJDAR 18, 3 (2015), 235--247.
[4]
D. Sharma, C. Chattopadhyay and G. Harit. 2016. A Unified Framework for Semantic Matching of Architectural Floorplans . In ICPR.
[5]
Mathieu Delalandre, Tony Pridmore, Ernest Valveny, Hervé Locteau, and Eric Trupin. 2007. Building synthetic graphical documents for performance evaluation. In International Workshop on Graphics Recognition. Springer, 288--298.
[6]
Michael Denkowski and Alon Lavie. 2011. Meteor 1.3: Automatic metric for reliable optimization and evaluation of machine translation systems. In WMT.
[7]
Anjan Dutta, Josep Lladós, and Umapada Pal. 2013. A symbol spotting approach in graphical documents by hashing serialized graphs. PR 46, 3 (2013), 752--768.
[8]
Shreya Goyal, Chiranjoy Chattopadhyay, and Gaurav Bhatnagar. 2018. Plan2Text: A Framework for Describing Building Floor Plan Images From First Person Perspective. In CSPA.
[9]
Chris Harris and Mike Stephens. 1988. A combined corner and edge detector. In Alvey vision conference, Vol. 15. Citeseer, 10--5244.
[10]
Ming K. Hu. 1962. Visual pattern recognition by moment invariants, computer methods in image analysis. IRE trans. on Information Theory 8 (1962).
[11]
Andrej Karpathy and Li Fei Fei. 2015. Deep visual--semantic alignments for generating image descriptions. In CVPR.
[12]
Chin--Yew Lin. 2004. Rouge: A package for automatic evaluation of summaries. In ACL.
[13]
L.P.d.l. Heras, O.R. Terrades, S. Robles, and G. Sanchez. 2014. Statistical segmentation and structural recognition for floor plan interpretation. IJDAR 17, 3 (2014), 221--237.
[14]
Carlos AB Mello, Diogo C Costa, and TJ dos Santos. 2012. Automatic image segmentation of old topographic maps and floor plans. In SMC.
[15]
Timo Ojala, Matti Pietikainen, and Topi Maenpaa. 2002. Multiresolution grayscale and rotation invariant texture classification with local binary patterns. IEEE Transactions on pattern analysis and machine intelligence 24, 7 (2002), 971--987.
[16]
Kishore Papineni, Salim Roukos, Todd Ward, and Wei--Jing Zhu. 2002. BLEU: a method for automatic evaluation of machine translation. In ACL.
[17]
S. Ahmed, M. Liwicki, M. Weber and A. Dengel. 2011. Improved automatic analysis of architectural floor plans. In ICDAR.
[18]
S. Ahmed, M. Liwicki, M.Weber and A. Dengel. 2011. Text/graphics segmentation in architectural floor plans. In ICDAR.
[19]
S. Ahmed, M. Liwicki, M. Weber and A. Dengel. 2012. Automatic room detection and room labeling from architectural floor plans. In DAS.
[20]
S. Macé, H. Locteau, E. Valveny and S. Tabbone. 2010. A system to detect rooms in architectural floor plan images. In DAS.
[21]
D. Sharma, N. Gupta, C. Chattopadhyay, and S. Mehta. 2017. DANIEL: A Deep Architecture for Automatic Analysis and Retrieval of Building Floor Plans. In ICDAR.
[22]
Karl Tombre. 1998. Analysis of engineering drawings: State of the art and challenges. Springer Berlin Heidelberg, 257--264.
[23]
Oriol Vinyals, Alexander Toshev, Samy Bengio, and Dumitru Erhan. 2015. Show and tell: A neural image caption generator. In CVPR.
[24]
Kelvin Xu, Jimmy Ba, Ryan Kiros, Kyunghyun Cho, Aaron Courville, Ruslan Salakhudinov, Rich Zemel, and Yoshua Bengio. 2015. Show, attend and tell: Neural image caption generation with visual attention. In ICML.

Cited By

View all
  • (2024)FVCap: An Approach to Understand Scanned Floor Plan Images Using Deep Learning and its ApplicationsSN Computer Science10.1007/s42979-024-02708-55:4Online publication date: 30-Mar-2024
  • (2024)A Hybrid Network for Scanned Floor Plan Image Recognition and Room LabelingPattern Recognition and Machine Intelligence10.1007/978-3-031-12700-7_48(470-478)Online publication date: 24-Jul-2024
  • (2021)Knowledge-driven description synthesis for floor plan interpretationInternational Journal on Document Analysis and Recognition (IJDAR)10.1007/s10032-021-00367-3Online publication date: 26-Apr-2021
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
MAHCI'18: Proceedings of the 2018 Workshop on Multimedia for Accessible Human Computer Interface
October 2018
38 pages
ISBN:9781450359801
DOI:10.1145/3264856
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 15 October 2018

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. abstractive synopsis
  2. floor plan
  3. morphology
  4. text recognition

Qualifiers

  • Research-article

Funding Sources

  • Science and Engineering Research Board

Conference

MM '18
Sponsor:
MM '18: ACM Multimedia Conference
October 22, 2018
Seoul, Republic of Korea

Upcoming Conference

MM '24
The 32nd ACM International Conference on Multimedia
October 28 - November 1, 2024
Melbourne , VIC , Australia

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)3
  • Downloads (Last 6 weeks)1
Reflects downloads up to 10 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2024)FVCap: An Approach to Understand Scanned Floor Plan Images Using Deep Learning and its ApplicationsSN Computer Science10.1007/s42979-024-02708-55:4Online publication date: 30-Mar-2024
  • (2024)A Hybrid Network for Scanned Floor Plan Image Recognition and Room LabelingPattern Recognition and Machine Intelligence10.1007/978-3-031-12700-7_48(470-478)Online publication date: 24-Jul-2024
  • (2021)Knowledge-driven description synthesis for floor plan interpretationInternational Journal on Document Analysis and Recognition (IJDAR)10.1007/s10032-021-00367-3Online publication date: 26-Apr-2021
  • (2019)BRIDGE: Building Plan Repository for Image Description Generation, and Evaluation2019 International Conference on Document Analysis and Recognition (ICDAR)10.1109/ICDAR.2019.00174(1071-1076)Online publication date: Sep-2019

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media