Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- surveyJune 2024
From Detection to Application: Recent Advances in Understanding Scientific Tables and Figures
ACM Computing Surveys (CSUR), Volume 56, Issue 10Article No.: 261, Pages 1–39https://doi.org/10.1145/3657285Tables and figures are usually used to present information in a structured and visual way in scientific documents. Understanding the tables and figures in scientific documents is significant for a series of downstream tasks, such as academic search, ...
- research-articleDecember 2023
Watchog: A Light-weight Contrastive Learning based Framework for Column Annotation
Proceedings of the ACM on Management of Data (PACMMOD), Volume 1, Issue 4Article No.: 272, Pages 1–24https://doi.org/10.1145/3626766Relational Web tables provide valuable resources for numerous downstream applications, making table understanding, especially column annotation that identifies semantic types and relations of columns, a hot topic in the field of data management. Despite ...
- ArticleNovember 2023
Dependency-Aware Core Column Discovery for Table Understanding
AbstractIn a relational table, core columns represent the primary subject entities that other columns in the table depend on. While discovering core columns is crucial for understanding a table’s semantic column types, column relations, and entities, it ...
- ArticleSeptember 2023
Enhancing Table Retrieval with Dual Graph Representations
Machine Learning and Knowledge Discovery in Databases: Research TrackSep 2023, Pages 107–123https://doi.org/10.1007/978-3-031-43421-1_7AbstractTable retrieval aims to rank candidate tables for answering natural language query, in which the most critical problem is how to learn informative representations for structured tables. Most previous methods roughly flatten the table and send it ...
- research-articleAugust 2023
GetPt: Graph-enhanced General Table Pre-training with Alternate Attention Network
- Ran Jia,
- Haoming Guo,
- Xiaoyuan Jin,
- Chao Yan,
- Lun Du,
- Xiaojun Ma,
- Tamara Stankovic,
- Marko Lozajic,
- Goran Zoranovic,
- Igor Ilic,
- Shi Han,
- Dongmei Zhang
KDD '23: Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data MiningAugust 2023, Pages 941–950https://doi.org/10.1145/3580305.3599366Tables are widely used for data storage and presentation due to their high flexibility in layout. The importance of tables as information carriers and the complexity of tabular data understanding attract a great deal of research on large-scale pre-...
-
- research-articleOctober 2022
End-to-End Compound Table Understanding with Multi-Modal Modeling
MM '22: Proceedings of the 30th ACM International Conference on MultimediaOctober 2022, Pages 4112–4121https://doi.org/10.1145/3503161.3547885Table is a widely used data form in webpages, spreadsheets, or PDFs to organize and present structural data. Although studies on table structure recognition have been successfully used to convert image-based tables into digital structural formats, ...
- research-articleJune 2022
Annotating Columns with Pre-trained Language Models
SIGMOD '22: Proceedings of the 2022 International Conference on Management of DataJune 2022, Pages 1493–1503https://doi.org/10.1145/3514221.3517906Inferring meta information about tables, such as column headers or relationships between columns, is an active research topic in data management as we find many tables are missing some of this information. In this paper, we study the problem of ...
- research-articleAugust 2021
Numerical Formula Recognition from Tables
KDD '21: Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data MiningAugust 2021, Pages 1986–1996https://doi.org/10.1145/3447548.3467425Claims over the numerical relationships among some measures are commonly expressed in tabular forms, and widely exist in the published documents on the Web. This paper introduces the problem of numerical formula recognition from tables, namely ...
- keynoteJune 2021
Deep Data Integration
SIGMOD '21: Proceedings of the 2021 International Conference on Management of DataJune 2021, Page 2https://doi.org/10.1145/3448016.3460534We are witnessing the widespread adoption of deep learning techniques as avant-garde solutions to different computational problems in recent years. In data integration, the use of deep learning techniques has helped establish several state-of-the-art ...
- short-paperJuly 2020
Summarizing and Exploring Tabular Data in Conversational Search
SIGIR '20: Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information RetrievalJuly 2020, Pages 1537–1540https://doi.org/10.1145/3397271.3401205Tabular data provide answers to a significant portion of search queries. However, reciting an entire result table is impractical in conversational search systems. We propose to generate natural language summaries as answers to describe the complex ...
- research-articleMay 2019
Generating Titles for Web Tables
WWW '19: The World Wide Web ConferenceMay 2019, Pages 638–647https://doi.org/10.1145/3308558.3313399Descriptive titles provide crucial context for interpreting tables that are extracted from web pages and are a key component of search features such as tabular featured snippets from Google and Bing. Prior approaches have attempted to produce titles by ...
- abstractSeptember 2016
Table Modelling, Extraction and Processing
DocEng '16: Proceedings of the 2016 ACM Symposium on Document EngineeringSeptember 2016, Pages 1–2https://doi.org/10.1145/2960811.2967173This tutorial is targeted at academics and practitioners, both within and outside of the Document Engineering community, who are confronted with table processing tasks such as information extraction and conversion, or have an interest in the topic, and ...
- research-articleSeptember 2015
TEXUS: A Task-based Approach for Table Extraction and Understanding
DocEng '15: Proceedings of the 2015 ACM Symposium on Document EngineeringSeptember 2015, Pages 25–34https://doi.org/10.1145/2682571.2797069In this paper, we propose a precise, comprehensive model of table processing which aims to remedy some of the problems in the discussion of table processing in the literature. The model targets application-independent, end-to-end table processing, and ...
- ArticleAugust 2013
ICDAR 2013 Table Competition
ICDAR '13: Proceedings of the 2013 12th International Conference on Document Analysis and RecognitionAugust 2013, Pages 1449–1453https://doi.org/10.1109/ICDAR.2013.292Table understanding is a well studied problem in document analysis, and many academic and commercial approaches have been developed to recognize tables in several document formats, including plain text, scanned page images and born-digital, object-based ...
- ArticleSeptember 2011
Table Content Understanding in SmartFIX
ICDAR '11: Proceedings of the 2011 International Conference on Document Analysis and RecognitionSeptember 2011, Pages 488–492https://doi.org/10.1109/ICDAR.2011.104The analysis of table structures and the retrieval of table contents is widely agreed to be a difficult challenge in the area of document analysis systems. Instead of extracting the layout of tables, we are interested in understanding their content. In ...
- ArticleJuly 2011
Enabling efficient browsing and manipulation of web tables on smartphone
HCII'11: Proceedings of the 14th international conference on Human-computer interaction: towards mobile and intelligent interaction environments - Volume Part IIIJuly 2011, Pages 117–126Tables are very important carriers of the vast information on the Internet and are widely used in web pages. However, most designs of web tables are only for desktop PCs and just focus on how to visually and logically show large amount of data without ...
- articleSeptember 2005
Towards Ontology Generation from Tables
World Wide Web (WWWJ), Volume 8, Issue 3September 2005, Pages 261–285https://doi.org/10.1007/s11280-005-0360-8At the heart of today's information-explosion problems are issues involving semantics, mutual understanding, concept matching, and interoperability. Ontologies and the Semantic Web are offered as a potential solution, but creating ontologies for real-...
- articleJuly 2005
Automating the extraction of data from HTML tables with unknown structure
Data & Knowledge Engineering (DAKE), Volume 54, Issue 1July 2005, Pages 3–28https://doi.org/10.1016/j.datak.2004.10.004Data on the Web in HTML tables is mostly structured, but we usually do not know the structure in advance. Thus, we cannot directly query for data of interest. We propose a solution to this problem based on document-independent extraction ontologies. Our ...
- ArticleNovember 2000