Leventidis A, Di Rocco L, Gatterbauer W, Miller R and Riedewald M.
(2023). DomainNet: Homograph Detection and Understanding in Data Lake Disambiguation. ACM Transactions on Database Systems. 48:3. (1-40). Online publication date: 30-Sep-2023.
Jiménez P, Roldán J and Corchuelo R.
(2022). A hybrid quantum approach to leveraging data from HTML tables. Knowledge and Information Systems. 10.1007/s10115-021-01636-7.
Jiménez P, Roldán J and Corchuelo R.
(2022). A coral-reef approach to extract information from HTML tables. Applied Soft Computing. 115:C. Online publication date: 1-Jan-2022.
Roldán J, Jiménez P, Szekely P and Corchuelo R.
(2022). TOMATE. Information Sciences: an International Journal. 577:C. (49-68). Online publication date: 1-Oct-2021.
Günther M, Thiele M, Gonsior J and Lehner W. Pre-Trained Web Table Embeddings for Table Discovery. Fourth Workshop in Exploiting AI Techniques for Data Management. (24-31).
Bleifus T, Bornemann L, Kalashnikov D, Naumann F and Srivastava D.
(2021). Structured Object Matching across Web Page Revisions 2021 IEEE 37th International Conference on Data Engineering (ICDE). 10.1109/ICDE51399.2021.00115. 978-1-7281-9184-3. (1284-1295).
Wang P, Shea R, Wang J and Wu E. Progressive Deep Web Crawling Through Keyword Queries For Data Enrichment. Proceedings of the 2019 International Conference on Management of Data. (229-246).
Ibrahim Y, Riedewald M, Weikum G and Zeinalipour-Yazti D.
(2019). Bridging Quantities in Tables and Text 2019 IEEE 35th International Conference on Data Engineering (ICDE). 10.1109/ICDE.2019.00094. 978-1-5386-7474-1. (1010-1021).
Eberius J, Thiele M and Lehner W.
(2017). Exploratory Ad-Hoc Analytics for Big Data. Handbook of Big Data Technologies. 10.1007/978-3-319-49340-4_11. (365-407).
Meusel R, Ritze D and Paulheim H.
(2016). Towards More Accurate Statistical Profiling of Deployed schema.org Microdata. Journal of Data and Information Quality. 8:1. (1-31). Online publication date: 29-Nov-2016.
Abedjan Z, Morcos J, Ilyas I, Ouzzani M, Papotti P and Stonebraker M.
(2016). DataXFormer: A robust transformation discovery system 2016 IEEE 32nd International Conference on Data Engineering (ICDE). 10.1109/ICDE.2016.7498319. 978-1-5090-2020-1. (1134-1145).
Lehmberg O, Ritze D, Meusel R and Bizer C. A Large Public Corpus of Web Tables containing Time and Context Metadata. Proceedings of the 25th International Conference Companion on World Wide Web. (75-76).
Ahmadov A, Thiele M, Eberius J, Lehner W and Wrembel R.
(2015). Towards a Hybrid Imputation Approach Using Web Tables 2015 IEEE/ACM 2nd International Symposium on Big Data Computing (BDC). 10.1109/BDC.2015.38. 978-0-7695-5696-3. (21-30).
Eberius J, Braunschweig K, Hentsch M, Thiele M, Ahmadov A and Lehner W.
(2015). Building the Dresden Web Table Corpus: A Classification Approach 2015 IEEE/ACM 2nd International Symposium on Big Data Computing (BDC). 10.1109/BDC.2015.30. 978-0-7695-5696-3. (41-50).
Eberius J, Thiele M, Braunschweig K and Lehner W. DrillBeyond. Proceedings of the 27th International Conference on Scientific and Statistical Database Management. (1-12).
Meusel R, Primpeli A, Meilicke C, Paulheim H and Bizer C.
(2015). Exploiting Microdata Annotations to Consistently Categorize Product Offers at Web Scale. E-Commerce and Web Technologies. 10.1007/978-3-319-27729-5_7. (83-99).