Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

A bottom-up, knowledge-aware approach to integrating and querying web data services

Published: 01 November 2013 Publication History

Abstract

As a wealth of data services is becoming available on the Web, building and querying Web applications that effectively integrate their content is increasingly important. However, schema integration and ontology matching with the aim of registering data services often requires a knowledge-intensive, tedious, and error-prone manual process.
We tackle this issue by presenting a bottom-up, semi-automatic service registration process that refers to an external knowledge base and uses simple text processing techniques in order to minimize and possibly avoid the contribution of domain experts in the annotation of data services. The first by-product of this process is a representation of the domain of data services as an entity-relationship diagram, whose entities are named after concepts of the external knowledge base matching service terminology rather than being manually created to accommodate an application-specific ontology. Second, a three-layer annotation of service semantics (service interfaces, access patterns, service marts) describing how services “play” with such domain elements is also automatically constructed at registration time. When evaluated against heterogeneous existing data services and with a synthetic service dataset constructed using Google Fusion Tables, the approach yields good results in terms of data representation accuracy.
We subsequently demonstrate that natural language processing methods can be used to decompose and match simple queries to the data services represented in three layers according to the preceding methodology with satisfactory results. We show how semantic annotations are used at query time to convert the user's request into an executable logical query. Globally, our findings show that the proposed registration method is effective in creating a uniform semantic representation of data services, suitable for building Web applications and answering search queries.

References

[1]
Bellahsene, Z., Bonifati, A., and Rahm, E. 2011. Schema Matching and Mapping. Springer.
[2]
Bergamaschi, S., Po, L., Sorrentino, S., and Corni, A. 2010. Uncertainty in data integration systems: Automatic generation of probabilistic relationships. In Management of the Interconnected World, Springer Physica-Verlag, Berlin, 221--228.
[3]
Bizer, C., Heath, T., Idehen, K., and Berners-Lee, T. 2008. Linked data on the Web. In Proceedings of WWW. 1265--1266.
[4]
Bozzon, A., Braga, D., Brambilla, M., Ceri, S., Corcoglioniti, F., Fraternali, P., and Vadacca, S. 2011. Search computing: Multi-domain search on ranked data. In Proceedings of SIGMOD. 1267--1270.
[5]
Bozzon, A., Brambilla, M., Ceri, S., and Fraternali, P. 2010. Liquid query: Multi-domain exploratory search on the Web. In Proceedings of WWW. 161--170.
[6]
Braga, D., Ceri, S., Corcoglioniti, F., and Grossniklaus, M. 2010. Panta Rhei: Flexible execution engine for search computing queries. In Search Computing: Challenges and Directions, S. Ceri and M. Brambilla, (Eds.), Springer-Verlag, Berlin, Heidelberg, Chapter 12, 225--243.
[7]
Braga, D., Ceri, S., Daniel, F., and Martinenghi, D. 2008. Optimization of multi-domain queries on the web. Proc. VLDB 1, 1, 562--573.
[8]
Brambilla, M., Campi, A., Ceri, S., and Quarteroni, S. 2011. Semantic resource framework. In Search Computing, Lecture Notes in Computer Science, vol. 6585, Springer, Berlin, 73--84.
[9]
Brambilla, M., Ceri, S., Cinefra, N., Das Sarma, A., Forghieri, F., and Quarteroni, S. 2012. Google fusion tables: Making sense of heterogeneous data. In Search Computing: Broadening Web Search. Lecture Notes in Computer Science, vol. 7538. Springer, Berlin, 53--67.
[10]
Calvanese, D., Giacomo, G. D., Lenzerini, M., and Rosati, R. 2012. View-based query answering in description logics: Semantics and complexity. Comput. Syst. Sci. 78, 1, 26--46.
[11]
Carenini, A., Cerizza, D., Comerio, M., Valle, E. D., Paoli, F. D., Maurino, A., Palmonari, M., and Turati, A. 2008. Glue2: A Web service discovery engine with non-functional properties. In Proceedings of the 6th European Conference on Web Services (ECWS). 21--30.
[12]
Ceri, S. and Brambilla, M. 2010. Search computing: Challenges and directions. In Objects and Databases. Lecture Notes in Computer Science, vol. 5950. Springer-Verlag, Berlin Heidelberg, 1--5.
[13]
Ceri, S. and Brambilla, M., eds. 2011. Search Computing: Trends and Developments. Lecture Notes in Computer Science, vol. 6585, Springer-Verlag, Berlin Heidelberg.
[14]
Choi, N., Song, I.-Y., and Han, H. 2006. A survey on ontology mapping. SIGMOD Rec. 35, 3, 34--41.
[15]
Ciglan, M., Norvag, K., and Hluchy, L. 2012. The semsets model for ad-hoc semantic list search. In Proceedings of WWW. 131--140.
[16]
Dalvi, N., Kumar, R., Pang, B., Ramakrishnan, R., Tomkins, A., Bohannon, P., Keerthi, S., and Merugu, S. 2009. A web of concepts. In Proceedings of the Symposium on Principles of Database Systems (PODS). ACM, 1--12.
[17]
Damljanovic, D., Agatonovic, M., and Cunningham, H. 2010a. Natural language interfaces to ontologies: Combining syntactic analysis and ontology-based lookup through the user interaction. In Proceedings of the 7th International Conference on Language Resources and Evaluation (LREC). N. Calzolari, K. Choukri, B. Maegaard, J. Mariani, J. Odijk, S. Piperidis, M. Rosner, and D. Tapias, Eds. European Language Resources Association (ELRA), Valletta, Malta, 19--21.
[18]
Damljanovic, D., Agatonovic, M., and Cunningham, H. 2010b. Natural language interfaces to ontologies: Combining syntactic analysis and ontology-based lookup through the user interaction. In Proceedings of the 7th Extended Semantic Web Conference (ESWC). Lecture Notes in Computer Science, Springer, vol. 6088, Berlin, 106--120.
[19]
Damljanovic, D. and Bontcheva, K. 2009. Towards enhanced usability of natural language interfaces to knowledge bases. In Web 2.0 & Semantic Web, V. Devedic and D. Gaevic, Eds. Annals of Information Systems, vol. 6. Springer, Berlin, 105--133.
[20]
Das Sarma, A., Fang, L., Gupta, N., Halevy, A., Lee, H., Wu, F., Xin, R., and Yu, C. 2012. Finding related tables. In Proceedings of the SIGMOD. ACM, New York, NY, 817--828.
[21]
De Virgilio, R. and Bianchini, D. 2010. A metamodel approach to flexible semantic Web service discovery. In Proceedings of the 19th ACM International Conference on Information and Knowledge Management (CIKM'10). ACM, New York, NY, 1309--1312.
[22]
Doan, A., Halevy, A., and Ives, Z. 2012. Principles of Data Integration. Morgan Kauffman.
[23]
Doan, A. and Halevy, A. Y. 2005. Semantic integration research in the database community: A brief survey. AI Mag. 26, 1, 83--94.
[24]
Dong, X., Halevy, A., Madhavan, J., Nemes, E., and Zhang, J. 2004. Similarity search for Web services. In Proceedings of VLDB. 372--383.
[25]
Fazzinga, B. and Lukasiewicz, T. 2010. Semantic search on the Web. Semantic Web J. 1, 1--2, 89--96.
[26]
Fensel, D., Facca, F., Simperl, E., and Toma, I., eds. 2011. Semantic Web Services. Springer, Berlin.
[27]
Ferragina, P. and Scaiella, U. 2010. TAGME: On-the-fly annotation of short text fragments (by wikipedia entities). In Proceedings of the International Conference on Information and Knowledge Management (CIKM). ACM, 1625--1628.
[28]
Foster, H., Uchitel, S., Magee, J., and Kramer, J. 2003. Model-based verification of Web service compositions. In Proceedings of Automated Software Engineering. 152--161.
[29]
Giunchiglia, F., Kharkevich, U., and Zaihrayeu, I. 2009. Concept search. In Proceedings of the 6th Extended Semantic Web Coference (ESWC). Lecture Notes in Computer Science, vol. 5559, Springer, Berlin, 429--444.
[30]
Granitzer, M., Sabol, V., Onn, K. W., Lukose, D., and Tochtermann, K. 2010. Ontology alignment: A survey with focus on visually supported semi-automatic techniques. Future Internet 2, 3, 238--258.
[31]
Halpin, T., Morgan, A., and Morgan, T. 2008. Information Modeling and Relational Databases. Morgan Kaufmann.
[32]
Herzig, D. M. and Tran, T. 2012. Heterogeneous Web data search using relevance-based on the fly data integration. In Proceedings of WWW. 141--150.
[33]
Joachims, T. 1999. Making Large-Scale Support Vector Machine Learning Practical. MIT Press, Cambridge, MA, 169--184.
[34]
Lafferty, J., McCallum, A., and Pereira, F. 2001. Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In Proceedings of the 18th International Conference on Machine Learning (ICML). 282--289.
[35]
Lenzerini, M. 2002. Data integration: A theoretical perspective. In Proceedings of the Symposium on Principles of Database Systems (PODS). ACM, 233--246.
[36]
Li, X. 2010. Understanding the semantic structure of noun phrase queries. In Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics (ACL). 1337--1345.
[37]
Lim, J. and Lee, K. 2010. Constructing composite Web services from natural language requests. Web Semantics: Science, Services and Agents on the World Wide Web 8, 1, 1--13.
[38]
Manolescu, I., Brambilla, M., Ceri, S., Comai, S., and Fraternali, P. 2005. Model-driven design and deployment of service-enabled web applications. ACM Trans. Internet Technol. 5, 3, 439--479.
[39]
Martin, D., Burstein, M., Mcdermott, D., Mcilraith, S., Paolucci, M., Sycara, K., Mcguinness, D. L., Sirin, E., and Srinivasan, N. 2007. Bringing semantics to Web services with OWL-S. World Wide Web 10, 3, 243--277.
[40]
Miller, G. 1995. WordNet: A lexical database for English. Comm. ACM 38, 11, 39--41.
[41]
Porter, M. 1980. An algorithm for suffix stripping. Program: Electr. Lib. Inf. Sys. 14, 3, 130--137.
[42]
Pound, J., Mika, P., and Zaragoza, H. 2010. Ad-hoc object retrieval in the Web of data. In Proceedings of WWW. 771--780.
[43]
Preda, N., Kasneci, G., Suchanek, F. M., Neumann, T., Yuan, W., and Weikum, G. 2010. Active knowledge: Dynamically enriching RDF knowledge bases by Web services. In Proceedings of the ACM SIGMOD International Conference on Management of Data (SIGMOD'10). ACM, New York, NY, 399--410.
[44]
Quarteroni, S., Guerrisi, V., and La Torre, P. 2012. Evaluating multi-focus natural language queries over data services. In Proceedings of the 7th International Conference on Language Resources and Evaluation (LREC). European Language Resources Association (ELRA).
[45]
Rahm, E. and Bernstein, P. A. 2001. A survey of approaches to automatic schema matching. VLDB 10, 4, 334--350.
[46]
Rajaraman, A., Sagiv, Y., and Ullman, J. D. 1995. Answering queries using templates with binding patterns (extended abstract). In Proceedings of the 14th ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems (PODS'95). ACM, New York, NY, 105--112.
[47]
Ranganathan, A., Riabov, A., and Udrea, O. 2009. Mashup-based information retrieval for domain experts. In Proceedings of the 18th ACM Conference on Information and Knowledge Management (CIKM'09). ACM, New York, NY, 711--720.
[48]
Roman, D., Keller, U., Lausen, H., de Bruijn, J., Lara, R., Stollberg, M., Polleres, A., Feier, C., Bussler, C., and Fensel, D. 2005. Web service modeling ontology. Appl. Ontol. 1, 1, 77--106.
[49]
Roy Chowdhury, S., Rodríguez, C., Daniel, F., and Casati, F. 2012. Baya: Assisted mashup development as a service. In Proceedings of WWW - Companion volume. ACM, New York, NY, 409--412.
[50]
Suchanek, F., Kasneci, G., and Weikum, G. 2007. YAGO: A core of semantic knowledge. In Proceedings of WWW. 697--706.
[51]
Suchanek, F. M., Bozzon, A., Valle, E. D., Campi, A., and Ronchi, S. 2011. Towards an ontological representation of services in search computing. In Search Computing: Trends and Developments. Lecture Notes in Computer Science, vol. 6585. Springer, Berlin, 101--112.
[52]
Ullman, J. D. 1997. Information integration using logical views. In Proceedings of ICDT, F. N. Afrati and P. G. Kolaitis, Eds., Lecture Notes in Computer Science, vol. 1186. Springer, Berlin, 19--40.
[53]
Vitvar, T., Kopecký, J., Viskova, J., and Fensel, D. 2008. WSMO-lite annotations for Web services. In Proceedings of the 5th Extended Semantic Web Conference (ESWC), Lecture Notes in Computer Science, vol. 5021, Springer, Berlin, Heidelberg, 674--689.
[54]
Wang, C., Xiong, M., Zhou, Q., and Yu, Y. 2007. Panto: A portable natural language interface to ontologies. In The Semantic Web: Research and Applications, E. Franconi, M. Kifer, and W. May, Eds., Lecture Notes in Computer Science, vol. 4519. Springer, Berlin, 473--487.
[55]
Wu, J., Chen, L., Xie, Y., and Zheng, Z. 2012. Titan: A system for effective Web service discovery. In Proceedings of the WWW - Companion volume. ACM, New York, NY, 441--444.
[56]
Zhang, D. and Lee, W. 2003. Question classification using support vector machines. In Proceedings of SIGIR. ACM, 26--32.

Cited By

View all
  • (2024)Assessing the impact of digital service innovation (DSI) on business performance: the mediating effect of Artificial Intelligence (AI)Journal of Enterprise Information Management10.1108/JEIM-02-2024-0095Online publication date: 22-Oct-2024
  • (2018)CrumbTrail: An efficient methodology to reduce multiple inheritance in knowledge graphsKnowledge-Based Systems10.1016/j.knosys.2018.03.030151(180-197)Online publication date: Jul-2018
  • (2018)Towards a UML and IFML Mapping to GraphQLCurrent Trends in Web Engineering10.1007/978-3-319-74433-9_13(149-155)Online publication date: 22-Feb-2018
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on the Web
ACM Transactions on the Web  Volume 7, Issue 4
October 2013
220 pages
ISSN:1559-1131
EISSN:1559-114X
DOI:10.1145/2540635
Issue’s Table of Contents
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 November 2013
Accepted: 01 June 2013
Revised: 01 November 2012
Received: 01 April 2012
Published in TWEB Volume 7, Issue 4

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Web data integration
  2. Web data services
  3. Web services
  4. natural language Web query
  5. service querying
  6. structured Web search

Qualifiers

  • Research-article
  • Research
  • Refereed

Funding Sources

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)2
  • Downloads (Last 6 weeks)1
Reflects downloads up to 09 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Assessing the impact of digital service innovation (DSI) on business performance: the mediating effect of Artificial Intelligence (AI)Journal of Enterprise Information Management10.1108/JEIM-02-2024-0095Online publication date: 22-Oct-2024
  • (2018)CrumbTrail: An efficient methodology to reduce multiple inheritance in knowledge graphsKnowledge-Based Systems10.1016/j.knosys.2018.03.030151(180-197)Online publication date: Jul-2018
  • (2018)Towards a UML and IFML Mapping to GraphQLCurrent Trends in Web Engineering10.1007/978-3-319-74433-9_13(149-155)Online publication date: 22-Feb-2018
  • (2017)Automatic semantic enrichment of data servicesProceedings of the 19th International Conference on Information Integration and Web-based Applications & Services10.1145/3151759.3151783(415-424)Online publication date: 4-Dec-2017
  • (2017)Extracting Emerging Knowledge from Social MediaProceedings of the 26th International Conference on World Wide Web10.1145/3038912.3052697(795-804)Online publication date: 3-Apr-2017
  • (2017)Services Discovery and Recommendation for Multi-datasource Access: Exploiting Semantic and Social TechnologiesA Comprehensive Guide Through the Italian Database Research Over the Last 25 Years10.1007/978-3-319-61893-7_22(375-390)Online publication date: 31-May-2017
  • (2017)Example-Driven Web API Specification DiscoveryModelling Foundations and Applications10.1007/978-3-319-61482-3_16(267-284)Online publication date: 20-Jun-2017
  • (2016)The role of developers’ social relationships in improving service selectionInternational Journal of Web Information Systems10.1108/IJWIS-04-2016-002212:4(477-503)Online publication date: 7-Nov-2016
  • (2015)Developers' networks contribution to web application designProceedings of the 17th International Conference on Information Integration and Web-based Applications & Services10.1145/2837185.2837241(1-10)Online publication date: 11-Dec-2015
  • (2015)Deriving Intensional Descriptions for Web ServicesProceedings of the 24th ACM International on Conference on Information and Knowledge Management10.1145/2806416.2806447(971-980)Online publication date: 17-Oct-2015
  • Show More Cited By

View Options

Get Access

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media