Abstract
We introduce the BioFuice approach for integrating data from different private and public data sources and ontologies. BioFuice follows a peer-to-peer-like data integration based on bidirectional mappings. Sources and mappings are associated with a domain model to support a semantically meaningful interoperability. BioFuice extends the generic iFuice integration platform which utilizes specific operators for data fusion and workflow-like script programs. BioFuice supports explorative data analysis and query and search capabilities. We outline the integration approach by an illustrating scenario, the architecture of BioFuice and its query interface.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Altschul, S.F., et al.: Basic Local Alignment Search Tool. Journal of Molecular Biology 215(3), 403–410 (1990)
Birney, E., et al.: An Overview of Ensembl. Genome Research 14, 925–928 (2004)
Bilke, A., et al.: Automatic Data Fusion with HumMer. In: Proc. 31st VLDB Conf., Demo description (2005)
Boeckmann, B., et al.: The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003. Nucleic Acids Research 31, 365–370 (2003)
Etzold, T., et al.: SRS: An Integration Platform for Databanks and Analysis Tools in Bioinformatics. In: [LC03], 109-145
Do, H.-H., Rahm, E.: Flexible integration of molecular-biological annotation data: The genMapper approach. In: Bertino, E., Christodoulakis, S., Plexousakis, D., Christophides, V., Koubarakis, M., Böhm, K., Ferrari, E. (eds.) EDBT 2004. LNCS, vol. 2992, pp. 811–822. Springer, Heidelberg (2004)
Galperin, M.Y.: The Molecular Biology Database Collection. Nucleic Acids Research 33, D5–D24 (2005)
Halevy, A., et al.: Piazza: data management infrastructure for semantic web applications. In: Proc. WWW (2003)
Heese, R., et al.: Self-extending Peer Data Management. In: Proc. Database Systems in Business, Technology and Web (BTW) (2005)
Hernandez, T., Kambhampati, S.: Integration of Biological Sources: Current Systems and Challenges Ahead. SIGMOD Record 33(3) (2004)
Ives, Z., et al.: Orchestra: Rapid, Collaborative Sharing of Dynamic Data. In: Proc. of Conf. on Innovative Data Systems Research (CIDR) (2005)
Kirsten, T., Do, H.-H., Körner, C., Rahm, E.: Hybrid integration of molecular-biological annotation data. In: Ludäscher, B., Raschid, L. (eds.) DILS 2005. LNCS (LNBI), vol. 3615, pp. 208–223. Springer, Heidelberg (2005)
Lacroix, Z., et al.: The Biological Integration System. In: Proc. 5th ACM Int. Workshop on Web Information and Data Management (2003)
Lacroix, Z., Critchlow, T. (eds.): Bioinformatics: Managing Scientific Data. Morgan Kaufmann, San Francisco (2003)
Liu, G., et al.: NetAffx: Affymetrix probesets and annotations. Nucleic Acids Research 31(1), 82–86 (2003)
Leser, U., Naumann, F.: (Almost) Hands-Off Information Integration for the Life Sciences. In: Proc. 2nd Conf. on Innovative Data Systems Research (CIDR) (2005)
Maibaum, M., Zamboulis, L., Rimon, G., Orengo, C., Martin, N., Poulovassilis, A.: Cluster based integration of heterogeneous biological databases using the autoMed toolkit. In: Ludäscher, B., Raschid, L. (eds.) DILS 2005. LNCS (LNBI), vol. 3615, pp. 191–207. Springer, Heidelberg (2005)
Mena, E., et al.: Observer: An Approach fro Query processing in Global Information Systems based on Interoperation across pre-existing Ontologies. Distributed and Parallel Databases 8(2), 223–271 (2000)
Necib, C.B., Freytag, J.-C.: Query Processing Using Ontologies. In: Pastor, Ó., Falcão e Cunha, J. (eds.) CAiSE 2005. LNCS, vol. 3520, pp. 167–186. Springer, Heidelberg (2005)
Ng, W.S., et al.: PeerDB A P2P-based System for Distributed Data Sharing. In: Proc. 19th Int. Conf. on Data Engineering (2003)
Prompramote, S., Chen, Y.P.: Annonda: Tool for integrating molecular-biological Annotation Data. In: Proc. 21st Int. Conf. on Data Engineering (ICDE) (2005)
Rahm, E., et al.: iFuice - Information Fusion utilizing Instance Correspondences and Peer Mappings. In: Proc. 8th Int. Workshop on the Web & Databases (WebDB) (2005)
Rahm, E., Thor, A.: Citation analysis of database publications. SIGMOD Record 34(4) (2005)
Schuler, G.D., et al.: Entrez: Molecular biology database and retrieval system. Journal of Methods in Enzymology 266, 141–162 (1996)
Stevens, R., et al.: Complex Query Formulation over diverse Information Sources in TAMBIS. In: [LC03], 190–224 (2003)
Tanaka, T., et al.: Chemokines in tumor progression and metastasis. Cancer Science 96(6), 317–322 (2005)
Wache, H., et al.: Ontology-based Integration of Information - A Survey of existing Approaches. In: Proc. Workshop on Ontologies and Information Sharing (IJCAI) (2001)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kirsten, T., Rahm, E. (2006). BioFuice: Mapping-Based Data Integration in Bioinformatics. In: Leser, U., Naumann, F., Eckman, B. (eds) Data Integration in the Life Sciences. DILS 2006. Lecture Notes in Computer Science(), vol 4075. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11799511_12
Download citation
DOI: https://doi.org/10.1007/11799511_12
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-36593-8
Online ISBN: 978-3-540-36595-2
eBook Packages: Computer ScienceComputer Science (R0)