Abstract
A widespread need is present in molecular biology laboratories for software systems to support the internal management of data and documents. A typical case is represented by genotyping procedures, which produce a large amount of documents whose content may represent a potentially important knowledge base. The exploitation of such information requires a proper classification of the elements in the knowledge base, and this can be effectively achieved using concepts and tools from research on the Semantic Web. In particular, genotyping-related documents can be handled through a DMS (Document Management System) that is also able to deal with semantic metadata, e.g. in the form of tags. The use of semantic tagging at this operating level is currently hampered by the lack of proper tools. In this paper, based on experience from a practical case, we present an integrated approach to manage relevant genotyping documents and to deal with their semantic tagging. A preliminary study on the test procedures workflow is crucial to understand the document production processes. The employed semantic annotation makes use of terms taken from domain ontologies in the biomedical field. The annotation tool must be seamlessly integrated in the supporting DMS; the tool flexibility and usability guarantee a low overhead for the annotation process, paving the way for a widespread adoption of semantic tagging for genotyping-related documents.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Alfresco website, http://www.alfresco.com
MeSH - Medical Subject Headings, http://www.nlm.nih.gov/mesh/
Altman, R.B., Bada, M., Chai, X.J., Carrillo, M.W., Chen, R.O., Abernethy, N.F.: RiboWeb: An ontology-based system for collaborative molecular biology. IEEE Intelligent Systems 14(5), 68–76 (1999)
Aranguren, M.E., Bechhofer, S., Lord, P., Sattler, U., Stevens, R.: Understanding and using the meaning of statements in a bio-ontology: recasting the Gene Ontology in OWL. BMC Bioinformatics 8, 57 (2007)
Ashburner, M., Ball, C., Blake, J., Botstein, D., Butler, H., Cherry, M., Davis, A., Dolinski, K., Dwight, S., Eppig, J.: Gene Ontology: Tool for the unification of biology. Nature Genetics 25, 25–29 (2000)
Bechhofer, S., van Harmele, F., Hedler, J., et al.: OWL Web Ontology Language reference (2002)
Bechini, A., Tomasi, A., Viotto, J.: Collaborative e-business and document management: Integration of legacy DMSs with the ebXML environment. In: Interdisciplinary Aspects of Information Systems Studies, pp. 287–293. Physica-Verlag HD, Heidelberg (2008)
Bechini, A., Tomasi, A., Viotto, J.: Enabling ontology-based document classification and management in ebXML registries. In: Proceedings of ACM SAC, pp. 1145–1150. ACM, New York (2008)
Bechini, A., Viotto, J., Giannini, R.: Smooth introduction of semantic tagging in genotyping procedures. In: Khuri, S., Lhotská, L., Pisanti, N. (eds.) ITBAM 2010. LNCS, vol. 6266, pp. 201–214. Springer, Heidelberg (2010)
Berners-Lee, T., Hendler, J., Lassila, O.: The semantic web. Scientific American 284(5), 34–43 (2001)
Bleke, J.: Bio-ontologies - fast and furious. Nature Biotechnologies 6(22), 773–774 (2004)
Bojars, U., Breslin, J.G., Peristeras, V., Tummarello, G., Decker, S.: Interlinking the social web with semantics. IEEE Intelligent Systems 23(3), 29–40 (2008)
Choy, D., Brown, A., McVeigh, R., Müller, F.: OASIS Content Management Interoperability Services (CMIS) Version 1.0 (2010)
Deus, H.F., Stanislaus, R., Veiga, D.F., Behrens, C., Wistuba, I.I., Minna, J.D., Garner, H.R., Swisher, S.G., Roth, J.A., Correa, A.M., Broom, B., Coombes, K., Chang, A., Vogel, L.H., Almeida, J.S.: A semantic web management model for integrative biomedical informatics. PLoS ONEÂ 3(8), e2946 (2008)
Ding, L., Finin, T.W., Joshi, A., Peng, Y., Pan, R., Reddivari, P.: Search on the semantic web. IEEE Computer 38(10), 62–69 (2005)
Dong, H., Hussain, F.K., Chang, E.: A survey in semantic search technologies. In: Proc. of DEST 2008, 2nd IEEE Int’l Conf. on Digital Ecosystems and Technologies, pp. 403–408 (2008)
Donofrio, N., Rajagopalon, R., Brown, D.E., Diener, S.E., Windham, D., Nolin, S., Floyd, A., Mitchell, T.K., Galadima, N., Tucker, S., Orbach, M.J., Patel, G., Farman, M.L., Pampanwar, V., Soderlund, C., Lee, Y.-H., Dean, R.A.: ’paclims’: A component LIM system for high-throughput functional genomic analysis. BMC Bioinformatics 6, 94 (2005)
Fong, C., Ko, D.C., Wasnick, M., Radey, M., Miller, S.I., Brittnacher, M.J.: Gwas analyzer: integrating genotype, phenotype and public annotation data for genome-wide association study analysis. Bioinformatics 26(4), 560–564 (2010)
Hadzic, M., Chang, E.: Medical ontologies to support human disease research and control. International Journal of Web and Grid Services 1(2), 139–150 (2005)
Huang, Y.W., Arkin, A.P., Chandonia, J.-M.: WIST: toolkit for rapid, customized LIMS development. Bioinformatics 27(3), 437–438 (2011)
Jayashree, B., Reddy, P.T., Leeladevi, Y., Crouch, J.H., Mahalakshmi, V., Buhariwalla, H.K., Eshwar, K.E., Mace, E., Folksterma, R., Senthilvel, S., Varshney, R.K., Seetha, K., Rajalakshmi, R., Prasanth, V.P., Chandra, S., Swarupa, L., SriKalyani, P., Hoisington, D.A.: Laboratory information management software for genotyping workflows: applications in high throughput crop genotyping. BMC Bioinformatics 7, 383 (2006)
Jensen, L.J., Bork, P.: Ontologies in quantitative biology: A basis for comparison, integration, and discovery. PLoS Biology 8(5), e1000374 (2010)
Kohl, K., Gremmels, J.: Documentation system for plant transformation service and research. Plant Methods 6(1), 4 (2010)
Kothari, C.R., Wilkinson, M.: Structured representation of biomedical experiments: A bottom-up approach. In: Proceedings of Int’l Conf. on Information and Knowledge Engineering (IKE), pp. 199–204. CSREA Press (2008)
Kumar, A., Smith, B.: Oncology ontology in the NCI thesaurus. In: Miksch, S., Hunter, J., Keravnou, E.T. (eds.) AIME 2005. LNCS (LNAI), vol. 3581, pp. 213–220. Springer, Heidelberg (2005)
Le Hellard, S., Ballereau, S.J., Visscher, P.M., Torrance, H.S., Pinson, J., Morris, S.W., Thomson, M.L., Semple, C.A.M., Muir, W.J., Blackwood, D.H.R., Porteous, D.J., Evans, K.L.: SNP genotyping on pooled DNAs: comparison of genotyping technologies and a semi automated method for data storage and analysis. Nucleic Acids Research 30(15), e74 (2002)
Li, J.-L., Deng, H., Lai, D.-B., Xu, F., Chen, J., Gao, G., Recker, R.R., Deng, H.-W.: Toward high-throughput genotyping: Dynamic and automatic software for manipulating large-scale genotype data using fluorescently labeled dinucleotide markers. Genome Res. 11(7), 1304–1314 (2001)
Monnier, S., Cox, D.G., Albion, T., Canzian, F.: T.I.M.S: TaqMan Information Management System, tools to organize data flow in a genotyping laboratory. BMC Bioinformatics 6, 246 (2005)
Olivier, M., Petitejan, A., Teague, J., Forbes, S., Dunnick, J., der Dunnen, J., Langerod, A., Wilkinson, J., Vihinen, M., Cotton, R., Hainaut, P.: Somatic mutation databases as tools for molecular epidemiology and molecular pathology of cancer: Proposed guidelines for improving data collection, distribution, and integration. Human Mutation 30(3), 275–282 (2009)
OMG. BPMN 2.0 specifications (2009)
Price, S.L., Nielsen, M.L., Delcambre, L.M., Vedsted, P., Steinhauer, J.: Using semantic components to search for domain-specific documents: An evaluation from the system perspective and the user perspective. Information Systems 34(8), 724–752 (2009)
Rubin, D.L., Shah, N.H., Noy, N.F.: Biomedical ontologies: a functional perspective. Briefings in Bioinformatics 9(1), 75–90 (2008)
Shah, N., Jonquet, C., Chiang, A., Butte, A., Chen, R., Musen, M.: Ontology-driven indexing of public datasets for translational bioinformatics. BMC Bioinformatics 10(suppl.2), S1 (2009)
Sioutos, N., de Coronado, S., Haber, M.W., Hartel, F.W., Shaiu, W.L., Wright, L.W.: NCI Thesaurus: a semantic model integrating cancer-related clinical and molecular information. Journal of Biomedical Informatics 40(1), 30–43 (2007)
Smith, B., Ashburner, M., Rosse, C., Bard, J., Bug, W., Ceusters, W., Goldberg, L.J., Eilbeck, K., Ireland, A., Mungall, C.J., Consortium, T.O., Leontis, N., Rocca-Serra, P., Ruttenberg, A., Sansone, S.-A., Scheuermann, R.H., Shah, N., Whetzel, P.L., Lewis, S.: The OBO foundry: coordinated evolution of ontologies to support biomedical data integration. Nature Biotechnology 25, 1251–1255 (2007)
Specia, L., Motta, E.: Integrating Folksonomies with the Semantic Web. In: Franconi, E., Kifer, M., May, W. (eds.) ESWC 2007. LNCS, vol. 4519, pp. 624–639. Springer, Heidelberg (2007)
Strömbäck, L., Hall, D., Lambrix, P.: A review of standards for data exchange within systems biology. Proteomics 7(6), 857–867 (2007)
Tanabe, L.K., Wilbur, W.J.: Tagging gene and protein names in biomedical text. Bioinformatics 18(8), 1124–1132 (2002)
Uren, V., Cimiano, P., Iria, J., Handschuh, S., Vargas-Vera, M., Motta, E., Ciravegna, F.: Semantic annotation for knowledge management: Requirements and a survey of the state of the art. Journal of Web Semantics 4(1), 14–28 (2006)
Wohed, P., van der Aalst, W.M.P., Dumas, M., ter Hofstede, A.H.M., Russell, N.: On the suitability of BPMN for business process modelling. In: Dustdar, S., Fiadeiro, J.L., Sheth, A.P. (eds.) BPM 2006. LNCS, vol. 4102, pp. 161–176. Springer, Heidelberg (2006)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Bechini, A., Giannini, R. (2011). Management of Genotyping-Related Documents by Integrated Use of Semantic Tagging. In: Hameurlain, A., Küng, J., Wagner, R., Böhm, C., Eder, J., Plant, C. (eds) Transactions on Large-Scale Data- and Knowledge-Centered Systems IV. Lecture Notes in Computer Science, vol 6990. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-23740-9_2
Download citation
DOI: https://doi.org/10.1007/978-3-642-23740-9_2
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-23739-3
Online ISBN: 978-3-642-23740-9
eBook Packages: Computer ScienceComputer Science (R0)