Abstract
Selecting the right set of descriptors for the annotation of a specific dataset can be a hard problem in research data management. Considering a dataset in an arbitrary domain, an application profile is complex to build because of the abundance of metadata standards, ontologies and other descriptor sources available for different domains. We propose to partially automate the process of data description by generating application profile recommendations based on a research data asset knowledge base. Our approach builds on existing technologies for exploring linked data and results in a process which can be tightly coupled with the research workflow, giving researchers more control over the description of their data. Preliminary experiments show that we can build on state-of-the-art technologies for search indexes, graph databases and triple stores to explore existing sources of linked data for our profile generation.
Supported by Ph.D. grant SFRH/BD/77092/2011, provided by the FCT (Fundação para a Ciência e Tecnologia).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Al-Khalifa, H.S., Davis, H.C.: The evolution of metadata from standards to semantics in E-learning applications. In: Proceedings of the Seventeenth Conference on Hypertext and Hypermedia - HYPERTEXT 2006, p. 69 (2006)
Bizer, C., Lehmann, J., Kobilarov, G., Auer, S., Becker, C., Cyganiak, R., Hellmann, S.: DBpedia - A crystallization point for the Web of Data. Web Semantics: Science, Services and Agents on the World Wide Web 7(3), 154–165 (2009)
Brickley, D., Miller, L.: FOAF Vocabulary Specification 0.98 (2010)
Calais, E.: Gravity and the figure of the Earth (2012), http://web.ics.purdue.edu/~ecalais/teaching/eas450/Gravity1.pdf
Dublin Core Metadata Initiative. DCMI Metadata Terms (2012), http://dublincore.org/documents/dcmi-terms/#terms-creator
Fire, M., Tenenboim, L., Lesser, O., Puzis, R., Rokach, L., Elovici, Y.: Link Prediction in Social Networks Using Computationally Efficient Topological Features. In: 2011 IEEE Third Int’l Conference on Privacy, Security, Risk and Trust and 2011 IEEE Third Int’l Conference on Social Computing, pp. 73–80 (October 2011)
Google Freebase. Freebase Documentation (2012), http://wiki.freebase.com/wiki/Main_Page
Haase, K.: Context for semantic metadata. In: Proceedings of the 12th Annual ACM International, pp. 204–211 (2004)
Hasan, M.A., Chaoji, V., Salem, S.: Link prediction using supervised learning. In: SDM 2006: Workshop on Link (2006)
Huang, Z.: Link Prediction Based on Graph Topology: The Predictive Value of the Generalized Clustering Coefficient (2006)
Jones, S., Ross, S., Ruusalepp, R.: Data Audit Framework Methodology (2009)
Kleinberg, J.M.: Authoritative Sources in a Hyperlinked Environment. Journal of the ACM (JACM) 46(5), 604–632 (1999)
LibenNowell, D.: The link prediction problem for social networks. In: CIKM 2003 Proceedings of the Twelfth International Conference on Information and Knowledge Management, pp. 556–559 (November 2004)
Lichtenwalter, R.N., Dame, N., Chawla, N.V.: Vertex Collocation Profiles: Subgraph Counting for Link Analysis and Prediction (1019), 1019–1028 (2012)
Lyon, L.: Dealing with Data: Roles, Rights, Responsibilities and Relationships. Technical report (2007)
Martinez-Uribe, L., Macdonald, S.: User Engagement in Research Data Curation. In: Agosti, M., Borbinha, J., Kapidakis, S., Papatheodorou, C., Tsakonas, G. (eds.) ECDL 2009. LNCS, vol. 5714, pp. 309–314. Springer, Heidelberg (2009)
P. A. A. i. D. Media. Digital preservation strategies. Workbook on Digital Private Papers, pp. 222–246 (2008)
Morfeo Project. Measurement Units Ontology (2008), http://forge.morfeo-project.org/wiki_en/index.php/Units_of_measurement_ontology
Oracle ThinkQuest. Information Internet: Chemistry Gravimetry (2012), http://library.thinkquest.org/10679/chemistry/gravimet.html
Piwowar, H.A., Day, R.B., Fridsma, D.S.: Sharing detailed research data is associated with increased citation rate. PLoS One 2(3) (2007)
Treloar, A., Wilkinson, R.: Rethinking Metadata Creation and Management in a Data-Driven Research World. In: 2008 IEEE Fourth International Conference on eScience, pp. 782–789 (December 2008)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
da Silva, J.R., Ribeiro, C., Lopes, J.C. (2012). Semi-automated Application Profile Generation for Research Data Assets. In: Dodero, J.M., Palomo-Duarte, M., Karampiperis, P. (eds) Metadata and Semantics Research. MTSR 2012. Communications in Computer and Information Science, vol 343. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-35233-1_10
Download citation
DOI: https://doi.org/10.1007/978-3-642-35233-1_10
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-35232-4
Online ISBN: 978-3-642-35233-1
eBook Packages: Computer ScienceComputer Science (R0)