Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.5555/2740769.2740815acmconferencesArticle/Chapter ViewAbstractPublication PagesjcdlConference Proceedingsconference-collections
research-article

CED2AR: the comprehensive extensible data documentation and access repository

Published: 08 September 2014 Publication History
  • Get Citation Alerts
  • Abstract

    We describe the design, implementation, and deployment of the Comprehensive Extensible Data Documentation and Access Repository (CED2AR). This is a metadata repository system that allows researchers to search, browse, access, and cite confidential data and metadata through either a web-based user interface or programmatically through a search API, all the while re-reusing and linking to existing archive and provider generated metadata. CED2AR is distinguished from other metadata repository-based applications due to requirements that derive from its social science context. These include the need to cloak confidential data and metadata and manage complex provenance chains.

    References

    [1]
    Abowd, J., Vilhuber, L., and Block, W. A Proposed Solution to the Archiving and Curation of Confidential Scientific Inputs. In J. Domingo-Ferrer and I. Tinnirello, eds., Privacy in Statistical Databases (LNCS 7756). Springer Berlin / Heidelberg, 2012, 216--225.
    [2]
    Barga, R. S. and Digiampietri, L. A. Automatic capture and efficient storage of e-Science experiment provenance. Concurrency and Computation: Practice and Experience 20, (2008), 419--429.
    [3]
    Bosch, T., Cyganiak, R., Gregory, A., and Wackerow, J. DDI-RDF Discovery Vocabulary: A Metadata Vocabulary for Documenting Research and Survey Data. Linked Data on the Web Workshop, (2013).
    [4]
    Bosch, T., Cyganiak, R., Wackerow, J., and Zapilko, B. Leveraging the DDI Model for Linked Statistical Data in the Social, Behavioural, and Economic Sciences. International Conference on Dublin Core and Metadata Applications; DC-2012--The Kuching Proceedings, (2012).
    [5]
    Cheney, J., Chong, S., Foster, N., Seltzer, M., and Vansummeren, S. Provenance. Proceeding of the 24th ACM SIGPLAN conference companion on Object oriented programming systems languages and applications - OOPSLA '09, ACM Press (2009), 957.
    [6]
    Chetty, R. The Transformative Potential of Administrative Data for Microeconometric Research. 2012. http://conference.nber.org/confer/2012/SI2012/LS/ChettySlides.pdf.
    [7]
    Crosas, M. The Dataverse Network®: An Open-Source Application for Sharing, Discovering and Preserving Data. D-Lib Magazine 17, 1/2 (2011).
    [8]
    Duerr, R. E., Downs, R. R., Tilmes, C., et al. On the utility of identification schemes for digital earth science data: an assessment and recommendations. Earth Science Informatics 4, 3 (2011), 139--160.
    [9]
    Evans, T., Zayatz, L., and Slanta, J. Using noise for disclosure limitation of establishment tabular data. Journal of Official Statistics 14, 4 (1998), 537--551.
    [10]
    Frew, J., Janee, G., and Slaughter, P. Automatic Provenance Collection and Publishing in a Science Data Production Environment - Early Results. Provenance and Annotation of Data and Processes - Third International Provenance and Annotation Workshop, {IPAW} 2010, Troy, {NY}, {USA}, June 15-16, 2010. Revised Selected Papers, (2010), 27--33.
    [11]
    Greenberg, J., White, H. C., Carrier, S., and Scherle, R. A Metadata Best Practice for a Scientific Data Repository. Journal of Library Metadata 9, 3-4 (2009), 194--212.
    [12]
    Groth, P. and Moreau, L. PROV-Overview: An Overview of the PROV Family of Documents. 2013.
    [13]
    Hagedorn, K. OAIster: a "no dead ends" OAI service provider. 21, 2 (2003), 170--181.
    [14]
    Hahnel, M. Exclusive: figshare a new open data project that wants to change the future of scholarly publishing. Impact of Social Sciences blog, 2012. http://eprints.lse.ac.uk/51893/1/blogs.lse.ac.uk-Exclusive_figshare_a_new_open_data_project_that_wants_to_change_the_future_of_scholarly_publishing.pdf.
    [15]
    Haltiwanger, J. C., Jarmin, R. S., and Miranda, J. Business Dynamics Statistics: An Overview. SSRN Electronic Journal, (2009).
    [16]
    Heath, T. and Bizer, C. Linked Data: Evolving the Web into a Global Data Space. Synthesis Lectures on the Semantic Web: Theory and Technology 1, 1 (2011), 1--136.
    [17]
    Jarmin, R. and Miranda, J. The Longtitudinal Business Database. 2002.
    [18]
    King, G. The Social Science Data Revolution. Horizons in Political Science, 2011. http://gking.harvard.edu/files/gking/files/evbase-horizonsp.pdf.
    [19]
    King, G. Ensuring the data-rich future of the social sciences. Science (New York, N.Y.) 331, 6018 (2011), 719--21.
    [20]
    Kinney, S. K., Reiter, J. P., Reznek, A. P., Miranda, J., Jarmin, R. S., and Abowd, J. M. Towards Unrestricted Public Use Business Microdata: The Synthetic Longitudinal Business Database. International Statistical Review 79, 3 (2011), 362--384.
    [21]
    Kramer, S., Leahey, A., Southall, H., Vampras, J., and Wackerow, J. Using RDF to describe and link social science data to related resources on the Web: leveraging the Data Documentation Initiative (DDI) model. 2012. http://eprints.port.ac.uk/9029/1/UsingRDFToDescribeAndLinkSocialScienceDataToRelatedResourcesOnTheWeb.pdf.
    [22]
    Lagoze, C., Arms, W. Y., Gan, S., et al. Core Services in the Architecture of the National Digital Library for Science Education (NSDL). ACM/IEEE (2002).
    [23]
    Lagoze, C., Block, W., Williams, J., Abowd, J. M., and Vilhuber, L. Data Management of Confidential Data. International Data Curation Conference, (2013).
    [24]
    Lagoze, C., Vilhuber, L., Williams, J., and Block, W. Encoding Provenance of Social Science Data: Integrating PROV with DDI. Proceedings of EDDI13 5th Annual European DDI User Conference, (2013).
    [25]
    Lagoze, C., Williams, J., and Vilhuber, L. Encoding Provenance Metadata for Social Science Datasets. MTSR 2013 - 7th Metadata and Semantics Research Conference, (2013).
    [26]
    Lagoze, C. Keeping Dublin Core Simple: Cross Domain Discovery or Resource Description? D-Lib Magazine 7, 1 (2001).
    [27]
    Mayernik, M. S., Choudhury, G. S., DiLauro, T., et al. The Data Conservancy Instance: Infrastructure and Organizational Services for Research Data Curation. D-Lib Magazine 18, 2012.
    [28]
    McDonough, J. Structural Metadata and the Social Limitation of Interoperability: A Sociotechnical View of XML and Digital Library Standards Development. Basilage The Markup Conference 2008, (2008).
    [29]
    Michener, W., Vieglais, D., Vision, T., Kunze, J., Cruse, P., and Janée, G. DataONE: Data Observation Network for Earth --- Preserving Data and Enabling Innovation in the Biological and Environmental Sciences. D-Lib Magazine 17, 1/2 (2011).
    [30]
    Missier, P., Belhajjame, K., and Cheney, J. The W3C PROV family of specifications for modelling provenance metadata. EDBT/ICDT '13, ACM Press (2013), 773--776.
    [31]
    Moreau, L. and Lebo, T. Linking across Provenance Bundles. 2013.
    [32]
    Moreau, L. and Missier, P. PROV-N: The Provenance Notation. 2013.
    [33]
    Moreau, L. PROV-XML: the PROV-XML Schema. 2013.
    [34]
    National Science Foundation. NSF Award Search: Award#1131848 - NCRN-MN: Cornell Census-NSF Research Node: Integrated Research Support, Training and Data Documentation. 2011. http://www.nsf.gov/awardsearch/showAward?AWD_ID=1131848.
    [35]
    Office of Science and Technology Policy. Increasing Access to the Results of Federally Funded Scientific Research. Washington D.C., 2013.
    [36]
    Peek, R. Digital Public Library of America. Information Today 29, (2012), 24.
    [37]
    Plale, B., McDonald, R. H., Chandrasekar, K., et al. The SEAD datanet prototype: Data preservation services for sustainability science. Proceedings of the ACM/IEEE Joint Conference on Digital Libraries, (2013), 439--440.
    [38]
    Pollard, T. J. and Wilkinson, J. M. Making Datasets Visible and Accessible: DataCite's First Summer Meeting. Ariadne, 64 (2010).
    [39]
    Renear, A. H., Sacchi, S., and Wicket, K. M. Definitions of Dataset in the Scientific and Technical Literature. Proceedings of the 73rd ASIS&T Annual Meeting, (2010).
    [40]
    Ruggles, S., Alexander, J. T., Genadek, K., Goeken, R., Schroeder, M. B., and Sobek, M. Integrated Public Use Microdata Series: Version 5.0 {Machine-readable database}. 2010.
    [41]
    Treloar, A. Design and Implementation of the Australian National Data Service. International Journal of Digital Curation 4, 2009.
    [42]
    Vardigan, M., Heus, P., and Thomas, W. Data Documentation Initiative: Toward a Standard for the Social Sciences. The International Journal of Digital Curation 3, 1 (2008).
    [43]
    Zimmerman, A. New Knowledge from Old Data Sharing and Reuse of Ecological Data. Science Technology Human Values2 33, 5 (2008), 631--652.

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    JCDL '14: Proceedings of the 14th ACM/IEEE-CS Joint Conference on Digital Libraries
    September 2014
    498 pages
    ISBN:9781479955695

    Sponsors

    Publisher

    IEEE Press

    Publication History

    Published: 08 September 2014

    Check for updates

    Author Tags

    1. metadata
    2. standards

    Qualifiers

    • Research-article

    Conference

    JCDL '14
    Sponsor:
    JCDL '14: 14th ACM/IEEE-CS Joint Conference on Digital Libraries
    September 8 - 12, 2014
    London, United Kingdom

    Acceptance Rates

    Overall Acceptance Rate 415 of 1,482 submissions, 28%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 90
      Total Downloads
    • Downloads (Last 12 months)3
    • Downloads (Last 6 weeks)0

    Other Metrics

    Citations

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media