Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Semantic Publishing Challenge – Assessing the Quality of Scientific Output by Information Extraction and Interlinking

  • Conference paper
  • First Online:
Semantic Web Evaluation Challenges (SemWebEval 2015)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 548))

Included in the following conference series:

Abstract

The Semantic Publishing Challenge series aims at investigating novel approaches for improving scholarly publishing using Linked Data technology. In 2014 we had bootstrapped this effort with a focus on extracting information from non-semantic publications – computer science workshop proceedings volumes and their papers – to assess their quality. The objective of this second edition was to improve information extraction but also to interlink the 2014 dataset with related ones in the LOD Cloud, thus paving the way for sophisticated end-user services.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

Notes

  1. 1.

    See https://www.force11.org/meetings/beyond-pdf-2, http://sepublica.info, and http://linkedscience.org/category/workshop/.

  2. 2.

    As no one participated in Task 3, our work on this task ended with step 3.

  3. 3.

    Anastasia Dimou, a co-author of one Task 1 submission [5], did not vote in this task.

  4. 4.

    https://github.com/angelobo/SemPubEvaluator.

  5. 5.

    Licensing issues slowed down progress: from Vol-1265 the metadata are open under CC0, whereas for older volumes CEUR-WS.org does not have the editors’ explicit permission to republish derivatives such as extracted RDF. Opinions diverge on the copyrightability of metadata [3]; DBLP actually republishes CEUR-WS.org metadata under ODC-BY. Still, CEUR-WS.org decided not to publish old metadata under their domain; instead, we will publish them as an outcome of this Challenge.

  6. 6.

    http://www.colinda.org/.

  7. 7.

    See http://eventseer.net/ and http://www.wikicfp.com/.

  8. 8.

    See http://www.geonames.org/ and http://dbpedia.org/.

  9. 9.

    http://dblp.l3s.de/dblp++.php.

  10. 10.

    http://www.semanticlancet.eu/.

  11. 11.

    http://data.semanticweb.org/.

  12. 12.

    http://lod.springer.com/.

References

  1. Allen, B.P., et al.: Improving future research communication and e-scholarship. FORCE11 Manifesto (2011). https://www.force11.org/about/manifesto

  2. Bryl, V., et al.: What’s in the proceedings? combining publisher’s and researcher’s perspectives. In: SePublica, vol. 1155 (2014). CEUR-WS.org

  3. Coyle, K.: Metadata and Copyright. Libr. J. (2013). http://lj.libraryjournal.com/2013/02/opinion/peer-to-peer-review/metadataand-copyright-peer-to-peer-review

  4. Dimou, A., Vander Sande, M., Colpaert, P., De Vocht, L., Verborgh, R., Mannens, E., Van de Walle, R.: Extraction and semantic annotation of workshop proceedings in HTML using RML. In: Presutti, V., et al. (eds.) SemWebEval 2014. CCIS, vol. 475, pp. 114–119. Springer, Heidelberg (2014)

    Google Scholar 

  5. Heyvaert, P., et al.: Semantically annotating CEUR-WS workshop proceedings with RML. In: Gandon, F., et al. (eds.) SemWebEval 2015. CCIS, vol. 548, pp. 165–176. Springer, Heidelberg (2015)

    Google Scholar 

  6. Klampfl, S., Kern, R.: Machine learning techniques for automatically extracting contextual information from scientific publications. In: Gandon, F., et al. (eds.) SemWebEval 2015. CCIS, vol. 548, pp. 105–116. Springer, Heidelberg (2015)

    Google Scholar 

  7. Kolchin, M., Kozlov, F.: A template-based information extraction from web sites with unstable markup. In: Presutti, V., et al. (eds.) SemWebEval 2014. CCIS, vol. 475, pp. 89–94. Springer, Heidelberg (2014)

    Google Scholar 

  8. Kolchin, M., et al.: CEUR-WS-LOD: conversion of CEUR-WS workshops to linked data. In: Gandon, F., et al. (eds.) SemWebEval 2015. CCIS, vol. 548, pp. 142–152. Springer, Heidelberg (2015)

    Google Scholar 

  9. Kovriguina, L., et al.: Metadata extraction from conference proceedings using template-based approach. In: Gandon, F., et al. (eds.) SemWebEval 2015. CCIS, vol. 548, pp. 153–164. Springer, Heidelberg (2015)

    Google Scholar 

  10. Lange, C., Di Iorio, A.: Semantic publishing challenge – assessing the quality of scientific output. In: Presutti, V., et al. (eds.) SemWebEval 2014. CCIS, vol. 475, pp. 61–76. Springer, Heidelberg (2014)

    Google Scholar 

  11. Milicka, M., Burget, R.: Information extraction from web sources based on multi-aspect content analysis. In: Gandon, F., et al. (eds.) SemWebEval 2015. CCIS, vol. 548, pp. 81–92. Springer, Heidelberg (2015)

    Google Scholar 

  12. Nuzzolese, A.G., Peroni, S., Recupero, D.R.: MACJa: metadata and citations jailbreaker. In: Gandon, F., et al. (eds.) SemWebEval 2015. CCIS, vol. 548, pp. 117–128. Springer, Heidelberg (2015)

    Google Scholar 

  13. Ronzano, F., del Bosque, G.C., Saggion, H.: Semantify CEUR-WS proceedings: towards the automatic generation of highly descriptive scholarly publishing linked datasets. In: Presutti, V., et al. (eds.) SemWebEval 2014. CCIS, vol. 475, pp. 83–88. Springer, Heidelberg (2014)

    Google Scholar 

  14. Ronzano, F., et al.: On the automated generation of scholarly publishing linked datasets: the case of CEUR-WS proceedings. In: Gandon, F., et al. (eds.) SemWebEval 2015. CCIS, vol. 548, pp. 177–188. Springer, Heidelberg (2015)

    Google Scholar 

  15. Sateli, B., Witte, R.: Automatic construction of a semantic knowledge base from CEUR workshop proceedings. In: Gandon, F., et al. (eds.) SemWebEval 2015. CCIS, vol. 548, pp. 129–141. Springer, Heidelberg (2015)

    Google Scholar 

  16. Presutti, V., et al. (eds.): SemWebEval 2014. CCIS, vol. 475. Springer, Heidelberg (2014)

    Google Scholar 

  17. Gandon, F., et al. (eds.) SemWebEval 2015. CCIS, vol. 548. Springer, Heidelberg (2015)

    Google Scholar 

  18. The DBLP Computer Science Bibliography. http://dblp.uni-trier.de

  19. Tkaczyk, D., Bolikowski, L.: Extracting contextual information from scientific literature using CERMINE system. In: Gandon, F., et al. (eds.) SemWebEval 2015. CCIS, vol. 548, pp. 93–104. Springer, Heidelberg (2015)

    Google Scholar 

  20. Verborgh, R., et al.: Low-cost queryable linked data through triple pattern fragments. In: ISWC Posters and Demonstrations, vol. 1272 (2014). CEUR-WS.org

Download references

Acknowledgements

We thank our reviewers, our sponsors Springer and Mendeley, and our participants for their hard work, creative solutions and useful suggestions. This work has been partially funded by the European Commission under grant agreement no. 643410.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sahar Vahdati .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Iorio, A.D., Lange, C., Dimou, A., Vahdati, S. (2015). Semantic Publishing Challenge – Assessing the Quality of Scientific Output by Information Extraction and Interlinking. In: Gandon, F., Cabrio, E., Stankovic, M., Zimmermann, A. (eds) Semantic Web Evaluation Challenges. SemWebEval 2015. Communications in Computer and Information Science, vol 548. Springer, Cham. https://doi.org/10.1007/978-3-319-25518-7_6

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-25518-7_6

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-25517-0

  • Online ISBN: 978-3-319-25518-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics