The purpose of this paper is to investigate the application of text classification in Hypatia, th... more The purpose of this paper is to investigate the application of text classification in Hypatia, the digital library of Technological Educational Institute of Athens, in order to provide an automated classification tool as an alternative to manual assignments. The crucial point in text classification is the selection of the most important term-words for document representation. Classic weighting method TF.IDF was investigated. Our document collection consists of 718 abstracts in Medicine, Tourism and Food Technology. Classification was conducted utilizing 14 classifiers available on WEKA. Classification process yielded an excellent ~97% precision score.
The effects of book and paper conservation treatments on the intrinsic data of the artifacts are ... more The effects of book and paper conservation treatments on the intrinsic data of the artifacts are examined. The tangible data present in an object are grouped in three layers, with the third layer being associated with the object’s material properties. The wealth of information that can be drawn from the data of the third layer and their significance is discussed. The obfuscation of critical data or their complete loss after specific treatments is a possible outcome, and conservators, stakeholders and the public should be aware of what may be lost after a conservation intervention.
Proceedings of the International Conference on Theory and Practice of Digital Libraries (TPDL 2011), Sep 30, 2011
Europeana has put in a stretch many known procedures in digital libraries, imposing requirements ... more Europeana has put in a stretch many known procedures in digital libraries, imposing requirements difficult to be implemented in many small institutions, often without dedicated systems support personnel. Although there are freely available open source software platforms that provide most of the commonly needed functionality such as OAI-PMH support, the migration from legacy software may not be easy, possible or desired. Furthermore, advanced requirements like selective harvesting according to complex criteria are not widely supported. To accommodate these needs and help institutions contribute their content to Europeana, we developed a series of tools. For the majority of small content providers that are running DSpace, we developed a DSpace plug-in, to convert and augment the Dublin Core metadata according to Europeana ESE requirements. For sites with different software, incompatible with OAI-PMH, we developed wrappers enabling repeatable generation and harvesting of ESE-compatible metadata via OAI-PMH. In both cases, the system is able to select and harvest only the desired metadata records, according to a variety of configuration criteria of arbitrary complexity. We applied our tools to providers with sophisticated needs, and present the benefits they achieved.
The purpose of this paper is to investigate the application of text classification in Hypatia, th... more The purpose of this paper is to investigate the application of text classification in Hypatia, the digital library of Technological Educational Institute of Athens, in order to provide an automated classification tool as an alternative to manual assignments. The crucial point in text classification is the selection of the most important term-words for document representation. Classic weighting method TF.IDF was investigated. Our document collection consists of 718 abstracts in Medicine, Tourism and Food Technology. Classification was conducted utilizing 14 classifiers available on WEKA. Classification process yielded an excellent ~97% precision score.
The effects of book and paper conservation treatments on the intrinsic data of the artifacts are ... more The effects of book and paper conservation treatments on the intrinsic data of the artifacts are examined. The tangible data present in an object are grouped in three layers, with the third layer being associated with the object’s material properties. The wealth of information that can be drawn from the data of the third layer and their significance is discussed. The obfuscation of critical data or their complete loss after specific treatments is a possible outcome, and conservators, stakeholders and the public should be aware of what may be lost after a conservation intervention.
Proceedings of the International Conference on Theory and Practice of Digital Libraries (TPDL 2011), Sep 30, 2011
Europeana has put in a stretch many known procedures in digital libraries, imposing requirements ... more Europeana has put in a stretch many known procedures in digital libraries, imposing requirements difficult to be implemented in many small institutions, often without dedicated systems support personnel. Although there are freely available open source software platforms that provide most of the commonly needed functionality such as OAI-PMH support, the migration from legacy software may not be easy, possible or desired. Furthermore, advanced requirements like selective harvesting according to complex criteria are not widely supported. To accommodate these needs and help institutions contribute their content to Europeana, we developed a series of tools. For the majority of small content providers that are running DSpace, we developed a DSpace plug-in, to convert and augment the Dublin Core metadata according to Europeana ESE requirements. For sites with different software, incompatible with OAI-PMH, we developed wrappers enabling repeatable generation and harvesting of ESE-compatible metadata via OAI-PMH. In both cases, the system is able to select and harvest only the desired metadata records, according to a variety of configuration criteria of arbitrary complexity. We applied our tools to providers with sophisticated needs, and present the benefits they achieved.
Proceedings of the 11th European Conference on Research and Advanced Technology for Digital Libraries (ECDL 2007), Budapest, Hungary, September 16-21, 2007, 2007
We present the results of a questionnaire survey for the access and reproduction policies of 67 d... more We present the results of a questionnaire survey for the access and reproduction policies of 67 digital collections in 34 libraries (national, academic, public, special etc) from 13 countries. We examine and analyze the above policies in relation to specific factors, such as, the acquisition method, copyright ownership, library type (national, academic, etc.), content creation (digitized, born-digital) and content type (audio, video, etc.); how these factors affect the policies of the examined digital collections. Responses were received from a range of library sectors but by far the best responses came from academic libraries, in which we focus. We extract policy (access, reproduction) rules and alternatives according to these factors that lead to a policy decision tree on digital information management for academic libraries. The resulting decision tree is based on a policy model; the model and tree are divided into two parts: for digitized and born-digital content.
Proceedings of the 9th European Conference on Research and Advanced Technology for Digital Libraries (ECDL 2005), Vienna, Austria, September 18-23, 2005, 2005
The access and reproduction policies of the digital collections of fifteen leading academic and n... more The access and reproduction policies of the digital collections of fifteen leading academic and national digital libraries worldwide are classified according to factors such as the creation type of the material, acquisition method and copyright ownership. The relationship of these factors and policies is analyzed and quantitative remarks are extracted. We propose a policy model for the digital content of the national and academic libraries. The model consists of rules, supplemented by their exceptions, about which factors lead to specific policies. We derive new policy rules on access and reproduction when different copyright terms are applied. We conclude with findings on policies. Finally, we compare national and academic library policies, showing interesting results that arise on their similarities and differences.
Proceedings of the 1st Hellenic Society for Systemic Studies National Conference 2005 (HSSS 2005), in co-ordination with the University of Peloponnese, Tripolis, Greece, May 12-14, 2005, 2005
Proceedings of the 1st Hellenic Society for Systemic Studies National Conference 2005 (HSSS 2005), in co-ordination with the University of Peloponnese, Tripolis, Greece, May 12-14, 2005, 2005
The digital content of libraries is different from commercial digital products, such as, computer... more The digital content of libraries is different from commercial digital products, such as, computer applications, software tools, and computer code or data streams, but they have the same sharing, reproduction and distribution digital properties and similar knowledge management problems. We examine the policies applied to commercial and library digital content. We classify the commercial digital products, according to their use
ABSTRACT This paper presents the EuropeanaLocal project, funded by the European Commission that a... more ABSTRACT This paper presents the EuropeanaLocal project, funded by the European Commission that aims to assist cultural repositories to provide access to digital content that they hold to Europeana, the European Digital Library. The paper emphasizes at the effort of Greek Institutional Repositories to provide cultural content to Europeana via their involvement at the EuropeanaLocal project. Greek libraries, museums and archives content providers trying to adopt interoperability standards, the automated metadata harvesting model of Europeana, enrich their metadata by applying the European Semantic Elements metadata schemas in order to be able to migrate their metadata and make available their content of immense value to Europeana. Finally, the paper analyzes the results of a research conducted in Greek repositories participating in the project, which demonstrates their readiness for this project, problems encountered, the diversity of materials and standards that they are used, etc. Finally, the paper illustrates the growing opportunities arise for libraries and other cultural institutions (archives, museums) through their participation in the EuropeanLocal Project.
Uploads
Refereed journal publications by Alexandros Koulouris
Refereed conference publications by Alexandros Koulouris