Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1352694.1352703acmconferencesArticle/Chapter ViewAbstractPublication Pageseatis-orgConference Proceedingsconference-collections
research-article

Digital libraries and engines of search: new information systems in the context of the digital preservation

Published: 14 May 2007 Publication History

Abstract

The first's library projects occur some years ago with digitization, but just in 1996, the first's web archive initiatives start occurring. Such, was based in the Internet growth and in its increasing use, items that revealed to be an opportunity to transform and readapt the traditional library services. In this context, search engines play a fundamental role of support to the new paradigm of knowledge, by capturing, storing and providing access to the resources, allowing the existence of a digital library in each computer with internet access. In this article we analyze the ways of developing a digital library, taking higher attention to the web harvesting technique, and presenting digital libraries capabilities and limitations. Then we fully summarize relevant projects and initiatives, to finally study the role of search engines in what concerns to, digital preservation, access and information diffusion.

References

[1]
Abiteboul, S., Cobéna, G., Masanes, J. and Sedrati, G. "A First Experience in Archiving the French Web", In Proceedings of the 6th European Conference on Research and Advances Technology for Digital Libraries, Rome, Italy, September 16--18, (2002).
[2]
Alexa. www.alexa.com,
[3]
Biblioteca do Conhecimento On-Line. www.b-on.pt
[4]
Biblioteca Nacional Digital. http://bnd.bn.pt
[5]
Brin, S. and Page, L. "The Anatomy of a Large-scale Hypertextual Web Search Engine", In Proceedings of the 7th International World Wide Web Conference (WWW7), Bisbane, Australia, April 14--18, (1998).
[6]
Campos, F., "Seleccionar recursos para bibliotecas digitais: princípios orientadores", (2005).
[7]
Campos, R. and Marques, C. "O Governo Electrónico e os Sistemas de Informação Públicos em Portugal", Actas da 1.a Conferência de Sistemas e Tecnologias de Informação, pp 421--437 (Volume I), Ofir, Portugal, Junho 21--23, (2006).
[8]
Campos, R., Dias, G. and Nunes, C. "WISE: Hierarchical Soft Clustering of Web Page Search Results based on Web Content Mining Techniques", In Proceedings of the 2006 IEEE / WIC / ACM International Conference on Web Intelligence, Hong Kong, China, Dezembro 18--22, (2006).
[9]
CAMiLEON. www.si.umich.edu/CAMILEON/domesday/domesday.html
[10]
Christensen, N. "Preserving the bits of the Danish Internet", In Proceedings of the 5th International Web Archiving Workshop, Vienna, Austria, September 22--23, (2005).
[11]
Combine harvester. http://combine.it.lth.se
[12]
European Commission, "Comimission Recommendation on the digitisation and online accessibility of cultural material and digital preservation", (2006). http://ec.europa.eu/information_society/newsroom/cf/itemlongdetail.cfm?item_id=2782
[13]
European Commission, "i2010: Digital Libraries", (2005). http://ec.europa.eu/information_society/activities/digital_libraries
[14]
European Library. www.europeanlibrary.org
[15]
European Archive. www.europarchive.org
[16]
Gomes, D., Freitas, S. and Silva, M. "Design and Selection Criteria for a National Web Archive", In Proceedings of the 10th European Conference Research and Advances Technology for Digital Libraries, Alicante, Spain, September 17--22, (2006).
[17]
Google Books. http://books.google.com
[18]
Google Books. http://scholar.google.com
[19]
Hallgrímsson, P., Bang, S. and Mannerheim, J. "Nordic Web Archive", In Proceedings of the 3th International Web Archiving Workshop, Trondheim, Norway, 21 August, (2003).
[20]
Heritrix. http://crawler.archive.org
[21]
HTTrack. www.httrack.com
[22]
IFLA., ICA. "Guidelines for Digitization Projects for Collections and Holdins in the Public Domain, particularly those held by libraries and archives", (2002).
[23]
International Internet Preservation Consortium. www.netpreserve.org
[24]
International Web Archiving Workshop. www.iwaw.net
[25]
Jodelis, R. "Harvesting and Archiving of Electronic Resources in Lithuania: towards Virtual Library", In Proceedings of the 9th Conference on Professional Information Resources, Prague, Czech Republic, May 27--29, (2003).
[26]
Kenney, A. and Oya, R. "Moving theory into practice: digital imaging for libraries and archives", Mountain View, Calif.: Research Libraries Group, (2000).
[27]
Koerbin, P., "Managing Web Archiving in Australia: a Case Study", In Proceedings of the 3th International Web Archiving Workshop, Norway, 21 August, (2003).
[28]
Kosala, R. and Blockeel, H. "Web Mining Research: a Survey", In ACM SIGKDD Exploration, 2(1), 1--15, (2000).
[29]
Lampos, C., Eirinaki, M., Jevtuchova, D. and Vazirgiannis, M. "Archiving the Greek Web", In Proceedings of the 4th International Web Archiving Workshop, Bath, UK, 16 September, (2005).
[30]
Lyman, P. "Archiving the World Wide Web", School of Information Management and Systems University of California, Berkeley, (2002).
[31]
Marill, J., Boyko, A. and Ashenfelder, M. "Tools and Techniques for Harvesting the World Wide Web", In Proceedings of the JCDL, (2004).
[32]
Marill, J., Boyko, A. and Ashenfelder, M. "Web Harvesting Survey", International Internet Preservation Consortium, 20 July, (2004).
[33]
Masanès, J. "Towards Continuous Web Archiving", D-Lib Magazine, Volume 8 Number 12, ISSN 1082 - 9873, December, (2002).
[34]
NetCraft. http://news.netcraft.com
[35]
Northest Document Conservation Center Andover Massachusetts, "Handbook for digital projects: A management tool for preservation and access", Maxine K. Sitts, Editor, (2000).
[36]
Ntoulas, A., Cho, J., Cho, K., Cho, H. and Cho, Y. "A study on the evolution of the web", In Proceedings of the 2005 UKC Conference, August, (2005).
[37]
Ntoulas, A., Zerfos, P. and Cho, J. "Downloading Hidden Web Content", Technical Report, UCLA, (2004).
[38]
Open Content Alliance. /www.opencontentalliance.org
[39]
Paradigma. www.nb.no/paradigma
[40]
Pereira, A. "O Advento Digital e a nova missão da Biblioteca Pública", (2005). Biblioteca Municipal Afonso Lopes Vieira, Câmara Municipal de Leiria. http://sapp.telepac.pt/apbad/congresso8/comm6.pdf
[41]
Persson, N. Arvidson, A. and Mannerheim, J. "The Kulturarw3 Project The Royal Swedish Web Archive", In Proceedings of the 66th IFLA Conference, Jerusalem, Israel, August 13--18, (2000).
[42]
Preserving Access to Digital Information. www.nla.gov.au/padi/index.html
[43]
Projecto Nórdico. http://nwa.nb.no
[44]
Rauber, A., Aschenbrenner, A. and Witvoet, O. "Austrian On-Line Archive Processing Analyzing Archives of the World Wide Web", In Proceedings of the 6th European Conference on Research and Advances Technology for Digital Libraries, Rome, Italy, September 16--18, (2002).
[45]
Tumba. www.tumba.pt
[46]
UK Web Archiving Consortium. www.webarchive.org.uk
[47]
Webb, C. "Who will save the Olympics?", OCLC/Preservation Resources Symposium, Digital Past, Digital Future: an Introduction to Digital Preservation, OCIC, Dublin, Ohio, 15 June, (2001).
[48]
Web aRchive Access. http://archive-access.sourceforge.net/projects/wera
[49]
Web Archive discussion list. http://listes.cru.fr/sympa/info/web-archive
[50]
Web Archiving Project. http://warp.ndl.go.jp
[51]
Xyleme crawler. www.xyleme.com
[52]
Žabička, P. "Archiving the Czech Web: Issues and Challenges", In Proceedings of the 3th International Web Archiving Workshop, Norway, 21 August, (2003).

Cited By

View all
  • (2009)Long-term digital preservation: preserving authenticity and usability of 3-D dataInternational Journal on Digital Libraries10.1007/s00799-009-0051-710:1(33-47)Online publication date: 28-Apr-2009
  1. Digital libraries and engines of search: new information systems in the context of the digital preservation

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    EATIS '07: Proceedings of the 2007 Euro American conference on Telematics and information systems
    May 2007
    498 pages
    ISBN:9781595935984
    DOI:10.1145/1352694
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 14 May 2007

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. digital libraries
    2. digital preservation
    3. information systems
    4. search engines
    5. web archiving
    6. web harvesting

    Qualifiers

    • Research-article

    Conference

    EATIS07

    Acceptance Rates

    Overall Acceptance Rate 17 of 64 submissions, 27%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)3
    • Downloads (Last 6 weeks)2
    Reflects downloads up to 24 Dec 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2009)Long-term digital preservation: preserving authenticity and usability of 3-D dataInternational Journal on Digital Libraries10.1007/s00799-009-0051-710:1(33-47)Online publication date: 28-Apr-2009

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media