Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3343413.3377980acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
research-article

We Could, but Should We?: Ethical Considerations for Providing Access to GeoCities and Other Historical Digital Collections

Published: 14 March 2020 Publication History

Abstract

We live in an era in which the ways that we can make sense of our past are evolving as more artifacts from that past become digital. At the same time, the responsibilities of traditional gatekeepers who have negotiated the ethics of historical data collection and use, such as librarians and archivists, are increasingly being sidelined by the system builders who decide whether and how to provide access to historical digital collections, often without sufficient reflection on the ethical issues at hand. It is our aim to better prepare system builders to grapple with these issues. This paper focuses discussions around one such digital collection from the dawn of the web, asking what sorts of analyses can and should be conducted on archival copies of the GeoCities web hosting platform that dates to 1994.

References

[1]
A. Babenko, A. Slesarev, A. Chigorin, and V. Lempitsky. 2014. Neural Codes for Image Retrieval. In ECCV. 584--599.
[2]
J. Baker. 2020. GeoCities and Diaries on the EarlyWeb. In The Diary, B. Ben-Amos and D. Ben-Amos (Eds.). Indiana University Press.
[3]
M. Bastian, S. Heymann, and M. Jacomy. 2009. Gephi: An Open Source Software for Exploring and Manipulating Networks. In ICWSM.
[4]
K. Beelen, T. Thijm, C. Cochrane, K. Halvemaan, G. Hirst, M. Kimmins, S. Lijbrink, M. Marx, N. Naderi, L. Rheault, R. Polyanovsky, and T. Whyte. 2017. Digitization of the Canadian Parliamentary Debates. Canadian Journal of Political Science 50, 3 (2017), 849 -- 864.
[5]
M. Braga. 2015. Google, a Search Company, Has Made Its Internet Archive Impossible to Search. Motherboard (Feb. 2015). https: //www.vice.com/en_us/article/jp5a77/google-a-search-company-has-madeits- internet-archive-impossible-to-search
[6]
N. Brügger. 2018. The Archived Web. Doing History in the Digital Age. MIT Press.
[7]
K. Christen. 2011. Opening Archives: Respectful Repatriation. The American Archivist 74, 1 (2011), 185--210.
[8]
H. Christenson. 2010. HathiTrust: A Research Library at Web Scale. Library Resources and Technical Services 55, 2 (2010), 93--102.
[9]
H. Chu and M. Rosenthal. 1996. Search Engines for the World Wide Web: A Comparative Study and Evaluation Methodology. In ASIS. 127--135.
[10]
D. Cohen, F. Gibbs, T. Hitchcock, G. Rockwell, J. Sander, R. Shoemaker, S. Sinclair, W. Turkel, C. Briquet, J. McLaughlin, M. Radzikowska, J. Simpson, and K. Uszkalo. 2011. Data Mining with Criminal Intent: Final White Paper.
[11]
P. Dent. 2000. "Ego-Surfing" Derides Valid, Prudent Activity. In Online Journalism Review, USC Annenberg School for Communication.
[12]
C. Dummitt. 2017. Unbuttoned: A History of Mackenzie King's Secret Life. McGill- Queen's University Press.
[13]
C. Ess. 2006. Ethical Pluralism and Global Information Ethics. Ethics and Information Technology 8, 4 (2006), 215--226.
[14]
C. Fiesler and N. Proferes. 2018. "Participant" Perceptions of Twitter Research Ethics. Social Media + Society 4, 1 (2018).
[15]
C. Frankel, M. Swain, and V. Athitsos. 1996. WebSeer: An Image Search Engine for the World Wide Web. Technical Report 96--14. University of Chicago.
[16]
S. Graham, I. Milligan, and S. Weingart. 2015. Exploring Big Historical Data: The Historian's Macroscope. Imperial College Press.
[17]
B. Hallinan, J. Brubaker, and C. Fiesler. 2019. Unexpected Expectations: Public Reaction to the Facebook Emotional Contagion Study. New Media & Society (2019).
[18]
S. High. 2015. Oral History at the Crossroads: Sharing Life Stories of Survival and Displacement. UBC Press.
[19]
H. Jenkins. 2006. Convergence Culture: Where Old and New Media Collide. NYU Press.
[20]
B. Jules, E. Summers, and V. Mitchell. 2018. Ethical Considerations for Archiving Social Media Content Generated by Contemporary Social Movements: Challenges, Opportunities, and Recommendations.
[21]
K. Keats-Rohan (Ed.). 2007. Prosopography Approaches and Applications: A Handbook. Oxford.
[22]
M. Kirschenbaum, R. Ovenden, and G. Redwine. 2010. Digital Forensics and Born-Digital Content in Cultural Heritage Collections. CLIR Publication No. 149. Council on Library and Information Resources.
[23]
E. Klein, B. Alex, C. Grover, C. Coates, A. Quigley, U. Hinrichs, J. Reid, N. Osborne, and I. Fieldhouse. 2014. Trading Consequences: Final White Paper.
[24]
Y. LeCun, Y. Bengio, and G. Hinton. 2015. Deep Learning. Nature 521, 7553 (2015), 436--444.
[25]
J. Lin, I. Milligan, J. Wiebe, and A. Zhou. 2017. Warcbase: Scalable Analytics Infrastructure for Exploring Web Archives. ACM Journal on Computing and Cultural Heritage 10, 4 (2017), Article 22.
[26]
S. Lomborg. 2013. Personal Internet Archives and Ethics. Research Ethics 9, 1 (2013), 20--31.
[27]
A. Markham and E. Buchanan. 2012. Ethical Decision-Making and Internet Research: Recommendations from the AOIR Ethics Working Committee (Version 2.0).
[28]
D. Maron, D. Berry, F. Payton, S. Lakin, and E. White. 2017. "CanWe Really Show This"? Ethics, Representation and Social Justice in Sensitive Digital Space. In JCDL. 354--355.
[29]
S. McTavish. 2018. West Hollywood Goes Global: Exploring Queer Identity on GeoCities. In Global Digital Humanities Symposium.
[30]
J. Metcalf and K. Crawford. 2016. Where Are Human Subjects in Big Data Research? The Emerging Ethics Divide. Big Data & Society 3, 1 (2016).
[31]
I. Milligan. 2016. Lost in the Infinite Archive: The Promise and Pitfalls of Web Archives. International Journal of Humanities and Arts Computing 10, 1 (2016), 78--94.
[32]
I. Milligan. 2017. Welcome to the Web: The Online Community of GeoCities and the Early Years of the World Wide Web. In The Web as History, N. Brügger and R. Schroeder (Eds.). UCL Press.
[33]
I. Milligan. 2019. GeoCities. In SAGE Handbook of Web History, Niels Brügger and Ian Milligan (Eds.). SAGE Publications.
[34]
I. Milligan. 2019. History in the Age of Abundance? How the Web is Transforming Historical Research. McGill-Queen's University Press.
[35]
F. Moretti. 2007. Graphs, Maps, Trees: Abstract Models for Literary History. Verso.
[36]
J. Nicholas. 2014. A Debt to the Dead? Ethics, Photography, History, and the Study of Freakery. Histoire sociale/Social history 47, 93 (2014), 139--155.
[37]
H. Nissenbaum. 2011. A Contextual Approach to Privacy Online. Daedalus 140, 4 (2011), 32--48.
[38]
K. Ocamb. 2012. David Bohnett: Social Change through Community Commitment. Frontiers (2012).
[39]
Office of the Secretary of the National Commission for the Protection of Human Subjects of Biomedical and Behavioral Research. 1979. The Belmont Report: Ethical Principles and Guidelines for the Protection of Human Subjects of Research. Technical Report. Department of Health, Education, and Welfare.
[40]
Interagency Advisory Panel on Research Ethics. 2018. Tri-Council Policy Statement: Ethical Conduct for Research Involving Humans - TCPS 2.
[41]
R. Peterson. 1997. Eight Internet Search Engines Compared. First Monday 2, 2 (1997).
[42]
L. Putnam. 2016. The Transnational and the Text-Searchable: Digitized Sources and the Shadows They Cast. The American Historical Review 121, 2 (2016), 377-- 402.
[43]
R. Rosenzweig. 2003. Scarcity or Abundance? Preserving the Past in a Digital Era. The American Historical Review 108, 3 (2003), 735--762.
[44]
N. Ruest, J. Lin, I. Milligan, and S. Fritz. 2020. The Archives Unleashed Project: Technology, Process, and Community to Improve Scholarly Access to Web Archives. arXiv:2001.05399 (2020).
[45]
B. Sawyer and D. Greely. 1999. Creating GeoCities Web Sites. Muska and Lipman Publishing.
[46]
J. Smith, J. Burgoyne, I. Fujinaga, D. De Roure, and J. Downie. 2011. Design and Creation of a Large-Scale Database of Structural Annotations. In ISMIR. 555--560.
[47]
M. Smith. 1999. Invisible Crowds in Cyberspace. In Communities in Cyberspace, M. Smith and P. Kollock (Eds.). Psychology Press.
[48]
J. Vitak, K. Shilton, and Z.Ashktorab. 2016. Beyond the Belmont Principles: Ethical Challenges, Practices, and Beliefs in the Online Data Research Community. In CSCW. 939--951.
[49]
H. Webb, M. Jirotka, B. Stahl, W. Housley, A. Edwards, M. Williams, R. Procter, O. Rana, and P. Burnap. 2017. The Ethical Challenges of Publishing Twitter Data for Research Dissemination. In WebSci. 339--348.
[50]
H. Yang, L. Liu, I. Milligan, N. Ruest, and J. Lin. 2019. Scalable Content-Based Analysis of Images inWeb Archives with TensorFlow and the Archives Unleashed Toolkit. In JCDL. 436--437.
[51]
M. Zimmer. 2018. Addressing Conceptual Gaps in Big Data Research Ethics: An Application of Contextual Integrity. Social Media + Society 4, 2 (2018).
[52]
M. Zimmer. 2018. How Contextual Integrity can help us with Research Ethics in Pervasive Data. https://medium.com/pervade-team/how-contextual-integritycan- help-us-with-research-ethics-in-pervasive-data-ef633c974cc1

Cited By

View all
  • (2024)Making Changes in Webpages Discoverable: A Change-Text Search Interface for Web ArchivesProceedings of the 2023 ACM/IEEE Joint Conference on Digital Libraries10.1109/JCDL57899.2023.00021(71-81)Online publication date: 26-Jun-2024
  • (2023)Egodokumentalne ślady człowieka w Internecie i ich archiwizacjaArcheion10.4467/26581264ARC.23.006.17866124(190-213)Online publication date: 14-Dec-2023
  • (2023)Imagining permanence on the web: Tracing the meanings of long-term preservation among the subjects of web archivesNew Media & Society10.1177/1461444823118703127:2(898-913)Online publication date: 22-Jul-2023
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
CHIIR '20: Proceedings of the 2020 Conference on Human Information Interaction and Retrieval
March 2020
596 pages
ISBN:9781450368926
DOI:10.1145/3343413
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 14 March 2020

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. contextual integrity
  2. distant reading
  3. ethical frameworks
  4. re-identification
  5. search

Qualifiers

  • Research-article

Funding Sources

  • Start Smart Labs
  • Social Sciences and Humanities Research Council of Canada
  • Natural Sciences and Engineering Research Council of Canada
  • Andrew W. Mellon Foundation
  • US National Science Foundation
  • Compute Canada

Conference

CHIIR '20
Sponsor:

Acceptance Rates

Overall Acceptance Rate 55 of 163 submissions, 34%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)37
  • Downloads (Last 6 weeks)1
Reflects downloads up to 16 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Making Changes in Webpages Discoverable: A Change-Text Search Interface for Web ArchivesProceedings of the 2023 ACM/IEEE Joint Conference on Digital Libraries10.1109/JCDL57899.2023.00021(71-81)Online publication date: 26-Jun-2024
  • (2023)Egodokumentalne ślady człowieka w Internecie i ich archiwizacjaArcheion10.4467/26581264ARC.23.006.17866124(190-213)Online publication date: 14-Dec-2023
  • (2023)Imagining permanence on the web: Tracing the meanings of long-term preservation among the subjects of web archivesNew Media & Society10.1177/1461444823118703127:2(898-913)Online publication date: 22-Jul-2023
  • (2023)To Re-experience the Web: A Framework for the Transformation and Replay of Archived Web PagesACM Transactions on the Web10.1145/358920617:4(1-49)Online publication date: 11-Jul-2023
  • (2022)A Scoping Review of Ethics Across SIGCHIProceedings of the 2022 ACM Designing Interactive Systems Conference10.1145/3532106.3533511(137-154)Online publication date: 13-Jun-2022
  • (2021)Remembering is a form of honouring: preserving the COVID-19 archival recordFACETS10.1139/facets-2020-01156:1(545-568)Online publication date: 1-Jan-2021
  • (2021)Multi-generational Stories of Urban Renewal: Preliminary Interviews for Map-Based StorytellingDiversity, Divergence, Dialogue10.1007/978-3-030-71305-8_26(319-326)Online publication date: 19-Mar-2021

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media