Although UK museum agencies are providing funding and developing policies to manage the impact of COVID-19, they do not have established mechanisms for gathering comprehensive data on the UK museum sector, for tracking the ways in which museums have responded to the pandemic. The
Museums in the Pandemic (MIP) project
3 seeks to provide timely data on which museums close, which remain resilient, and how the profile of the UK museum sector changes as a result of COVID-19. The project’s research draws on multi-disciplinary expertise from museum studies, computer science, data science and geographical information science, and combines both quantitative and qualitative methods, including natural language processing,
Machine Learning (ML), data visualisation, interview-based research and primary data collection.
In the earlier
Mapping Museums (MM) project, conducted by the authors and their collaborators,
4 we gathered authoritative longitudinal data on the entire UK museum sector, from 1960 to date, including the dates of museum permanent closures. That data is publicly available
5 through a browsable database, search app and visualisations. However, it is missing data relating to museums’ responses to the COVID-19 pandemic and hence cannot be used to investigate how the profile of the UK museum sector may be changing as a result of COVID-19. This need has motivated our research into extracting and analysing data from museums’ public websites and social media posts to investigate museums’ responses to the COVID-19 pandemic. In this context, museum closures are particularly significant events, and several types of closure were observed: temporary, indefinite or permanent.
This article focuses on the design, implementation and evaluation of computational techniques we have developed in the MIP project, utilising data scraped from museums’ public websites and social media posts to detect a set of high-level activity indicators for museums that are relevant to the COVID-19 pandemic. As social media platforms, we considered Facebook and Twitter (now X), which are used respectively by 51 and 19 million people in the UK.
6 We have developed visualisations to track occurrences of these indicators over the duration of the MIP project, across all museums and split according to one or more key museum attributes such as Governance, Size, Subject matter, Accreditation status and Location,
7 to allow the project’s museum studies experts to assess whether the pandemic is having a disproportionate impact on particular types of museums.
The contributions of this article are our methodology and methods for extracting meaningful data from web-based resources relating to a large number of institutions. Our research is novel in aiming to analyse the behaviour of the entire UK museum sector at an institutional level, in contrast to works that focus on aspects such as cultural artefacts, cultural production, museum services, visitors and visitor-generated content (see Section
1.1). These methods are transferable to large-scale analysis of other aspects of museums’ online content, beyond our specific focus here on the response of the UK’s museums to the COVID-19 pandemic. Although this article has a methodological focus, detailed analyses of online content production and engagement are conducted in forthcoming studies by the MIP project team [
25,
26]. The data and materials used in this article are available online as open data (see the Data Availability Statement).
1.1 Related Work
We view our research as falling into the broad field of Digital Humanities [
15], lying at the intersection of traditionally qualitative humanistic disciplines, such as museum studies, and the use of digital methods and data science. More specifically, our research relates to and contributes to the following:
•
web science, which studies the structure and evolution of online resources, linkages and content [
18], in our case, museums’ websites and social media posts as relating to their responses to the COVID-19 pandemic;
•
web analytics, which seeks to understand users’ online behaviour at a large scale [
39], in our case, the “user” entities being the museums in the UK; and
•
social media analytics, which seeks to gain insights into users’ behaviours, sentiments and preferences on social media platforms, in our case, how such platforms are being used by the UK’s museums to communicate their responses to the COVID-19 pandemic, through the application of web scraping, data cleansing, and data analysis and visualisation techniques [
6,
47].
A related area is that of
cultural analytics which uses data science techniques and “big data” to study cultural artefacts and cultural production at large scale [
31]. However, in contrast to this, our research falls into a new application area that we term
museum analytics, characterised by the large-scale application of data science methods to analyse museums’ online presence via their institutional websites and social media at the scale of an entire museum sector (the UK museum sector) rather than to study cultural artefacts or production.
A number of papers perform analyses of museum collections or other data held by museums. For example, the collection of papers in the work of Belhi et al. [
7] investigates the classification of cultural assets, the analysis and retrieval of visually linked paintings, image reconstruction methods, enhancing the end-user experience, the analysis and restoration of historical manuscripts, and the use of named entity recognition methods for the analysis of historical text. The chapter on “Museum Big Data” [
37] reviews methods and techniques to identify new and uncover hidden information, patterns, clusters and relationships within museum data. Such data comprises museum artefacts and services, data related to museum visits, and visitor-generated data on the web and social media. Other papers perform analyses specifically on museum visitors. For example, a classification of online visitors into six categories using both web analytics and traditional surveys is undertaken in the work of Villaespesa [
49]. One goal of the work is to suggest to museums that the online experience should be differentiated for the various categories of visitors. By contrast, physical visitors to museums are studied in the work of Widdop and Cutts [
50]. Here a multilevel logistic model is used to show that the places where individuals reside impact on museum participation.
Several studies apply ML to explore museum data about collections and visitors [
30]. Shao et al. [
46] use topic modelling on online reviews to understand visitor experiences in a London museum. Sentiment analysis can produce insights about online opinions about tourist destinations, including museums, at a large scale [
8]. ML is also deployed to make museum collection data more accessible, inter-linking entities as Linked Open Data [
13]. Social media analytics can also provide creative input to museum curators, extracting concepts and ideas from visitor comments and responses [
14]. Compared to these studies, our research is concerned with neither museum assets nor visitors. Instead, our focus is on analysing information posted by museums themselves, whether on their websites or through their social media channels, as it relates to the COVID-19 pandemic.
There have been numerous studies investigating museums’ responses to the pandemic (summarised in the following paragraphs). In contrast to our work, these are relatively small scale in relation to the size of the museum sector concerned, sampling a selection of a country’s or countries’ museums, whereas we aim to extract and analyse data from museums’ online presence covering the entire UK museum sector.
Some studies have considered museums’ responses through their web-based activities. Using a geographically representative sample of UK museums, King et al. [
21] identified 88 temporary exhibitions that would have opened during the first lockdown in the UK and analysed the 21 online exhibitions that were put in place as a substitute. They identified themes of access, embodiment and human connection emerging from the exhibition content, and raised questions around the conceptualisation, presentation and value of digital collections. Samaroudi et al. [
45] analysed the web-based content produced during the first lockdown period (April–July 2020) by a sample of 48 UK and 35 U.S. “memory institutions,” such as museums. They identified trends in how institutions restructured their digital content in terms of different types of content, museums and audiences, and made recommendations on how institutions could enhance the value of their digital content. Burke et. al. [
10] discussed three types of digital content offered by a small sample of Norwegian and international museums during the pandemic: virtual tours, online exhibitions and crowdsourced art creation. Gutowski and Kłos-Adamkiewicz [
17] evaluated a sample of 136 virtual tours of Polish museums and monuments taken in April 2020, finding no significant increase in digital content compared with a pre-pandemic sample taken in August 2019. Jin and Min [
19] examined how more than 1,300 Chinese museums provided new online exhibitions, educational programmes and livestreaming services to connect with their audiences during the first pandemic lockdown, analysing these offerings and their effects on audience engagement and making recommendations for museums’ ongoing development of their digital communication strategy. Raimo et al. [
42] studied the effect of the pandemic on the digitisation processes of three Italian museums, finding more frequent website updates, increased use of social media and more virtual exhibitions.
Several studies have considered museums’ responses through their use of social media. Agostino et al. [
3] analysed data from Italy’s Ministry for Cultural Heritage and Tourism about the 100 most-visited Italian state museums, finding a doubling of these museums’ use of social media during lockdown and creation of new types on online content. Magliacani and Sorrentino [
29] surveyed 34 Italian university museums to investigate how they maintained audience experience during lockdown, finding that the time they spent on social media management did not change significantly but that more than half offered video narrations of their collections and more than a quarter offered live online events. Kyprianos and Kontou [
24] undertook a questionnaire-based survey of 101 museums in the Attica region of Greece, finding that most of the 52 museums that responded had increased the time they spent on social media management during the pandemic and 50% had seen a moderate or significant rise in user traffic on their social media accounts. Ryder et al. [
44] surveyed the types of digital content 66 cultural institutions in the United States produced during lockdown and the effect on audiences’ engagement with their social media accounts; they found an increase in both live and serialised digital content and almost all institutions reporting an increase in social media engagement.
Other studies include those of of Mackay [
28], who conducted interviews with 10 UK museum operations professionals to investigate how museums dealt with the initial stages of the pandemic, identifying themes of emotional impact on employees, importance of staff adaptability and flexibility, disappointment with the crisis response of the UK governments, and mutual support of professionals in the sector, and Marzano and Castellini [
32], who surveyed approximately 1,500 Italian museums to investigate whether they activated new digital channels during lockdown, finding that only a minority of museums were working towards the provision of new digital activities.
Internationally, major bodies such as UNESCO, ICOM and NEMO have surveyed samples of museums from many countries to investigate trends in museums’ behaviours during the pandemic. UNESCO [
48] and ICOM [
36] reported on the increased online presence of museums across a variety of digital activities, the economic impact of the pandemic, and concerns about laying-off of staff and permanent museum closures. NEMO [
35] similarly reported an increase in digital services in a majority of museums, increased online visits and significant loss of income caused by closure during lockdown. Summaries of these and other studies are presented by Noerher et al. [
34] and Raved and Yahel [
43].