Michaela Mahlberg

Gender inequality and female body language in children’s literature

Digital Scholarship in the Humanities

In this paper, we aim to situate corpus linguistic approaches to literary texts within the wider ... more In this paper, we aim to situate corpus linguistic approaches to literary texts within the wider context of digital humanities. With an exploratory case study of gendered body language in children’s literature, we illustrate the relationship between quantitative and qualitative analysis. The case study is focused on female body language descriptions and how the presentation of body language has changed over time. We work with two corpora of children’s literature: 19th century and contemporary fiction. Our analysis confirms the substantial imbalance in the representation of female and male characters that has been identified by earlier studies and also shows a more nuanced picture of emerging subtle changes.

Fiction -one register or two? Speech and narration in novels

In this paper our focus is on analyzing register variation within fiction, rather than between fi... more In this paper our focus is on analyzing register variation within fiction, rather than between fiction and other registers. By working with subcorpora that separate text within and outside of quotation marks, we appromixate fictional speech and narration. This enables us to identify and compare linguistic features with regard to different situational contexts in the fictional world. We focus in particular on the novels of Charles Dickens and a reference corpus of other 19th-century fiction. Our main method for the register analysis is Multi-dimensional Analysis (MDA) for which we draw on altogether four dimensions from two previous MDAs. The linguistic distinctions we identify highlight similarities between fictional speech and involved registers such as face-to-face communication, and between narration and more informational and narrative prose. In addition to the detailed information on register features that characterize speech and narration, the paper raises more general questions about the ability of register studies to deal with situational contexts within fiction.

Speech-bundles in the 19th-century English novel

Language and Literature, 2019

We propose a lexico-grammatical approach to speech in fiction based on the centrality of 'fiction... more We propose a lexico-grammatical approach to speech in fiction based on the centrality of 'fictional speech-bundles' as the key element of fictional talk. To identify fictional speech-bundles, we use three corpora of 19th-century fiction that are available through the corpus stylistic web application CLiC (Corpus Linguistics in Context). We focus on the 'quotes' subsets of the corpora, i.e. text within quotation marks, which is mostly equivalent to direct speech. These quotes subsets are compared across the fiction corpora and with the spoken component of the British National Corpus 1994. The comparisons illustrate how fictional speech-bundles can be described on a continuum from lexical bundles in real spoken language to repeated sequences of words that are specific to individual fictional characters. Typical functions of fictional speech-bundles are the description of interactions and interpersonal relationships of fictional characters. While our approach crucially depends on an innovative corpus linguistic methodology, it also draws on theoretical insights into spoken grammar and characterisation in fiction in order to question traditional notions of realism and authenticity in fictional speech.

Point and CLiC Teaching literature with corpus stylistic tools

This chapter looks at the corpus tool CLiC, a web application specifically designed for the study... more This chapter looks at the corpus tool CLiC, a web application specifically designed for the study of literary texts. It allows students to run concordances or generate keywords, for instance. It gives students the opportunity to work with a corpus of Dickens novels, but also with other 19th century authors. Unlike more general corpus tools, CLiC enables searches that help to address research questions particular to literary texts. We investigate the question as to what kind of corpus exercises can be designed to help students understand the variety of opportunities that corpus approaches to literary texts offer. We deal with issues of frequency, but also with links between concepts in literary linguistics and corpus linguistics, specifically characterization and mind-modelling. We focus on examples from Charles Dickens's Oliver Twist for an illustrative case-study.

Translating fictional characters_Cermakova_Mahlberg.pdf

by Anna Cermakova and Michaela Mahlberg

The Corpus Linguistics Discourse. In honour of Wolfgang Teubert, 2018

In this chapter, we propose a novel theoretical framework for the literary translation of fiction... more In this chapter, we propose a novel theoretical framework for the literary translation of fictional characters. This framework develops the cognitive corpus linguistic notion of mind- modelling to account for process-, product- and function-oriented aspects of literary translation. We use the examples of Alice and the Queen from Alice’s Adventures in Wonderland to compare character cues across the English original and a Czech translation. The character cues we focus on are reporting verbs. Reporting verbs, as part of the presentation of fictional speech, form a central component of narrative fiction and so provide an ideal evidential basis for our theoretical framework. The translation shifts we found through our comparison of source and target text specifically include gendered uses of reporting verbs. By approaching the target text as both a translation and a reading of the text in its own right we are able to view translation shifts as a reflection of shifts in the mind- modelling of fictional characters.

Lexical cohesion: Corpus linguistic theory and its application in English language teaching

International Journal of Corpus Linguistics, 2006

Reading Dickens’s characters: Employing psycholinguistic methods to to investigate the cognitive reality of patterns in texts

by Michaela Mahlberg, Kathy Conklin, and Marie-Josee Bisson

CLiC Dickens: novel uses of concordances for the integration of corpus stylistics and cognitive poetics

by Michaela Mahlberg and Peter Stockwell

This paper introduces the web application CLiC, which we developed as part of a research project ... more This paper introduces the web application CLiC, which we developed as part of a research project bringing together insights from both cognitive poetics and corpus stylistics, with Dickens's novels as a case study. CLiC supports the analysis of discourse in narrative fiction with search options that make it possible to focus on stretches of text within and outside quotation marks. We argue that such search options open up novel ways of using concordances to link lexico-grammatical and textual patterns. We focus specifically on patterns for the creation of fictional characters. From a technical point of view, we explain the XML annotation that CLiC works with. Our discussion of textual examples focusses on phrases in fictional speech that illustrate significant differences between text within and outside quotation marks. In terms of theory, we argue that CLiC supports the identification of textual patterns that can provide insights into fictional minds and contribute to the exploration of readerly effects within the wider framework of mind-modelling.

MICHAELA MAHLBERG (2009). Patterns in News Stories: A Corpus Approach to Teaching Discourse Analysis.

In L. Lombardo (ed.). Using Corpora to Learn about Language and Discourse. Bern: Peter Lang., 2009

Analysing the opinions of UK veterinarians on practice-based research using corpus linguistic and mathematical methods

by Michaela Mahlberg, Viola Wiegand, and Rachel Dean

The use of corpus linguistic techniques and other related mathematical analyses have rarely, if e... more The use of corpus linguistic techniques and other related mathematical analyses have rarely, if ever, been applied to qualitative data collected from the veterinary field. The aim of this study was to explore the use of a combination of corpus linguistic analyses and mathematical methods to investigate a free-text questionnaire dataset collected from 3796 UK veterinarians on evidence-based veterinary medicine, specifically, attitudes towards practice-based research (PBR) and improving the veterinary knowledge base. The corpus methods of key word, concordance and collocate analyses were used to identify patterns of meanings within the free text responses. Key words were determined by comparing the questionnaire data with a wordlist from the British National Corpus (representing general English text) using cross-tabs and log-likelihood comparisons to identify words that occur significantly more frequently in the questionnaire data. Concordance and collocation analyses were used to account for the contextual patterns in which such key words occurred, involving qualitative analysis and Mutual Information Analysis (MI3). Additionally, a mathematical topic modelling approach was used as a comparative analysis; words within the free text responses were grouped into topics based on their weight or importance within each response to find starting points for analysis of textual patterns. Results generated from using both qualitative and quantitative techniques identified that the perceived advantages of taking part in PBR centred on the themes of improving knowledge of both individuals and of the veterinary profession as a whole (illustrated by patterns around the words learning, improving, contributing). Time constraints (lack of time, time issues, time commitments) were the main concern of respondents in relation to taking part in PBR. Opinions of what vets could do to improve the veterinary knowledge base focussed on the collecting and sharing of information (record, report), particularly recording and discussing clinical cases (interesting cases), and undertaking relevant continuing professional development activities. The approach employed here demonstrated how corpus linguistics and mathematical methods can help to both identify and contextualise relevant linguistic patterns in the questionnaire responses. The results of the study inform those seeking to coordinate PBR initiatives about the motivators of veterinarians to participate in such initiatives and what concerns need to be addressed. The approach used in this study demonstrates a novel way of analysing textual data in veterinary research.

Key words and translated cohesion in Lovecraft's At the Mountains of Madness and one of its Italian translations

In this paper, we explore the potential of a corpus approach to study translated cohesion. We use... more In this paper, we explore the potential of a corpus approach to study translated cohesion. We use key words as starting points for identifying cohesive networks in Lovecraft's At the Mountains of Madness and discuss how these networks contribute to the construction of literary meanings in the text. We focus on the role of repetition as a key element in establishing cohesive networks between lexical items. We specifically discuss the implications of our method for the analysis of cohesion in translated texts. A comparison of Lovecraft's original novel and a translation into Italian provides us with a nuanced understanding of the complex nature of cohesive networks. Finally, we discuss the broader issue of applying models and methods from corpus linguistics to corpus stylistic analysis.

Exploring text-initial words, clusters and concgrams in a newspaper corpus

A fresh view of the structure of hard news stories

'Mind-modelling with corpus stylistics in David Copperfield'

(with Michaela Mahlberg) Language and Literature 24 (2) (2015)

Phrases in literary contexts: Patterns and distributions of suspensions in Dickens’s novels

A case for corpus stylistics Ian Fleming’s Casino Royal, English Text Construction 4:2 (2011), 204–227 . doi 10.1075/etc.

Dickens, the suspended quotation and the corpus

Corpus Linguistics and the Study of Nineteenth-Century Fiction

Journal of Victorian Culture, 2010

Corpus Linguistics in Action: The Fireplace Pose in 19th Century Fiction

The Programming Historian, Sep 21, 2017

Fiction — One Register or Two ? Narrative and Fictional Speech in Dickens ’ s Novels

Gender inequality and female body language in children’s literature

Digital Scholarship in the Humanities

Fiction -one register or two? Speech and narration in novels

Speech-bundles in the 19th-century English novel

Language and Literature, 2019

Point and CLiC Teaching literature with corpus stylistic tools

Translating fictional characters_Cermakova_Mahlberg.pdf

by Anna Cermakova and Michaela Mahlberg

The Corpus Linguistics Discourse. In honour of Wolfgang Teubert, 2018

Lexical cohesion: Corpus linguistic theory and its application in English language teaching

International Journal of Corpus Linguistics, 2006

Reading Dickens’s characters: Employing psycholinguistic methods to to investigate the cognitive reality of patterns in texts

by Michaela Mahlberg, Kathy Conklin, and Marie-Josee Bisson

CLiC Dickens: novel uses of concordances for the integration of corpus stylistics and cognitive poetics

by Michaela Mahlberg and Peter Stockwell

MICHAELA MAHLBERG (2009). Patterns in News Stories: A Corpus Approach to Teaching Discourse Analysis.

In L. Lombardo (ed.). Using Corpora to Learn about Language and Discourse. Bern: Peter Lang., 2009

Analysing the opinions of UK veterinarians on practice-based research using corpus linguistic and mathematical methods

by Michaela Mahlberg, Viola Wiegand, and Rachel Dean

Key words and translated cohesion in Lovecraft's At the Mountains of Madness and one of its Italian translations

Exploring text-initial words, clusters and concgrams in a newspaper corpus

A fresh view of the structure of hard news stories

'Mind-modelling with corpus stylistics in David Copperfield'

(with Michaela Mahlberg) Language and Literature 24 (2) (2015)

Phrases in literary contexts: Patterns and distributions of suspensions in Dickens’s novels

A case for corpus stylistics Ian Fleming’s Casino Royal, English Text Construction 4:2 (2011), 204–227 . doi 10.1075/etc.

Dickens, the suspended quotation and the corpus

Corpus Linguistics and the Study of Nineteenth-Century Fiction

Journal of Victorian Culture, 2010

Textual patterns of dreaming and the unconscious mind in Dickens

Dickens Day 2017

Eye language -body part collocations and textual contexts in the nineteenth-century novel

Phraséologie et stylistique de la langue littéraire / Phraseology and Stylistics of Literary Language. Approches interdisciplinaires / Interdisciplinary Approaches, 2020

The description of body language is an important autho-rial technique of characterisation. In thi... more The description of body language is an important autho-rial technique of characterisation. In this chapter, we offer a corpus linguistic approach to the study of body language that enables us to combine detailed qualitative analysis with the observation of more general textual patterns. We take the example of the body part noun eyes to identify patterns of non-verbal communication. Our approach centres on the comparison of collocation across fictional speech, narration and suspensions, as a means to identify local textual functions of eye language. The general principles we demonstrate are applicable beyond the example of eyes to the study of body part nouns more generally. The analysis employs the Cor-poraCoCo R package, the web application CLiC, and it also makes use of semantic annotation with the USAS tagger.