This paper describes the linguistic analysis of a corpus of patient narratives that was used to d... more This paper describes the linguistic analysis of a corpus of patient narratives that was used to develop and test software to carry out sentiment analysis on the aforementioned corpus. There is a growing body of research on the relationship between sentiment analysis, social media (for example, Twitter) and health care, but less research on sentiment analysis of patient narratives (being longer and more complex texts). The motivation for this research is that patient narratives of experiences of the National Health Service (NHS) in the UK provide rich data of the treatment received.The corpus threw up some unexpected results that may be of benefit for researchers of sentiment analysis. The linguistic problems encountered have been divided into three sections: the noisy nature of large corpora; the idiomatic nature of language; the nature of language in the clinical domain. This article gives an overview of the project and describes the linguistic problems that arose out of the projec...
This article presents research carried out on a corpus of newspaper articles about the financial ... more This article presents research carried out on a corpus of newspaper articles about the financial crisis in Spain (Corpus de la Crisis Financiera - CCF). The genesis and compilation of the CCF coincided with a growing body of publications about the financial situation in Spain, a severe economic downturn involving a banking crisis, a burst housing bubble, a dramatic increase in unemployment, and cuts in social services. In this paper, we are going to focus on the semantics and rhetorical functions in the different texts that make up the corpus. Our main objective is to explore the realizations of evaluative meaning in our corpus, either overtly expressed by the journalist or implicitly transmitted in texts by means of rhetorical devices such as metaphors.We will provide examples from our corpus to show how the recurrence and coexistence of such linguistic features play a cohesive role providing texts consistency and texture. These linguistic resources persuade individual readers and ...
This article describes research undertaken in order to design a methodology for the reticular rep... more This article describes research undertaken in order to design a methodology for the reticular representation of knowledge of a specific discourse community. To achieve this goal, a representative corpus of the scientific production of the members of this discourse community (Universidad Politecnica de Valencia, UPV) was created. The article presents the practical analysis (frequency, keyword, collocation and cluster analysis) that was carried out in the initial phases of the study aimed at establishing the theoretical and practical background and framework for our matrix and network analysis of the scientific discourse of the UPV. In the methodology section, the processes that have allowed us to extract from the corpus the linguistic elements needed to develop co-occurrence matrices, as well as the computer tools used in the research, are described. From these co-occurrence matrices, semantic networks of subject and discipline knowledge were generated. Finally, based on the results ...
This article describes research undertaken in order to design a methodology for the reticular rep... more This article describes research undertaken in order to design a methodology for the reticular representation of knowledge of a specific discourse community. To achieve this goal, a representative corpus of the scientific production of the members of this discourse community (Universidad Politécnica de Valencia, UPV) was created. The article presents the practical analysis (frequency, keyword, collocation and cluster analysis) that was carried out in the initial phases of the study aimed at establishing the theoretical and practical background and framework for our matrix and network analysis of the scientific discourse of the UPV. In the methodology section, the processes that have allowed us to extract from the corpus the linguistic elements needed to develop co-occurrence matrices, as well as the computer tools used in the research, are described. From these co-occurrence matrices, semantic networks of subject and discipline knowledge were generated. Finally, based on the results ...
This paper describes the linguistic analysis of a corpus of patient narratives that was used to d... more This paper describes the linguistic analysis of a corpus of patient narratives that was used to develop and test software to carry out sentiment analysis on the aforementioned corpus. There is a growing body of research on the relationship between sentiment analysis, social media (for example, Twitter) and health care, but less research on sentiment analysis of patient narratives (being longer and more complex texts). The motivation for this research is that patient narratives of experiences of the National Health Service (NHS) in the UK provide rich data of the treatment received. The corpus threw up some unexpected results that may be of benefit for researchers of sentiment analysis. The linguistic problems encountered have been divided into three sections: the noisy nature of large corpora; the idiomatic nature of language; the nature of language in the clinical domain. This article gives an overview of the project and describes the linguistic problems that arose out of the proje...
This paper describes the linguistic analysis of a corpus of patient narratives that was used to d... more This paper describes the linguistic analysis of a corpus of patient narratives that was used to develop and test software to carry out sentiment analysis on the aforementioned corpus. There is a growing body of research on the relationship between sentiment analysis, social media (for example, Twitter) and health care, but less research on sentiment analysis of patient narratives (being longer and more complex texts). The motivation for this research is that patient narratives of experiences of the National Health Service (NHS) in the UK provide rich data of the treatment received.The corpus threw up some unexpected results that may be of benefit for researchers of sentiment analysis. The linguistic problems encountered have been divided into three sections: the noisy nature of large corpora; the idiomatic nature of language; the nature of language in the clinical domain. This article gives an overview of the project and describes the linguistic problems that arose out of the projec...
This article presents research carried out on a corpus of newspaper articles about the financial ... more This article presents research carried out on a corpus of newspaper articles about the financial crisis in Spain (Corpus de la Crisis Financiera - CCF). The genesis and compilation of the CCF coincided with a growing body of publications about the financial situation in Spain, a severe economic downturn involving a banking crisis, a burst housing bubble, a dramatic increase in unemployment, and cuts in social services. In this paper, we are going to focus on the semantics and rhetorical functions in the different texts that make up the corpus. Our main objective is to explore the realizations of evaluative meaning in our corpus, either overtly expressed by the journalist or implicitly transmitted in texts by means of rhetorical devices such as metaphors.We will provide examples from our corpus to show how the recurrence and coexistence of such linguistic features play a cohesive role providing texts consistency and texture. These linguistic resources persuade individual readers and ...
This article describes research undertaken in order to design a methodology for the reticular rep... more This article describes research undertaken in order to design a methodology for the reticular representation of knowledge of a specific discourse community. To achieve this goal, a representative corpus of the scientific production of the members of this discourse community (Universidad Politecnica de Valencia, UPV) was created. The article presents the practical analysis (frequency, keyword, collocation and cluster analysis) that was carried out in the initial phases of the study aimed at establishing the theoretical and practical background and framework for our matrix and network analysis of the scientific discourse of the UPV. In the methodology section, the processes that have allowed us to extract from the corpus the linguistic elements needed to develop co-occurrence matrices, as well as the computer tools used in the research, are described. From these co-occurrence matrices, semantic networks of subject and discipline knowledge were generated. Finally, based on the results ...
This article describes research undertaken in order to design a methodology for the reticular rep... more This article describes research undertaken in order to design a methodology for the reticular representation of knowledge of a specific discourse community. To achieve this goal, a representative corpus of the scientific production of the members of this discourse community (Universidad Politécnica de Valencia, UPV) was created. The article presents the practical analysis (frequency, keyword, collocation and cluster analysis) that was carried out in the initial phases of the study aimed at establishing the theoretical and practical background and framework for our matrix and network analysis of the scientific discourse of the UPV. In the methodology section, the processes that have allowed us to extract from the corpus the linguistic elements needed to develop co-occurrence matrices, as well as the computer tools used in the research, are described. From these co-occurrence matrices, semantic networks of subject and discipline knowledge were generated. Finally, based on the results ...
This paper describes the linguistic analysis of a corpus of patient narratives that was used to d... more This paper describes the linguistic analysis of a corpus of patient narratives that was used to develop and test software to carry out sentiment analysis on the aforementioned corpus. There is a growing body of research on the relationship between sentiment analysis, social media (for example, Twitter) and health care, but less research on sentiment analysis of patient narratives (being longer and more complex texts). The motivation for this research is that patient narratives of experiences of the National Health Service (NHS) in the UK provide rich data of the treatment received. The corpus threw up some unexpected results that may be of benefit for researchers of sentiment analysis. The linguistic problems encountered have been divided into three sections: the noisy nature of large corpora; the idiomatic nature of language; the nature of language in the clinical domain. This article gives an overview of the project and describes the linguistic problems that arose out of the proje...
Uploads
Papers by Ana Botella