Speech Analysis by Natural Language Processing Techniques: A Possible Tool for Very Early Detection of Cognitive Decline?

Beltrami, Daniela; Gagliardi, Gloria; Rossini Favretti, Rema; Ghidoni, Enrico; Tamburini, Fabio; Calzà, Laura

doi:10.3389/fnagi.2018.00369

ORIGINAL RESEARCH article

Front. Aging Neurosci., 13 November 2018

Sec. Neurocognitive Aging and Behavior

Volume 10 - 2018 | https://doi.org/10.3389/fnagi.2018.00369

Speech Analysis by Natural Language Processing Techniques: A Possible Tool for Very Early Detection of Cognitive Decline?

$\r\nDaniela Beltrami,�$ Daniela Beltrami^1,2^†

Gloria Gagliardi^1,3^†

Rema Rossini Favretti³

Enrico Ghidoni²

Fabio Tamburini³

Laura Calzà^1,4^*

¹Interdepartmental Centre for Industrial Research in Health Sciences and Technologies, University of Bologna, Bologna, Italy
²Clinical Neuropsychology Unit, Arcispedale S. Maria Nuova di Reggio Emilia, Reggio Emilia, Italy
³Department of Classical Philology and Italian Studies, University of Bologna, Bologna, Italy
⁴Department of Pharmacy and Biotechnology, University of Bologna, Bologna, Italy

Background: The discovery of early, non-invasive biomarkers for the identification of “preclinical” or “pre-symptomatic” Alzheimer's disease and other dementias is a key issue in the field, especially for research purposes, the design of preventive clinical trials, and drafting population-based health care policies. Complex behaviors are natural candidates for this. In particular, recent studies have suggested that speech alterations might be one of the earliest signs of cognitive decline, frequently noticeable years before other cognitive deficits become apparent. Traditional neuropsychological language tests provide ambiguous results in this context. In contrast, the analysis of spoken language productions by Natural Language Processing (NLP) techniques can pinpoint language modifications in potential patients. This interdisciplinary study aimed at using NLP to identify early linguistic signs of cognitive decline in a population of elderly individuals.

Methods: We enrolled 96 participants (age range 50–75): 48 healthy controls (CG) and 48 cognitively impaired participants: 16 participants with single domain amnestic Mild Cognitive Impairment (aMCI), 16 with multiple domain MCI (mdMCI) and 16 with early Dementia (eD). Each subject underwent a brief neuropsychological screening composed by MMSE, MoCA, GPCog, CDT, and verbal fluency (phonemic and semantic). The spontaneous speech during three tasks (describing a complex picture, a typical working day and recalling a last remembered dream) was then recorded, transcribed and annotated at various linguistic levels. A multidimensional parameter computation was performed by a quantitative analysis of spoken texts, computing rhythmic, acoustic, lexical, morpho-syntactic, and syntactic features.

Results: Neuropsychological tests showed significant differences between controls and mdMCI, and between controls and eD participants; GPCog, MoCA, PF, and SF also discriminated between controls and aMCI. In the linguistic experiments, a number of features regarding lexical, acoustic and syntactic aspects were significant in differentiating between mdMCI, eD, and CG (non-parametric statistical analysis). Some features, mainly in the acoustic domain also discriminated between CG and aMCI.

Conclusions: Linguistic features of spontaneous speech transcribed and analyzed by NLP techniques show significant differences between controls and pathological states (not only eD but also MCI) and seems to be a promising approach for the identification of preclinical stages of dementia. Long duration follow-up studies are needed to confirm this assumption.

Introduction

The increasing prevalence of dementia among the elderly population is a major societal challenge, leading to a growing demand of diagnostic services for defects in memory and cognitive performance. One major diagnostic focus would be the early distinction among memory and cognitive complains likely to evolve as neurodegenerative disease, from functional symptoms, or non-neurological disorders. In this area, terminology and diagnostic criteria are still under discussion. For example, the general description of “a person reporting the feeling of an impairment of the cognitive function” is named “subjective cognitive impairment” or “subjective cognitive decline,” or “subjective memory complains,” or “functional memory disorder” etc., and robust diagnostic criteria are not yet available (Stewart, 2012; Burmester et al., 2016), although a specific working group proposed research criteria (Jessen et al., 2014). Moreover, extensive research over the past decades in the dementia field, have recognized “an intermediate state of cognitive function between the changes seen in aging and those fulfilling the criteria for dementia and often Alzheimer disease (AD),” named Mild Cognitive Impairment (MCI, Petersen, 2011). From a clinical point of view, MCI has been then categorized in two major subtypes, i.e., amnestic MCI (aMCI) and non-amnesic MCI (naMCI), each of them including one (single) or more (multiple) cognitive domains (Petersen et al., 2014), which might or not evolve in dementia. When evolving as dementia, MCI is preceded by a very long biological history of the disease, as suggested by longitudinal models of the alteration of AD biomarkers including Ab42 and tau in the cerebrospinal fluid (CSF), amyloid deposition at PET, MRI alterations and FDG PET abnormalities (Selkoe and Hardy, 2016). This leads to the identification of new entities to be considered as research criteria, referred as “prodromal AD” by the International Working Group-2 (IWG-2; Dubois et al., 2014) and “MCI due to AD” by the AD group at the National Institute of Aging-Alzheimer Association (NIA-AA; Albert et al., 2011). This preclinical period could offer a window of opportunity for drug development, risk assessment, and prevention (Calzà et al., 2015; Epelbaum et al., 2017; Ritchie et al., 2017).

Overall these studies addressed research attention on the feasibility of detecting early cognitive changes, and several initiatives and researches are in progress, focusing on identifying the best predictive among the available cognitive tests (Mortamais et al., 2017). Memory is probably the most investigated domain. Episodic memory functioning seems to be a robust predictor of dementia in prospective studies based on in vivo amyloid imaging (Bäckman et al., 2005; Hedden et al., 2013). Some aspects of language have also been the subject of growing interest, and most of these studies focused on verbal ability, verbal learning and memory, naming, category or letter verbal fluency, verbal episodic memory, etc. The evaluation of the linguistic functions is usually performed by means of traditional pencil-and-paper or corresponding computer-assisted tests (Ostberg et al., 2005; Duong et al., 2006; Cuetos et al., 2009; Joubert et al., 2010; Pakhomov et al., 2012). Composite scores exploring both memory and language have also been proposed, such as the Alzheimer's Disease Cooperative Study Preclinical Alzheimer Cognitive Composite (ADCS-PACC) (Donohue et al., 2014) and the Alzheimer prevention initiative (API) composite score (Langbaum et al., 2014). The API score is composed of seven test scores, i.e., category fluency—fruits and vegetables—, Boston naming test, Logical Memory-delayed recall, east Boston naming test immediate recall, Ravens progressive matrices subset, symbol digit modalities, and the Mini-Mental State Examination (MMSE) orientation to time items. Composite scores are now being used as primary end-point in secondary prevention trials in AD involving presenilin 1 E280A mutation carriers (API trial, Ayutyanont et al., 2014) or in anti-amyloid treatments in asymptomatic individuals that show early amyloid accumulation (Donohue et al., 2014).

While sometimes significant differences between the MCI and normal elderly participants have been recognized by these tests, the range of variation of the scores in MCI often overlaps with that of normal people, making their clinical use unreliable in categorizing individual participants (Taler and Phillips, 2008). Even more confused results emerged from the few studies on Subjective Cognitive Complaints (Martins et al., 2012).

New perspectives are being opened up by the interest toward computerized analysis of spoken language (Natural Language Processing techniques), together with the availability of numerous algorithms for analysis and classification of “speech.” The experience gained in the “Electronic linguistic corpora” studies, chosen by virtue of their representativeness in characterizing a particular language or linguistic variety, are now opening up new perspectives for language analysis in clinical contexts, also considering that these approaches might quantify many aspects of language, both at the segmental and suprasegmental level, such as prosody and rhythm, that are not explored by conventional language tests.

When applied to “pathological language” (i.e., linguistic productions of subjects affected by a developmental or acquired speech and language disorder), this approach and related technologies would also have the significant advantage of representing a natural and spontaneous language record, outside the diagnostic set-up of the conventional neuropsychological test of language, potentially applicable to large sections of the population using low-cost tools.

With this connection established, we intended to investigate whether the analysis of the spontaneous speech performed by Natural Language Processing techniques could reveal alterations of the language performance in early cognitive decline. This proof-of-concept study analyzed, by using the Natural Language Processing techniques, the spontaneous speech used by the participants to answer to three specific tasks, i.e., the description of a drawing, details of a last dream and the description of a working day. The study included 96 participants, divided into a control group (CG, N = 48) and three pathological groups (PG), e.g., amnestic MCI (aMCI, N = 16), multiple domain MCI (mdMCI, N = 16) and early dementia (eD, N = 16).

Materials and Methods

Participants

The study was approved by the Ethical Committee of Azienda Ospedaliera Reggio Emilia (n. 2013/0013438). We enrolled 96 participants (48 males; 48 females) between the ages of 50 and 75 (according to the age range of people admitted to our outpatient clinical service) and with at least a junior high school certificate (8 years of education) or primary school certificate (5 years of education) with high intellectual interests throughout the life span. All of them provided informed and written consent. The sample was composed of a Control Group (CG) and a Pathological Group (PG). The CG included 48 participants. The PG included 48 participants from two outpatient clinical services involved in care and diagnostic evaluation of cognitive disorders and dementia. Inclusion criteria are outlined in Table 1.

TABLE 1

Table 1. Inclusion criteria for participant enrollment in control and pathological groups.

The PG refers to three categories: (i) amnestic Mild Cognitive Impairment (aMCI; 16 participants: 8 females, 8 males) characterized by an isolated memory deficit detected by standardized tests; (ii) multiple-domain MCI (mdMCI; 16 participants: 8 females, 8 males) where two or more cognitive abilities are affected (12 amnestic-md-MCI; 4 non-amnestic-md-MCI); (iii) early Dementia (e-D; 16 participants: 8 females, 8 males). Within this group, four sub-categories can be identified: probable Alzheimer Dementia (AD; six cases); Fronto-Temporal Dementia (FTD; one case of Primary Progressive Aphasia—PPA; one subject presenting the executive and behavioral variant; one case of semantic dementia); Mixed Dementia (MD; six cases) and Lewy Body Dementia (LBD; one case).

In the first two conditions (MCI), the cognitive changes are serious enough to be detected by neuropsychological assessment, but not so severe to interfere with everyday activities, as evaluated by the conventional scales (e.g., ADL -Activities of Daily Living-Petersen, 2004; Winblad et al., 2004); in the eD, the cognitive deficits partially interfere in everyday life (however, the Mini Mental State Examination score is equal or greater than 18).

Conventional Neuropsychological Evaluation

All the participants of the CG and PG were requested to complete the anamnestic interview (anagraphic data; occupation/retirement; children; familiarity with neurodegenerative pathologies; clinical history and pharmacotherapy), a neurological assessment and other medical examinations planned in the diagnostic work-up and the traditional cognitive battery aimed to evaluate several cognitive domains: logic, memory, attention, language, visuo-spatial, praxic, and executive functions.

The battery was composed of those tests which are most used in the clinical practice to assess cognitive decline (Velayudhan et al., 2014; Tsoi et al., 2015), with an Italian standardization and short administration time. In particular: Mini Mental State Examination (MMSE); Phonemic (PF) and Semantic (SF) verbal fluency tests; Clock Drawing Test (CDT), Montreal Cognitive Assessment (MoCA); General Practitioner Assessment of Cognition (GPCog). MMSE is a 30-point questionnaire providing measures of orientation, encoding (immediate memory), short-term memory as well as language functioning; Phonemic and Semantic Fluency test are commonly used verbal tasks in which the subject is requested to produce, in 1 min, words relevant to a given category; they allow a quick evaluation of the lexical access, whose functioning can be compromised early on in neurodegenerative disorders (Auriacombe et al., 2006; Clark et al., 2009). The verbal fluency test is included in the ACE-R (Mioshi et al., 2006), which is indicated as an accurate tool to identify MCI and early dementia. CDT is a brief cognitive task which mainly assesses praxis and planning abilities. It is very accurate in detecting dementia, whilst remains unclear in its accuracy to recognize MCI (Lee et al., 2011); MoCA is a 30-point test which assesses several cognitive domains such a verbal memory, visuospatial and executive abilities, attention and language; GP-Cog is a brief screening tool for cognitive impairment, designed for general practitioners and primary care physicians. It is composed of two different parts: the patient assessment (registration and recall of verbal information, temporal orientation, visuospatial abilities, and language); a caregiver interview which has to be submitted only when the scoring of the first one (range 0–9) is lower than 9. For this reason, in this study we considered and reported only the first part.

In order to compare the cognitive performances of the different groups (CG vs. aMCI vs. mdMCI vs. eD), we corrected the raw scores of the different cognitive tests (MMSE, MoCA, PF, SF) for age and education, as indicated in the respective standardization procedure (SF, Novelli et al., 1986; MMSE, Measso et al., 1993; PF, Carlesimo et al., 1995; MoCA, Conti et al., 2015), thus obtaining a standardized score, as it is usually done in clinical practice. For those (CDT, GP-Cog), which lack correction, we used the original raw scores. For each test the raw scores were transformed into adjusted scores (with correction for age and education) and classified as Equivalent Scores (ES), a 5-point scale that offers a solution to the problem of standardizing neuropsychological scores after adjustment for age and education. ES = 0 reflects a pathological performance, ES > 0 (1–4) means performance in the range of normality. Subjects with ES = 0 in the single memory domain or in more different cognitive areas, autonomous in the activities of daily living, MMSE score ≥18, were respectively classified as aMCI or mdMCI. Subjects with ES = 0 in more than one test, with need of support in one or more activities of daily living, MMSE score ≥18, were classified as e-D. Subjects with MMSE < 18 were discarded.

Speech Analysis

All participants were required to record their spontaneous speech during the execution of three tasks, elicited by these input sentences: “Could you please describe this picture?” (the picture illustrated a living room with some characters carrying out certain actions; Ciurli et al., 1996); “Could you please describe a typical working day?”; “Could you please describe the last dream you remember?” Spontaneous speech samples were recorded in a quiet room with an Olympus—Linear PCM Recorder LS-5 (in WAV files; 44.1 KHz, 16 bit) placed on a table in front of the subject.

Speech samples were collected during test sessions in the form of audio files and were manually transcribed in order to produce the dialogue orthographic transcription through the use of the Transcriber software package (http://trans.sourceforge.net). We chose the “utterance” as the reference unit in the speech continuum, defined as the counterpart of a speech act (Finegan, 2011). The latter can be considered as “the minimal linguistic entity that can be pragmatically interpreted.” Utterances are demarcated by prosody in the speech flow, therefore the identification of their boundaries is achieved through the detection of “prosodic breaks.” As a matter of fact, a large body of evidence suggests that the perception of prosodic breaks is a function of the simultaneous activation of some acoustic cues (e.g., F0 reset, final lengthening, drop in intensity, pause, initial rush in the next prosodic unit) reaching high inter-rater agreement in annotation. The manual detection of such breaks has been achieved by listening the utterances and splitting the entire dialogues into turns. In addition, during the transcription process, a series of paralinguistic phenomena such as empty and filled pauses (e.g., “mmh,” “eeh,” “ehm”), disfluencies (e.g., hesitation, stuttering, false start, lapsus…) and non-verbal phenomena (e.g., inspirations, laughs, coughing fits, throat clearing) were annotated inserting also temporal information. In order to consistently transcribe subjects' productions, we developed strict transcription guidelines imposing the transcribers to comply to the designed transcription protocol. This ensured consistency among them during the entire project.

The word turns of the test subjects were isolated, removing all speech fragments containing speech produced by the interviewer, and the selected utterances underwent automatic morphological and syntactic annotation. More precisely, they were Part-of-Speech tagged and syntactically parsed with the dependency model used by the Turin University Linguistic Environment—TULE (Lesmo, 2007), based on the TUT–Turin University TreeBank tagset (Bosco et al., 2000) in order to explicit all the morphological, syntactic and lexical information. Afterwards, all the automatically inserted annotations were manually checked to remove all the errors introduced by the annotation procedures.

A multidimensional parameter computation was performed on the data, evaluating rhythmic, acoustic, lexical, and morpho-syntactic quantitative features of the spoken utterances.

The Table in Supplementary Material outlines the complete list of the linguistic/stylometric cues considered in the study: indexes proposed in the literature as statistically relevant on languages other than Italian. Moreover, some new parameters have been tested. More precisely, we checked in the cited literature for linguistic features that, in the referred study, were able to successfully discriminate in a statistically significant way between controls and MCI or eD subjects. All these linguistic features have been considered for this study and proper computational tools for computing them have been developed and carefully tested.

With regard to the parameters derived from the speech acoustics, we used the “ssvad” Voice Activity Detector (Mak and Yu, 2014), especially developed for interview speech, to segment the recordings and identify speech vs. non-speech regions. Moreover, we used a forced alignment system developed for this study by using the Kaldi-DNN-ASR (http://kaldi.sourceforge.net/about.html) package trained on the APASCI Italian Corpus (Angelini et al., 1994), for obtaining the temporally aligned phonetic transcriptions needed to compute various rhythmic and acoustic features.

Data Availability and Statistical Analysis

All data are stored in anonymous form at the experimentation sites. Due to the Italian privacy policy, data (recordings, transcriptions, anagraphic, and clinical data, etc.) are protected and availability restricted to the project participants. Demographic variables are presented in Table 2, as mean ± SD Results of the neuropsychological tests are presented as median and interquartile range. Because of the small sample size, the non-parametric Mann-Whitney U-test was used to compare performances in neuropsychological tests and language between two groups of subjects (eD vs. CG), and the non-parametric Kruskal-Wallis test with Dunn's multiple comparison having CG as control, when significant, was used to compare aMCI and mdMCI vs. CG. A probability level of P < 0.05 was considered to be statistically significant. All statistical tests were two-sided. The Prism version 6.0 (GraphPad, La Jolla, CA) was used for the statistical analysis.

TABLE 2

Table 2. Level of education and demographic characteristics of participants.

Results

Participants and Neuropsychological Tests

The focus of the study was the analysis of spontaneous speech in MCI subgroups, compared to healthy CG. We included aMCI and mdMCI, being the memory domain the isolated defect in aMCI, while other cognitive domains are affected in mdMCI. This would identify the best candidate to reveal subclinical disorders in speech production analysis. The eD group was introduced as frankly pathological control.

Age and level of education of enrolled participants in the CG and the PG (aMCI, mdMCI, eD) are reported in Table 2. The statistical analysis (the non-parametric Kruskal-Wallis tests with Dunn's multiple comparison) indicated that no age differences were observed between the subgroups, while the level of education of the eD group is significantly lower compared with the CG (p-value: 0.0171).

Results of the conventional neuropsychological examination are summarized in Figure 1. MCI subgroups and eD group were separately compared to the CG, by the Kruskal-Wallis tests followed by multiple comparison, and Mann-Whitney U-test, respectively. As expected, the mean scores of the CG result significantly higher than those obtained by the eD in all tests. GP-Cog is the best test that can discriminate between controls and aMCI. This result is probably due to the fact that aMCI is defined as an isolated memory deficit, and GP-Cog is mainly based on memory performance. MoCA, PF, and SF recognize both aMCI and mdMCI, even with a substantial greater statistical power in mdMCI. MMSE and CDT can discriminate between CG and mdMCI, but they cannot discriminate CG from aMCI.

FIGURE 1

Figure 1. Results of the conventional neuropsycological test performed at the enrolment of the study. The graphs report median and interquartile range. The statistical analysis was performed by the non-parametric Kruskal-Wallis test with Dunn's multiple comparison having CG as control, where *p < 0.05; **p < 0.01; ***p < 0.001; ****p < 0.0001. MMSE, Mini Mental State Examination; MoCA, Montreal Cognitive Assessment; CDT, Clock Drawing Test; GPCog-a, General Practitioner Assessment of Cognition; PF, Phonemic verbal fluency tests; SF, Semantic verbal fluency tests.

Spontaneous Speech Analysis

The natural language in CG and PG was explored by three different tasks, e.g., two narrative tasks elicited by specific questions (“Could you please describe a typical working day?” “Could you please describe the last dream you remember?”), and a descriptive task using a visual stimulus (a picture drawing of a living room).

For each audio file, the acoustic, lexical, rhythmic and syntactic linguistic features were extracted and analyzed. Due to the file's poor audio quality, 4 subjects belonging to the CG group were excluded from the analysis. The complete list of investigated features, the respective explanations and related references are reported in Supplementary Materials.

The statistical analysis was performed by comparing aMCI and mdMCI vs. CG by the non-parametric Kruskal-Wallis test, and eD vs. CG by the non-parametric Mann-Whitney U-test. Results are presented in Table 3, for 4 groups of features referred to the lexical (A), rhythmic (B), acoustic (C), and syntactic (D) features of the speech. The significant p-value is indicated for the corresponding feature and the corresponding speech task (figure, working day, dream).

TABLE 3

Table 3. The table reports the results of the spontaneous speech analysis.

The acoustic is the language category more altered in the PG. The acoustic parameters investigate temporal features, quantifying speech rate and pauses in the signal (e.g., silence segments duration, speech segments duration, verbal rate, transformed phonation rate, standardized phonation time, and standardized pause rate) as well as some spectral properties of the voice. Most of the considered acoustic features are mainly affected in the PGs compared to the CG group. Notably, these features seem to be able to distinguish, not only eD and mdMCI, but also aMCI from the CG.

On the contrary, the rhythmic parameters that quantify changes in speech rhythm and in utterance composition in terms of vowel and consonant alternation, are poorly modified during the cognitive decline, with the notable exception of the dream description task in both aMCI and mdMCI.

Lexical parameters, that attempt to gauge lexical knowledge and retrieval in spontaneous speech, are substantially altered in eD; among these features content density (i.e., the ratio of open-class words to closed-class words) is consistently reduced, especially for the picture description task, in both a- and mdMCI. In general, some features (namely LEX_ContDens, LEX_PoS_ADJ and the lexical richness parameters), indicating a global impoverishment of the speech production from the lexical point of view, resulted significant in distinguishing PG from CG. Reducing the amount of content words as well as using less modifiers (e.g., adjectives), results in poorer productions, even if the sentences are still correct from a formal point of view.

The syntactic features are designed to measure the degree of utterance structural complexity, and most of the investigated parameters are altered in the picture task in early AD subjects. Moreover, some syntactic features such as utterance length are altered not only in eD, but also in aMCI and mdMCI, using the “working day” and “dream” as speech tasks, which require the construction of a complete and structured narration, showing a general simplification. As a matter of fact, syntactic structures produced by PG, despite being grammatically correct and coherent, contains less complex relations among phrases and fewer embedded structures. Notably, narrative tasks, involving more than merely the naming of persons and objects as in the figure task, typically realized by both CG and PG in the form of coordinative structure or simple lists, are sensitive, with regard to syntactic features, in discriminating also aMCI.

Discussion

The early recognition of cognitive decline is widely shared goal in the aging global population, and the focus is rapidly moving from defined clinical entities, such as MCI, to pre-clinical or asymptomatically stages in a general and still poorly defined frame addressed as “cognitive frailty” (Calzà et al., 2015). Specifically, early recognition helps to diagnose early dementia; identify dementia in at-risk individuals; design preventive clinical trials; identify reversible cognitive deficit in systemic diseases–metabolic, renal, cardiovascular, etc. – in depression or inappropriate pharmacological regimens; for secondary and tertiary prevention; and to define more appropriate health and social policies (Sugimoto et al., 2018; Vella Azzopardi et al., 2018).

Language has a central role among the cognitive domains that may reveal early signs of decline, becoming an established topic of research and clinical monitoring of AD progression (reviewed by Bucks et al., 2000; Kempler and Goral, 2008; Taler and Phillips, 2008; Shafto and Tyler, 2014; Szatloczki et al., 2015). Extensive literature on the use of traditional tests for the language assessment, especially with lexical and semantic access tasks, provides evidence that the lexico-semantic system is already affected in the initial stages of the disease, and patients have difficulties in tasks such as picture naming (Jacobson et al., 2002) and phonemic and semantic verbal fluency (Marczinski and Kertesz, 2006; Rascovsky et al., 2007). On the contrary, the phonological, morphological and syntactic systems are believed to be relatively preserved in the initial stages of the disease, as indicated by such tasks involving reading letters and words (Stilwell et al., 2016).

Studies dedicated to evaluate early language signs in prodromal or preclinical stages, such as MCI (Taler and Phillips, 2008; Drummond et al., 2015; Szatloczki et al., 2015; Hernández-Domínguez et al., 2018), reports inhomogeneous results. For example some authors described significant differences between controls and MCI in semantic fluency and naming tests (Ostberg et al., 2005; Duong et al., 2006; Radanovic et al., 2007; Cuetos et al., 2009; Joubert et al., 2010; Ahmed et al., 2013; Mueller et al., 2016), while others did not confirm differences in Boston Naming, semantic and phonemic verbal fluency (Bschor et al., 2001).

Moreover, the conventional neuropsychological language tests used in these studies fail to explore different levels of the cognitive network involved in complex linguistic activities (phonological, morphosyntactic, semantic-lexical, semantic-pragmatic). Thus, the spontaneous speech analysis is raising increasing interest in the neuropsychological research for the early detection of cognitive decline (Drummond et al., 2015; Aramaki et al., 2016; Pistono et al., 2016), also because of the high complexity of tasks that require not just lexical-semantic abilities, but also memory and executive functions. These novel approaches in language analysis may offer an opportunity to detect subclinical language changes, that may be present several years before the clinical phase of the disease and can be considered as one of the prodromal (or preclinical) manifestations of the disease.

The analysis at the discourse level is today possible by using the computational tools of NLP that allow the automatic detection of acoustic, lexical, semantic, syntactic and pragmatic parameters (Roark et al., 2011; Satt et al., 2013; König et al., 2015), thus leading to, a quantitative description and analysis of speech elicited by visual stimuli (“please describe this picture”), or by episodic memory (“please describe your last dream”). This approach has been also applied to the DementiaBank corpus, including narrative samples from 167 patients with “possible” or “probable” AD. By using two machine-learning classifiers, four factors distinguished AD vs. control narrative samples: semantic impairment, acoustic abnormality, syntactic impairment, and information impairment (Fraser et al., 2016). Other studies in small cohorts of AD patients (mild, moderate, and severe) have indicated alterations in articulation rate, speech tempo, hesitation ratio, and rate of grammatical errors (Hoffmann et al., 2010); and in acoustic measurements, such as pitch level, pitch modulation, and speaking rate (Horley et al., 2010). However, it should be noted that the potentiality of spontaneous speech analysis is poor in AD patients, even at early stages of the disease, due to the already severe alteration of the language performance.

Thus, increasing interest is directed toward subjects with subjective cognitive impairment or subjective memory complaints (Cuetos et al., 2009), a condition that could be a preclinical phase of the MCI condition (Jessen et al., 2014; Eichler et al., 2015; Mendonça et al., 2016), and MCI. In this proof-of-concept study we used the NLP tools for the analysis of the spontaneous discourse in early cognitive decline (aMCI and mdMCI) and in early Alzheimer disease (eD), included in the study as “positive control.” According to other studies, MCI group's results could represent an intermediate stage between CG and eD (Drummond et al., 2015). However, we demonstrated that aspects of the language not considered in conventional neuropsychological tests are deeply affected in MCI compared to CG. In particular, the acoustic features of language–e.g., pause duration, speech segment duration, and phonation rate-conveying linguistic and paralinguistic information such as illocution, modality, emphasis, attitude, and emotion (Finegan, 2011) seem to be sensitive markers of early cognitive decline, also distinguishing amnestic from multiple domain MCI. Notably, pause alterations during autobiographic discourse collected by the EPITOUL ecological task (exploring the episodic memory test) has been also described in MCI by others (Pistono et al., 2016). Consistent with previous scientific literature, the deterioration of verbal fluency, lexical retrieval process and discourse planning may result in longer hesitations, increased pauses and lower phonation rate. These acoustic features may discriminate between control groups and aMCI (König et al., 2015).

On the contrary, the speech rhythm seems to be rather preserved in the PGs included in our study, while in a study including eAD patients (MMSE > 24), a high variability of syllabic interval was reported (Martínez-Sánchez et al., 2017).

A number of studies have already demonstrated that lexical-semantic system is often impaired in MCI and dementia: our results confirm the finding, showing that patient's linguistic productions are semantically impoverished. Moreover, even though the correctness of grammatical form is generally preserved, syntax shows to be overall simplified.

Our study provides strong evidence to the emerging, but still puzzling literature, supporting spontaneous speech analysis as a potential tool for early detection of cognitive decline. The need to make these evaluation tools applicable on a large scale and at low cost, has prompted researchers to devise automated forms of analysis of collected samples of speech, recorded and manually transcribed according to appropriate coding systems, by software that can detect a series of acoustic and lexical variables, with detection of acoustic, lexical, semantic, syntactic, and pragmatic parameters (Thomas et al., 2005; Roark et al., 2011; Pakhomov et al., 2012; Satt et al., 2013). In these studies, speech was assessed through the collection of linguistic production samples obtained using various types of tasks.

An additional contribution from this study derived from the use of three different speech tasks. In spite of the fact that the description of a complex picture is the most widely used task (Goodglass et al., 2000; Bschor et al., 2001; Cuetos et al., 2009; König et al., 2015), we observed that the description of a “working day” and “the last dream” seem to be more sensitive tasks, probably because require memory recall and a more structured narration. Some other researchers have investigated different aspects of language as communication abilities (Toledo et al., 2018), reading comprehension (Hudon et al., 2006; Schmitter-Edgecombe and Creamer, 2010), the repetition of complex sentences (König et al., 2015; Lust et al., 2015) or the ability to recognize the grammatical correctness (Taler and Jarema, 2004). Other groups of researchers have applied the analysis of discourse on verbal productions recorded during the classic episodic memory tests as the Wechsler logical memory (Roark et al., 2007, 2011). The most ambitious studies have also used analytical tools applicable directly on the voice recordings of subjects (Meilán et al., 2014), showing a good correlation between the automated classification and that based on clinical and manual data processing (Roark et al., 2011; Satt et al., 2013; Hernández-Domínguez et al., 2018).

Conclusion

Results from this proof-of-concept study suggest that computerized speech analysis identifies alterations in MCI for language features not explored by conventional diagnostic neuropsychological tests, also including language tests such as phonological and lexical fluency. Numerous acoustic features can distinguish between healthy controls and aMCI subjects, and lexical, rhythmic, and syntactic features may be also relevant, depending on the type of language task evaluated. While longitudinal studies (in progress) are necessary to confirm this hypothesis, we suggest that (i) speech analysis should be included as exploratory end-point in AD prevention studies; (ii) speech analysis coupled to imaging studies in cognitive decline could provide new information for language neuroanatomy; (iii) computerized speech analysis should be considered also for the development of novel tests for preclinical AD, thus contributing to the research priorities (prevention and identification of AD risk) identified by the WHO (Ministerial Conference on Global Action against Dementia, Shah et al., 2016).

Author Contributions

DB performed neuropsychological testing and speech recording, GG performed language analysis, RR worked out the linguistic planning and performed the critic review of the language data, EG performed the clinical study design, patient enrolment and neuropsychological test, FT performed language analysis and critical review of the language data, LC ideated and designed the study, analyzed, performed statistical analysis, interpreted the data and drafted the manuscript. All authors have contributed, read and approved the manuscript.

Conflict of Interest Statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

The reviewer GGE and handling Editor declared their shared affiliation.

Acknowledgments

This work was supported by the OPLON project (Opportunities for active and healthy LONgevity, Smart Cities, Ministero Università e Ricerca, SCN_00176) (to LC). The contribution of Amici di Casa Insieme (Mercato Saraceno, Forlì-Cesena, Italy) and IRET Foundation (Ozzano Emilia, Italy) is also acknowledged. The contribution of the project manager Roberta Torricella is gratefully acknowledged.

Supplementary Material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fnagi.2018.00369/full#supplementary-material

References

Ahmed, S., Haigh, A. M. F., de Jager, C. A., and Garrard, P. (2013). Connected speech as a marker of disease progression in autopsy-proven Alzheimer's disease. Brain 136, 3727–3737. doi: 10.1093/brain/awt269

PubMed Abstract | CrossRef Full Text | Google Scholar

Albert, M. S., DeKosky, S. T., Dickson, D., Dubois, B., Feldman, H. H., Fox, N. C., et al. (2011). The diagnosis of mild cognitive impairment due to Alzheimer's disease: recommendations from the National Institute on Aging-Alzheimer's Association workgroups on diagnostic guidelines for Alzheimer's disease. Alzheimers Dement. 7, 270–279. doi: 10.1016/j.jalz.2011.03.008

PubMed Abstract | CrossRef Full Text | Google Scholar

Angelini, B., Brugnara, F., Falavigna, D., Giuliani, D., Gretter, R., and Omologo, M. (1994). “Speaker independent continuous speech recognition using an acoustic-phonetic Italian corpus,” in Proceedings of ICSLP 94 (Yokohama), 1391–1394.

Google Scholar

Aramaki, E., Shikata, S., Miyabe, M., Usuda, Y., Asada, K., Ayaya, S., et al. (2016). Understanding the relationship between social cognition and word difficulty. A language based analysis of individuals with autism spectrum disorder. Methods Inf Med. 54, 522–529. doi: 10.3414/ME15-01-0038

PubMed Abstract | CrossRef Full Text

Auriacombe, S., Lechevallier, N., Amieva, H., Harston, S., Raoux, N., and Dartigues, J. F. (2006). A longitudinal study of quantitative and qualitative features of category verbal fluency in incident Alzheimer's disease subjects: results from the PAQUID study. Dement. Geriatr. Cogn. Disord. 21, 260–266. doi: 10.1159/000091407

PubMed Abstract | CrossRef Full Text | Google Scholar

Ayutyanont, N., Langbaum, J. B., Hendrix, S. B., Chen, K., Fleisher, A. S., Friesenhahn, M., et al. (2014). The Alzheimer's prevention initiative composite cognitive test score: sample size estimates for the evaluation of preclinical Alzheimer's disease treatments in presenilin 1 E280A mutation carriers. J. Clin. Psychiatry 75, 652–660. doi: 10.4088/JCP.13m08927

PubMed Abstract | CrossRef Full Text | Google Scholar

Bäckman, L., Jones, S., Berger, A. K., Laukka, E. J., and Small, B. J. (2005). Cognitive impairment in preclinical Alzheimer's disease: a meta-analysis. Neuropsychology 19, 520–531. doi: 10.1037/0894-4105.19.4.520

PubMed Abstract | CrossRef Full Text | Google Scholar

Bosco, C., Lombardo, V., Vassallo, D., and Lesmo, L. (2000). “Building a Treebank for Italian: a data-driven annotation schema,” in Proceedings of LREC-2000 (Athens).

Google Scholar

Bschor, T., Kühl, K. P., and Reischies, F. M. (2001). Spontaneous speech of patients with dementia of the Alzheimer type and mild cognitive impairment. Int. Psychogeriatr. 13, 289–298. doi: 10.1017/S1041610201007682

PubMed Abstract | CrossRef Full Text | Google Scholar

Bucks, R. S., Singh, S., Cuerden, J. M., and Wilcock, G. K. (2000). Analysis of spontaneous, conversational speech in dementia of Alzheimer type: evaluation of an objective technique for analysing lexical performance. Aphasiology 14, 71–91. doi: 10.1080/026870300401603

CrossRef Full Text | Google Scholar

Burmester, B., Leathem, J., and Merrick, P. (2016). Subjective cognitive complaints and objective cognitive function in aging: a systematic review and meta-analysis of recent cross-sectional findings. Neuropsychol. Rev. 26, 376–393. doi: 10.1007/s11065-016-9332-2

PubMed Abstract | CrossRef Full Text | Google Scholar

Calzà, L., Beltrami, D., Gagliardi, G., Ghidoni, E., Marcello, N., Rossini-Favretti, R., et al. (2015). Should we screen for cognitive decline and dementia? Maturitas 82, 28–35. doi: 10.1016/j.maturitas.2015.05.013

PubMed Abstract | CrossRef Full Text | Google Scholar

Carlesimo, G. A., Caltagirone, C., Nocentini, U., Fadda, L., Marfia, G., Gainotti, G., et al. (1995). Batteria per la valutazione del deterioramento mentale: standardizzazione ed affidabilità diagnostica nell'identificazione di pazienti affetti da sindrome demenziale. Archivio di Psicologia Neurologia e Psichiatria 56, 471–488.

Google Scholar

Ciurli, P., Marangolo, P., and Basso, A. (1996). Esame del Linguaggio II. Firenze: Giunti Organizzazioni Speciali.

Clark, L. J., Gatz, M., Zheng, L., Chen, Y. L., McCleary, C., and Mack, W. J. (2009). Longitudinal verbal fluency in normal aging, preclinical, and prevalent Alzheimer's disease. Am. J. Alzheimers. Dis. Other Demen. 24, 461–468. doi: 10.1177/1533317509345154

PubMed Abstract | CrossRef Full Text | Google Scholar

Conti, S., Bonazzi, S., Laiacona, M., Masina, M., and Vanelli Coralli, M. (2015). Montreal Cognitive Assessment (MoCA) – Italian version: regression-based norms and equivalent scores. Neurol. Sci. 26, 209–214. doi: 10.1007/s10072-014-1921-3

CrossRef Full Text | Google Scholar

Cuetos, F., Rodríguez-Ferreiro, J., and Menéndez, M. (2009). Semantic markers in the diagnosis of neurodegenerative dementias. Dement. Geriatr. Cogn. Disord. 28, 267–274. doi: 10.1159/000242438

PubMed Abstract | CrossRef Full Text | Google Scholar

Donohue, M. C., Sperling, R. A., Salmon, D. P., Rentz, D. M., Raman, R., Thomas, R. G., et al. (2014). Australian imaging, biomarkers, and lifestyle flagship study of ageing; Alzheimer's disease neuroimaging initiative; Alzheimer's disease cooperative study. The preclinical Alzheimer cognitive composite: measuring amyloid-related decline. JAMA Neurol. 71, 961–970. doi: 10.1001/jamaneurol.2014.803

CrossRef Full Text | Google Scholar

Drummond, C., Coutinho, G., Fonseca, R. P., Assunção, N., Teldeschi, A., de Oliveira-Souza, R., et al. (2015). Deficits in narrative discourse elicited by visual stimuli are already present in patients with mild cognitive impairment. Front. Aging Neurosci. 7:96. doi: 10.3389/fnagi.2015.00096

PubMed Abstract | CrossRef Full Text | Google Scholar

Dubois, B., Feldman, H. H., Jacova, C., Hampel, H., Molinuevo, J. L., Blennow, K., et al. (2014). Advancing research diagnostic criteria for Alzheimer's disease: the IWG-2 criteria. Lancet Neurol. 13, 614–629. doi: 10.1016/S1474-4422(14)70090-0

PubMed Abstract | CrossRef Full Text | Google Scholar

Duong, A., Whitehead, V., Hanratty, K., and Chertkow, H. (2006). The nature of lexico-semantic processing deficits in mild cognitive impairment. Neuropsychologia 44, 1928–1935. doi: 10.1016/j.neuropsychologia.2006.01.034

PubMed Abstract | CrossRef Full Text | Google Scholar

Eichler, T., Thyrian, J. R., Hertel, J., Wucherer, D., Michalowsky, B., Reiner, K., et al. (2015). Subjective memory impairment: no suitable criteria for case-finding of dementia in primary care. Alzheimers Dement. 1, 179–186. doi: 10.1016/j.dadm.2015.02.004

PubMed Abstract | CrossRef Full Text | Google Scholar

Epelbaum, S., Genthon, R., Cavedo, E., Habert, M. O., Lamari, F., Gagliardi, G., et al. (2017). Preclinical Alzheimer's disease: a systematic review of the cohorts underlying the concept. Alzheimers Dement. 13, 454–467. doi: 10.1016/j.jalz.2016.12.003

PubMed Abstract | CrossRef Full Text | Google Scholar

Finegan, E. (2011). Language. Its Structure and Use. Boston, MA: Wadsworth.

Google Scholar

Fraser, K. C., Meltzer, J. A., and Rudzicz, F. (2016). Linguistic features identify Alzheimer's disease in narrative speech. J. Alzheimers Dis. 49, 407–422. doi: 10.3233/JAD-150520

PubMed Abstract | CrossRef Full Text | Google Scholar

Goodglass, H., Lindfield, K. C., and Alexander, M. P. (2000). Semantic capacities of the right hemisphere as seen in two cases of pure word blindness. J. Psycholinguist. Res. 29, 399–422. doi: 10.1023/A:1005155228509

PubMed Abstract | CrossRef Full Text | Google Scholar

Hedden, T., Oh, H., Younger, A. P., and Patel, T. A. (2013). Meta-analysis of amyloid-cognition relations in cognitively normal older adults. Neurology 80, 1341–1348. doi: 10.1212/WNL.0b013e31828ab35d

PubMed Abstract | CrossRef Full Text | Google Scholar

Hernández-Domínguez, L., Ratté, S., Sierra-Martínez, G., and Roche-Bergua, A. (2018). Computer-based evaluation of Alzheimer's disease and mild cognitive impairment patients during a picture description task. Alzheimer's Dement. 10, 260–268. doi: 10.1016/j.dadm.2018.02.004

PubMed Abstract | CrossRef Full Text | Google Scholar

Hoffmann, I., Nemeth, D., Dye, C. D., Pákáski, M., Irinyi, T., and Kálmán, J. (2010). Temporal parameters of spontaneous speech in Alzheimer's disease. Int. J. Speech Lang. Pathol. 12, 29–34. doi: 10.3109/17549500903137256

PubMed Abstract | CrossRef Full Text | Google Scholar

Horley, K., Reid, A., and Burnham, D. (2010). Emotional prosody perception and production in dementia of the Alzheimer's type. J. Speech Lang. Hear. Res. 53, 1132–1146. doi: 10.1044/1092-4388(2010/09-0030)

PubMed Abstract | CrossRef Full Text | Google Scholar

Hudon, C., Belleville, S., Souchay, C., Gély-Nargeot, M. C., Chertkow, H., and Gauthier, S. (2006). Memory for gist and detail information in Alzheimer's disease and mild cognitive impairment. Neuropsychology 20, 566–577. doi: 10.1037/0894-4105.20.5.566

PubMed Abstract | CrossRef Full Text | Google Scholar

Jacobson, M. W., Delis, D. C., Bondi, M. W., and Salmon, D. P. (2002). Do neuropsychological tests detect preclinical Alzheimer's disease: individual-test versus cognitive-discrepancy score analyses. Neuropsychology 16, 132–139. doi: 10.1037/0894-4105.16.2.132

PubMed Abstract | CrossRef Full Text | Google Scholar

Jessen, F., Amariglio, R. E., van Boxtel, M., Breteler, M., Ceccaldi, M., Chételat, G., et al. (2014). A conceptual framework for research on subjective cognitive decline in preclinical Alzheimer's disease. Alzheimers Dement. 10, 844–852. doi: 10.1016/j.jalz.2014.01.001

PubMed Abstract | CrossRef Full Text | Google Scholar

Joubert, S., Brambati, S. M., Ansado, J., Barbeau, E. J., Felician, O., Didic, M., et al. (2010). The cognitive and neural expression of semantic memory impairment in mild cognitive impairment and early Alzheimer's disease. Neuropsychologia 48, 978–988. doi: 10.1016/j.neuropsychologia.2009.11.019

PubMed Abstract | CrossRef Full Text | Google Scholar

Kempler, D., and Goral, M. (2008). Language and dementia: neuropsychological aspects. Annu. Rev. Appl. Linguist. 28, 73–90. doi: 10.1017/S0267190508080045

PubMed Abstract | CrossRef Full Text | Google Scholar

König, A., Satt, A., Sorin, A., Hoory, R., Toledo-Ronen, O., Derreumaux, A., et al. (2015). Automatic speech analysis for the assessment of patients with predementia and Alzheimer's disease. Alzheimers Dement. 1, 112–124. doi: 10.1016/j.dadm.2014.11.012

PubMed Abstract | CrossRef Full Text | Google Scholar

Langbaum, J. B., Hendrix, S. B., Ayutyanont, N., Chen, K., Fleisher, A. S., Shah, R. C., et al. (2014). An empirically derived composite cognitive test score with improved power to track and evaluate treatments for preclinical Alzheimer's disease. Alzheimers Dement. 10, 666–674. doi: 10.1016/j.jalz.2014.02.002

PubMed Abstract | CrossRef Full Text | Google Scholar

Lee, J. H., Oh, E. S., Jeong, S. H., Sohn, E. H., Lee, T. Y., and Lee, A. Y. (2011). Longitudinal changes in clock drawing test (CDT) performance according to dementia subtypes and severity. Arch. Gerontol. Geriatr. 53, 179–182. doi: 10.1016/j.archger.2010.08.010

PubMed Abstract | CrossRef Full Text | Google Scholar

Lesmo, L. (2007). “Il parser basato su regole del Gruppo NLP dell'Università di Torino,” in Intelligenza Artificiale, IV, 46–47.

Google Scholar

Lust, B., Flynn, S., Cohen Sherman, J., Gair, J., Henderson, C. R. Jr., Cordella, C., et al. (2015). Reversing ribot: does regression hold in language of prodromal Alzheimer's disease? Brain Lang. 143, 1–10. doi: 10.1016/j.bandl.2015.01.013

PubMed Abstract | CrossRef Full Text | Google Scholar

Mak, M. W., and Yu, H. B. (2014). A study of voice activity detection techniques for NIST speaker recognition evaluations. Comput. Speech Lang. 28, 295–313 doi: 10.1016/j.csl.2013.07.003

CrossRef Full Text | Google Scholar

Marczinski, C. A., and Kertesz, A. (2006). Category and letter fluency in semantic dementia, primary progressive aphasia, and Alzheimer's disease. Brain Lang. 97, 258–265. doi: 10.1016/j.bandl.2005.11.001

PubMed Abstract | CrossRef Full Text | Google Scholar

Martínez-Sánchez, F., Meilán, J. J. G., Vera-Ferrandiz, J. A., Carro, J., Pujante-Valverde, I. M., Ivanova, O., et al. (2017). Speech rhythm alterations in Spanish-speaking individuals with Alzheimer's disease. Neuropsychol. Dev. Cogn. B Aging Neuropsychol. Cogn. 24, 418–434. doi: 10.1080/13825585.2016.1220487

PubMed Abstract | CrossRef Full Text | Google Scholar

Martins, I. P., Mares, I., and Stilwell, P. A. (2012). How subjective are subjective language complaints. Eur. J. Neurol. 19, 666–671. doi: 10.1111/j.1468-1331.2011.03635.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Measso, G., Cavarzeran, F., Zappalà, G., Lebowitz, B. D., Crook, T. H., Pirozzolo, F. J., et al. (1993). The mini-mental state examination: normative study of an italian random sample. Dev. Neuropsychol. 9, 77–95. doi: 10.1080/87565649109540545

CrossRef Full Text | Google Scholar

Meilán, J. J., Martínez-Sánchez, F., Carro, J., López, D. E., Millian-Morell, L., and Arana, J. M. (2014). Speech in Alzheimer's disease: can temporal and acoustic parameters discriminate dementia? Dement. Geriatr. Cogn. Disord. 37, 327–334. doi: 10.1159/000356726

PubMed Abstract | CrossRef Full Text | Google Scholar

Mendonça, M. D., Alves, L., and Bugalho, P. (2016). From subjective cognitive complaints to dementia: who is at risk? A systematic review. Am. J. Alzheimers Dis. Other Demen. 31, 105–114. doi: 10.1177/1533317515592331

PubMed Abstract | CrossRef Full Text | Google Scholar

Mioshi, E., Dawson, K., Mitchell, J., Arnold, R., and Hodges, J. R. (2006). The Addenbrooke's Cognitive Examination Revised (ACE-R): a brief cognitive test battery for dementia screening. Int. J. Geriatr. Psychiatry 21, 1078–1085. doi: 10.1002/gps.1610

PubMed Abstract | CrossRef Full Text | Google Scholar

Mortamais, M., Ash, J. A., Harrison, J., Kaye, J., Kramer, J., Randolph, C., et al. (2017) Detecting cognitive changes in preclinical Alzheimer's disease: a review of its feasibility. Alzheimers Dement. 13, 468–492. doi: 10.1016/j.jalz.2016.06.2365

PubMed Abstract | CrossRef Full Text | Google Scholar

Mueller, K. D., Koscik, R. L., Turkstra, L. S., Riedeman, S. K., LaRue, A., Clark, L. R., et al. (2016). Connected language in late middle-aged adults at risk for Alzheimer's disease. J. Alzheimer's Dis. 54, 1539–1550. doi: 10.3233/JAD-160252

PubMed Abstract | CrossRef Full Text | Google Scholar

Novelli, G., Papagno, C., Capitani, E., Laiacona, M., Vallar, G., and Cappa, S. F. (1986). Tre test clinici di ricerca e produzione lessicale. Taratura su soggetti normali. Archivio di Psicologia Neurologia e Psichiatria 4, 477–506.

Google Scholar

Ostberg, P., Fernaeus, S. E., Hellström, K., Bogdanović, N., and Wahlund, L. O. (2005). Impaired verb fluency: a sign of mild cognitive impairment. Brain Lang. 95, 273–279. doi: 10.1016/j.bandl.2005.01.010

PubMed Abstract | CrossRef Full Text

Pakhomov, S. V., Hemmy, L. S., and Lim, K. O. (2012). Automated semantic indices related to cognitive function and rate of cognitive decline. Neuropsychologia 50, 2165–2175. doi: 10.1016/j.neuropsychologia.2012.05.016

PubMed Abstract | CrossRef Full Text | Google Scholar

Petersen, R. C. (2004). Mild cognitive impairment as a diagnostic entity. J. Intern. Med. 256, 183–194. doi: 10.1111/j.1365-2796.2004.01388.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Petersen, R. C. (2011). Clinical practice. Mild cognitive impairment. N. Engl. J. Med. 364, 2227–2234. doi: 10.1056/NEJMcp0910237

PubMed Abstract | CrossRef Full Text | Google Scholar

Petersen, R. C., Caracciolo, B., Brayne, C., Gauthier, S., Jelic, V., and Fratiglioni, L. (2014). Mild cognitive impairment: a concept in evolution. J. Intern. Med. 275, 214–228. doi: 10.1111/joim.12190

PubMed Abstract | CrossRef Full Text | Google Scholar

Pistono, A., Jucla, M., Barbeau, E. J., Saint-Aubert, L., Lemesle, B., Calvet, B., et al. (2016). Pauses during autobiographical discourse reflect episodic memory processes in early Alzheimer's disease. J. Alzheimers Dis. 50, 687–698. doi: 10.3233/JAD-150408

PubMed Abstract | CrossRef Full Text | Google Scholar

Radanovic, M., Carthery-Goulart, M. T., Charchat-Fichman, H., Herrera, E. Jr, Lima, E. E. P., Smid, J., et al. (2007). Analysis of brief language tests in the detection of cognitive decline and dementia. Dement Neuropsychol. 1, 37–45. doi: 10.1590/S1980-57642008DN10100007

PubMed Abstract | CrossRef Full Text | Google Scholar

Rascovsky, K., Salmon, D. P., Hansen, L. A., Thal, L. J., and Galasko, D. (2007). Disparate letter and semantic category fluency deficits in autopsy-confirmed frontotemporal dementia and Alzheimer's disease. Neuropsychology 21, 20–30. doi: 10.1037/0894-4105.21.1.20

PubMed Abstract | CrossRef Full Text | Google Scholar

Ritchie, K., Ropacki, M., Albala, B., Harrison, J., Kaye, J., Kramer, J., et al. (2017). Recommended cognitive outcomes in preclinical Alzheimer's disease: consensus statement from the European prevention of Alzheimer's dementia project. Alzheimers Dement. 13, 186–195. doi: 10.1016/j.jalz.2016.07.154

PubMed Abstract | CrossRef Full Text | Google Scholar

Roark, B., Mitchell, M., and Hollingshead, K. (2007). “Syntactic complexity measures for detecting mild cognitive impairment,” in Proceedings of the Workshop BioNLP 2007: Biological, Translational, and Clinical Language Processing, ACL - Association for Computational Linguistics, eds K. B. Cohen, D. Demner-Fushman, C. Frieman, L. Hirschman, J. Pestian (Prague), 1–8. doi: 10.3115/1572392.1572394

CrossRef Full Text | Google Scholar

Roark, B., Mitchell, M., Hosom, J. P., Hollingshead, K., and Kaye, J. A. (2011). Spoken language derived measures for detecting Mild Cognitive Impairment. IEEE Trans. Audio Speech Lang. Processing 19, 2081–2090. doi: 10.1109/TASL.2011.2112351

PubMed Abstract | CrossRef Full Text | Google Scholar

Satt, A., Sorin, A., Toledo-Ronen, O., Barkan, O., Kompatsiaris, I., Kokonozi, A., et al. (2013). “Evaluation of speech-based protocol for detection of early-stage dementia,” in Proceedings Interspeech (Lyon), 1692–1696.

Google Scholar

Schmitter-Edgecombe, M., and Creamer, S. (2010). Assessment of strategic processing during narrative comprehension in individuals with mild cognitive impairment. J. Int. Neuropsychol. Soc. 16, 661–671. doi: 10.1017/S1355617710000433

PubMed Abstract | CrossRef Full Text | Google Scholar

Selkoe, D. J., and Hardy, J. (2016). The amyloid hypothesis of Alzheimer's disease at 25 years. EMBO Mol. Med. 8, 595–608. doi: 10.15252/emmm.201606210

PubMed Abstract | CrossRef Full Text | Google Scholar

Shafto, M. A., and Tyler, L. K. (2014). Language in the aging brain: the network dynamics of cognitive decline and preservation. Science 346, 583–587. doi: 10.1126/science.1254404

PubMed Abstract | CrossRef Full Text | Google Scholar

Shah, H., Albanese, E., Duggan, C., Rudan, I., Langa, K. M., Carrillo, M. C., et al. (2016). Research priorities to reduce the global burden of dementia by 2025. Lancet Neurol. 15, 1285–1294. doi: 10.1016/S1474-4422(16)30235-6

PubMed Abstract | CrossRef Full Text | Google Scholar

Singh, S., Bucks, R. S., and Cuerden, J. M. (2001). An evaluation of an objective technique for analysing temporal variables in DAT spontaneous speech. Aphasiology 15, 571–583. doi: 10.1080/02687040143000041

CrossRef Full Text | Google Scholar

Stewart, R. (2012). Subjective cognitive impairment. Curr. Opin. Psychiatry 25, 445–450. doi: 10.1097/YCO.0b013e3283586fd8

PubMed Abstract | CrossRef Full Text | Google Scholar

Stilwell, B. L., Dow, R. M., Lamers, C., and Woods, R. T. (2016). Language changes in bilingual individuals with Alzheimer's disease. Int. J. Lang. Commun. Disord. 51, 113–127. doi: 10.1111/1460-6984.12190

PubMed Abstract | CrossRef Full Text | Google Scholar

Sugimoto, T., Sakurai, T., Ono, R., Kimura, A., Saji, N., Niida, S., et al. (2018). Epidemiological and clinical significance of cognitive frailty: a mini review. Ageing Res. Rev. 44, 1–7. doi: 10.1016/j.arr.2018.03.002

PubMed Abstract | CrossRef Full Text | Google Scholar

Szatloczki, G., Hoffmann, I., Vincze, V., Kalman, J., and Pakaski, M. (2015). Speaking in Alzheimer's disease, is that an early sign? Importance of changes in language abilities in Alzheimer's disease. Front. Aging Neurosci. 7:195. doi: 10.3389/fnagi.2015.00195

PubMed Abstract | CrossRef Full Text | Google Scholar

Taler, V., and Jarema, G. (2004). Processing of mass/count information in Alzheimer's disease and mild cognitive impairment. Brain Lang. 90, 262–275. doi: 10.1016/S0093-934X(03)00439-5

PubMed Abstract | CrossRef Full Text | Google Scholar

Taler, V., and Phillips, N. A. (2008). Language performance in Alzheimer's disease and mild cognitive impairment: a comparative review. J. Clin. Exp. Neuropsychol. 30, 501–556. doi: 10.1080/13803390701550128

PubMed Abstract | CrossRef Full Text | Google Scholar

Thomas, C., Keselj, V., Cercone, N., Rockwood, K., and Asp, E. (2005). “Automatic detection and rating of dementia of Alzheimer type through lexical analysis of spontaneous speech,” in Proceedings of the IEEE International Conference on Mechatronics and Automation, eds J. Gu, and P. X. Liu (Nova Scotia), 1569–1574. doi: 10.1109/ICMA.2005.1626789

CrossRef Full Text | Google Scholar

Toledo, C. M., Aluísio, S. M., Dos Santos, L. B., Brucki, S. M. D., très, E. S., de Oliveira, M. O., et al. (2018). Analysis of macrolinguistic aspects of narratives from individuals with Alzheimer's disease, mild cognitive impairment, and no cognitive impairment. Alzheimers Dement. 10, 31–40. doi: 10.1016/j.dadm.2017.08.005

PubMed Abstract | CrossRef Full Text | Google Scholar

Tsoi, K. K., Chan, J. Y., Hirai, H. W., Wong, S. Y., and Kwok, T. C. (2015). Cognitive tests to detect dementia: a systematic review and meta-analysis. JAMA Intern. Med. 175, 1450–1458. doi: 10.1001/jamainternmed.2015.2152

PubMed Abstract | CrossRef Full Text | Google Scholar

Velayudhan, L., Ryu, S. H., Raczek, M., Philpot, M., Lindesay, J., Critchfield, M., et al. (2014). Review of brief cognitive tests for patients with suspected dementia. Int. Psychogeriatr. 26, 1247–1262. doi: 10.1017/S1041610214000416

PubMed Abstract | CrossRef Full Text | Google Scholar

Vella Azzopardi, R., Beyer, I., Vermeiren, S., Petrovic, M., Van Den Noortgate, N., Bautmans, I., et al. (2018). Gerontopole Brussels study group. Increasing use of cognitive measures in the operational definition of frailty-a systematic review. Ageing Res. Rev. 43, 10–16. doi: 10.1016/j.arr.2018.01.003

CrossRef Full Text | Google Scholar

Winblad, B., Palmer, K., Kivipelto, M., Jelic, V., Fratiglioni, L., Wahlund, L. O., et al. (2004). Mild cognitive impairment–beyond controversies, towards a consensus: report of the International Working Group on Mild Cognitive Impairment. J. Intern. Med. 256, 240–246. doi: 10.1111/j.1365-2796.2004.01380.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: cognitive decline, language, Natural Language Processing, preclinical Alzheimer, speech analysis, mild cognitive impairment

Citation: Beltrami D, Gagliardi G, Rossini Favretti R, Ghidoni E, Tamburini F and Calzà L (2018) Speech Analysis by Natural Language Processing Techniques: A Possible Tool for Very Early Detection of Cognitive Decline? Front. Aging Neurosci. 10:369. doi: 10.3389/fnagi.2018.00369

Received: 19 July 2018; Accepted: 24 October 2018;
Published: 13 November 2018.

Edited by:

Muthuraman Muthuraman, Universität Mainz, Germany

Reviewed by:

Gabriel Gonzalez-Escamilla, Universitätsmedizin Mainz, Germany
Dumitru Ciolac, Nicolae Testemitanu State University of Medicine and Pharmacy, Moldova

Copyright © 2018 Beltrami, Gagliardi, Rossini Favretti, Ghidoni, Tamburini and Calzà. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Laura Calzà, laura.calza@unibo.it

^†These authors have contributed equally to this work

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.