Picture Interpretation Test (PIT) 360Â°: An Innovative Measure of Executive Functions

Serino, Silvia; Baglio, Francesca; Rossetto, Federica; Realdon, Olivia; Cipresso, Pietro; Parsons, Thomas D.; Cappellini, Giacomo; Mantovani, Fabrizia; De Leo, Gianluca; Nemni, Raffaello; Riva, Giuseppe

doi:10.1038/s41598-017-16121-x

Download PDF

Article
Open access
Published: 22 November 2017

Picture Interpretation Test (PIT) 360Â°: An Innovative Measure of Executive Functions

Silvia SerinoÂ ORCID: orcid.org/0000-0002-8422-1358^1,2,
Francesca Baglio³,
Federica Rossetto^2,3,
Olivia Realdon⁴,
Pietro CipressoÂ ORCID: orcid.org/0000-0002-0662-7678^1,2,
Thomas D. ParsonsÂ ORCID: orcid.org/0000-0003-0331-5019^5,6,
Giacomo Cappellini⁷,
Fabrizia Mantovani⁴,
Gianluca De Leo⁸,
Raffaello Nemni^3,9 &
â¦
Giuseppe Riva^1,2Â

Scientific Reports volumeÂ 7, ArticleÂ number:Â 16000 (2017) Cite this article

4425 Accesses
1 Altmetric
Metrics details

Subjects

Abstract

The assessment of executive functions poses researchers with several challenges related to both the complexity of the construct of executive functions itself and/or the methodological difficulties related to its evaluation. The main objective of the current study was to evaluate a 360Â° version of an ecologically valid assessment called the Picture Interpretation Test (PIT). Participants included 19 patients with Parkinsonâs disease (PD) and 19 healthy controls. All participants endorsed globally positive experiences of the PIT 360Â°. Furthermore, findings indicated that patients with PD took longer to correctly interpret the PIT 360Â° scene and tended to significantly focus on details of the 360Â° scene instead of the most informative elements. The time needed for a correct interpretation of the presented scene also correlated significantly with performance in conventional paper and pencil tests of executive functions for patients with PD. Classification analysis indicated the potential of the PIT 360Â° for distinguishing between patients with PD and healthy controls. Overall, these data provide preliminary evidence in support of the PIT 360Â° for evaluating executive functions.

Psychometric validation for a brand-new tool for the assessment of executive functions using 360Â° technology

Article Open access 27 May 2023

Unveiling Trail Making Test: visual and manual trajectories indexing multiple executive processes

Article Open access 22 August 2022

Introducing the tablet-based Oxford Cognitive Screen-Plus (OCS-Plus) as an assessment tool for subtle cognitive impairments

Article Open access 12 April 2021

Introduction

The assessment of executive functions poses researchers with several challenges related to both the complexity of the construct of executive functions itself (see for example¹) and/or the methodological difficulties related to its evaluation, specifically in predicting behaviors in real-life contexts^{2,3,4,5,6,7,8,9}. In term of complexity, Chan and co-workers¹⁰ have described executive functions as âan umbrella term comprising a wide range of cognitive processes and behavioral competencies which include verbal reasoning, problem-solving, planning, sequencing, the ability to sustain attention, resistance to interference, utilization of feedback, multitasking, cognitive flexibility, and the ability to deal with noveltyâ (pg. 201). Others have proposed four cognitive constructs: volition, planning, purposive action, and effective performance¹¹. Still others reduce this list to three cognitive constructs, namely inhibitory control, working memory, and cognitive flexibility^12,13,14.

In an attempt to refine executive functions assessment, Burgess et al.⁸ advanced the idea of developing neuropsychological assessments based on models derived from directly observable everyday behaviors. Such an approach allows for an examination of the ways in which a sequence of actions leads to a given behavior in normal functioning. This âfunction-ledâ approach differs from the emphasis on abstract cognitive âconstructsâ without regard for their ability to predict the complexity of âfunctionalâ behaviors found in real-life situations^2,3,4,5. Burgess and colleagues advanced the Multiple Errands test as a measure of executive functioning in real-life scenarios. While there are notable aspects of such naturalistic assessments, there are concerns about their many limitations in terms of time consumption, cost, poor control, and lack of safety¹⁵.

A potential alternative is the use of virtual reality (VR) technology for function-led assessments of executive functioning¹⁶. Indeed, VR permits the development of such assessments simulating everyday activities, allowing a secure and ecologically valid measure of executive functions^9,17. A virtual Multiple Errands Test (VMET) has been developed and tested in various clinical populations^18,19,20. The VMET allows for the evaluation of patientsâ abilities in formulating and checking a list of goals to effectively respond to environmental demands to achieve a series of tasks (e.g., buy a specific product, ask the examiner information about a product to be purchased). A recent study found that VMET was an effective tool for detecting early executive deficits in non-demented patients with Parkinsonâs Disease (PD)¹⁹. Furthermore, results demonstrated that patients with PD made more errors in the VMET tasks and showed a poorer ability in using effective strategies to complete the tasks in comparison to a control group. Interestingly, these two groups did not differ in their performance when compared on a traditional assessment of executive functioning.

A new technology for presenting neuropsychological stimuli is found in 360Â° environments (immersive photographs or videos) delivered via smartphones. The potentiality of 360Â° technology can be better understood by considering the âvirtuality continuumâ proposed by Milgram²¹, in which stimuli are presented in a manner ranging from completely real (real environment) to âvirtualâ (virtual environment). The space between extremes, called âmixed realityâ, is the area wherein real and virtual may co-exist producing new experiences. Advances in 360Â° technologies allow participants to be immersed into a real situation that they experience from a first-person perspective. This platform allows for sequential focusing upon various elements and portions of the environment at different times. Moreover, this permits a sequential planning of visual search.

In this direction, we developed a 360Â° version of the Picture Interpretation Test (PIT)^22,23 that leverages Luria and colleaguesâ²⁴ use of a Russian picture entitled âUnexpected Returnâ to investigate active visual perception in patients with frontal lobe damage. The approach was developed from their belief that the interpretation of a meaningful picture engages the patientâs neurocognitive system via recursive selection of the most informative elements observed during the visual search. This allows for the elaboration and testing of hypotheses regarding its meaning. Rosci and colleagues²² validated the PIT for detecting executive deficits in an Italian sample of 196 normal adults and 12 patients with pre-frontal brain lesions, who were asked to interpret what was happening in a reproduction of the famous painting âIl Sorcioâ (âThe Mouseâ). Findings revealed that 60 percent of the patients were unable to interpret the picture. Moreover, a similar failure rate was found in patient performance on a verbal fluency task and the Trail Making Test. These results suggest the potential of the Italian version of the PIT for testing of pre-frontal patients, thus making it one of the most used neuropsychological tests in the Italian context²⁵.

The current study was aimed to evaluate a 360Â° version of the PIT for detecting executive deficits through a function-led approach that combines experimental control with real-world engaging background. The study included patients with Parkinsonâs Disease (PD) because of the substantial research findings revealing a cognitive profile characterized by a dysexecutive syndrome^6,26,27.

To investigate the quality of the experience associated with the experiencing the PIT 360Â°, we examined participant self-reports (e.g., perceived balance between challenge and skills, as well as patientsâ intrinsic motivation in being confronted with the task). To specifically investigate the capability of PIT 360Â° for detecting executive deficits, we compared the performance of patients with PD to that of healthy controls (HC) using indices obtained from PIT 360Â°. Furthermore, we compared performance between the two groups on traditional construct-driven neuropsychological assessments of executive functioning. Finally, to evaluate the predictive validity of indices obtained from PIT 360Â°, we investigated how all these measures would be able to distinguish patients with PD from HC into their respective groups.

Results

User experience assessment

TableÂ 1 presents descriptive data obtained from the user experience assessment divided between the two groups.

Table 1 Scores obtained from the user experience assessment for Parkinsonâs Disease patient (PD Group) and Older Controls (HC Group). Data are shown as means and standard deviations (SD).

Full size table

Comparison of data between groups using the MannâWhitney U test did not reveal statistically significant differences (all psâ>â0.05). Results obtained from the Friedman Test indicated a significant difference among the four quadrants of GEW in terms of the mean number of reported felt emotions [Ï²(3)â=â87.572; pâ<â0.001] and their intensities [Ï²(3)â=â91,377; pâ<â0.001]. Specifically, Wilcoxon tests on mean number of self-reported emotion within the different quadrants revealed that all participants experienced more emotions with positive valence and high goal conduciveness. The same findings resulted for the intensities of self-reported emotions. See TablesÂ 2 and 3 for full statistics.

Table 2 Results obtained from Wilcoxon Test comparisons on different quadrants of Geneva Emotion Wheel. Mean number of reported felt emotion.

Full size table

Table 3 Results obtained from Wilcoxon Test comparisons on different quadrants of Geneva Emotion Wheel. Intensities of reported felt emotion.

Full size table

Conventional neuropsychological assessment

TableÂ 4 offers an overview of the scores obtained from the traditional neuropsychological evaluation divided between HC and patients with PD.

Table 4 Scores obtained from the neuropsychological assessment for Parkinsonâs Disease patient (PD Group) and Older Controls (HC). Data are shown as means and standard deviations (SD).

Full size table

When controlling for age and education, results showed that the PD group had significantly lower scores on MoCa when compared to HC [F(1, 34)â=â10.252; pâ=â0.003; Partial Î·²â=â0.232], they also performed significantly poorer on the phonemic fluency task [F(1, 34)â=â6.390; pâ=â0.016; Partial Î·²â=â0.158) and in the two TMT sub-tests, namely the TMT-A [F(1, 34)â=â7.075; pâ=â0.012; Partial Î·²â=â0.172] and the TMT-B [Fâ=â(1, 34) 4.240; pâ=â0.047; Partial Î·²â=â0.111)].

Performance on PIT 360Â°

Fourteen patients with PD (73.7%) correctly interpreted the scene proposed in the PIT 360Â°, while 5 patients with PD (26.3%) failed in the recognition. As concerns the HC group, sixteen participants (84.2%) recognized the scene, while 3 participants (15.8%) didnât succeed in the task. There was no significant difference in the proportion of participants successfully completing the task between groups [Ï²(1)â=â0.426; pâ=â0. 693].

When controlling for age and education, results showed that patients took longer (meanâ=â106.418; SDâ=â66.851) in comparison with HC group (meanâ=â70.808; SDâ=â52.782) for giving an interpretation of the scene proposed [F(1,34)â=â4.624; pâ=â0.039; Partial Î·²â=â0.120]. Moreover, the PD group (meanâ=â10.00, SDâ=â7.039) provided a more detailed description of the scene in comparison to the HC group (meanâ=â5.789; SDâ=â3.505) [F(1,34)â=â5.695; pâ=â0.023; Partial Î·²â=â0.143].

Correlations between neuropsychological tests and performance on PIT 360Â°

There were no significant correlations between neuropsychological tests and indexes of PIT 360Â° for HC group (see Fig.Â 1). For patients with PD, the time needed for a correct interpretation of the PIT 360Â° (Correct Interpretation) was found to positively correlate with performance on TMT-A (râ=â0.509; pâ=â0.026) and negatively with that on the phonemic fluency task (râ=ââ0.577; pâ=â0.009) (see Fig.Â 1). At the same time, there was a trend for correlation between Correct Interpretation and TMT-B (râ=â0.429; pâ=â0.066).

Classification of Healthy Controls or Patients with PD

Performance of the classifiers was evaluated by carrying out a relative operating characteristic (ROC) analysis²⁸. The area under the ROC curve (AUC) provides a single measure of overall prediction accuracy, Precision represents the proportion of true positives among all the instances classified as positive, CA (Classification accuracy) represents the proportion of the instances that were classified correctly, F1 indicates the harmonic mean of precision (P) and Recall (R), and Recall (R) is the proportion of cases which were classified as positive, among all instances which truly were positives. Results from nonlinear stochastic approximation (i.e., machine learning approach) methods showed a Precision between 55.6% and 68.8% for the conventional neuropsychological assessment of executive functions (TableÂ 5), while it ranged from 50.0% to 71.4% for PIT 360Â° (TableÂ 6). According to these results (TablesÂ 5 and 6), Random Forest has a Precision under the 60%, thus making this algorithm not reliable for the classification of cases into two groups using both traditional neuropsychological tests and indices from PIT 360Â°. On the other hand, results obtained with the Logistic Regression showed a good Precision only for traditional neuropsychological tests, but not for PIT 360Â°. Finally, we opted to use both Support Vector Machine and NaÃ¯ve Bayes that showed a good precision (over than 60%).

Table 5 Stratified 10-fold Cross validation for the neuropsychological assessment battery¹.

Full size table

Table 6 Stratified 10-fold Cross validation for the indices of PIT 360Â°¹.

Full size table

Although the ability to predict control group membership is quite similar between the two types of assessment (slightly better for the traditional neuropsychological assessment), it is interesting to note that NaÃ¯ve Bayes and Support Vector Machine algorithms showed that the indices from PIT 360Â° had a higher capability in predicting PD Group membership (see Fig.Â 2).

Discussion

The main objective of the current study was to evaluate the 360Â° version of the Picture Interpretation Test (PIT)^22,23, for providing assessment of executive functions processing in PD using an innovative and ecologically valid tool. Following immersion in the PIT 360Â°, HCs and patients with Parkinsonâs Disease (PD) were surveyed on their affective reactions to the 360Â° scene, perceived level of challenge and skills, appreciation of the tool, and their sense of presence while immersed in the PIT 360Â°. Then, to specifically evaluate the ability of PIT 360Â° in detecting executive deficits, we compared the performance of patients with PD and healthy controls comparing conventional neuropsychological assessments with the PIT 360Â°. Correlations between the conventional neuropsychological tests of executive functions and performance on PIT 360 were also explored. Finally, we investigated the predictive validity of indices obtained from PIT 360Â° in distinguishing PD patients from the healthy controls. Results from user experience assessment of the PIT 360Â° revealed that all participants endorsed positive reactions to their experience of the PIT 360Â°. This was apparent in the higher scores in the first quadrants of Geneva Emotion Wheel (GEW^29,30), which includes interest, joy, happiness, satisfaction, elation and pride. In particular, patients with PD did not endorse affective responding with low valence and high control (such as anger or irritation). Moreover, both groups perceived a high level of their own skills in the context of a demanding task (the interaction with PIT 360Â°), which resulted in perceived balanced level of challenge-skills. As emerged by mean scores of Intrinsic Motivation Inventory (IMI³¹), the PIT 360Â° was reported to be an interesting and an enjoyable activity. Finally, all participants reported a very high level of presence during the interaction with PIT 360Â°.

All subjects were preliminarily assessed by a neuropsychological assessment and all of them obtained scores within the normal range. This confirms that our patients were in a relatively well-preserved clinical state. Only a statistical group comparison revealed differences between the two groups. As expected, these behavioral findings indicate that patients with early PD and no clinical evidence of cognitive impairment may already exhibit sub-clinical abnormalities, as previously reported^32,33. In line with the pattern of results from conventional neuropsychological assessment, the PIT 360Â° analysis revealed different performances in patients with PD compared to HC. Although the percentage of PD patients that failed in correctly interpreting the scene is quite similar to that of HC (26.3% vs. 15.8%) confirming that they showed a relatively well-functioning cognitive status, analyses on the two PIT 360Â° indices showed significant differences between the two groups. Specifically, patients with PD took longer to provide a correct interpretation of the scene proposed and provided significantly more details about the objects found in the scene. While the patients gave significantly richer descriptions of the scene, they appeared more prone to distractor interference (âThere is a white coat, there are two chairs in front of the TV. There is a big TV. On the floor, there is maybe a scale. Then, I see a wardrobe that may be a fridge. There is a man who is working on jumper cables near the wardrobe. I see an electric device on the table. I think that the man is repairing something. The man is curled up behind a white-board, or something similar, a spot where it is possible to hang sheets.â). These findings are in line with Luriaâs view²⁴, suggesting that this test is able to capture deficits in active visual perception. Our data indicated that patients with PD demonstrated more difficulties when compared to healthy controls in focusing on the most important components for a correct interpretation of the scene. Thus, PD patients appear to focus on details instead on the most informative elements. They were not able to find important elements for a correct interpretation of the whole scene nor did they match these elements with their hypotheses about the meaning. In most cases, a poor interpretation based only on the details was given.

Interestingly, results from correlation analyses indicated that neuropsychological tests correlate significantly with indexes of PIT 360Â° only for patients with PD. Specifically, the time needed for giving a correct interpretation of the PIT 360Â° scene did not correlate with the Montreal Cognitive Assessment (i.e., a measure of global cognitive level), but it was significantly correlated with the Trail Making Test and the phonemic verbal fluency task, thus tapping both verbal and visuospatial aspects of executive functioning and motor aspects. These findings suggest that PIT 360Â° can be considered as a quick, ecological and useful screening instrument able to evaluate different aspects of dysexecutive impairment in patients with PD.

Results obtained from classifiers clearly indicated the potential of PIT 360Â° scene assessment in distinguishing between patients with PD and HC. Two of the algorithms used indicated that PIT 360Â° had a higher capability in predicting PD group membership with respect to a traditional neuropsychological assessment. Although machine learning approaches have been traditionally applied to the analysis of very complex medical datasets³⁴, recent studies have also applied them for classifying patients according to their cognitive impairment and consequently reduce the number of onerous tests required for their diagnosis^35,36,37.

While the findings of the current study are promising, there are some limitations that should be considered. First of all, in order to fully evaluate the potentiality of PIT 360Â° as a new screening tool of executive functions, future studies are needed to assess its testâretest reliability and validity. A large validation study with a sample of participants across the lifespan including the PIT 360Â°, the original PIT^22,23, as well as other conventional neuropsychological measures should be performed. Moreover, it will be important to investigate the value of PIT 360Â° in detecting executive impairments in other clinical populations who are known to have executive dysfunctioning.

Conclusions

This study provides the first evidence that the 360Â° technology may play a role in the future of neuropsychological assessment. Moreover, this technology may be integrated with other portable devices, such as an eye-tracker. As suggested by pioneering study of Luria²⁴, it would be particularly interesting to investigate patientsâ eye movements during the interpretation of the scene proposed. Indeed, Luria found that disturbances in the active visual perception were reflected by a corresponding disorganized scanning gaze movements. In conclusion, although preliminary, our findings provide encouraging evidence in support of the use of immersive 360Â° environments in general, and the PIT 360Â° specifically for innovative evaluation of executive impairments.

Materials and Methods

Participants

Thirty-eight participants took part in the study: 19 patients with Parkinsonâs Disease (PD group), and 19 healthy controls matched for age and education with the PD group (HC group).

Outpatients meeting the diagnostic criteria for probable PD³⁸ were consecutively recruited from the Neurorehabilitation Unit of Don Carlo Gnocchi Foundation, IRCCS. All patients were at a mild to moderate stage of the disease, scoring between stages 1 and 2 of the Hoehn and Yahr (H&Y) Scale³⁹. None had any report of cognitive problems or any evidence of cognitive deficits in their daily living activities. A Mini Mental State Examination (MMSE) was used to exclude any patient who reported scores outside the normal range (MMSE cut-off score 23, 8^40,41). All subjects (patients and HCs) were right handed as assessed by the Edinburgh Inventory⁴². Exclusion criteria included any major systemic, psychiatric, or other neurological illnesses. Particular attention was used to exclude those patients who experienced visual hallucinations, had episodes of severe depression or autonomic failure, manifested resistance to dopaminergic drugs and were at an unstable dosage of antiparkinsonian treatment during the 3 months prior to study entry. The PD group was composed of 3 women and 16 men, while the HC group included 9 women and 10 men. The mean age for the PD group was 66.53 (SDâ=â9.43), with an average of 12.47 (SDâ=â3.47) years of education of, while the mean age for the HC group was 67.58 (SDâ=â7.86), with an average of 14.37 (SDâ=â3.48) years of education. The two groups did not differ significantly in terms of age [t(36)â=ââ0.377; pâ=â0.708] or education [t(36)â=ââ1.680; pâ=â0.102]. However, there were significantly less women in the PD group [Ï²(1)â=â4.835; pâ=â0.036].

The study was conducted in compliance with the Helsinki Declaration of 1975, as revised in 2008. Local Ethics Committee (Don Carlo Gnocchi Foundation) approval and written informed consent to be included in the study was obtained by participants before study initiation.

Procedure of the study

Participants underwent a conventional neuropsychological assessment to obtain their global cognitive profile and level of executive functioning (pre-task evaluation). Subsequently participants were asked to complete the PIT) 360Â° (PIT 360Â° session). The PIT 360Â° was designed and administered through an innovative mobile application (PIT 360) that allows participants to explore an immersive 360Â° experience. At the end of PIT 360 task evaluation, subjects were asked to rate their affective reactions, perceived levels of challenge and skill, appreciation and sense of presence experienced while performing the PIT 360 task (post-task evaluation).

Pre-task evaluation: neuropsychological measures

In the pre-task evaluation we administered the following conventional paper and pencil tests: the Montreal Cognitive Assessment (MoCa⁴³) as a measure of global cognitive level; the Trail Making Test (in two specific sub-tests: TMT-A and TMT-B⁴⁴) and the phonemic verbal fluency task (FAS⁴⁵) as measures of executive functioning.

PIT 360Â° session: The PIT 360Â° development, description and administration

The PIT 360Â° is the 360Â° version of the Picture Interpretation Test^22,23. In the PIT test, a small-scale color reproduction (19âÃâ13) of the famous painting âIl Sorcioâ (âThe Mouseâ) of the Italian painter Giacomo Favretto is used as a test stimulus. In this painting, a room in disarray is presented with three frightened girls standing on chairs and a boy who is searching for something on the floor. Although not visible, it is apparent that there is mouse hidden behind a piece of furniture. Participants are asked to interpret what is happening in the scene in a limited time frame (180âseconds), while the time to say the word âmouseâ is the outcome measure.

The PIT 360Â° was developed with the Ricoh Theta S Digital Camera that permits the creation of 360Â° spherical imageries. The camera is able to capture a 360Â° scene by stitching two 180Â° scans via integrated software at a resolution of 1792 by 3584 pixels. This allows for a presentation of an immersive stereoscopic 360Â° experience directly on a virtual reality headset (including mobile phone) via the Ricoh Theta S application. For the present study, the PIT 360Â° was rendered trough the mobile application of the Ricoh Theta S on an iPhone 6 Plus. Two scenes were recorded: one, to be used in the Familiarization phase (see Fig.Â 3), in which a meeting room appeared with tables, chairs, a sink, a television, some dressers with several objects spread on them. The second scene (see Fig.Â 4), to be used in experimental phase, was designed in line with the Favrettoâs painting âIl Sorcioâ: in the same room, with the same furniture and objects spread throughout it along with a boy searching for something on the floor, while three frightened girls standing on chairs watch him.

The neuropsychologist started the administrations by inviting participants to sit on a swivel chair and to wear the virtual reality headset (connected to the iPhone 6 Plus). This allowed participants to explore an interactive 360Â° experience. In case of presbyopia, participants were asked to wear their own glasses. Then participants underwent a familiarization phase (3âminutes) aimed at familiarizing them with the technology and control for potential side effects (e.g., dizziness, nausea). The examiner followed a cessation rule in which experimental sessions should be stopped if severe side effects occurred. The examiner asked participants to keep their eyes closed, and started time registration (in seconds) and audio recording coinciding with the instruction âOpen your eyesâ. Participants were then presented with the 360Â° scene of the room including a table (in the center), a sink with a mirror (on the participantâs right), a television on a table, two dressers (on the participantâs left), and various chairs and objects spread throughout the room. They were asked to find five objects in the scene to answer the experimentersâ questions (i.e. âLetâs search for the agenda. Where is the agenda?â). Upon completion of the three-minute familiarization phase, participants were asked to close again their eyes. The experimental session began with time registration (in seconds) and audio recording coinciding with the examinerâs instruction âOpen your eyesâ. In this phase, participants were asked to freely explore the scene derived from Favrettoâs âIl Sorcioâ and to tell the examiner what is happening as quickly as possible (maximum time: 180âseconds). Time registration lasted until the instant in which the participant said the word âmouseâ or something similar (e.g., âsnakeâ, âroachâ, etc; generic classifications were allowed). After participants pronounced the word âmouseâ (or similar generic classifications), the experimenter asked âWhat do you mean?â in order to confirm the participantâs understanding of the situation.

The following indices were calculated:

1)
Correct Interpretation: The time in seconds registered from the time in which the experimenter said the words âOpen your eyesâ until the participant provided a correct interpretationâ (i.e., âmouseâ, âanimalâ, etc.). The maximum time allowed was 180âseconds. If the participants failed to interpret the scene in the allotted 180âseconds, then a time of 180âseconds was assigned as the outcome (as suggested by Rosci and colleagues²²);
2)
Number of Scene Elements: The sum of the scene elements that were verbalized during the interpretation of the scene.

The post-task evaluation: user experience assessment

In the post-task evaluation, participants were asked to rate their experience during the task on the following instruments:

Geneva Emotion Wheel (GEW^29,30).

It consists of 20 discrete emotion terms that are systematically aligned in a circle. Underlying the alignment of the emotion terms are the two dimensions â valence/goal conduciveness (negative to positive) and control/coping potential (low to high) separating the emotions into four quadrants, each meant as an emotion family: Positive valence/High coping potential, Positive valence/Low coping potential, Negative valence/Low coping potential, and Negative valence/High coping potential. In all the four quadrants, the single emotion terms are considered as indexes âreflecting a unique experience of mental and bodily changes in the context of being confronted with a particular eventâ²⁹. The mean number of emotions labels chosen within each of the four quadrants and the reported intensity in feeling it show how participants shaped their subjective feeling in performing the PIT 360Â° task along the dimensions of valence/goal conduciveness and control/coping potential.

Perceived fit of demands and skills (from Flow Short Scale⁴⁶). LandhÃ¤uÃer and Keller⁴⁷ highlighted that, although researchers seem to equalize the skill-demands compatibility with the experience of flow itself (e.g.^48,49) in many studies, the balance between skills of the individual and perceived challenges of the task cannot be considered as a measure of the flow experience per se. Therefore, we administered the three items from the Flow Short Scale (5-points scale) that assess this specific component of the flow experience in performing the PIT 360Â° task. The first item asked participants to evaluate their perceived level of skills in coping with the task (âPerceived coping skillsâ), whereas the second item is related to the perceived level of challenges (âPerceived challengeâ). Finally, participants were asked to indicate the perceived challenge-skill balance (âPerceived challenge- skill balanceâ) in 5-point-scale with 1 indicating that the current challenge is too low for onesâ perceived skills, 3 indicating that the current challenge fit exactly to onesâ skills and 5 indicating that the challenge is too high.

Intrinsic Motivation Inventory (IMI³¹). Participants responded to five items (7-points scale) from the subscale âEnjoymentâ of the Intrinsic Motivation Inventory (IMI). These items were chosen to evaluate participantsâ appreciation to the proposed activity, including the items âThis activity was fun to doâ and âWhile I was doing this activity, I was thinking about how much I enjoyed itâ. The mean of the item scores is considered.

The Slater-Usoh-Steed Questionnaire (SUS⁵⁰). It consists of 7-points questionnaire which evaluates the sense of presence with three items: 1) the sense of being in the scene depicted in the 360Â° scene, 2) the extent to which the 360Â° scene became the dominant reality, and 3) the extent to which the 360Â° scene was remembered as a place. The mean of the item scores was considered.

Data analysis

The normality of data distribution was assessed using the Kolmogorov-Smirnov test. Since data were not normally distributed, non-parametric tests were used to investigate the quality of the experience associated with the interaction with PIT 360Â° and potential differences between PD and HC group on user experience variables (i.e., GEW, Flow Short Scale, IMI, and SUS). Moreover, for the GEW^29,30, differences within the four quadrants in the number and intensity of self-reported emotions were also investigated using the Friedman Test. Next, Wilcoxon tests, with Bonferroniâs adjustment, were computed to break down significant findings. Subsequently, between-group comparisons of performance on the conventional neuropsychological assessment and the indexes of PIT 360Â° (Correct Interpretation and Number of Scene Element) were made by univariate analysis of covariance (ANCOVA), using age and education as covariates. These statistical analyses were performed using the Statistical Package for the Social Sciences for Windows (IBM Corp Armonk, NY, USA), version 23. Finally, Pearson correlation coefficient was used to examine correlations between conventional neuropsychological tests and the indexes of PIT 360Â°. These statistical analyses were carried out using the software MedCalc (MedCalc Software, Ostend, Belgium), version 16.8.4.

Nonlinear stochastic approximation (i.e., machine learning) methods were used to compare the classification accuracy of traditional neuropsychological assessments versus the PIT 360Â° indices for classifying participants into either the âPatients with PDâ or âHealthy Controlsâ groups. Machine learning approaches are devoted to prediction and it is thought to be explorative rather than explicative⁵¹; accordingly, we used different algorithms to compare the predictive value of each one of them to understand which one was the best based on their accuracy. Because our analyses are based on relatively small sample sizes, a Leave-one-out cross-validation (LOOCV)^52,53. Different algorithms were employed, namely: a) a Logistic Regression classification algorithm with ridge regularization; b) a Random Forest classification to classify using an ensemble of decision trees; c) a Support Vector Machine (SVM) to map inputs to higher-dimensional feature spaces that best separate different classes; d) a NaÃ¯ve Bayes classification, for discriminating between the two groups, even without any particular assumption for the distribution for the features. All these analyses were computed using Python 3.4 with the Orange 3.3.5 data mining suite, which is freely available as open source code (https://github.com/biolab/orange3).

References

Stuss, D. T. & Alexander, M. P. Executive functions and the frontal lobes: a conceptual view. Psychological research 63, 289â298 (2000).
ArticleÂ CASÂ PubMedÂ Google ScholarÂ
Barker, L. A., Andrade, J. & Romanowski, C. A. J. Impaired implicit cognition with intact executive function after extensive bilateral prefrontal pathology: A case study. Neurocase 10, 233â248 (2004).
ArticleÂ CASÂ PubMedÂ Google ScholarÂ
Goldstein, G. In Ecological Validity of Neuropsychological Testing (eds R. J. Sbordone & C. J. Long) 75â89 (FL:GsRPress/St.LuciePress, 1996).
Shallice, T. & Burgess, P. W. Deficits in strategy application following frontal lobe damage in man. Brain 114, 727â741 (1991).
ArticleÂ PubMedÂ Google ScholarÂ
Chaytor, N. & Schmitter-Edgecombe, M. The ecological validity of neuropsychological tests: A review of the literature on everyday cognitive skills. Neuropsychology review 13, 181â197 (2003).
ArticleÂ PubMedÂ Google ScholarÂ
Kudlicka, A., Clare, L. & Hindle, J. V. Executive functions in Parkinsonâs disease: Systematic review and metaâanalysis. Movement Disorders 26, 2305â2315 (2011).
ArticleÂ PubMedÂ Google ScholarÂ
Spooner, D. M. & Pachana, N. A. Ecological validity in neuropsychological assessment: a case for greater consideration in research with neurologically intact populations. Archives of clinical neuropsychology 21, 327â337 (2006).
ArticleÂ PubMedÂ Google ScholarÂ
Burgess, P. W. et al. The case for the development and use of âecologically validâ measures of executive function in experimental and clinical neuropsychology. Journal of the international neuropsychological society 12, 194â209 (2006).
ArticleÂ PubMedÂ Google ScholarÂ
Parsons, T. D. Virtual Reality for Enhanced Ecological Validity and Experimental Control in the Clinical, Affective, and Social Neurosciences. Frontiers in Human Neuroscience, 1â19 (2015).
Chan, R. C. K., Shum, D., Toulopoulou, T. & Chen, E. Y. H. Assessment of executive functions: Review of instruments and identification of critical issues. Archives of clinical neuropsychology 23, 201â216 (2008).
ArticleÂ PubMedÂ Google ScholarÂ
Lezak, M. D. Neuropsychological Assessment (4th Ed.). (Oxford University Press, 2004).
Lehto, J. E., JuujÃ¤rvi, P., Kooistra, L. & Pulkkinen, L. Dimensions of executive functioning: Evidence from children. British Journal of Developmental Psychology 21, 59â80 (2003).
ArticleÂ Google ScholarÂ
Miyake, A. et al. The unity and diversity of executive functions and their contributions to complex âfrontal lobeâ tasks: A latent variable analysis. Cognitive psychology 41, 49â100 (2000).
ArticleÂ CASÂ PubMedÂ Google ScholarÂ
Diamond, A. Executive functions. Annual review of psychology 64, 135 (2013).
ArticleÂ PubMedÂ Google ScholarÂ
Logie, R. H., Trawley, S. & Law, A. Multitasking: Multiple, domain-specific cognitive functions in a virtual environment. Memory & cognition 39, 1561â1574 (2011).
ArticleÂ Google ScholarÂ
Parsons, T. D., Carlew, A. R., Magtoto, J. & Stonecipher, K. The potential of function-led virtual environments for ecologically valid measures of executive function in experimental and clinical neuropsychology. Neuropsychological rehabilitation, 37(5), 777â807 (2017).
Bohil, C. J., Alicea, B. & Biocca, F. A. Virtual reality in neuroscience research and therapy. Nature reviews neuroscience 12, 752â762 (2011).
CASÂ PubMedÂ Google ScholarÂ
Raspelli, S. et al. Validating the Neuro VR-based virtual version of the Multiple ErrandsTest: preliminary results. Presence: Teleoperators and Virtual Environments 21, 31â42 (2012).
ArticleÂ Google ScholarÂ
Cipresso, P. et al. Virtual multiple errands test (VMET): a virtual reality-based tool to detect early executive functions deficit in Parkinsonâs disease. Frontiers in behavioral neuroscience 8 (2014).
Cipresso, P. et al. Break in volition: A virtual reality study in patients with obsessive-compulsive disorder. Experimental brain research 229, 443â449 (2013).
ArticleÂ PubMedÂ Google ScholarÂ
Milgram, P. & Kishino, F. A taxonomy of mixed reality visual displays. IEICE TRANSACTIONS on Information and Systems 77, 1321â1329 (1994).
Google ScholarÂ
Rosci, C., Sacco, D., Laiacona, M. & Capitani, E. Interpretation of a complex picture and its sensitivity to frontal damage: a reappraisal. Neurological Sciences 25, 322â330 (2005).
ArticleÂ CASÂ PubMedÂ Google ScholarÂ
Bisiach, E., Cappa, S. & Vallar, G. Guida allâesame neuropsicologico. (R. Cortina, 1983).
Luria, A. R., Karpov, B. A. & Yarbuss, A. L. Disturbances of active visual perception with lesions of the frontal lobes. Cortex 2, 202â212 (1966).
ArticleÂ Google ScholarÂ
Bianchi, A. & Dai PrÃ , M. Twenty years after Spinnler and Tognoni: new instruments in the Italian neuropsychologistâs toolbox. Neurological sciences 29, 209â217 (2008).
ArticleÂ PubMedÂ Google ScholarÂ
Dirnberger, G. & Jahanshahi, M. Executive dysfunction in Parkinsonâs disease: a review. Journal of neuropsychology 7, 193â224 (2013).
ArticleÂ PubMedÂ Google ScholarÂ
Litvan, I. et al. Diagnostic criteria for mild cognitive impairment in Parkinsonâs disease: Movement Disorder Society Task Force guidelines. Movement Disorders 27, 349â356 (2012).
ArticleÂ PubMedÂ PubMed CentralÂ Google ScholarÂ
Swets, J. A. Measuring the accuracy of diagnostic systems. Science 240, 1285â1293 (1988).
ArticleÂ ADSÂ CASÂ PubMedÂ MATHÂ MathSciNetÂ Google ScholarÂ
Scherer, K. R. What are emotions? And how can they be measured? Social science information 44, 695â729 (2005).
ArticleÂ Google ScholarÂ
Scherer, K. R., Shuman, V., Fontaine, J. R. J. & Soriano, C. The GRID meets the Wheel: Assessing emotional feeling via self-report. Components of emotional meaning: A sourcebook, 281â298 (2013).
Deci, E. L., Eghrari, H., Patrick, B. C. & Leone, D. R. Facilitating internalization: The selfâdetermination theory perspective. Journal of personality 62, 119â142 (1994).
ArticleÂ CASÂ PubMedÂ Google ScholarÂ
Pillon, B., Czernecki, V. & Dubois, B. Dopamine and cognitive function. Current opinion in neurology 16, S17âS22 (2003).
ArticleÂ CASÂ PubMedÂ Google ScholarÂ
Baglio, F. et al. Functional brain changes in early Parkinsonâs disease during motor response and motor inhibition. Neurobiology of aging 32, 115â124 (2011).
ArticleÂ PubMedÂ Google ScholarÂ
Kononenko, I. Machine learning for medical diagnosis: history, state of the art and perspective. Artificial Intelligence in medicine 23, 89â109 (2001).
ArticleÂ CASÂ PubMedÂ Google ScholarÂ
Buscema, M. et al. Artificial neural networks and artificial organisms can predict Alzheimer pathology in individual patients only on the basis of cognitive and functional status. Neuroinformatics 2, 399â415 (2004).
ArticleÂ PubMedÂ PubMed CentralÂ Google ScholarÂ
Parsons, T. D., Rizzo, A. A. & Buckwalter, J. G. Backpropagation and regression: comparative utility for neuropsychologists. Journal of Clinical and Experimental Neuropsychology 26, 95â104 (2004).
ArticleÂ PubMedÂ Google ScholarÂ
Weakley, A., Williams, J. A., Schmitter-Edgecombe, M. & Cook, D. J. Neuropsychological test selection for cognitive impairment classification: A machine learning approach. Journal of clinical and experimental neuropsychology 37, 899â916 (2015).
ArticleÂ PubMedÂ PubMed CentralÂ Google ScholarÂ
Gelb, D. J., Oliver, E. & Gilman, S. Diagnostic criteria for Parkinson disease. Archives of neurology 56, 33â39 (1999).
ArticleÂ CASÂ PubMedÂ Google ScholarÂ
Fahn, S. & Elton, R. L. Unified rating scale for Parkinsonâs disease. Recent developments in Parkinsonâs disease. Florham Park. New York: Macmillan, 153â163 (1987).
Measso, G. et al. The miniâmental state examination: Normative study of an Italian random sample. Developmental Neuropsychology 9, 77â85 (1993).
ArticleÂ Google ScholarÂ
Folstein, M. F., Robins, L. N. & Helzer, J. E. The mini-mental state examination. Archives of general psychiatry 40, 812â812 (1983).
ArticleÂ CASÂ PubMedÂ Google ScholarÂ
Oldfield, R. C. Handedness in musicians. British Journal of Psychology 60, 91â99 (1969).
ArticleÂ CASÂ PubMedÂ Google ScholarÂ
Conti, S., Bonazzi, S., Laiacona, M., Masina, M. & Coralli, M. V. Montreal Cognitive Assessment (MoCA)-Italian version: regression based norms and equivalent scores. Neurological Sciences 36, 209â214 (2015).
ArticleÂ PubMedÂ Google ScholarÂ
Giovagnoli, A. R. et al. Trail making test: normative values from 287 normal adult controls. The Italian journal of neurological sciences 17, 305â309 (1996).
ArticleÂ CASÂ PubMedÂ Google ScholarÂ
Carlesimo, G. A. et al. The mental deterioration battery: normative data, diagnostic reliability and qualitative analyses of cognitive impairment. European neurology 36, 378â384 (1996).
ArticleÂ CASÂ PubMedÂ Google ScholarÂ
Rheinberg, F., Vollmeyer, R. & Engeser, S. In Diagnostik von Motivation und Selbstkonzept (eds J. Stiensmeier-Pelster & F. Rheinberg) 261â279 (2003).
LandhÃ¤uÃer, A. & Keller, J. In Advances in flow research (ed S. Engeser) 65â85 (Springer, 2012).
Csikszentmihalyi, M. & LeFevre, J. Optimal experience in work and leisure. Journal of personality and social psychology 56, 815â822 (1989).
ArticleÂ CASÂ PubMedÂ Google ScholarÂ
Eisenberger, R., Jones, J. R., Stinglhamber, F., Shanock, L. & Randall, A. T. Flow experiences at work: For high need achievers alone? Journal of Organizational Behavior 26, 755â775 (2005).
ArticleÂ Google ScholarÂ
Usoh, M., Catena, E., Arman, S. & Slater, M. Using presence questionnaires in reality. Presence: Teleoperators and Virtual Environments 9, 497â503 (2000).
ArticleÂ Google ScholarÂ
Mitchell, T. Machine Learning., (McGraw Hill, 1997).
Caruana, R. & Niculescu-Mizil, A. InProceedings of the 23rd international conference on Machine learning. 161â168 (ACM).
Suthaharan, S. In Machine Learning Models and Algorithms for Big Data Classification 183â206 (Springer, 2016).

Download references

Acknowledgements

This work was partially supported by the Italian funded project âHigh-end and Low-End Virtual Reality Systems for the Rehabilitation of Frailty in the Elderlyâ (PE-2013-02355948), by the research project Tecnologia Positiva e Healthy Aging (Positive Technology and Healthy Aging) (Grant D.3.2., 2014) and by the research project âAgeing and Healthy Living: A Human Centered Approach in Research and innovation as Source of Quality Lifeâ, funded by Fondazione Cariplo within the 2014. The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication

Author information

Authors and Affiliations

Applied Technology for Neuro-Psychology Lab, IRCCS Istituto Auxologico Italiano, Via Magnasco, 2 20149, Milan, Italy
Silvia Serino,Â Pietro CipressoÂ &Â Giuseppe Riva
Department of Psychology, UniversitÃ Cattolica del Sacro Cuore, Largo Gemelli, 1, 20100, Milan, Italy
Silvia Serino,Â Federica Rossetto,Â Pietro CipressoÂ &Â Giuseppe Riva
IRCCS, Fondazione don Carlo Gnocchi ONLUS, Via Capecelatro 66, 20148, Milan, Italy
Francesca Baglio,Â Federica RossettoÂ &Â Raffaello Nemni
Department of Human Sciences for Education, UniversitÃ degli Studi di Milano-Bicocca, Milan, Italy
Olivia RealdonÂ &Â Fabrizia Mantovani
Computational Neuropsychology and Simulation Laboratory, University of North Texas, 1155 Union Circle #311280, Denton, Texas, 76203-5017, USA
Thomas D. Parsons
Department of Psychology, University of North Texas, 1155 Union Circle #311280, Denton, Texas, 76203-5017, USA
Thomas D. Parsons
National Research Council of Italy, Institute for the Dynamics of Environmental Processes, Piazza della Scienza, 1, 20126, Milan, Italy
Giacomo Cappellini
Department of Clinical and Digital Health Sciences, College of Allied Health Sciences, Augusta University, 987 St. Sebastian Way, EC 4316, Augusta, Georgia, 30912, USA
Gianluca De Leo
Department of Pathophysiology and Transplantation, UniversitÃ degli Studi di Milano, via Francesco Sforza, 35, 20122, Milan, Italy
Raffaello Nemni

Authors

Silvia Serino
View author publications
You can also search for this author in PubMedÂ Google Scholar
Francesca Baglio
View author publications
You can also search for this author in PubMedÂ Google Scholar
Federica Rossetto
View author publications
You can also search for this author in PubMedÂ Google Scholar
Olivia Realdon
View author publications
You can also search for this author in PubMedÂ Google Scholar
Pietro Cipresso
View author publications
You can also search for this author in PubMedÂ Google Scholar
Thomas D. Parsons
View author publications
You can also search for this author in PubMedÂ Google Scholar
Giacomo Cappellini
View author publications
You can also search for this author in PubMedÂ Google Scholar
Fabrizia Mantovani
View author publications
You can also search for this author in PubMedÂ Google Scholar
Gianluca De Leo
View author publications
You can also search for this author in PubMedÂ Google Scholar
Raffaello Nemni
View author publications
You can also search for this author in PubMedÂ Google Scholar
Giuseppe Riva
View author publications
You can also search for this author in PubMedÂ Google Scholar

Contributions

S.S., F.B., F.R., O.R., and G.R. developed the study concept. All authors contributed to the study design. F.R. was involved in the data collection. G.C. was responsible for the technical development of the PIT. S.S. and O.R performed the data analysis and interpretation under the supervision of F.M., R.M., G.D.L., T.P.C. performed computational data analysis. S.S., F.B., F.R., and O.R. wrote the first draft of the manuscript. All authors were involved in a critical revision of the manuscript for important intellectual content. All the authors approved the final version of the manuscript for submission.

Corresponding author

Correspondence to Silvia Serino.

Ethics declarations

Competing Interests

The authors declare that they have no competing interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the articleâs Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the articleâs Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Serino, S., Baglio, F., Rossetto, F. et al. Picture Interpretation Test (PIT) 360Â°: An Innovative Measure of Executive Functions. Sci Rep 7, 16000 (2017). https://doi.org/10.1038/s41598-017-16121-x

Download citation

Received: 22 March 2017
Accepted: 01 November 2017
Published: 22 November 2017
DOI: https://doi.org/10.1038/s41598-017-16121-x