Abstract
Diagnosing autism spectrum disorder (ASD) conventionally demands significant time and resources. Language deficits are key markers of ASD, particularly in constructing narratives. This study leverages computational models to analyze story book narratives from seven children with ASD and 16 typically-developing (TD) peers. By transcribing and training models on limited data using augmentation techniques, our best model achieved over 90% accuracy, sensitivity, and specificity-outperforming previous models by 20% in ASD detection. This research showcases the efficacy of our approach in efficiently assessing language abilities and identifying ASD tendencies. The method holds promise for enhancing diagnostic efficiency and providing comprehensive language evaluations to support children with ASD and their caregivers.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Data Availability
The collected dataset can only be shared after the consent from the participants can be obtained and documented.
References
American Psychiatric Association (2013) Diagnostic and statistical manual of mental disorders, 5th edn. Retrieved from https://www.psychiatry.org/psychiatrists/practice/dsm
Baixauli I, Colomer C, Roselló B, Miranda A (2016) Narratives of children with high-functioning autism spectrum disorder: A meta-analysis. Res Development Disabilit 59:234–254
Baron-Cohen S, Leslie AM, Frith U (1985) Does the autistic child have a “theory of mind’’? Cognition 21(1):37–46
Capps L, Losh M, Thurber C (2000) “The frog ate the bug and made his mouth sad’’: Narrative competence in children with autism. J Abnormal Child Psychol 28:193–204
Chojnicka I, Wawer A (2020) Social language in autism spectrum disorder: A computational analysis of sentiment and linguistic abstraction. PLoS One 15(3):e0229985
Christensen DL (2016) Prevalence and characteristics of autism spectrum disorder among children aged 8 years—autism and developmental disabilities monitoring network, 11 sites, united states, 2012. MMWR. Surveillance summaries 65
Colle L, Baron-Cohen S, Wheelwright S, Van Der Lely HK (2008) Narrative discourse in adults with high-functioning autism or asperger syndrome. J Autism Development Disorders 38:28–40
Devlin J, Chang MW, Lee K, Toutanova K (2018) Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv:1810.04805
Dey RK, Das AK (2023) Modified term frequency-inverse document frequency based deep hybrid framework for sentiment analysis. Multimed Tools Appl 82(21):32967–32990
Dey RK, Das AK (2024) Neighbour adjusted dispersive flies optimization based deep hybrid sentiment analysis framework. Multimed Tools Appl pp 1–24
Diehl JJ, Bennetto L, Young EC (2006) Story recall and narrative coherence of high-functioning children with autism spectrum disorders. J Abnormal Child Psychol 34:83–98
Education Mo (2014) Handbook of identification methods for students with disabilities and gifted talents. https://spe-girc.ntnu.edu.tw/wp-content/uploads/2021/08/jianding201608.pdf
Farooq MS, Tehseen R, Sabir M, Atal Z (2023) Detection of autism spectrum disorder (asd) in children and adults using machine learning. Scientific Reports 13(1):9605
Fusaroli R, Grossman R, Bilenberg N, Cantio C, Jepsen JRM, Weed E (2022) Toward a cumulative science of vocal markers of autism: A cross-linguistic meta-analysis-based investigation of acoustic markers in american and danish autistic children. Autism Res 15(4):653–664
Huang R, Wu S, Tsia I, Huang T, Zheng Z (2016) Chinese Language Sample Analysis Guide. Psychological Publishing
Hung Y, Chen I, Wu C (2019) Detection of autism spectrum disorder in different settings: Accuracy of the modified checklist for autism in toddlers. Bullet Special Educ 44(3):33–61
Jiao X, Yin Y, Shang L, Jiang X, Chen X, Li L, Wang F, Liu Q (2019) Tinybert: Distilling bert for natural language understanding. arXiv:1909.10351
Kuijper SJ, Hartman CA, Bogaerds-Hazenberg S, Hendriks P (2017) Narrative production in children with autism spectrum disorder (asd) and children with attention-deficit/hyperactivity disorder (adhd): Similarities and differences. J Abnormal Psychol 126(1):63
Lee C (1993) Spit the Seeds. Hsin-yi
Loomes R, Hull L, Mandy WPL (2017) What is the male-to-female ratio in autism spectrum disorder? a systematic review and meta-analysis. J American Academy Child Adolescent Psychiatry 56(6):466–474
Lord C, Rutter M, DiLavore P, Risi S, Gotham K, Bishop S, et al (2012) Autism diagnostic observation schedule–2nd edition (ados-2). Los Angeles, CA: Western Psychological Corporation 284
MacQueen J (1967) Classification and analysis of multivariate observations. In: Proceedings of the 5th berkeley symposium on mathematical statistics and probability, pp 281–297
MacWhinney B, Snow C (1990) The child language data exchange system: An update. J Child Language 17(2):457–472
Marini A, Ozbič M, Magni R, Valeri G (2020) Toward a definition of the linguistic profile of children with autism spectrum disorder. Front Psychol 11:808
Parsons L, Cordier R, Munro N, Joosten A, Speyer R (2017) A systematic review of pragmatic language interventions for children with autism spectrum disorder. PloS one 12(4):e0172242
Pearson K (1901) Liii. on lines and planes of closest fit to systems of points in space. The London, Edinburgh, and Dublin philosophical magazine and journal of science 2(11):559–572
Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, Blondel M, Prettenhofer P, Weiss R, Dubourg V, et al (2011) Scikit-learn: Machine learning in python. J Mach Learn Res 12:2825–2830
Prevention CfDCa (2022) Screening and diagnosis of autism spectrum disorder. https://www.cdc.gov/ncbddd/autism/screening.html
Raj S, Masood S (2020) Analysis and detection of autism spectrum disorder using machine learning techniques. Procedia Comput Sci 167:994–1004
Rumpf AL, Kamp-Becker I, Becker K, Kauschke C (2012) Narrative competence and internal state language of children with asperger syndrome and adhd. Res Develop Disabilit 33(5):1395–1407
Rutter M, Bailey A, Lord C (2003) The Social Communication Questionnaire: Manual. Western Psychological Services
Schaaf CP, Betancur C, Yuen RK, Parr JR, Skuse DH, Gallagher L, Bernier RA, Buchanan JA, Buxbaum JD, Chen CA et al (2020) A framework for an evidence-based gene list relevant to autism spectrum disorder. Nature Rev Genetics 21(6):367–376
May WY, Chang KJ (2003) Introduction to CKIP Chinese word segmentation system for the first international Chinese Word Segmentation Bakeoff. Proc ACL, Second SIGHAN Workshop on Chinese Language Processing: 168–171
Sparck Jones K (1972) A statistical interpretation of term specificity and its application in retrieval. J Documentation 28(1):11–21
of Social Welfare in Taiwan ND (2023) The annual statistics of people with disabilities in 2023. https://dosw.gov.taipei/cp.aspx?n=BA5B128CF7454DDC
Goto456 (2019) Stopwords (hit_stopwords.txt). GitHub repository. Retrieved from https://github.com/goto456/stopwords
Tsai IF (2009) A study of Chinese language sample analysis for 3-5 years old children. [Master’s thesis, Taipei City University of Education]. National Digital Library of Theses and Dissertations in Taiwan. https://hdl.handle.net/11296/3pa2mt
Tsay JS (2005) Taiwan child language corpus: data collection and annotation. In: Proceedings of the Fifth Workshop on Asian Language Resources (ALR-05) and First Symposium on Asian Language Resources Network (ALRN)
Tseng Y, Liu H (2017) Examining performance of expository and conversational discourse in mandarin-speaking children with language impairment. J Special Educ 46:1–30
Tseng Y, Liu H (2023) Investigating performance of expository discourse in mandarin-speaking children with language impairment: Language sampling analysis. Bullet Special Educ 48(1):31–60
Wawer A, Chojnicka I (2022) Detecting autism from picture book narratives using deep neural utterance embeddings. Int J Language Commun Disorders 57(5):948–962
Wei J, Zou K (2019) Eda: Easy data augmentation techniques for boosting performance on text classification tasks. arXiv:1901.11196
Wiesner D (1991) Tuesday. Houghton Mifflin Harcourt
Wong YS (2018) Utility of the Screening Tool for Autism in Two-year-olds (STAT) and the Autism Diagnostic Observation Schedule (ADOS) for detecting autism spectrum disorder in toddlers under aged 24 months: A follow-up study. [Master’s thesis, Kaohsiung Medical University]. Airiti Library. https://www.airitilibrary.com/Article/Detail?DocID=U0011-2307201816313300
Wu CY (2013) Urban and rural differences in age at initial diagnosis and healthcare utilization among pre-school children with autism. [Master’s thesis, Taipei Medical University]. National Digital Library of Theses and Dissertations in Taiwan. https://hdl.handle.net/11296/3w4d84
Wu CC (2002) The reliability of production measures on children with specific impairment. [Master’s thesis, National Chiayi University]. National Digital Library of Theses and Dissertations in Taiwan. https://hdl.handle.net/11296/728hps
Zeidan J, Fombonne E, Scorah J, Ibrahim A, Durkin MS, Saxena S, Yusuf A, Shih A, Elsabbagh M (2022) Global prevalence of autism: A systematic review update. Autism Res 15(5):778–790
Zhou J (2009) Research on Chinese Children’s Language Development: Application and Development of International Children’s Corpus Research Methods. Educational Science Press
Acknowledgements
We extend our heartfelt gratitude to the children and their families who took part in this study, generously contributing to our dataset. We are also grateful to the Zhulian Elementary School in East District of Hsinchu City and the Hsinchu Autism Association for their cooperation in recruiting the participants for the study and providing the experiment venue.
Funding
This research was funded by [109-2221-E-468-014-MY3] (awarded to Dr. Arbee L.P. Chen) from National Science and Technology Council, Republic of China.
Author information
Authors and Affiliations
Contributions
All authors contributed to the study conception and design. Material preparation, data collection and analysis were performed by Ruihan Sun. The first draft of the manuscript was written by Ruihan Sun and all other authors commented on previous versions of the manuscript. All authors read and approved the final manuscript.
Corresponding author
Ethics declarations
Competing Interests
All authors certify that they have no affiliations with or involvement in any organization or entity with any financial interest or non-financial interest in the subject matter or materials discussed in this manuscript.
Ethical Approval
This study was reviewed and approved by the Institutional Review Board at China Medical University (IRB number: CRREC-112-002).
Consent to Participate
We obtained the written consent from the legal guardian of each participant.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Sun, R., Wong, J., Chen, E.E. et al. Using Computational Models to Detect Autistic Tendencies for Children from their Story Book Narratives. Multimed Tools Appl (2025). https://doi.org/10.1007/s11042-025-20600-z
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s11042-025-20600-z