Predicting Human Psychometric Properties Using Computational Language Models

Laverghetta Jr., Antonio; Nighojkar, Animesh; Mirzakhalov, Jamshidbek; Licato, John

Computer Science > Computation and Language

arXiv:2205.06203 (cs)

[Submitted on 12 May 2022]

Title:Predicting Human Psychometric Properties Using Computational Language Models

Authors:Antonio Laverghetta Jr., Animesh Nighojkar, Jamshidbek Mirzakhalov, John Licato

View PDF

Abstract:Transformer-based language models (LMs) continue to achieve state-of-the-art performance on natural language processing (NLP) benchmarks, including tasks designed to mimic human-inspired "commonsense" competencies. To better understand the degree to which LMs can be said to have certain linguistic reasoning skills, researchers are beginning to adapt the tools and concepts from psychometrics. But to what extent can benefits flow in the other direction? In other words, can LMs be of use in predicting the psychometric properties of test items, when those items are given to human participants? If so, the benefit for psychometric practitioners is enormous, as it can reduce the need for multiple rounds of empirical testing. We gather responses from numerous human participants and LMs (transformer- and non-transformer-based) on a broad diagnostic test of linguistic competencies. We then use the human responses to calculate standard psychometric properties of the items in the diagnostic test, using the human responses and the LM responses separately. We then determine how well these two sets of predictions correlate. We find that transformer-based LMs predict the human psychometric data consistently well across most categories, suggesting that they can be used to gather human-like psychometric data without the need for extensive human trials.

Comments:	To appear in Quantitative Psychology, The 86th Annual Meeting of the Psychometric Society, Virtual. arXiv admin note: substantial text overlap with arXiv:2106.06849
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2205.06203 [cs.CL]
	(or arXiv:2205.06203v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2205.06203

Submission history

From: Antonio Laverghetta Jr. [view email]
[v1] Thu, 12 May 2022 16:40:12 UTC (5,611 KB)

Computer Science > Computation and Language

Title:Predicting Human Psychometric Properties Using Computational Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Predicting Human Psychometric Properties Using Computational Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators