Multimodal Methods for Analyzing Learning and Training Environments: A Systematic Literature Review

Cohn, Clayton; Davalos, Eduardo; Vatral, Caleb; Fonteles, Joyce Horn; Wang, Hanchen David; Ma, Meiyi; Biswas, Gautam

Computer Science > Machine Learning

arXiv:2408.14491 (cs)

[Submitted on 22 Aug 2024]

Title:Multimodal Methods for Analyzing Learning and Training Environments: A Systematic Literature Review

Authors:Clayton Cohn, Eduardo Davalos, Caleb Vatral, Joyce Horn Fonteles, Hanchen David Wang, Meiyi Ma, Gautam Biswas

View PDF HTML (experimental)

Abstract:Recent technological advancements have enhanced our ability to collect and analyze rich multimodal data (e.g., speech, video, and eye gaze) to better inform learning and training experiences. While previous reviews have focused on parts of the multimodal pipeline (e.g., conceptual models and data fusion), a comprehensive literature review on the methods informing multimodal learning and training environments has not been conducted. This literature review provides an in-depth analysis of research methods in these environments, proposing a taxonomy and framework that encapsulates recent methodological advances in this field and characterizes the multimodal domain in terms of five modality groups: Natural Language, Video, Sensors, Human-Centered, and Environment Logs. We introduce a novel data fusion category -- mid fusion -- and a graph-based technique for refining literature reviews, termed citation graph pruning. Our analysis reveals that leveraging multiple modalities offers a more holistic understanding of the behaviors and outcomes of learners and trainees. Even when multimodality does not enhance predictive accuracy, it often uncovers patterns that contextualize and elucidate unimodal data, revealing subtleties that a single modality may miss. However, there remains a need for further research to bridge the divide between multimodal learning and training studies and foundational AI research.

Comments:	Submitted to ACM Computing Surveys. Currently under review
Subjects:	Machine Learning (cs.LG); Multimedia (cs.MM)
Cite as:	arXiv:2408.14491 [cs.LG]
	(or arXiv:2408.14491v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2408.14491

Submission history

From: Clayton Cohn [view email]
[v1] Thu, 22 Aug 2024 22:42:23 UTC (2,395 KB)

Computer Science > Machine Learning

Title:Multimodal Methods for Analyzing Learning and Training Environments: A Systematic Literature Review

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Multimodal Methods for Analyzing Learning and Training Environments: A Systematic Literature Review

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators