Curriculum Learning Meets Directed Acyclic Graph for Multimodal Emotion Recognition (Accepted by LREC-COLING 2024)

Emotion recognition in conversation (ERC) is a crucial task in natural language processing and affective computing. This paper proposes MultiDAG+CL, a novel approach for Multimodal Emotion Recognition in Conversation (ERC) that employs Directed Acyclic Graphs (DAG) to integrate textual, acoustic, and visual features within a unified framework. The model is enhanced by Curriculum Learning (CL) to address challenges related to emotional shifts and data imbalance. Curriculum learning facilitates the learning process by gradually presenting training samples in a meaningful order, thereby improving the model's performance in handling emotional variations and data imbalance. Experimental results on the IEMOCAP and MELD datasets demonstrate that the MultiDAG+CL models outperform baseline models.

Requirements

Python 3.6
PyTorch 1.6.0
Transformers 3.5.1
CUDA 10.1

Datasets and Utterance Feature

You can download the dataset and extracted utterance feature from https://drive.google.com/drive/folders/1zCfjx-HhqEY2tQlxvg1X_6T7sB6hVA2T?usp=sharing. Multimodal datasets are only available for IEMOCAP and MELD, marking with "_mm" in their names.

Training

To train model with all three modals (not using curriculum learning): !python run.py --dataset_name IEMOCAP --gnn_layers 4 --lr 0.0005 --batch_size 16 --epochs 30 --dropout 0.4 --emb_dim 2948 (2948 = 1024 + 1582 + 342)

In read function (dataset.py): features.append(u['cls'][0]+u['cls'][1]+u['cls'][2])

To run program with text + visual: !python run.py --dataset_name IEMOCAP --gnn_layers 4 --lr 0.0005 --batch_size 16 --epochs 30 --dropout 0.4 --emb_dim 1366 (1366 = 1024 + 342)

In read function (dataset.py): features.append(u['cls'][0]+u['cls'][2])

Evaluate (required saving model first): !python evaluate.py --dataset_name IEMOCAP --state_dict_file /link/to/saved/model --gnn_layers 4 --lr 0.0005 --batch_size 16 --dropout 0.4 --emb_dim 2948

To train model with curriculum learning: adding --curriculum and --bucket_number parameter.

Train model with all three modals and curriculum learning: !python run.py --dataset_name IEMOCAP --gnn_layers 4 --lr 0.0005 --batch_size 16 --epochs 30 --dropout 0.4 --emb_dim 2948 --curriculum --bucket_number 12

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
.gitattributes		.gitattributes
README.md		README.md
cl.py		cl.py
dataloader.py		dataloader.py
dataset.py		dataset.py
evaluate.py		evaluate.py
model.py		model.py
model_utils.py		model_utils.py
run.py		run.py
trainer.py		trainer.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Curriculum Learning Meets Directed Acyclic Graph for Multimodal Emotion Recognition (Accepted by LREC-COLING 2024)

Requirements

Datasets and Utterance Feature

Training

About

Releases

Packages

Languages

vanntc711/MultiDAG-CL

Folders and files

Latest commit

History

Repository files navigation

Curriculum Learning Meets Directed Acyclic Graph for Multimodal Emotion Recognition (Accepted by LREC-COLING 2024)

Requirements

Datasets and Utterance Feature

Training

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages