Combining difficulty ranking with multi-armed bandits to sequence educational content

A Segal, Y Ben David, JJ Williams, K Gal… - Artificial Intelligence in …, 2018 - Springer
Artificial Intelligence in Education: 19th International Conference, AIED 2018 …, 2018Springer
We address the problem of how to personalize educational content to students in order to
maximize their learning gains over time. We present a new computational approach to this
problem called MAPLE (Multi-Armed Bandits based Personalization for Learning
Environments) that combines difficulty ranking with multi-armed bandits. Given a set of target
questions MAPLE estimates the expected learning gains for each question and uses an
exploration-exploitation strategy to choose the next question to pose to the student. It …
Abstract
We address the problem of how to personalize educational content to students in order to maximize their learning gains over time. We present a new computational approach to this problem called MAPLE (Multi-Armed Bandits based Personalization for Learning Environments) that combines difficulty ranking with multi-armed bandits. Given a set of target questions MAPLE estimates the expected learning gains for each question and uses an exploration-exploitation strategy to choose the next question to pose to the student. It maintains a personalized ranking over the difficulties of question in the target set and updates it in real-time according to students’ progress. We show in simulations that MAPLE was able to improve students’ learning gains compared to approaches that sequence questions in increasing level of difficulty, or rely on content experts. When implemented in a live e-learning system in the wild, MAPLE showed promising initial results.
Springer