Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3430895.3460979acmotherconferencesArticle/Chapter ViewAbstractPublication Pagesl-at-sConference Proceedingsconference-collections
Work in Progress

Analyzing Patterns in Student SQL Solutions via Levenshtein Edit Distance

Published: 08 June 2021 Publication History

Abstract

Structured Query Language (SQL), the standard language for relational database management systems, is an essential skill for software developers, data scientists, and professionals who need to interact with databases. SQL is highly structured and presents diverse ways for learners to acquire this skill. However, despite the significance of SQL to other related fields, little research has been done to understand how students learn SQL as they work on homework assignments. In this paper, we analyze students' SQL submissions to homework problems of the Database Systems course offered at the University of Illinois at Urbana-Champaign. For each student, we compute the Levenshtein Edit Distances between every submission and their final submission to understand how students reached their final solution and how they overcame any obstacles in their learning process. Our system visualizes the edit distances between students' submissions to a SQL problem, enabling instructors to identify interesting learning patterns and approaches. These findings will help instructors target their instruction in difficult SQL areas for the future and help students learn SQL more effectively.

Supplementary Material

MP4 File (L-at-S21-lswp152.mp4)
Structured Query Language (SQL) is highly structured and presents diverse ways for learners to acquire this skill, including data scientists and professionals who must interact with databases. Despite the significance of SQL to related fields, little research has been done to understand how students learn SQL as they work on assignments. We analyze students' SQL submissions to homework problems of the Database Systems course at the University of Illinois at Urbana-Champaign. For each student, we compute the Levenshtein Edit Distances between each current and final submission to understand how students reached their final solution by overcoming any obstacles in their learning process. Our system visualizes the edit distances between students' submissions to a SQL problem, enabling instructors to identify learning patterns and approaches. These findings will help instructors target their instruction in difficult SQL areas for the future and help students learn SQL more effectively.

References

[1]
A. Ahadi, J. Prior, V. Behbood, and R. Lister. 2015. A Quantitative Study of the Relative Difficulty for Novices of Writing Seven Different Types of SQL Queries. In Proceedings of the 2015 ACM Conference on Innovation and Technology in Computer Science Education (ITiCSE '15). ACM, New York, NY, USA, 201--206.
[2]
L. Cagliero, L. De Russis, L. Farinetti, and T. Montanaro. 2018. Improving the Effectiveness of SQL Learning Practice: A Data-Driven Approach. In 2018 IEEE 42nd Annual Computer Software and Applications Conference (COMPSAC), Vol. 01. 980--989.
[3]
J. Danaparamita and W. Gatterbauer. 2011. QueryViz: Helping Users Understand SQL Queries and Their Patterns. In Proceedings of the 14th International Conference on Extending Database Technology (EDBT/ICDT '11). ACM, New York, NY, USA.
[4]
Ashley DiFranza. 2020. 5 Reasons SQL is the Need-to-Know Skill for Data Analysts. (2020).
[5]
Robert C. Jinkens. 2009. Nontraditional Students: Who Are They? SIGCSE Bull. 43, 4 (Dec. 2009), 979--987.
[6]
A. Mitrovic. 1998. Learning SQL with a Computerized Tutor. In Proceedings of the Twenty-Ninth SIGCSE Technical Symposium on Computer Science Education (SIGCSE '98). ACM, New York, NY, USA, 307--311.

Cited By

View all
  • (2024)Understanding the Characteristics of Students’ Behavioral Processes in Solving Computational Thinking Problems Based on the Behavioral SequencesJournal of Educational Computing Research10.1177/07356331241251397Online publication date: 29-Apr-2024
  • (2023)Mining SQL Problem Solving Patterns using Advanced Sequence Processing AlgorithmsProceedings of the 2nd International Workshop on Data Systems Education: Bridging education practice with education research10.1145/3596673.3596973(37-43)Online publication date: 23-Jun-2023
  • (2023)Assessing Student Learning Across Various Database Query Languages2023 IEEE Frontiers in Education Conference (FIE)10.1109/FIE58773.2023.10343409(1-9)Online publication date: 18-Oct-2023
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
L@S '21: Proceedings of the Eighth ACM Conference on Learning @ Scale
June 2021
380 pages
ISBN:9781450382151
DOI:10.1145/3430895
Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 08 June 2021

Check for updates

Author Tags

  1. SQL
  2. database education
  3. levenshtein edit distance
  4. online assessment

Qualifiers

  • Work in progress

Conference

L@S '21
L@S '21: Eighth (2021) ACM Conference on Learning @ Scale
June 22 - 25, 2021
Virtual Event, Germany

Acceptance Rates

Overall Acceptance Rate 117 of 440 submissions, 27%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)41
  • Downloads (Last 6 weeks)4
Reflects downloads up to 23 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Understanding the Characteristics of Students’ Behavioral Processes in Solving Computational Thinking Problems Based on the Behavioral SequencesJournal of Educational Computing Research10.1177/07356331241251397Online publication date: 29-Apr-2024
  • (2023)Mining SQL Problem Solving Patterns using Advanced Sequence Processing AlgorithmsProceedings of the 2nd International Workshop on Data Systems Education: Bridging education practice with education research10.1145/3596673.3596973(37-43)Online publication date: 23-Jun-2023
  • (2023)Assessing Student Learning Across Various Database Query Languages2023 IEEE Frontiers in Education Conference (FIE)10.1109/FIE58773.2023.10343409(1-9)Online publication date: 18-Oct-2023
  • (2023)Comparison of Student Learning Outcomes Among SQL Problem-Solving Patterns2023 IEEE Frontiers in Education Conference (FIE)10.1109/FIE58773.2023.10343395(1-9)Online publication date: 18-Oct-2023
  • (2023)Uncovering Patterns of SQL Errors in Student Assignments: A Comparative Analysis of Different Assignment Types2023 IEEE Frontiers in Education Conference (FIE)10.1109/FIE58773.2023.10343207(01-09)Online publication date: 18-Oct-2023
  • (2022)Analyzing Student SQL Solutions via Hierarchical Clustering and Sequence Alignment ScoresProceedings of the 1st International Workshop on Data Systems Education10.1145/3531072.3535319(10-15)Online publication date: 12-Jun-2022

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media