Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3430665.3456314acmconferencesArticle/Chapter ViewAbstractPublication PagesiticseConference Proceedingsconference-collections
research-article
Open access

A Quantitative Analysis of Student Solutions to Graph Database Problems

Published: 26 June 2021 Publication History
  • Get Citation Alerts
  • Abstract

    As data grow both in size and in connectivity, the interest to use graph databases in the industry has been proliferating. However, there has been little research on graph database education. In response to the need to introduce college students to graph databases, this paper is the first to analyze students' errors in homework submissions of queries written in Cypher, the query language for Neo4j---the most prominent graph database. Based on 40,093 student submissions from homework assignments in an upper-level computer science database course at one university, this paper provides a quantitative analysis of students' learning when solving graph database problems. The data shows that students struggle the most to correctly use Cypher's WITH clause to define variable names before referencing in the WHERE clause and these errors persist over multiple homework problems requiring the same techniques, and we suggest a further improvement on the classification of syntactic errors.

    References

    [1]
    Alireza Ahadi, Vahid Behbood, Arto Vihavainen, Julia Prior, and Raymond Lister. 2016. Students' Syntactic Mistakes in Writing Seven Different Types of SQL Queries and Its Application to Predicting Students' Success. In Proceedings of the 47th ACM Technical Symposium on Computing Science Education (Memphis, Tennessee, USA) (SIGCSE '16). ACM, New York, NY, USA, 401--406. https://doi.org/10.1145/2839509.2844640
    [2]
    Alireza Ahadi, Julia Prior, Vahid Behbood, and Raymond Lister. 2015. A Quantitative Study of the Relative Difficulty for Novices of Writing Seven Different Types of SQL Queries. In Proceedings of the 2015 ACM Conference on Innovation and Technology in Computer Science Education (Vilnius, Lithuania) (ITiCSE '15). ACM, New York, NY, USA, 201--206. https://doi.org/10.1145/2729094.2742620
    [3]
    Abdussalam Alawini. 2018. Database Systems. https://alawini.com/teaching/cs411-database-systems/
    [4]
    Ridha Alkhabaz, Seth Poulsen, Mei Chen, and Abdussalam Alawini. 2021. Insights from Student Solutions to MongoDB Homework Problems. In Proceedings of the 2021 ACM Conference on Innovation and Technology in Computer Science Education (Virtual Event, Germany) (ITiCSE '21). Association for Computing Machinery, New York, NY, USA, 7.
    [5]
    Peter Brusilovsky, Sergey Sosnovsky, Michael V. Yudelson, Danielle H. Lee, Vladimir Zadorozhny, and Xin Zhou. 2010. Learning SQL Programming with Interactive Tools: From Integration to Personalization. ACM Trans. Comput. Educ., Vol. 9, 4, Article 19 (Jan. 2010), 15 pages. https://doi.org/10.1145/1656255.1656257
    [6]
    Mike Buerli. 2012. The current state of graph databases. Department of Computer Science, Cal Poly San Luis Obispo, mbuerli@ calpoly. edu, Vol. 32, 3 (2012), 67--83.
    [7]
    Edgar F Codd. 2002. A relational model of data for large shared data banks. In Software pioneers. Springer, 263--294.
    [8]
    Susan Davidson. 2020. Data Management in the Cloud. https://www.seas.upenn.edu/ cis550/
    [9]
    María del Pilar Ángeles. [n.d.]. NoSQL systems. https://www.coursera.org/learn/nosql-databases
    [10]
    Diogo Fernandes and Jorge Bernardino. 2018. Graph Databases Comparison: AllegroGraph, ArangoDB, InfiniteGraph, Neo4J, and OrientDB. In DATA. 373--380.
    [11]
    Ghislain Fourny. 2019. Big Data. https://video.ethz.ch/lectures/d-infk/2019/autumn/263--3010-00L.html
    [12]
    Brad Fowler, Joy Godin, and Margaret E Geddy. 2016. Teaching Case: Introduction to NoSQL in a Traditional Database Course. J. Inf. Syst. Educ., Vol. 27 (2016), 99--104.
    [13]
    José Guia, Valéria Gonçalves Soares, and Jorge Bernardino. 2017a. Graph Databases: Neo4j Analysis. In ICEIS .
    [14]
    José Guia, Valéria Goncc alves Soares, and Jorge Bernardino. 2017b. Graph Databases: Neo4j Analysis. In ICEIS (1). 351--356.
    [15]
    Amarnath Gupta. [n.d.]. Graph Analytics for Big Data. https://www.coursera.org/learn/big-data-graph-analytics
    [16]
    Dimitrios Kotsifakos, Dimitrios Magetos, Alexandros Veletsos, and Christos Douligeris. 2019. Teaching the Basic Commands of NoSQL Databases Using Neo4j in Vocational Education and Training (VET). European Journal of Engineering Research and Science CIE (Apr. 2019), 13--18. https://doi.org/10.24018/ejers.2019.0.CIE.1291
    [17]
    Josep Lluís Larriba-Pey, Norbert Martínez-Bazán, and David Domínguez-Sal. 2014. Introduction to Graph Databases .Springer International Publishing, Cham, 171--194. https://doi.org/10.1007/978--3--319--10587--1_4
    [18]
    Jo ao Ricardo Lourencc o, Bruno Cabral, Paulo Carreiro, Marco Vieira, and Jorge Bernardino. 2015. Choosing the right NoSQL database for the job: a quality attribute evaluation. Journal of Big Data, Vol. 2, 1 (2015), 1--26.
    [19]
    Davin McCall and Michael Kölling. 2014. Meaningful categorisation of novice programmer errors. In 2014 IEEE Frontiers in Education Conference (FIE) Proceedings. IEEE, 1--8.
    [20]
    Justin J Miller. 2013. Graph database applications and concepts with Neo4j. In Proceedings of the Southern Association for Information Systems Conference, Atlanta, GA, USA, Vol. 2324.
    [21]
    Sriram Mohan. 2018. Teaching NoSQL Databases to Undergraduate Students: A Novel Approach. In Proceedings of the 49th ACM Technical Symposium on Computer Science Education (Baltimore, Maryland, USA) (SIGCSE '18). Association for Computing Machinery, New York, NY, USA, 314--319. https://doi.org/10.1145/3159450.3159554
    [22]
    Neo4j, Inc. 2019. Neo4j. https://neo4j.com/
    [23]
    Seth Poulsen, Liia Butler, Abdussalam Alawini, and Geoffrey L. Herman. 2020. Insights from Student Solutions to SQL Homework Problems. In Proceedings of the 2020 ACM Conference on Innovation and Technology in Computer Science Education (Trondheim, Norway) (ITiCSE '20). Association for Computing Machinery, New York, NY, USA, 404--410. https://doi.org/10.1145/3341525.3387391
    [24]
    Rabi Prasad, Padhy Manas, Ranjan Patra, and Suresh Chandra Satapathy. 2011. RDBMS to NoSQL: Reviewing Some Next-Generation Non-Relational Databases., 15--30 pages.
    [25]
    Phyllis Reisner. 1981. Human Factors Studies of Database Query Languages: A Survey and Assessment. ACM Comput. Surv., Vol. 13, 1 (March 1981), 13--31. https://doi.org/10.1145/356835.356837
    [26]
    Facts & Factors Research. Feb, 2020. Graph Database Market By Product Type (Resource Description Framework and Property Graph), and By Application (BFSI, IT & Telecom, Healthcare & Life Sciences, Transportation & Logistics, Retail & Ecommerce, Government & Public, and Others): Global Industry Perspective, Comprehensive Analysis, and Forecast, 2019 - 2026. https://www.fnfresearch.com/graph-database-market-by-product-type-resource-description
    [27]
    Kristin Tuft and David Maier. 2014. Data Management in the Cloud. http://datalab.cs.pdx.edu/education/clouddbms-win2014/page.php?content=index
    [28]
    Chad Vicknair, Michael Macias, Zhendong Zhao, Xiaofei Nan, Yixin Chen, and Dawn Wilkins. 2010. A Comparison of a Graph Database and a Relational Database: A Data Provenance Perspective. In Proceedings of the 48th Annual Southeast Regional Conference (Oxford, Mississippi) (ACM SE '10). Association for Computing Machinery, New York, NY, USA, Article 42, 6 pages. https://doi.org/10.1145/1900008.1900067
    [29]
    Matthew West, Geoffrey L. Herman, and Craig Zilles. 2015. PrairieLearn: Mastery-based Online Problem Solving with Adaptive Scoring and Recommendations Driven by Machine Learning. In 2015 ASEE Annual Conference & Exposition. ASEE Conferences, Seattle, Washington, 26.1238.1--26.1238.14. https://peer.asee.org/24575.

    Cited By

    View all
    • (2024)Teaching Multiple Data Models and Query LanguagesProceedings of the 2024 on Innovation and Technology in Computer Science Education V. 110.1145/3649217.3653628(234-240)Online publication date: 3-Jul-2024
    • (2023)Assessing Student Learning Across Various Database Query Languages2023 IEEE Frontiers in Education Conference (FIE)10.1109/FIE58773.2023.10343409(1-9)Online publication date: 18-Oct-2023
    • (2022)Teaching about Data and DatabasesACM SIGMOD Record10.1145/3552490.355250451:2(52-60)Online publication date: 29-Jul-2022

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    ITiCSE '21: Proceedings of the 26th ACM Conference on Innovation and Technology in Computer Science Education V. 1
    June 2021
    611 pages
    ISBN:9781450382144
    DOI:10.1145/3430665
    This work is licensed under a Creative Commons Attribution International 4.0 License.

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 26 June 2021

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Neo4j
    2. database education
    3. online assessment

    Qualifiers

    • Research-article

    Funding Sources

    Conference

    ITiCSE 2021
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 552 of 1,613 submissions, 34%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)132
    • Downloads (Last 6 weeks)13
    Reflects downloads up to 10 Aug 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Teaching Multiple Data Models and Query LanguagesProceedings of the 2024 on Innovation and Technology in Computer Science Education V. 110.1145/3649217.3653628(234-240)Online publication date: 3-Jul-2024
    • (2023)Assessing Student Learning Across Various Database Query Languages2023 IEEE Frontiers in Education Conference (FIE)10.1109/FIE58773.2023.10343409(1-9)Online publication date: 18-Oct-2023
    • (2022)Teaching about Data and DatabasesACM SIGMOD Record10.1145/3552490.355250451:2(52-60)Online publication date: 29-Jul-2022

    View Options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Get Access

    Login options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media