Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3545945.3569767acmconferencesArticle/Chapter ViewAbstractPublication PagessigcseConference Proceedingsconference-collections
research-article
Open access

Towards a Validated Self-Efficacy Scale for Data Management

Published: 03 March 2023 Publication History

Abstract

We propose a self-efficacy scale for data management. The scale assesses students' perceived capabilities in mastering the breadth and depth of modern data management, as well as hands-on skills for effective management of data. Such capabilities are critical to computing and data science students. We have conducted factor analysis to validate the scale. The analysis produced a factor model with high internal consistencies. Group analyses using the factor solution and statistical testing show that (1) males and females have similar self-efficacy, except for the depth of knowledge where females showed higher confidences; and (2) CS students had much higher self-efficacy than non-CS students. To the best of our knowledge, this is the first self-efficacy scale for data management.

References

[1]
Albert Bandura and Sebastian Wessels. 1994. Self-efficacy (in Encyclopedia of Human Behavior). Vol. 4. Academic Press.
[2]
Coen Bernaards, Robert Jennrich, and Maintainer Paul Gilbert. 2022. Package GPArotation. (2022).
[3]
Ismail Bile Hassan, Thanaa Ghanem, David Jacobson, Simon Jin, Katherine Johnson, Dalia Sulieman, and Wei Wei. 2021. Data science curriculum design: A case study. In Proceedings of the 52nd ACM Technical Symposium on Computer Science Education. 529--534.
[4]
J Martin Bland and Douglas G Altman. 1997. Statistics notes: Cronbach's alpha. Bmj, Vol. 314, 7080 (1997), 572.
[5]
Jennifer M Blaney and Jane G Stout. 2017. Examining the relationship between introductory computing course experiences, self-efficacy, and belonging among first-generation college women. In Proceedings of the 2017 ACM SIGCSE technical symposium on computer science education. 69--74.
[6]
Ryan Bockmon, Stephen Cooper, Jonathan Gratch, and Mohsen Dorodchi. 2020. Validating a CS attitudes instrument. In Proceedings of the 51st ACM Technical Symposium on Computer Science Education. 899--904.
[7]
Richard A Brown. 2009. Hadoop at home: large-scale computing at a small college. In Proceedings of the 40th ACM Technical Symposium on Computer Science Education. 106--110.
[8]
Tor Busch. 1995. Gender differences in self-efficacy and attitudes toward computers. Journal of educational computing research, Vol. 12, 2 (1995), 147--158.
[9]
Bill Chambers and Matei Zaharia. 2018. Spark: The definitive guide: Big data processing made simple. "O'Reilly Media, Inc.".
[10]
Jacob Cohen. 2013. Statistical power analysis for the behavioral sciences. Routledge.
[11]
Gregory W Corder and Dale I Foreman. 2014. Nonparametric statistics: A step-by-step approach. John Wiley & Sons.
[12]
Sarah Dahlby Albright, Titus H Klinge, and Samuel A Rebelsky. 2018. A functional approach to data science in CS1. In Proceedings of the 49th ACM Technical Symposium on Computer Science Education. 1035--1040.
[13]
Holger Danielsiek, Laura Toma, and Jan Vahrenhold. 2018. An instrument to assess self-efficacy in introductory algorithms courses. ACM Inroads, Vol. 9, 1 (2018), 56--65.
[14]
Richard D De Veaux, Mahesh Agarwal, Maia Averett, Benjamin S Baumer, Andrew Bray, Thomas C Bressoud, Lance Bryant, Lei Z Cheng, Amanda Francis, Robert Gould, et al. 2017. Curriculum guidelines for undergraduate programs in data science. Annu Rev Stat Appl, Vol. 4 (2017), 15--30.
[15]
Debzani Deb, Muztaba Fuad, and Keith Irwin. 2019. A module-based approach to teaching big data and cloud computing topics at cs undergraduate level. In Proceedings of the 50th ACM Technical Symposium on Computer Science Education. 2--8.
[16]
Brian Dorn and Allison Elliott Tew. 2015. Empirical validation and application of the computing attitudes survey. Computer Science Education, Vol. 25, 1 (2015), 1--36.
[17]
Allison Elliott Tew, Brian Dorn, and Oliver Schneider. 2012. Toward a validated computing attitudes survey. In Proceedings of the ninth annual international conference on International computing education research. 135--142.
[18]
Catherine O Fritz, Peter E Morris, and Jennifer J Richler. 2012. Effect size estimates: current use, calculations, and interpretation. Journal of experimental psychology: General, Vol. 141, 1 (2012), 2.
[19]
Gene V Glass. 1965. A ranking variable analogue of biserial correlation: Implications for short-cut item analysis. Journal of Educational Measurement, Vol. 2, 1 (1965), 91--95.
[20]
James W Grice. 2001. Computing and evaluating factor scores. Psychological methods, Vol. 6, 4 (2001), 430.
[21]
IBM. 2020. The Data Science Skills Competency Model. https://www.ibm.com/downloads/cas/7109RLQM.
[22]
Richard Arnold Johnson, Dean W Wichern, et al. 2014. Applied multivariate statistical analysis. Vol. 6. Pearson London, UK:.
[23]
Maria Kallia and Sue Sentance. 2019. Learning to use functions: The relationship between misconceptions and self-efficacy. In Proceedings of the 50th ACM technical symposium on computer science education. 752--758.
[24]
Dave S Kerby. 2014. The simple difference formula: An approach to teaching nonparametric correlation. Comprehensive Psychology, Vol. 3 (2014), 11--IT.
[25]
Suneuy Kim. 2020. Seamless Integration of NoSQL class into the Database Curriculum. In Proceedings of the 2020 ACM Conference on Innovation and Technology in Computer Science Education. 314--320.
[26]
Bruce M King, Patrick J Rosopa, and Edward W Minium. 2018. Statistical reasoning in the behavioral sciences. John Wiley & Sons.
[27]
James M Lattin, J Douglas Carroll, and Paul E Green. 2003. Analyzing multivariate data. Vol. 1. Thomson Brooks/Cole Pacific Grove, CA.
[28]
W Lenhard and A Lenhard. 2016. Computation of effect sizes. Psychometrica. http://www.psychometrica.de/effect_size.html.
[29]
Robert C MacCallum, Keith F Widaman, Shaobo Zhang, and Sehee Hong. 1999. Sample size in factor analysis. Psychological methods, Vol. 4, 1 (1999), 84.
[30]
Christine A Murphy, Delphine Coover, and Steven V Owen. 1989. Development and validation of the computer self-efficacy scale. Educational and Psychological measurement, Vol. 49, 4 (1989), 893--899.
[31]
Frank Pajares. 1997. Current directions in self-efficacy research. Advances in motivation and achievement, Vol. 10, 149 (1997), 1--49.
[32]
Frank Pajares and Laura Graham. 1999. Self-efficacy, motivation constructs, and mathematics performance of entering middle school students. Contemporary educational psychology, Vol. 24, 2 (1999), 124--139.
[33]
Ariel S Rabkin, Charles Reiss, Randy Katz, and David Patterson. 2012. Experiences teaching MapReduce in the cloud. In Proceedings of the 43rd ACM technical symposium on Computer Science Education. 601--606.
[34]
Vennila Ramalingam, Deborah LaBelle, and Susan Wiedenbeck. 2004. Self-efficacy and mental models in learning to program. In Proceedings of the 9th annual SIGCSE conference on Innovation and technology in computer science education. 171--175.
[35]
Vennila Ramalingam and Susan Wiedenbeck. 1998. Development and validation of scores on a computer programming self-efficacy scale and group analyses of novice programmer self-efficacy. Journal of Educational Computing Research, Vol. 19, 4 (1998), 367--381.
[36]
RDocumentation. 2022. factor.scores: Various ways to estimate factor scores for the factor analysis model.
[37]
William Revelle. 2022. How To: Use the psych package for Factor Analysis and data reduction.
[38]
Mariam Salloum, Daniel Jeske, Wenxiu Ma, Vagelis Papalexakis, Christian Shelton, Vassilis Tsotras, and Shuheng Zhou. 2021. Developing an interdisciplinary data science program. In Proceedings of the 52nd ACM Technical Symposium on Computer Science Education. 509--515.
[39]
Barbara G Tabachnick, Linda S Fidell, and Jodie B Ullman. 2007. Using multivariate statistics. Vol. 5. pearson Boston, MA.
[40]
Jos MF Ten Berge, Wim P Krijnen, Tom Wansbeek, and Alexander Shapiro. 1999. Some new results on correlation-preserving factor scores prediction methods. Linear algebra and its applications, Vol. 289, 1--3 (1999), 311--318.
[41]
Tom White. 2012. Hadoop: The definitive guide. "O'Reilly Media, Inc.".
[42]
Wensheng Wu. 2021a. CS vs non-CS: Analyzing Online Social Behaviors of Data Science Students with Diverse Academic Backgrounds. In Proceedings of the 26th ACM Conference on Innovation and Technology in Computer Science Education V. 2. 643--643.
[43]
Wensheng Wu. 2021b. One Size Doesn't Fit All: Diversifying Data Science Course Projects by Student Background and Interests. In Proceedings of the 26th ACM Conference on Innovation and Technology in Computer Science Education V. 1. 67--73.
[44]
Wensheng Wu. 2021c. SQL2X: Learning SQL, NoSQL, and MapReduce via Translation. In Proceedings of the 52nd ACM Technical Symposium on Computer Science Education. 590--596.
[45]
Wensheng Wu. 2022. Investigating Internship Experiences of Data Science Students for Curriculum Enhancement. In Proceedings of the 27th ACM Conference on on Innovation and Technology in Computer Science Education Vol. 1. 505--511.
[46]
Barry J Zimmerman, Albert Bandura, and Manuel Martinez-Pons. 1992. Self-motivation for academic attainment: The role of self-efficacy beliefs and personal goal setting. American educational research journal, Vol. 29, 3 (1992), 663--676.

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
SIGCSE 2023: Proceedings of the 54th ACM Technical Symposium on Computer Science Education V. 1
March 2023
1481 pages
ISBN:9781450394314
DOI:10.1145/3545945
This work is licensed under a Creative Commons Attribution International 4.0 License.

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 03 March 2023

Check for updates

Author Tags

  1. data management
  2. factor analysis
  3. group analysis
  4. self efficacy

Qualifiers

  • Research-article

Conference

SIGCSE 2023
Sponsor:

Acceptance Rates

Overall Acceptance Rate 1,595 of 4,542 submissions, 35%

Upcoming Conference

SIGCSE Virtual 2024
1st ACM Virtual Global Computing Education Conference
December 5 - 8, 2024
Virtual Event , NC , USA

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 239
    Total Downloads
  • Downloads (Last 12 months)146
  • Downloads (Last 6 weeks)20
Reflects downloads up to 09 Nov 2024

Other Metrics

Citations

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media