SAGA: A Scalable Framework for Optimizing Data Cleaning Pipelines for Machine Learning Applications
Abstract
References
Index Terms
- SAGA: A Scalable Framework for Optimizing Data Cleaning Pipelines for Machine Learning Applications
Recommendations
Machine Learning and Data Cleaning: Which Serves the Other?
The last few years witnessed significant advances in building automated or semi-automated data quality, data cleaning and data integration systems powered by machine learning (ML). In parallel, large deployment of ML systems in business, science, ...
Data cleaning and machine learning: a systematic literature review
AbstractMachine Learning (ML) is integrated into a growing number of systems for various applications. Because the performance of an ML model is highly dependent on the quality of the data it has been trained on, there is a growing interest in approaches ...
Comments
Information & Contributors
Information
Published In
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
Author Tags
Qualifiers
- Research-article
Contributors
Other Metrics
Bibliometrics & Citations
Bibliometrics
Article Metrics
- 0Total Citations
- 531Total Downloads
- Downloads (Last 12 months)531
- Downloads (Last 6 weeks)84
Other Metrics
Citations
View Options
Get Access
Login options
Check if you have access through your login credentials or your institution to get full access on this article.
Sign in