Author: Vartak, Manasi : Search

research-article

The last 5+ years in ML have focused on building the best models, hyperparameter optimization, parallel training, massive neural networks, etc. Now that the building of models has become easy, models are being integrated into every piece of software and ...

research-article

Opportunities for data management research in the era of horizontal AI/ML

Proceedings of the VLDB Endowment (PVLDB), Volume 12, Issue 12Page 2323https://doi.org/10.14778/3352063.3352149

AI/ML is becoming a horizontal technology: its application is expanding to more domains, and its integration touches more parts of the technology stack. Given the strong dependence of ML on data, this expansion creates a new space for applying data ...

abstract

DEEM 2019: Workshop on Data Management for End-to-End Machine Learning

SIGMOD '19: Proceedings of the 2019 International Conference on Management of DataPages 2066–2067https://doi.org/10.1145/3299869.3323598

The DEEM workshop brings together researchers and practitioners at the intersection of applied machine learning, data management and systems research, with the goal to discuss the arising data management issues in machine learning application scenarios.

...

research-article

MISTIQUE: A System to Store and Query Model Intermediates for Model Diagnosis

SIGMOD '18: Proceedings of the 2018 International Conference on Management of DataPages 1285–1300https://doi.org/10.1145/3183713.3196934

Model diagnosis is the process of analyzing machine learning (ML) model performance to identify where the model works well and where it doesn't. It is a key part of the modeling process and helps ML developers iteratively improve model accuracy. Often, ...

Article

Free

A meta-learning perspective on cold-start recommendations for items

NIPS'17: Proceedings of the 31st International Conference on Neural Information Processing SystemsPages 6907–6917

Matrix factorization (MF) is one of the most popular techniques for product recommendation, but is known to suffer from serious cold-start problems. Item cold-start problems are particularly acute in settings such as Tweet recommendation where new items ...

research-article

Towards Visualization Recommendation Systems

ACM SIGMOD Record (SIGMOD), Volume 45, Issue 4Pages 34–39https://doi.org/10.1145/3092931.3092937

Data visualization is often used as the first step while performing a variety of analytical tasks. With the advent of large, high-dimensional datasets and significant interest in data science, there is a need for tools that can support rapid visual ...

research-article

ModelDB: a system for machine learning model management

HILDA '16: Proceedings of the Workshop on Human-In-the-Loop Data AnalyticsArticle No.: 14, Pages 1–3https://doi.org/10.1145/2939502.2939516

Building a machine learning model is an iterative process. A data scientist will build many tens to hundreds of models before arriving at one that meets some acceptance criteria (e.g. AUC cutoff, accuracy threshold). However, the current style of model ...

research-article

SeeDB: efficient data-driven visualization recommendations to support visual analytics

Proceedings of the VLDB Endowment (PVLDB), Volume 8, Issue 13Pages 2182–2193https://doi.org/10.14778/2831360.2831371

Data analysts often build visualizations as the first step in their analytical workflow. However, when working with high-dimensional datasets, identifying visualizations that show relevant or desired trends in data can be laborious. We propose SeeDB, a ...

research-article

A demonstration of the BigDAWG polystore system

Proceedings of the VLDB Endowment (PVLDB), Volume 8, Issue 12Pages 1908–1911https://doi.org/10.14778/2824032.2824098

This paper presents BigDAWG, a reference implementation of a new architecture for "Big Data" applications. Such applications not only call for large-scale analytics, but also for real-time streaming support, smaller analytics at interactive speeds, data ...

research-article

SeeDB: automatically generating query visualizations

Proceedings of the VLDB Endowment (PVLDB), Volume 7, Issue 13Pages 1581–1584https://doi.org/10.14778/2733004.2733035

Data analysts operating on large volumes of data often rely on visualizations to interpret the results of queries. However, finding the right visualization for a query is a laborious and time-consuming task. We demonstrate SeeDB, a system that partially ...

research-article

GenBase: a complex analytics genomics benchmark

SIGMOD '14: Proceedings of the 2014 ACM SIGMOD International Conference on Management of DataPages 177–188https://doi.org/10.1145/2588555.2595633

This paper introduces a new benchmark designed to test database management system (DBMS) performance on a mix of data management tasks (joins, filters, etc.) and complex analytics (regression, singular value decomposition, etc.) Such mixed workloads are ...

demonstration

CHIC: a combination-based recommendation system

SIGMOD '13: Proceedings of the 2013 ACM SIGMOD International Conference on Management of DataPages 981–984https://doi.org/10.1145/2463676.2465270

Current recommender systems are focused largely on recommending items based on similarity. For instance, Netflix can recommend movies similar to previously viewed movies, and Amazon can recommend items based on ratings of similar users. Although ...

demonstration

QRelX: generating meaningful queries that provide cardinality assurance

SIGMOD '10: Proceedings of the 2010 ACM SIGMOD International Conference on Management of dataPages 1215–1218https://doi.org/10.1145/1807167.1807323

In many business and consumer applications, queries have cardinality constraints. However, current database systems provide minimal support for cardinality assurance. Consequently, users must adopt a cumbersome trial-and-error approach to find queries ...

research-article

The ASSISTment Builder: Supporting the Life Cycle of Tutoring System Content Creation

IEEE Transactions on Learning Technologies (IEEETLT), Volume 2, Issue 2Pages 157–166https://doi.org/10.1109/TLT.2009.23

Content creation is a large component of the cost of creating educational software. Estimates are that approximately 200 hours of development time are required for every hour of instruction. We present an authoring tool designed to reduce this cost as ...

Search Results

Applied Filters

People

Names

Institutions

Authors

Publications

Journal/Magazine Names

Proceedings/Book Names

All Publications

Content Type

Media Formats

Publisher

Conferences

Sponsors

Conference Event

Proceedings Series

Publication Date

From ML models to intelligent applications: the rise of MLOps

Opportunities for data management research in the era of horizontal AI/ML

DEEM 2019: Workshop on Data Management for End-to-End Machine Learning

MISTIQUE: A System to Store and Query Model Intermediates for Model Diagnosis

A meta-learning perspective on cold-start recommendations for items

Towards Visualization Recommendation Systems

ModelDB: a system for machine learning model management

SeeDB: efficient data-driven visualization recommendations to support visual analytics

A demonstration of the BigDAWG polystore system

SeeDB: automatically generating query visualizations

GenBase: a complex analytics genomics benchmark

CHIC: a combination-based recommendation system

QRelX: generating meaningful queries that provide cardinality assurance

The ASSISTment Builder: Supporting the Life Cycle of Tutoring System Content Creation

Applied Filters

People

Names

Institutions

Authors

Publications

Journal/Magazine Names

Proceedings/Book Names

All Publications

Content Type

Media Formats

Publisher

Conferences

Sponsors

Conference Event

Proceedings Series

Publication Date

Save to Binder