It was the training data pruning too!

Mudrakarta, Pramod Kaushik; Taly, Ankur; Sundararajan, Mukund; Dhamdhere, Kedar

Computer Science > Machine Learning

arXiv:1803.04579 (cs)

[Submitted on 12 Mar 2018]

Title:It was the training data pruning too!

Authors:Pramod Kaushik Mudrakarta, Ankur Taly, Mukund Sundararajan, Kedar Dhamdhere

View PDF

Abstract:We study the current best model (KDG) for question answering on tabular data evaluated over the WikiTableQuestions dataset. Previous ablation studies performed against this model attributed the model's performance to certain aspects of its architecture. In this paper, we find that the model's performance also crucially depends on a certain pruning of the data used to train the model. Disabling the pruning step drops the accuracy of the model from 43.3% to 36.3%. The large impact on the performance of the KDG model suggests that the pruning may be a useful pre-processing step in training other semantic parsers as well.

Subjects:	Machine Learning (cs.LG); Computation and Language (cs.CL)
Cite as:	arXiv:1803.04579 [cs.LG]
	(or arXiv:1803.04579v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1803.04579

Submission history

From: Ankur Taly [view email]
[v1] Mon, 12 Mar 2018 23:59:37 UTC (70 KB)

✅2024-10-01: arxiv.org is back to normal.✅

Computer Science > Machine Learning

Title:It was the training data pruning too!

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

✅2024-10-01: arxiv.org is back to normal.✅

Computer Science > Machine Learning

Title:It was the training data pruning too!

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators