Ask the experts: sourcing high-quality datasets for nutritional counselling through Human-AI collaboration

Balloccu, Simone; Reiter, Ehud; Kumar, Vivek; Recupero, Diego Reforgiato; Riboni, Daniele

Computer Science > Computation and Language

arXiv:2401.08420 (cs)

[Submitted on 16 Jan 2024]

Title:Ask the experts: sourcing high-quality datasets for nutritional counselling through Human-AI collaboration

Authors:Simone Balloccu, Ehud Reiter, Vivek Kumar, Diego Reforgiato Recupero, Daniele Riboni

View PDF

Abstract:Large Language Models (LLMs), with their flexible generation abilities, can be powerful data sources in domains with few or no available corpora. However, problems like hallucinations and biases limit such applications. In this case study, we pick nutrition counselling, a domain lacking any public resource, and show that high-quality datasets can be gathered by combining LLMs, crowd-workers and nutrition experts. We first crowd-source and cluster a novel dataset of diet-related issues, then work with experts to prompt ChatGPT into producing related supportive text. Finally, we let the experts evaluate the safety of the generated text. We release HAI-coaching, the first expert-annotated nutrition counselling dataset containing ~2.4K dietary struggles from crowd workers, and ~97K related supportive texts generated by ChatGPT. Extensive analysis shows that ChatGPT while producing highly fluent and human-like text, also manifests harmful behaviours, especially in sensitive topics like mental health, making it unsuitable for unsupervised use.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2401.08420 [cs.CL]
	(or arXiv:2401.08420v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2401.08420

Submission history

From: Simone Balloccu Mr [view email]
[v1] Tue, 16 Jan 2024 15:07:09 UTC (9,651 KB)

Computer Science > Computation and Language

Title:Ask the experts: sourcing high-quality datasets for nutritional counselling through Human-AI collaboration

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Ask the experts: sourcing high-quality datasets for nutritional counselling through Human-AI collaboration

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators