Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–16 of 16 results for author: Daheim, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.09136  [pdf, other

    cs.CL cs.AI cs.LG

    Stepwise Verification and Remediation of Student Reasoning Errors with Large Language Model Tutors

    Authors: Nico Daheim, Jakub Macina, Manu Kapur, Iryna Gurevych, Mrinmaya Sachan

    Abstract: Large language models (LLMs) present an opportunity to scale high-quality personalized education to all. A promising approach towards this means is to build dialog tutoring models that scaffold students' problem-solving. However, even though existing LLMs perform well in solving reasoning questions, they struggle to precisely detect student's errors and tailor their feedback to these errors. Inspi… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: Preprint. Nico Daheim and Jakub Macina contributed equally. Code and dataset can be found under: https://github.com/eth-lre/verify-then-generate

  2. arXiv:2403.03307  [pdf, other

    cs.CL

    Book2Dial: Generating Teacher-Student Interactions from Textbooks for Cost-Effective Development of Educational Chatbots

    Authors: Junling Wang, Jakub Macina, Nico Daheim, Sankalan Pal Chowdhury, Mrinmaya Sachan

    Abstract: Educational chatbots are a promising tool for assisting student learning. However, the development of effective chatbots in education has been challenging, as high-quality data is seldom available in this domain. In this paper, we propose a framework for generating synthetic teacher-student interactions grounded in a set of textbooks. Our approaches capture one aspect of learning interactions wher… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: 24 pages, 19 tables, 2 figures

  3. arXiv:2403.03029  [pdf, other

    cs.CL

    Socratic Reasoning Improves Positive Text Rewriting

    Authors: Anmol Goel, Nico Daheim, Iryna Gurevych

    Abstract: Reframing a negative into a positive thought is at the crux of several cognitive approaches to mental health and psychotherapy that could be made more accessible by large language model-based solutions. Such reframing is typically non-trivial and requires multiple rationalization steps to uncover the underlying issue of a negative thought and transform it to be more positive. However, this rationa… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  4. arXiv:2402.17641  [pdf, other

    cs.LG cs.AI cs.CL math.OC stat.ML

    Variational Learning is Effective for Large Deep Networks

    Authors: Yuesong Shen, Nico Daheim, Bai Cong, Peter Nickl, Gian Maria Marconi, Clement Bazan, Rio Yokota, Iryna Gurevych, Daniel Cremers, Mohammad Emtiyaz Khan, Thomas Möllenhoff

    Abstract: We give extensive empirical evidence against the common belief that variational learning is ineffective for large neural networks. We show that an optimizer called Improved Variational Online Newton (IVON) consistently matches or outperforms Adam for training large networks such as GPT-2 and ResNets from scratch. IVON's computational costs are nearly identical to Adam but its predictive uncertaint… ▽ More

    Submitted 6 June, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

    Comments: Published at International Conference on Machine Learning (ICML), 2024. The first two authors contributed equally. Code is available here: https://github.com/team-approx-bayes/ivon

  5. arXiv:2310.12808  [pdf, other

    cs.LG cs.AI cs.CL

    Model Merging by Uncertainty-Based Gradient Matching

    Authors: Nico Daheim, Thomas Möllenhoff, Edoardo Maria Ponti, Iryna Gurevych, Mohammad Emtiyaz Khan

    Abstract: Models trained on different datasets can be merged by a weighted-averaging of their parameters, but why does it work and when can it fail? Here, we connect the inaccuracy of weighted-averaging to mismatches in the gradients and propose a new uncertainty-based scheme to improve the performance by reducing the mismatch. The connection also reveals implicit assumptions in other schemes such as averag… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

    Comments: Preprint. Under review

  6. arXiv:2307.15703  [pdf, other

    cs.CL cs.AI cs.LG

    Uncertainty in Natural Language Generation: From Theory to Applications

    Authors: Joris Baan, Nico Daheim, Evgenia Ilia, Dennis Ulmer, Haau-Sing Li, Raquel Fernández, Barbara Plank, Rico Sennrich, Chrysoula Zerva, Wilker Aziz

    Abstract: Recent advances of powerful Language Models have allowed Natural Language Generation (NLG) to emerge as an important technology that can not only perform traditional tasks like summarisation or translation, but also serve as a natural language interface to a variety of applications. As such, it is crucial that NLG systems are trustworthy and reliable, for example by indicating when they are likely… ▽ More

    Submitted 28 July, 2023; originally announced July 2023.

  7. arXiv:2305.14536  [pdf, other

    cs.CL

    MathDial: A Dialogue Tutoring Dataset with Rich Pedagogical Properties Grounded in Math Reasoning Problems

    Authors: Jakub Macina, Nico Daheim, Sankalan Pal Chowdhury, Tanmay Sinha, Manu Kapur, Iryna Gurevych, Mrinmaya Sachan

    Abstract: While automatic dialogue tutors hold great potential in making education personalized and more accessible, research on such systems has been hampered by a lack of sufficiently large and high-quality datasets. Collecting such datasets remains challenging, as recording tutoring sessions raises privacy concerns and crowdsourcing leads to insufficient data quality. To address this, we propose a framew… ▽ More

    Submitted 23 October, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: Jakub Macina, Nico Daheim, and Sankalan Pal Chowdhury contributed equally to this work. Accepted at EMNLP2023 Findings. Code and dataset available: https://github.com/eth-nlped/mathdial

  8. arXiv:2304.07101  [pdf, other

    cs.CL cs.AI cs.LG

    Task-oriented Document-Grounded Dialog Systems by HLTPR@RWTH for DSTC9 and DSTC10

    Authors: David Thulke, Nico Daheim, Christian Dugast, Hermann Ney

    Abstract: This paper summarizes our contributions to the document-grounded dialog tasks at the 9th and 10th Dialog System Technology Challenges (DSTC9 and DSTC10). In both iterations the task consists of three subtasks: first detect whether the current turn is knowledge seeking, second select a relevant knowledge document, and third generate a response grounded on the selected document. For DSTC9 we propose… ▽ More

    Submitted 14 April, 2023; originally announced April 2023.

    Comments: Accepted for publication in IEEE Transactions on Audio, Speech and Language Processing. arXiv admin note: text overlap with arXiv:2112.08844

  9. arXiv:2303.17574  [pdf, other

    cs.CL cs.AI cs.LG

    Elastic Weight Removal for Faithful and Abstractive Dialogue Generation

    Authors: Nico Daheim, Nouha Dziri, Mrinmaya Sachan, Iryna Gurevych, Edoardo M. Ponti

    Abstract: Ideally, dialogue systems should generate responses that are faithful to the knowledge contained in relevant documents. However, many models generate hallucinated responses instead that contradict it or contain unverifiable information. To mitigate such undesirable behaviour, it has been proposed to fine-tune a `negative expert' on negative examples and subtract its parameters from those of a pre-… ▽ More

    Submitted 30 March, 2023; originally announced March 2023.

  10. arXiv:2301.09919  [pdf, other

    cs.CL cs.AI

    Opportunities and Challenges in Neural Dialog Tutoring

    Authors: Jakub Macina, Nico Daheim, Lingzhi Wang, Tanmay Sinha, Manu Kapur, Iryna Gurevych, Mrinmaya Sachan

    Abstract: Designing dialog tutors has been challenging as it involves modeling the diverse and complex pedagogical strategies employed by human tutors. Although there have been significant recent advances in neural conversational systems using large language models (LLMs) and growth in available dialog corpora, dialog tutoring has largely remained unaffected by these advances. In this paper, we rigorously a… ▽ More

    Submitted 27 March, 2023; v1 submitted 24 January, 2023; originally announced January 2023.

    Comments: EACL 2023 (main conference, camera-ready)

  11. arXiv:2301.09008  [pdf, other

    cs.CL

    Poor Man's Quality Estimation: Predicting Reference-Based MT Metrics Without the Reference

    Authors: Vilém Zouhar, Shehzaad Dhuliawala, Wangchunshu Zhou, Nico Daheim, Tom Kocmi, Yuchen Eleanor Jiang, Mrinmaya Sachan

    Abstract: Machine translation quality estimation (QE) predicts human judgements of a translation hypothesis without seeing the reference. State-of-the-art QE systems based on pretrained language models have been achieving remarkable correlations with human judgements yet they are computationally heavy and require human annotations, which are slow and expensive to create. To address these limitations, we def… ▽ More

    Submitted 25 April, 2023; v1 submitted 21 January, 2023; originally announced January 2023.

    Comments: Accepted at EACL23 (main)

  12. arXiv:2210.17418  [pdf, ps, other

    cs.CL cs.AI cs.LG

    Controllable Factuality in Document-Grounded Dialog Systems Using a Noisy Channel Model

    Authors: Nico Daheim, David Thulke, Christian Dugast, Hermann Ney

    Abstract: In this work, we present a model for document-grounded response generation in dialog that is decomposed into two components according to Bayes theorem. One component is a traditional ungrounded response generation model and the other component models the reconstruction of the grounding document based on the dialog context and generated response. We propose different approximate decoding schemes an… ▽ More

    Submitted 31 October, 2022; originally announced October 2022.

    Comments: Accepted to Findings of EMNLP 2022

  13. arXiv:2206.11249  [pdf, other

    cs.CL cs.AI cs.LG

    GEMv2: Multilingual NLG Benchmarking in a Single Line of Code

    Authors: Sebastian Gehrmann, Abhik Bhattacharjee, Abinaya Mahendiran, Alex Wang, Alexandros Papangelis, Aman Madaan, Angelina McMillan-Major, Anna Shvets, Ashish Upadhyay, Bingsheng Yao, Bryan Wilie, Chandra Bhagavatula, Chaobin You, Craig Thomson, Cristina Garbacea, Dakuo Wang, Daniel Deutsch, Deyi Xiong, Di Jin, Dimitra Gkatzia, Dragomir Radev, Elizabeth Clark, Esin Durmus, Faisal Ladhak, Filip Ginter , et al. (52 additional authors not shown)

    Abstract: Evaluation in machine learning is usually informed by past choices, for example which datasets or metrics to use. This standardization enables the comparison on equal footing using leaderboards, but the evaluation choices become sub-optimal as better alternatives arise. This problem is especially pertinent in natural language generation which requires ever-improving suites of datasets, metrics, an… ▽ More

    Submitted 24 June, 2022; v1 submitted 22 June, 2022; originally announced June 2022.

  14. arXiv:2112.08844  [pdf, ps, other

    cs.CL cs.AI cs.LG

    Adapting Document-Grounded Dialog Systems to Spoken Conversations using Data Augmentation and a Noisy Channel Model

    Authors: David Thulke, Nico Daheim, Christian Dugast, Hermann Ney

    Abstract: This paper summarizes our submission to Task 2 of the second track of the 10th Dialog System Technology Challenge (DSTC10) "Knowledge-grounded Task-oriented Dialogue Modeling on Spoken Conversations". Similar to the previous year's iteration, the task consists of three subtasks: detecting whether a turn is knowledge seeking, selecting the relevant knowledge document and finally generating a ground… ▽ More

    Submitted 16 December, 2021; originally announced December 2021.

    Comments: Accepted to the DSTC10 workshop at AAAI 2022

  15. arXiv:2106.07275  [pdf, other

    cs.CL cs.AI cs.LG

    Cascaded Span Extraction and Response Generation for Document-Grounded Dialog

    Authors: Nico Daheim, David Thulke, Christian Dugast, Hermann Ney

    Abstract: This paper summarizes our entries to both subtasks of the first DialDoc shared task which focuses on the agent response prediction task in goal-oriented document-grounded dialogs. The task is split into two subtasks: predicting a span in a document that grounds an agent turn and generating an agent response based on a dialog and grounding document. In the first subtask, we restrict the set of vali… ▽ More

    Submitted 14 June, 2021; originally announced June 2021.

    Comments: Accepted by 1st DialDoc Workshop at ACL-IJCNLP 2021

  16. arXiv:2102.04643  [pdf, ps, other

    cs.CL

    Efficient Retrieval Augmented Generation from Unstructured Knowledge for Task-Oriented Dialog

    Authors: David Thulke, Nico Daheim, Christian Dugast, Hermann Ney

    Abstract: This paper summarizes our work on the first track of the ninth Dialog System Technology Challenge (DSTC 9), "Beyond Domain APIs: Task-oriented Conversational Modeling with Unstructured Knowledge Access". The goal of the task is to generate responses to user turns in a task-oriented dialog that require knowledge from unstructured documents. The task is divided into three subtasks: detection, select… ▽ More

    Submitted 8 February, 2021; originally announced February 2021.

    Comments: Accepted by DSTC9 Workshop at AAAI-2021