Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 51 results for author: Mitchell, T

Searching in archive cs. Search in all archives.
.
  1. Automated Generation and Tagging of Knowledge Components from Multiple-Choice Questions

    Authors: Steven Moore, Robin Schmucker, Tom Mitchell, John Stamper

    Abstract: Knowledge Components (KCs) linked to assessments enhance the measurement of student learning, enrich analytics, and facilitate adaptivity. However, generating and linking KCs to assessment items requires significant effort and domain-specific knowledge. To streamline this process for higher-education courses, we employed GPT-4 to generate KCs for multiple-choice questions (MCQs) in Chemistry and E… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: Learning @ Scale 2024

  2. arXiv:2404.17460  [pdf, other

    cs.CL

    Ruffle&Riley: Insights from Designing and Evaluating a Large Language Model-Based Conversational Tutoring System

    Authors: Robin Schmucker, Meng Xia, Amos Azaria, Tom Mitchell

    Abstract: Conversational tutoring systems (CTSs) offer learning experiences through interactions based on natural language. They are recognized for promoting cognitive engagement and improving learning outcomes, especially in reasoning tasks. Nonetheless, the cost associated with authoring CTS content is a major obstacle to widespread adoption and to research on effective instructional design. In this paper… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2310.01420

  3. arXiv:2404.11483  [pdf, other

    cs.AI cs.LG

    AgentKit: Flow Engineering with Graphs, not Coding

    Authors: Yue Wu, Yewen Fan, So Yeon Min, Shrimai Prabhumoye, Stephen McAleer, Yonatan Bisk, Ruslan Salakhutdinov, Yuanzhi Li, Tom Mitchell

    Abstract: We propose an intuitive LLM prompting framework (AgentKit) for multifunctional agents. AgentKit offers a unified framework for explicitly constructing a complex "thought process" from simple natural language prompts. The basic building block in AgentKit is a node, containing a natural language prompt for a specific subtask. The user then puts together chains of nodes, like stacking LEGO pieces. Th… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

  4. arXiv:2402.01108  [pdf, other

    cs.CL cs.LG

    Reasoning Capacity in Multi-Agent Systems: Limitations, Challenges and Human-Centered Solutions

    Authors: Pouya Pezeshkpour, Eser Kandogan, Nikita Bhutani, Sajjadur Rahman, Tom Mitchell, Estevam Hruschka

    Abstract: Remarkable performance of large language models (LLMs) in a variety of tasks brings forth many opportunities as well as challenges of utilizing them in production settings. Towards practical adoption of LLMs, multi-agent systems hold great promise to augment, integrate, and orchestrate LLMs in the larger context of enterprise platforms that use existing proprietary data and models to tackle comple… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

  5. arXiv:2310.01557  [pdf, other

    cs.LG cs.AI

    SmartPlay: A Benchmark for LLMs as Intelligent Agents

    Authors: Yue Wu, Xuan Tang, Tom M. Mitchell, Yuanzhi Li

    Abstract: Recent large language models (LLMs) have demonstrated great potential toward intelligent agents and next-gen automation, but there currently lacks a systematic benchmark for evaluating LLMs' abilities as agents. We introduce SmartPlay: both a challenging benchmark and a methodology for evaluating LLMs as agents. SmartPlay consists of 6 different games, including Rock-Paper-Scissors, Tower of Hanoi… ▽ More

    Submitted 17 March, 2024; v1 submitted 2 October, 2023; originally announced October 2023.

  6. arXiv:2310.01420  [pdf, other

    cs.CL cs.AI cs.HC

    Ruffle&Riley: Towards the Automated Induction of Conversational Tutoring Systems

    Authors: Robin Schmucker, Meng Xia, Amos Azaria, Tom Mitchell

    Abstract: Conversational tutoring systems (CTSs) offer learning experiences driven by natural language interaction. They are known to promote high levels of cognitive engagement and benefit learning outcomes, particularly in reasoning tasks. Nonetheless, the time and cost required to author CTS content is a major obstacle to widespread adoption. In this paper, we introduce a novel type of CTS that leverages… ▽ More

    Submitted 14 November, 2023; v1 submitted 26 September, 2023; originally announced October 2023.

    Comments: NeurIPS'23 GAIED, Camera-ready

  7. arXiv:2305.15486  [pdf, other

    cs.AI cs.LG

    SPRING: Studying the Paper and Reasoning to Play Games

    Authors: Yue Wu, Shrimai Prabhumoye, So Yeon Min, Yonatan Bisk, Ruslan Salakhutdinov, Amos Azaria, Tom Mitchell, Yuanzhi Li

    Abstract: Open-world survival games pose significant challenges for AI algorithms due to their multi-tasking, deep exploration, and goal prioritization requirements. Despite reinforcement learning (RL) being popular for solving games, its high sample complexity limits its effectiveness in complex open-world games like Crafter or Minecraft. We propose a novel approach, SPRING, to read the game's original aca… ▽ More

    Submitted 11 December, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

  8. arXiv:2305.02412  [pdf, other

    cs.CL cs.AI cs.LG

    Plan, Eliminate, and Track -- Language Models are Good Teachers for Embodied Agents

    Authors: Yue Wu, So Yeon Min, Yonatan Bisk, Ruslan Salakhutdinov, Amos Azaria, Yuanzhi Li, Tom Mitchell, Shrimai Prabhumoye

    Abstract: Pre-trained large language models (LLMs) capture procedural knowledge about the world. Recent work has leveraged LLM's ability to generate abstract plans to simplify challenging control tasks, either by action scoring, or action modeling (fine-tuning). However, the transformer architecture inherits several constraints that make it difficult for the LLM to directly serve as the agent: e.g. limited… ▽ More

    Submitted 7 May, 2023; v1 submitted 3 May, 2023; originally announced May 2023.

  9. arXiv:2304.13734  [pdf, other

    cs.CL cs.AI cs.LG

    The Internal State of an LLM Knows When It's Lying

    Authors: Amos Azaria, Tom Mitchell

    Abstract: While Large Language Models (LLMs) have shown exceptional performance in various tasks, one of their most prominent drawbacks is generating inaccurate or false information with a confident tone. In this paper, we provide evidence that the LLM's internal state can be used to reveal the truthfulness of statements. This includes both statements provided to the LLM, and statements that the LLM itself… ▽ More

    Submitted 17 October, 2023; v1 submitted 25 April, 2023; originally announced April 2023.

  10. arXiv:2304.13626  [pdf, other

    cs.AI

    The Roles of Symbols in Neural-based AI: They are Not What You Think!

    Authors: Daniel L. Silver, Tom M. Mitchell

    Abstract: We propose that symbols are first and foremost external communication tools used between intelligent agents that allow knowledge to be transferred in a more efficient and effective manner than having to experience the world directly. But, they are also used internally within an agent through a form of self-communication to help formulate, describe and justify subsymbolic patterns of neural activit… ▽ More

    Submitted 26 April, 2023; originally announced April 2023.

    Comments: 28 pages

  11. arXiv:2303.13401  [pdf, other

    cs.LG cs.CV

    Optimization and Optimizers for Adversarial Robustness

    Authors: Hengyue Liang, Buyun Liang, Le Peng, Ying Cui, Tim Mitchell, Ju Sun

    Abstract: Empirical robustness evaluation (RE) of deep learning models against adversarial perturbations entails solving nontrivial constrained optimization problems. Existing numerical algorithms that are commonly used to solve them in practice predominantly rely on projected gradient, and mostly handle perturbations modeled by the $\ell_1$, $\ell_2$ and $\ell_\infty$ distances. In this paper, we introduce… ▽ More

    Submitted 23 March, 2023; originally announced March 2023.

  12. arXiv:2302.04449  [pdf, other

    cs.LG cs.AI cs.CL

    Read and Reap the Rewards: Learning to Play Atari with the Help of Instruction Manuals

    Authors: Yue Wu, Yewen Fan, Paul Pu Liang, Amos Azaria, Yuanzhi Li, Tom M. Mitchell

    Abstract: High sample complexity has long been a challenge for RL. On the other hand, humans learn to perform tasks not only from interaction or demonstrations, but also by reading unstructured text documents, e.g., instruction manuals. Instruction manuals and wiki pages are among the most abundant data that could inform agents of valuable features and policies or task-specific environmental dynamics and re… ▽ More

    Submitted 26 October, 2023; v1 submitted 9 February, 2023; originally announced February 2023.

  13. arXiv:2212.10708  [pdf, other

    cs.CL

    Zero-shot Triplet Extraction by Template Infilling

    Authors: Bosung Kim, Hayate Iso, Nikita Bhutani, Estevam Hruschka, Ndapa Nakashole, Tom Mitchell

    Abstract: The task of triplet extraction aims to extract pairs of entities and their corresponding relations from unstructured text. Most existing methods train an extraction model on training data involving specific target relations, and are incapable of extracting new relations that were not observed at training time. Generalizing the model to unseen relations typically requires fine-tuning on synthetic t… ▽ More

    Submitted 20 September, 2023; v1 submitted 20 December, 2022; originally announced December 2022.

    Comments: IJCNLP-AACL 2023 (main)

  14. arXiv:2210.00973  [pdf, ps, other

    cs.LG cs.CV cs.MS eess.SP math.OC

    NCVX: A General-Purpose Optimization Solver for Constrained Machine and Deep Learning

    Authors: Buyun Liang, Tim Mitchell, Ju Sun

    Abstract: Imposing explicit constraints is relatively new but increasingly pressing in deep learning, stimulated by, e.g., trustworthy AI that performs robust optimization over complicated perturbation sets and scientific applications that need to respect physical laws and constraints. However, it can be hard to reliably solve constrained deep learning problems without optimization expertise. The existing d… ▽ More

    Submitted 13 November, 2022; v1 submitted 3 October, 2022; originally announced October 2022.

    Comments: Accepted by the NeurIPS Workshop on Optimization for Machine Learning (OPT 2022). arXiv admin note: text overlap with arXiv:2111.13984

  15. arXiv:2210.00621  [pdf, other

    cs.LG cs.CV eess.SP math.OC

    Optimization for Robustness Evaluation beyond $\ell_p$ Metrics

    Authors: Hengyue Liang, Buyun Liang, Ying Cui, Tim Mitchell, Ju Sun

    Abstract: Empirical evaluation of deep learning models against adversarial attacks entails solving nontrivial constrained optimization problems. Popular algorithms for solving these constrained problems rely on projected gradient descent (PGD) and require careful tuning of multiple hyperparameters. Moreover, PGD can only handle $\ell_1$, $\ell_2$, and $\ell_\infty$ attack models due to the use of analytical… ▽ More

    Submitted 13 November, 2022; v1 submitted 2 October, 2022; originally announced October 2022.

    Comments: 5 pages, 1 figure, 3 tables, accepted by the 14th International OPT Workshop on Optimization for Machine Learning, and submitted to the 2023 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2023)

  16. arXiv:2202.03980  [pdf, other

    cs.LG cs.CY

    Transferable Student Performance Modeling for Intelligent Tutoring Systems

    Authors: Robin Schmucker, Tom M. Mitchell

    Abstract: Millions of learners worldwide are now using intelligent tutoring systems (ITSs). At their core, ITSs rely on machine learning algorithms to track each user's changing performance level over time to provide personalized instruction. Crucially, student performance models are trained using interaction sequence data of previous learners to analyse data generated by future learners. This induces a col… ▽ More

    Submitted 8 February, 2022; originally announced February 2022.

  17. arXiv:2111.13984  [pdf, other

    cs.LG cs.CV cs.MS eess.SP math.OC

    NCVX: A User-Friendly and Scalable Package for Nonconvex Optimization in Machine Learning

    Authors: Buyun Liang, Tim Mitchell, Ju Sun

    Abstract: Optimizing nonconvex (NCVX) problems, especially nonsmooth and constrained ones, is an essential part of machine learning. However, it can be hard to reliably solve such problems without optimization expertise. Existing general-purpose NCVX optimization packages are powerful but typically cannot handle nonsmoothness. GRANSO is among the first optimization solvers targeting general nonsmooth NCVX p… ▽ More

    Submitted 1 January, 2022; v1 submitted 27 November, 2021; originally announced November 2021.

    Comments: NCVX is available at https://ncvx.org

  18. arXiv:2109.08544  [pdf, other

    cs.AI cs.CL cs.LG cs.SC

    Conversational Multi-Hop Reasoning with Neural Commonsense Knowledge and Symbolic Logic Rules

    Authors: Forough Arabshahi, Jennifer Lee, Antoine Bosselut, Yejin Choi, Tom Mitchell

    Abstract: One of the challenges faced by conversational agents is their inability to identify unstated presumptions of their users' commands, a task trivial for humans due to their common sense. In this paper, we propose a zero-shot commonsense reasoning system for conversational agents in an attempt to achieve this. Our reasoner uncovers unstated presumptions from user commands satisfying a general templat… ▽ More

    Submitted 17 September, 2021; originally announced September 2021.

    Comments: Appearing in the 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP)

  19. arXiv:2109.01753  [pdf, other

    cs.LG cs.CY

    Assessing the Performance of Online Students -- New Data, New Approaches, Improved Accuracy

    Authors: Robin Schmucker, Jingbo Wang, Shijia Hu, Tom M. Mitchell

    Abstract: We consider the problem of assessing the changing performance levels of individual students as they go through online courses. This student performance (SP) modeling problem is a critical step for building adaptive online teaching systems. Specifically, we conduct a study of how to utilize various types and large amounts of student log data to train accurate machine learning (ML) models that predi… ▽ More

    Submitted 8 February, 2022; v1 submitted 3 September, 2021; originally announced September 2021.

  20. arXiv:2106.04072  [pdf, ps, other

    cs.AI cs.LG

    Coarse-to-Fine Curriculum Learning

    Authors: Otilia Stretcu, Emmanouil Antonios Platanios, Tom M. Mitchell, Barnabás Póczos

    Abstract: When faced with learning challenging new tasks, humans often follow sequences of steps that allow them to incrementally build up the necessary skills for performing these new tasks. However, in machine learning, models are most often trained to solve the target tasks directly.Inspired by human learning, we propose a novel curriculum learning approach which decomposes challenging tasks into sequenc… ▽ More

    Submitted 7 June, 2021; originally announced June 2021.

  21. arXiv:2105.02486  [pdf, other

    cs.CL

    Towards General Natural Language Understanding with Probabilistic Worldbuilding

    Authors: Abulhair Saparov, Tom M. Mitchell

    Abstract: We introduce the Probabilistic Worldbuilding Model (PWM), a new fully-symbolic Bayesian model of semantic parsing and reasoning, as a first step in a research program toward more domain- and task-general NLU and AI. Humans create internal mental models of their observations which greatly aid in their ability to understand and reason about a large variety of problems. In PWM, the meanings of senten… ▽ More

    Submitted 20 December, 2021; v1 submitted 6 May, 2021; originally announced May 2021.

    Comments: Accepted to TACL; pre-MIT Press publication version

  22. arXiv:2102.09103  [pdf, other

    cs.CY

    Gender Bias, Social Bias and Representation: 70 Years of B$^H$ollywood

    Authors: Kunal Khadilkar, Ashiqur R. KhudaBukhsh, Tom M. Mitchell

    Abstract: With an outreach in more than 90 countries, a market share of 2.1 billion dollars and a target audience base of at least 1.2 billion people, Bollywood, aka the Mumbai film industry, is a formidable entertainment force. While the number of lives Bollywood can potentially touch is massive, no comprehensive NLP study on the evolution of social and gender biases in Bollywood dialogues exists. Via a su… ▽ More

    Submitted 17 February, 2021; originally announced February 2021.

  23. Screen2Vec: Semantic Embedding of GUI Screens and GUI Components

    Authors: Toby Jia-Jun Li, Lindsay Popowski, Tom M. Mitchell, Brad A. Myers

    Abstract: Representing the semantics of GUI screens and components is crucial to data-driven computational methods for modeling user-GUI interactions and mining GUI designs. Existing GUI semantic representations are limited to encoding either the textual content, the visual design and layout patterns, or the app contexts. Many representation techniques also require significant manual data annotation efforts… ▽ More

    Submitted 11 January, 2021; originally announced January 2021.

    Comments: Accepted to CHI Conference on Human Factors in Computing Systems (CHI 2021)

  24. arXiv:2101.10112  [pdf, other

    cs.CY cs.CL

    Fringe News Networks: Dynamics of US News Viewership following the 2020 Presidential Election

    Authors: Ashiqur R. KhudaBukhsh, Rupak Sarkar, Mark S. Kamlet, Tom M. Mitchell

    Abstract: The growing political polarization of the American electorate over the last several decades has been widely studied and documented. During the administration of President Donald Trump, charges of "fake news" made social and news media not only the means but, to an unprecedented extent, the topic of political communication. Using data from before the November 3rd, 2020 US Presidential election, rec… ▽ More

    Submitted 21 January, 2021; originally announced January 2021.

  25. arXiv:2010.02339  [pdf, ps, other

    cs.CL cs.CY

    We Don't Speak the Same Language: Interpreting Polarization through Machine Translation

    Authors: Ashiqur R. KhudaBukhsh, Rupak Sarkar, Mark S. Kamlet, Tom M. Mitchell

    Abstract: Polarization among US political parties, media and elites is a widely studied topic. Prominent lines of prior research across multiple disciplines have observed and analyzed growing polarization in social media. In this paper, we present a new methodology that offers a fresh perspective on interpreting polarization through the lens of machine translation. With a novel proposition that two sub-comm… ▽ More

    Submitted 18 October, 2020; v1 submitted 5 October, 2020; originally announced October 2020.

  26. arXiv:2009.08424  [pdf, other

    cs.CL cs.AI cs.LG

    Modeling Task Effects on Meaning Representation in the Brain via Zero-Shot MEG Prediction

    Authors: Mariya Toneva, Otilia Stretcu, Barnabas Poczos, Leila Wehbe, Tom M. Mitchell

    Abstract: How meaning is represented in the brain is still one of the big open questions in neuroscience. Does a word (e.g., bird) always have the same representation, or does the task under which the word is processed alter its representation (answering "can you eat it?" versus "can it fly?")? The brain activity of subjects who read the same word while performing different semantic tasks has been shown to… ▽ More

    Submitted 15 November, 2020; v1 submitted 17 September, 2020; originally announced September 2020.

    Comments: accepted at NeurIPS 2020

  27. arXiv:2008.13347  [pdf, other

    cs.CL cs.CY cs.LG

    Discovering Bilingual Lexicons in Polyglot Word Embeddings

    Authors: Ashiqur R. KhudaBukhsh, Shriphani Palakodety, Tom M. Mitchell

    Abstract: Bilingual lexicons and phrase tables are critical resources for modern Machine Translation systems. Although recent results show that without any seed lexicon or parallel data, highly accurate bilingual lexicons can be learned using unsupervised methods, such methods rely on the existence of large, clean monolingual corpora. In this work, we utilize a single Skip-gram model trained on a multilingu… ▽ More

    Submitted 30 August, 2020; originally announced August 2020.

  28. arXiv:2008.00045  [pdf

    cs.CY

    From Data to Knowledge to Action: A Global Enabler for the 21st Century

    Authors: Eric Horvitz, Tom Mitchell

    Abstract: A confluence of advances in the computer and mathematical sciences has unleashed unprecedented capabilities for enabling true evidence-based decision making. These capabilities are making possible the large-scale capture of data and the transformation of that data into insights and recommendations in support of decisions about challenging problems in science, society, and government. Key advances… ▽ More

    Submitted 31 July, 2020; originally announced August 2020.

    Comments: A Computing Community Consortium (CCC) white paper, 8 pages

  29. arXiv:2006.10022  [pdf, other

    cs.AI cs.CL cs.LG stat.ML

    Conversational Neuro-Symbolic Commonsense Reasoning

    Authors: Forough Arabshahi, Jennifer Lee, Mikayla Gawarecki, Kathryn Mazaitis, Amos Azaria, Tom Mitchell

    Abstract: In order for conversational AI systems to hold more natural and broad-ranging conversations, they will require much more commonsense, including the ability to identify unstated presumptions of their conversational partners. For example, in the command "If it snows at night then wake me up early because I don't want to be late for work" the speaker relies on commonsense reasoning of the listener to… ▽ More

    Submitted 2 February, 2021; v1 submitted 17 June, 2020; originally announced June 2020.

    Comments: Appearing in the 35th AAAI international Conference on Artificial Intelligence, 2021

  30. arXiv:2004.03473  [pdf, other

    cs.LG stat.ML

    Learning from Imperfect Annotations

    Authors: Emmanouil Antonios Platanios, Maruan Al-Shedivat, Eric Xing, Tom Mitchell

    Abstract: Many machine learning systems today are trained on large amounts of human-annotated data. Data annotation tasks that require a high level of competency make data acquisition expensive, while the resulting labels are often subjective, inconsistent, and may contain a variety of human biases. To improve the data quality, practitioners often need to collect multiple annotations per example and aggrega… ▽ More

    Submitted 7 April, 2020; originally announced April 2020.

  31. arXiv:2003.02622  [pdf

    cs.HC cs.AI

    Towards Effective Human-AI Collaboration in GUI-Based Interactive Task Learning Agents

    Authors: Toby Jia-Jun Li, Jingya Chen, Tom M. Mitchell, Brad A. Myers

    Abstract: We argue that a key challenge in enabling usable and useful interactive task learning for intelligent agents is to facilitate effective Human-AI collaboration. We reflect on our past 5 years of efforts on designing, developing and studying the SUGILITE system, discuss the issues on incorporating recent advances in AI with HCI principles in mixed-initiative interactions and multi-modal interactions… ▽ More

    Submitted 5 March, 2020; originally announced March 2020.

    Journal ref: CHI 2020 Workshop on Artificial Intelligence for HCI: A Modern Approach (AI4HCI)

  32. arXiv:2002.06306  [pdf, other

    cs.LG cs.AI cs.MA stat.ML

    Jelly Bean World: A Testbed for Never-Ending Learning

    Authors: Emmanouil Antonios Platanios, Abulhair Saparov, Tom Mitchell

    Abstract: Machine learning has shown growing success in recent years. However, current machine learning systems are highly specialized, trained for particular problems or domains, and typically on a single narrow dataset. Human learning, on the other hand, is highly general and adaptable. Never-ending learning is a machine learning paradigm that aims to bridge this gap, with the goal of encouraging research… ▽ More

    Submitted 14 February, 2020; originally announced February 2020.

    Comments: Published as a conference paper at ICLR 2020

    Journal ref: International Conference on Learning Representations 2020

  33. arXiv:1912.06074  [pdf, other

    cs.LG cs.AI stat.ML

    Game Design for Eliciting Distinguishable Behavior

    Authors: Fan Yang, Liu Leqi, Yifan Wu, Zachary C. Lipton, Pradeep Ravikumar, William W. Cohen, Tom Mitchell

    Abstract: The ability to inferring latent psychological traits from human behavior is key to developing personalized human-interacting machine learning systems. Approaches to infer such traits range from surveys to manually-constructed experiments and games. However, these traditional games are limited because they are typically designed based on heuristics. In this paper, we formulate the task of designing… ▽ More

    Submitted 12 December, 2019; originally announced December 2019.

    Comments: 33rd Conference on Neural Information Processing Systems (NeurIPS 2019)

  34. arXiv:1910.12795  [pdf, other

    cs.LG cs.CL cs.CV stat.ML

    Learning Data Manipulation for Augmentation and Weighting

    Authors: Zhiting Hu, Bowen Tan, Ruslan Salakhutdinov, Tom Mitchell, Eric P. Xing

    Abstract: Manipulating data, such as weighting data examples or augmenting with new instances, has been increasingly used to improve model training. Previous work has studied various rule- or learning-based approaches designed for specific types of data manipulation. In this work, we propose a new method that supports learning different manipulation schemes with the same gradient-based algorithm. Our approa… ▽ More

    Submitted 28 October, 2019; originally announced October 2019.

    Comments: NeurIPS 2019

  35. arXiv:1910.12197  [pdf, other

    cs.CL cs.LG

    Look-up and Adapt: A One-shot Semantic Parser

    Authors: Zhichu Lu, Forough Arabshahi, Igor Labutov, Tom Mitchell

    Abstract: Computing devices have recently become capable of interacting with their end users via natural language. However, they can only operate within a limited "supported" domain of discourse and fail drastically when faced with an out-of-domain utterance, mainly due to the limitations of their semantic parser. In this paper, we propose a semantic parser that generalizes to out-of-domain examples by lear… ▽ More

    Submitted 27 October, 2019; originally announced October 2019.

    Comments: 2019 Conference on Empirical Methods in Natural Language Processing (EMNLP)

  36. arXiv:1909.00031  [pdf, other

    cs.HC cs.AI

    Interactive Task and Concept Learning from Natural Language Instructions and GUI Demonstrations

    Authors: Toby Jia-Jun Li, Marissa Radensky, Justin Jia, Kirielle Singarajah, Tom M. Mitchell, Brad A. Myers

    Abstract: Natural language programming is a promising approach to enable end users to instruct new tasks for intelligent agents. However, our formative study found that end users would often use unclear, ambiguous or vague concepts when naturally instructing tasks in natural language, especially when specifying conditionals. Existing systems have limited support for letting the user teach agents new concept… ▽ More

    Submitted 6 January, 2020; v1 submitted 30 August, 2019; originally announced September 2019.

    Comments: The AAAI-20 Workshop on Intelligent Process Automation (IPA-20)

  37. arXiv:1906.11861  [pdf, other

    cs.CL

    Relating Simple Sentence Representations in Deep Neural Networks and the Brain

    Authors: Sharmistha Jat, Hao Tang, Partha Talukdar, Tom Mitchell

    Abstract: What is the relationship between sentence representations learned by deep recurrent models against those encoded by the brain? Is there any correspondence between hidden layers of these recurrent models and brain regions when processing sentences? Can these deep models be used to synthesize brain data which can then be utilized in other extrinsic tasks? We investigate these questions using sentenc… ▽ More

    Submitted 27 June, 2019; originally announced June 2019.

    Comments: Association for Computational Linguistics (ACL) 2019

  38. Understanding language-elicited EEG data by predicting it from a fine-tuned language model

    Authors: Dan Schwartz, Tom Mitchell

    Abstract: Electroencephalography (EEG) recordings of brain activity taken while participants read or listen to language are widely used within the cognitive neuroscience and psycholinguistics communities as a tool to study language comprehension. Several time-locked stereotyped EEG responses to word-presentations -- known collectively as event-related potentials (ERPs) -- are thought to be markers for seman… ▽ More

    Submitted 2 April, 2019; originally announced April 2019.

    Comments: To appear in Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics

  39. arXiv:1903.09848  [pdf, other

    cs.CL cs.LG stat.ML

    Competence-based Curriculum Learning for Neural Machine Translation

    Authors: Emmanouil Antonios Platanios, Otilia Stretcu, Graham Neubig, Barnabas Poczos, Tom M. Mitchell

    Abstract: Current state-of-the-art NMT systems use large neural networks that are not only slow to train, but also often require many heuristics and optimization tricks, such as specialized learning rate schedules and large batch sizes. This is undesirable as it requires extensive hyperparameter tuning. In this paper, we propose a curriculum learning framework for NMT that reduces training time, reduces the… ▽ More

    Submitted 26 March, 2019; v1 submitted 23 March, 2019; originally announced March 2019.

    Journal ref: NAACL 2019

  40. arXiv:1902.09091  [pdf, other

    cs.CL cs.AI cs.LG

    Leveraging Knowledge Bases in LSTMs for Improving Machine Reading

    Authors: Bishan Yang, Tom Mitchell

    Abstract: This paper focuses on how to take advantage of external knowledge bases (KBs) to improve recurrent neural networks for machine reading. Traditional methods that exploit knowledge from KBs encode knowledge as discrete indicator features. Not only do these features generalize poorly, but they require task-specific feature engineering to achieve good performance. We propose KBLSTM, a novel neural mod… ▽ More

    Submitted 25 February, 2019; originally announced February 2019.

    Comments: published at ACL 2017

    Journal ref: ACL 2017

  41. arXiv:1902.08373  [pdf, other

    cs.CL cs.LG

    Learning to Learn Semantic Parsers from Natural Language Supervision

    Authors: Igor Labutov, Bishan Yang, Tom Mitchell

    Abstract: As humans, we often rely on language to learn language. For example, when corrected in a conversation, we may learn from that correction, over time improving our language fluency. Inspired by this observation, we propose a learning algorithm for training semantic parsers from supervision (feedback) expressed in natural language. Our algorithm learns a semantic parser from users' corrections such a… ▽ More

    Submitted 22 February, 2019; originally announced February 2019.

    Comments: published at EMNLP 2018

  42. arXiv:1902.01827  [pdf

    physics.chem-ph cs.HC physics.bio-ph physics.ed-ph

    Interactive molecular dynamics in virtual reality from quantum chemistry to drug binding: An open-source multi-person framework

    Authors: Michael O'Connor, Simon J. Bennie, Helen M. Deeks, Alexander Jamieson-Binnie, Alex J. Jones, Robin J. Shannon, Rebecca Walters, Thomas J. Mitchell, Adrian J. Mulholland, David R. Glowacki

    Abstract: As molecular scientists have made progress in their ability to engineer nano-scale molecular structure, we are facing new challenges in our ability to engineer molecular dynamics (MD) and flexibility. Dynamics at the molecular scale differs from the familiar mechanics of everyday objects, because it involves a complicated, highly correlated, and three-dimensional many-body dynamical choreography w… ▽ More

    Submitted 1 May, 2019; v1 submitted 5 February, 2019; originally announced February 2019.

  43. arXiv:1811.05546  [pdf, other

    cs.CL

    Discourse in Multimedia: A Case Study in Information Extraction

    Authors: Mrinmaya Sachan, Kumar Avinava Dubey, Eduard H. Hovy, Tom M. Mitchell, Dan Roth, Eric P. Xing

    Abstract: To ensure readability, text is often written and presented with due formatting. These text formatting devices help the writer to effectively convey the narrative. At the same time, these help the readers pick up the structure of the discourse and comprehend the conveyed information. There have been a number of linguistic theories on discourse structure of text. However, these theories only conside… ▽ More

    Submitted 13 November, 2018; originally announced November 2018.

  44. arXiv:1808.08493  [pdf, other

    cs.CL cs.LG stat.ML

    Contextual Parameter Generation for Universal Neural Machine Translation

    Authors: Emmanouil Antonios Platanios, Mrinmaya Sachan, Graham Neubig, Tom Mitchell

    Abstract: We propose a simple modification to existing neural machine translation (NMT) models that enables using a single universal model to translate between multiple languages while allowing for language specific parameterization, and that can also be used for domain adaptation. Our approach requires no changes to the model architecture of a standard NMT system, but instead introduces a new component, th… ▽ More

    Submitted 25 August, 2018; originally announced August 2018.

    Comments: Published in the proceedings of Empirical Methods in Natural Language Processing (EMNLP), 2018

  45. arXiv:1803.05805  [pdf, other

    cs.HC physics.bio-ph physics.comp-ph q-bio.OT

    Sonifying stochastic walks on biomolecular energy landscapes

    Authors: Robert E. Arbon, Alex J. Jones, Lars A. Bratholm, Tom Mitchell, David R. Glowacki

    Abstract: Translating the complex, multi-dimensional data from simulations of biomolecules to intuitive knowledge is a major challenge in computational chemistry and biology. The so-called "free energy landscape" is amongst the most fundamental concepts used by scientists to understand both static and dynamic properties of biomolecular systems. In this paper we use Markov models to design a strategy for map… ▽ More

    Submitted 15 March, 2018; originally announced March 2018.

  46. arXiv:1705.07086  [pdf, other

    cs.LG cs.AI stat.ML

    Estimating Accuracy from Unlabeled Data: A Probabilistic Logic Approach

    Authors: Emmanouil A. Platanios, Hoifung Poon, Tom M. Mitchell, Eric Horvitz

    Abstract: We propose an efficient method to estimate the accuracy of classifiers using only unlabeled data. We consider a setting with multiple classification problems where the target classes may be tied together through logical constraints. For example, a set of classes may be mutually exclusive, meaning that a data instance can belong to at most one of them. The proposed method is based on the intuition… ▽ More

    Submitted 19 May, 2017; originally announced May 2017.

  47. arXiv:1612.05348  [pdf, other

    cs.AI cs.CL

    Machine Reading with Background Knowledge

    Authors: Ndapandula Nakashole, Tom M. Mitchell

    Abstract: Intelligent systems capable of automatically understanding natural language text are important for many artificial intelligence applications including mobile phone voice assistants, computer vision, and robotics. Understanding language often constitutes fitting new information into a previously acquired view of the world. However, many machine reading systems rely on the text alone to infer its me… ▽ More

    Submitted 15 December, 2016; originally announced December 2016.

    Comments: 28 pages

    MSC Class: 68T50

  48. arXiv:1609.03632  [pdf, other

    cs.CL cs.AI

    Joint Extraction of Events and Entities within a Document Context

    Authors: Bishan Yang, Tom Mitchell

    Abstract: Events and entities are closely related; entities are often actors or participants in events and events without entities are uncommon. The interpretation of events and entities is highly contextually dependent. Existing work in information extraction typically models events separately from entities, and performs inference at the sentence level, ignoring the rest of the document. In this paper, we… ▽ More

    Submitted 12 September, 2016; originally announced September 2016.

    Comments: 11 pages, 2 figures, published at NAACL 2016

    Journal ref: Proceedings of NAACL-HLT 2016, pages 289-299

  49. arXiv:1512.00112  [pdf, other

    cs.CL cs.AI cs.SI

    Inferring Interpersonal Relations in Narrative Summaries

    Authors: Shashank Srivastava, Snigdha Chaturvedi, Tom Mitchell

    Abstract: Characterizing relationships between people is fundamental for the understanding of narratives. In this work, we address the problem of inferring the polarity of relationships between people in narrative summaries. We formulate the problem as a joint structured prediction for each narrative, and present a model that combines evidence from linguistic and semantic features, as well as features based… ▽ More

    Submitted 30 November, 2015; originally announced December 2015.

  50. arXiv:1404.3301  [pdf, other

    cs.AI

    Efficient Inference and Learning in a Large Knowledge Base: Reasoning with Extracted Information using a Locally Groundable First-Order Probabilistic Logic

    Authors: William Yang Wang, Kathryn Mazaitis, Ni Lao, Tom Mitchell, William W. Cohen

    Abstract: One important challenge for probabilistic logics is reasoning with very large knowledge bases (KBs) of imperfect information, such as those produced by modern web-scale information extraction systems. One scalability problem shared by many probabilistic logics is that answering queries involves "grounding" the query---i.e., mapping it to a propositional representation---and the size of a "groundin… ▽ More

    Submitted 12 April, 2014; originally announced April 2014.

    Comments: arXiv admin note: substantial text overlap with arXiv:1305.2254