Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–14 of 14 results for author: Feng, S Y

.
  1. arXiv:2408.03617  [pdf, other

    cs.CL cs.AI cs.LG

    Is Child-Directed Speech Effective Training Data for Language Models?

    Authors: Steven Y. Feng, Noah D. Goodman, Michael C. Frank

    Abstract: While high-performing language models are typically trained on hundreds of billions of words, human children become fluent language users with a much smaller amount of data. What are the features of the data they receive, and how do these features support language modeling objectives? To investigate this question, we train GPT-2 models on 29M words of English-language child-directed speech and a n… ▽ More

    Submitted 7 August, 2024; originally announced August 2024.

    Comments: Preprint. Code and data will be released soon

  2. arXiv:2406.10447  [pdf, other

    cs.CV

    The BabyView dataset: High-resolution egocentric videos of infants' and young children's everyday experiences

    Authors: Bria Long, Violet Xiang, Stefan Stojanov, Robert Z. Sparks, Zi Yin, Grace E. Keene, Alvin W. M. Tan, Steven Y. Feng, Chengxu Zhuang, Virginia A. Marchman, Daniel L. K. Yamins, Michael C. Frank

    Abstract: Human children far exceed modern machine learning algorithms in their sample efficiency, achieving high performance in key domains with much less data than current models. This ''data gap'' is a key challenge both for building intelligent artificial systems and for understanding human development. Egocentric video capturing children's experience -- their ''training data'' -- is a key ingredient fo… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: 9 pages, 2 figures, 4 tables and SI. Submitted to NeurIPS Datasets and Benchmarks

  3. arXiv:2210.04191  [pdf, other

    cs.CL cs.AI cs.LG

    CHARD: Clinical Health-Aware Reasoning Across Dimensions for Text Generation Models

    Authors: Steven Y. Feng, Vivek Khetan, Bogdan Sacaleanu, Anatole Gershman, Eduard Hovy

    Abstract: We motivate and introduce CHARD: Clinical Health-Aware Reasoning across Dimensions, to investigate the capability of text generation models to act as implicit clinical knowledge bases and generate free-flow textual explanations about various health-related conditions across several dimensions. We collect and present an associated dataset, CHARDat, consisting of explanations about 52 health conditi… ▽ More

    Submitted 12 February, 2023; v1 submitted 9 October, 2022; originally announced October 2022.

    Comments: EACL 2023. Code available at https://github.com/styfeng/CHARD

  4. arXiv:2209.08950  [pdf, other

    physics.optics physics.bio-ph

    Using fluorescent beads to emulate single flurophores

    Authors: Luis A. Aleman-Castaneda, Sherry Yi-Ting Feng, Rodrigo Gutierrez-Cuevas, Isael Herrera, Thomas G. Brown, Sophie Brasselet, Miguel A. Alonso

    Abstract: In this work, we study the conditions under which fluorescent beads can be used to emulate single fluorescent molecules in the calibration of optical microscopes. Although beads are widely used due to their brightness and easy manipulation, there can be notable differences between the point spread functions (PSFs) they produce and those for single-molecule fluorophores, caused by their different e… ▽ More

    Submitted 6 December, 2022; v1 submitted 19 September, 2022; originally announced September 2022.

    Journal ref: J. Opt. Soc. Am. A 39, C167-C178 (2022)

  5. arXiv:2209.07752  [pdf, other

    cs.CL cs.AI cs.LG

    PINEAPPLE: Personifying INanimate Entities by Acquiring Parallel Personification data for Learning Enhanced generation

    Authors: Sedrick Scott Keh, Kevin Lu, Varun Gangal, Steven Y. Feng, Harsh Jhamtani, Malihe Alikhani, Eduard Hovy

    Abstract: A personification is a figure of speech that endows inanimate entities with properties and actions typically seen as requiring animacy. In this paper, we explore the task of personification generation. To this end, we propose PINEAPPLE: Personifying INanimate Entities by Acquiring Parallel Personification data for Learning Enhanced generation. We curate a corpus of personifications called Personif… ▽ More

    Submitted 16 September, 2022; originally announced September 2022.

    Comments: Accepted to COLING 2022; official Github repo at https://github.com/sedrickkeh/PINEAPPLE

  6. arXiv:2209.06275  [pdf, other

    cs.CL cs.AI cs.LG

    PANCETTA: Phoneme Aware Neural Completion to Elicit Tongue Twisters Automatically

    Authors: Sedrick Scott Keh, Steven Y. Feng, Varun Gangal, Malihe Alikhani, Eduard Hovy

    Abstract: Tongue twisters are meaningful sentences that are difficult to pronounce. The process of automatically generating tongue twisters is challenging since the generated utterance must satisfy two conditions at once: phonetic difficulty and semantic meaning. Furthermore, phonetic difficulty is itself hard to characterize and is expressed in natural tongue twisters through a heterogeneous mix of phenome… ▽ More

    Submitted 14 February, 2023; v1 submitted 13 September, 2022; originally announced September 2022.

    Comments: EACL 2023. Code at https://github.com/sedrickkeh/PANCETTA

  7. arXiv:2109.03892  [pdf, other

    cs.CL cs.AI cs.LG

    Retrieve, Caption, Generate: Visual Grounding for Enhancing Commonsense in Text Generation Models

    Authors: Steven Y. Feng, Kevin Lu, Zhuofu Tao, Malihe Alikhani, Teruko Mitamura, Eduard Hovy, Varun Gangal

    Abstract: We investigate the use of multimodal information contained in images as an effective method for enhancing the commonsense of Transformer models for text generation. We perform experiments using BART and T5 on concept-to-text generation, specifically the task of generative commonsense reasoning, or CommonGen. We call our approach VisCTG: Visually Grounded Concept-to-Text Generation. VisCTG involves… ▽ More

    Submitted 25 March, 2022; v1 submitted 8 September, 2021; originally announced September 2021.

    Comments: Accepted to AAAI 2022. Code at https://github.com/styfeng/VisCTG

  8. arXiv:2108.06643  [pdf, other

    cs.CL cs.AI cs.LG

    SAPPHIRE: Approaches for Enhanced Concept-to-Text Generation

    Authors: Steven Y. Feng, Jessica Huynh, Chaitanya Narisetty, Eduard Hovy, Varun Gangal

    Abstract: We motivate and propose a suite of simple but effective improvements for concept-to-text generation called SAPPHIRE: Set Augmentation and Post-hoc PHrase Infilling and REcombination. We demonstrate their effectiveness on generative commonsense reasoning, a.k.a. the CommonGen task, through experiments using both BART and T5 models. Through extensive automatic and human evaluation, we show that SAPP… ▽ More

    Submitted 1 December, 2021; v1 submitted 14 August, 2021; originally announced August 2021.

    Comments: INLG 2021 [Best Long Paper]. Code available at https://github.com/styfeng/SAPPHIRE

  9. arXiv:2105.03075  [pdf, other

    cs.CL cs.AI cs.LG

    A Survey of Data Augmentation Approaches for NLP

    Authors: Steven Y. Feng, Varun Gangal, Jason Wei, Sarath Chandar, Soroush Vosoughi, Teruko Mitamura, Eduard Hovy

    Abstract: Data augmentation has recently seen increased interest in NLP due to more work in low-resource domains, new tasks, and the popularity of large-scale neural networks that require large amounts of training data. Despite this recent upsurge, this area is still relatively underexplored, perhaps due to the challenges posed by the discrete nature of language data. In this paper, we present a comprehensi… ▽ More

    Submitted 1 December, 2021; v1 submitted 7 May, 2021; originally announced May 2021.

    Comments: Accepted to ACL 2021 Findings. GitHub repo with paper list at https://github.com/styfeng/DataAug4NLP ; Talk at https://www.youtube.com/watch?v=kNBVesKUZCk&ab_channel=StevenFeng ; Podcast at https://www.youtube.com/watch?v=qmqyT_97Poc&ab_channel=GradientFlow and https://thedataexchange.media/data-augmentation-in-natural-language-processing

  10. arXiv:2104.06669  [pdf, other

    cs.CL cs.AI

    NAREOR: The Narrative Reordering Problem

    Authors: Varun Gangal, Steven Y. Feng, Malihe Alikhani, Teruko Mitamura, Eduard Hovy

    Abstract: Many implicit inferences exist in text depending on how it is structured that can critically impact the text's interpretation and meaning. One such structural aspect present in text with chronology is the order of its presentation. For narratives or stories, this is known as the narrative order. Reordering a narrative can impact the temporal, causal, event-based, and other inferences readers draw… ▽ More

    Submitted 27 March, 2022; v1 submitted 14 April, 2021; originally announced April 2021.

    Comments: Accepted to AAAI 2022; Code at https://github.com/vgtomahawk/NAREORCamReady

  11. arXiv:2010.01794  [pdf, other

    cs.CL cs.AI cs.LG

    GenAug: Data Augmentation for Finetuning Text Generators

    Authors: Steven Y. Feng, Varun Gangal, Dongyeop Kang, Teruko Mitamura, Eduard Hovy

    Abstract: In this paper, we investigate data augmentation for text generation, which we call GenAug. Text generation and language modeling are important tasks within natural language processing, and are especially challenging for low-data regimes. We propose and evaluate various augmentation methods, including some that incorporate external knowledge, for finetuning GPT-2 on a subset of Yelp Reviews. We als… ▽ More

    Submitted 10 October, 2020; v1 submitted 5 October, 2020; originally announced October 2020.

    Comments: EMNLP 2020 Deep Learning Inside Out (DeeLIO) Workshop; Code available at https://github.com/styfeng/GenAug

  12. arXiv:1910.08293  [pdf, other

    cs.CL cs.AI cs.LG

    ALOHA: Artificial Learning of Human Attributes for Dialogue Agents

    Authors: Aaron W. Li, Veronica Jiang, Steven Y. Feng, Julia Sprague, Wei Zhou, Jesse Hoey

    Abstract: For conversational AI and virtual assistants to communicate with humans in a realistic way, they must exhibit human characteristics such as expression of emotion and personality. Current attempts toward constructing human-like dialogue agents have presented significant difficulties. We propose Human Level Attributes (HLAs) based on tropes as the basis of a method for learning dialogue agents that… ▽ More

    Submitted 1 December, 2021; v1 submitted 18 October, 2019; originally announced October 2019.

    Comments: AAAI 2020. Code available at https://github.com/newpro/aloha-chatbot Talk at https://www.youtube.com/watch?v=TtomrolC4Dc&ab_channel=StevenFeng

  13. arXiv:1909.00088  [pdf, other

    cs.CL cs.IR cs.LG

    Keep Calm and Switch On! Preserving Sentiment and Fluency in Semantic Text Exchange

    Authors: Steven Y. Feng, Aaron W. Li, Jesse Hoey

    Abstract: In this paper, we present a novel method for measurably adjusting the semantics of text while preserving its sentiment and fluency, a task we call semantic text exchange. This is useful for text data augmentation and the semantic correction of text generated by chatbots and virtual assistants. We introduce a pipeline called SMERTI that combines entity replacement, similarity masking, and text infi… ▽ More

    Submitted 21 September, 2020; v1 submitted 30 August, 2019; originally announced September 2019.

    Comments: EMNLP-IJCNLP 2019; Code available at https://github.com/styfeng/SMERTI

  14. Multiband Fitting to Three Long GRBs with Fermi/LAT Data: Structured Ejecta Sweeping up a Density-Jump Medium

    Authors: S. Y. Feng, Z. G. Dai

    Abstract: We present broadband (radio, optical, X-ray and GeV) fits to the afterglow light curves and spectra of three long-duration gamma-ray bursts (GRBs 080916C, 090902B, and 090926A) detected by the Gamma-Ray Burst Monitor (GBM) and Large Area Telescope (LAT) instruments on the Fermi satellite. Using the observed broadband data, we study the origin of the high energy emission, and suggest that the early… ▽ More

    Submitted 4 June, 2011; v1 submitted 13 November, 2010; originally announced November 2010.

    Comments: 32 pages, 4 figures, 2 tables; some minor typo corrected, optical depth does not have significant contribution to the result, major conclusions unchanged

    Journal ref: Research in Astron. Astrophys. 2011, Vol .11 No. 9