Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 87 results for author: Ramakrishnan, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.14541  [pdf, other

    cs.LG

    Are LLMs Naturally Good at Synthetic Tabular Data Generation?

    Authors: Shengzhe Xu, Cho-Ting Lee, Mandar Sharma, Raquib Bin Yousuf, Nikhil Muralidhar, Naren Ramakrishnan

    Abstract: Large language models (LLMs) have demonstrated their prowess in generating synthetic text and images; however, their potential for generating tabular data -- arguably the most common data type in business and scientific applications -- is largely underexplored. This paper demonstrates that LLMs, used as-is, or after traditional fine-tuning, are severely inadequate as synthetic table generators. Du… ▽ More

    Submitted 21 June, 2024; v1 submitted 20 June, 2024; originally announced June 2024.

  2. arXiv:2406.14005  [pdf, other

    cs.CL cs.AI cs.LG

    Information Guided Regularization for Fine-tuning Language Models

    Authors: Mandar Sharma, Nikhil Muralidhar, Shengzhe Xu, Raquib Bin Yousuf, Naren Ramakrishnan

    Abstract: The pretraining-fine-tuning paradigm has been the de facto strategy for transfer learning in modern language modeling. With the understanding that task adaptation in LMs is often a function of parameters shared across tasks, we argue that a more surgical approach to regularization needs to exist for smoother transfer learning. Towards this end, we investigate how the pretraining loss landscape is… ▽ More

    Submitted 21 June, 2024; v1 submitted 20 June, 2024; originally announced June 2024.

  3. arXiv:2404.01536  [pdf, other

    cs.CL cs.AI cs.LG

    Laying Anchors: Semantically Priming Numerals in Language Modeling

    Authors: Mandar Sharma, Rutuja Murlidhar Taware, Pravesh Koirala, Nikhil Muralidhar, Naren Ramakrishnan

    Abstract: Off-the-shelf pre-trained language models have become the de facto standard in NLP pipelines for a multitude of downstream tasks. However, the inability of these models to properly encode numerals limits their performance on tasks requiring numeric comprehension. We introduce strategies to semantically prime numerals in any corpus by generating anchors governed by the distribution of numerals in s… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: Accepted to the findings of NAACL 2024

  4. arXiv:2402.01748  [pdf, other

    cs.NI cs.AI cs.CL cs.LG

    Large Multi-Modal Models (LMMs) as Universal Foundation Models for AI-Native Wireless Systems

    Authors: Shengzhe Xu, Christo Kurisummoottil Thomas, Omar Hashash, Nikhil Muralidhar, Walid Saad, Naren Ramakrishnan

    Abstract: Large language models (LLMs) and foundation models have been recently touted as a game-changer for 6G systems. However, recent efforts on LLMs for wireless networks are limited to a direct application of existing language models that were designed for natural language processing (NLP) applications. To address this challenge and create wireless-centric foundation models, this paper presents a compr… ▽ More

    Submitted 7 February, 2024; v1 submitted 29 January, 2024; originally announced February 2024.

  5. arXiv:2305.08246  [pdf, other

    cs.CL cs.AI cs.LG

    Learning Non-linguistic Skills without Sacrificing Linguistic Proficiency

    Authors: Mandar Sharma, Nikhil Muralidhar, Naren Ramakrishnan

    Abstract: The field of Math-NLP has witnessed significant growth in recent years, motivated by the desire to expand LLM performance to the learning of non-linguistic notions (numerals, and subsequently, arithmetic reasoning). However, non-linguistic skill injection typically comes at a cost for LLMs: it leads to catastrophic forgetting of core linguistic skills, a consequence that often remains unaddressed… ▽ More

    Submitted 14 May, 2023; originally announced May 2023.

    Comments: Accepted to ACL 2023's main conference

  6. arXiv:2212.11666  [pdf, other

    cs.IT quant-ph

    Channel Simulation: Finite Blocklengths and Broadcast Channels

    Authors: Michael X. Cao, Navneeth Ramakrishnan, Mario Berta, Marco Tomamichel

    Abstract: We study channel simulation under common randomness-assistance in the finite-blocklength regime and identify the smooth channel max-information as a linear program one-shot converse on the minimal simulation cost for fixed error tolerance. We show that this one-shot converse can be achieved exactly using no-signaling assisted codes, and approximately achieved using common randomness-assisted codes… ▽ More

    Submitted 8 June, 2023; v1 submitted 22 December, 2022; originally announced December 2022.

    Comments: 32 pages, 10 figures

  7. arXiv:2211.02098  [pdf, other

    cs.CL cs.AI cs.LG

    Overcoming Barriers to Skill Injection in Language Modeling: Case Study in Arithmetic

    Authors: Mandar Sharma, Nikhil Muralidhar, Naren Ramakrishnan

    Abstract: Through their transfer learning abilities, highly-parameterized large pre-trained language models have dominated the NLP landscape for a multitude of downstream language tasks. Though linguistically proficient, the inability of these models to incorporate the learning of non-linguistic entities (numerals and arithmetic reasoning) limits their usage for tasks that require numeric comprehension or s… ▽ More

    Submitted 3 November, 2022; originally announced November 2022.

    Comments: NeurIPS 2022: Math-AI Workshop

  8. arXiv:2210.13994  [pdf, other

    cs.CV

    Minutiae-Guided Fingerprint Embeddings via Vision Transformers

    Authors: Steven A. Grosz, Joshua J. Engelsma, Rajeev Ranjan, Naveen Ramakrishnan, Manoj Aggarwal, Gerard G. Medioni, Anil K. Jain

    Abstract: Minutiae matching has long dominated the field of fingerprint recognition. However, deep networks can be used to extract fixed-length embeddings from fingerprints. To date, the few studies that have explored the use of CNN architectures to extract such embeddings have shown extreme promise. Inspired by these early works, we propose the first use of a Vision Transformer (ViT) to learn a discriminat… ▽ More

    Submitted 25 October, 2022; v1 submitted 25 October, 2022; originally announced October 2022.

  9. arXiv:2210.02841  [pdf, other

    cs.CR cs.LG

    Detecting Irregular Network Activity with Adversarial Learning and Expert Feedback

    Authors: Gopikrishna Rathinavel, Nikhil Muralidhar, Timothy O'Shea, Naren Ramakrishnan

    Abstract: Anomaly detection is a ubiquitous and challenging task relevant across many disciplines. With the vital role communication networks play in our daily lives, the security of these networks is imperative for smooth functioning of society. To this end, we propose a novel self-supervised deep learning framework CAAD for anomaly detection in wireless communication systems. Specifically, CAAD employs co… ▽ More

    Submitted 15 October, 2022; v1 submitted 1 October, 2022; originally announced October 2022.

    Comments: 12 pages, 6 figures

  10. arXiv:2208.02867  [pdf, other

    math.OC cs.AI cs.NE

    Memetic algorithms for Spatial Partitioning problems

    Authors: Subhodip Biswas, Fanglan Chen, Zhiqian Chen, Chang-Tien Lu, Naren Ramakrishnan

    Abstract: Spatial optimization problems (SOPs) are characterized by spatial relationships governing the decision variables, objectives, and/or constraint functions. In this article, we focus on a specific type of SOP called spatial partitioning, which is a combinatorial problem due to the presence of discrete spatial units. Exact optimization methods do not scale with the size of the problem, especially wit… ▽ More

    Submitted 4 August, 2022; originally announced August 2022.

    Comments: 32 pages, accepted at ACM Transactions on Spatial Algorithms and Systems: Special issue on the Best Papers from the 2020 ACM SIGSPATIAL Conference

    ACM Class: G.1.6; I.2.8

  11. arXiv:2208.00493  [pdf, other

    cs.LG cs.AI

    Scrutinizing Shipment Records To Thwart Illegal Timber Trade

    Authors: Debanjan Datta, Sathappan Muthiah, John Simeone, Amelia Meadows, Naren Ramakrishnan

    Abstract: Timber and forest products made from wood, like furniture, are valuable commodities, and like the global trade of many highly-valued natural resources, face challenges of corruption, fraud, and illegal harvesting. These grey and black market activities in the wood and forest products sector are not limited to the countries where the wood was harvested, but extend throughout the global supply chain… ▽ More

    Submitted 31 July, 2022; originally announced August 2022.

    Comments: Accepted in Proceedings of 6th Outlier Detection and Description Workshop, ACM SigKDD 2021 https://oddworkshop.github.io/assets/papers/7.pdf. arXiv admin note: substantial text overlap with arXiv:2104.01156

  12. arXiv:2207.12571  [pdf, other

    cs.CL

    Innovations in Neural Data-to-text Generation: A Survey

    Authors: Mandar Sharma, Ajay Gogineni, Naren Ramakrishnan

    Abstract: The neural boom that has sparked natural language processing (NLP) research through the last decade has similarly led to significant innovations in data-to-text generation (DTG). This survey offers a consolidated view into the neural DTG paradigm with a structured examination of the approaches, benchmark datasets, and evaluation protocols. This survey draws boundaries separating DTG from the rest… ▽ More

    Submitted 1 April, 2024; v1 submitted 25 July, 2022; originally announced July 2022.

    Comments: Accepted to ACM Transactions on Intelligent Systems and Technology 2024

  13. arXiv:2207.04029  [pdf, other

    cs.IR cs.AI

    Lessons from Deep Learning applied to Scholarly Information Extraction: What Works, What Doesn't, and Future Directions

    Authors: Raquib Bin Yousuf, Subhodip Biswas, Kulendra Kumar Kaushal, James Dunham, Rebecca Gelles, Sathappan Muthiah, Nathan Self, Patrick Butler, Naren Ramakrishnan

    Abstract: Understanding key insights from full-text scholarly articles is essential as it enables us to determine interesting trends, give insight into the research and development, and build knowledge graphs. However, some of the interesting key insights are only available when considering full-text. Although researchers have made significant progress in information extraction from short documents, extract… ▽ More

    Submitted 8 July, 2022; originally announced July 2022.

    Comments: ACM KDD 2022 Workshop on Data-driven Science of Science

    ACM Class: I.2; I.2.7; H.3

  14. arXiv:2206.14384  [pdf, other

    cs.LG cs.AI stat.ME

    Framing Algorithmic Recourse for Anomaly Detection

    Authors: Debanjan Datta, Feng Chen, Naren Ramakrishnan

    Abstract: The problem of algorithmic recourse has been explored for supervised machine learning models, to provide more interpretable, transparent and robust outcomes from decision support systems. An unexplored area is that of algorithmic recourse for anomaly detection, specifically for tabular data with only discrete feature values. Here the problem is to present a set of counterfactuals that are deemed n… ▽ More

    Submitted 28 June, 2022; originally announced June 2022.

    Comments: ACM SigKDD 2022, Research Track

  15. arXiv:2206.03703  [pdf, other

    cs.AI math.NA stat.AP

    Sampling-based techniques for designing school boundaries

    Authors: Subhodip Biswas, Fanglan Chen, Zhiqian Chen, Chang-Tien Lu, Naren Ramakrishnan

    Abstract: Recently, an increasing number of researchers, especially in the realm of political redistricting, have proposed sampling-based techniques to generate a subset of plans from the vast space of districting plans. These techniques have been increasingly adopted by U.S. courts of law and independent commissions as a tool for identifying partisan gerrymanders. Motivated by these recent developments, we… ▽ More

    Submitted 8 June, 2022; originally announced June 2022.

    Comments: 11 pages, 4 figures

    ACM Class: I.2.1; I.5.3; I.2.8; G.3; G.1.6

  16. arXiv:2204.02531  [pdf, other

    cs.CL cs.AI

    Improving Zero-Shot Event Extraction via Sentence Simplification

    Authors: Sneha Mehta, Huzefa Rangwala, Naren Ramakrishnan

    Abstract: The success of sites such as ACLED and Our World in Data have demonstrated the massive utility of extracting events in structured formats from large volumes of textual data in the form of news, social media, blogs and discussion forums. Event extraction can provide a window into ongoing geopolitical crises and yield actionable intelligence. With the proliferation of large pretrained language model… ▽ More

    Submitted 5 April, 2022; originally announced April 2022.

  17. arXiv:2203.05983  [pdf, other

    cs.CV

    PseudoProp: Robust Pseudo-Label Generation for Semi-Supervised Object Detection in Autonomous Driving Systems

    Authors: Shu Hu, Chun-Hao Liu, Jayanta Dutta, Ming-Ching Chang, Siwei Lyu, Naveen Ramakrishnan

    Abstract: Semi-supervised object detection methods are widely used in autonomous driving systems, where only a fraction of objects are labeled. To propagate information from the labeled objects to the unlabeled ones, pseudo-labels for unlabeled objects must be generated. Although pseudo-labels have proven to improve the performance of semi-supervised object detection significantly, the applications of image… ▽ More

    Submitted 16 April, 2022; v1 submitted 11 March, 2022; originally announced March 2022.

    Comments: Accepted by the Workshop on Autonomous Driving (WAD) at CVPR 2022

  18. arXiv:2203.02095  [pdf, other

    cs.LG cs.AR cs.CR eess.SP

    Contrastive Graph Convolutional Networks for Hardware Trojan Detection in Third Party IP Cores

    Authors: Nikhil Muralidhar, Abdullah Zubair, Nathanael Weidler, Ryan Gerdes, Naren Ramakrishnan

    Abstract: The availability of wide-ranging third-party intellectual property (3PIP) cores enables integrated circuit (IC) designers to focus on designing high-level features in ASICs/SoCs. The massive proliferation of ICs brings with it an increased number of bad actors seeking to exploit those circuits for various nefarious reasons. This is not surprising as integrated circuits affect every aspect of socie… ▽ More

    Submitted 3 March, 2022; originally announced March 2022.

    Journal ref: IEEE International Symposium on Hardware Oriented Security and Trust (HOST), 2021, pp. 181-191

  19. arXiv:2202.10446  [pdf, other

    cs.LG physics.soc-ph q-bio.PE stat.AP

    EINNs: Epidemiologically-informed Neural Networks

    Authors: Alexander Rodríguez, Jiaming Cui, Naren Ramakrishnan, Bijaya Adhikari, B. Aditya Prakash

    Abstract: We introduce EINNs, a framework crafted for epidemic forecasting that builds upon the theoretical grounds provided by mechanistic models as well as the data-driven expressibility afforded by AI models, and their capabilities to ingest heterogeneous information. Although neural forecasting models have been successful in multiple tasks, predictions well-correlated with epidemic trends and long-term… ▽ More

    Submitted 10 January, 2023; v1 submitted 21 February, 2022; originally announced February 2022.

    Comments: Appears in AAAI 2023

  20. Moderate deviation expansion for fully quantum tasks

    Authors: Navneeth Ramakrishnan, Marco Tomamichel, Mario Berta

    Abstract: The moderate deviation regime is concerned with the finite block length trade-off between communication cost and error for information processing tasks in the asymptotic regime, where the communication cost approaches a capacity-like quantity and the error vanishes at the same time. We find exact characterisations of these trade-offs for a variety of fully quantum communication tasks, including qu… ▽ More

    Submitted 8 October, 2023; v1 submitted 14 December, 2021; originally announced December 2021.

    Comments: 32 pages

    Journal ref: IEEE Transactions on Information Theory 69(8), 5041-5059 (2023)

  21. arXiv:2111.05199  [pdf, other

    cs.LG

    Deep diffusion-based forecasting of COVID-19 by incorporating network-level mobility information

    Authors: Padmaksha Roy, Shailik Sarkar, Subhodip Biswas, Fanglan Chen, Zhiqian Chen, Naren Ramakrishnan, Chang-Tien Lu

    Abstract: Modeling the spatiotemporal nature of the spread of infectious diseases can provide useful intuition in understanding the time-varying aspect of the disease spread and the underlying complex spatial dependency observed in people's mobility patterns. Besides, the county level multiple related time series information can be leveraged to make a forecast on an individual time series. Adding to this ch… ▽ More

    Submitted 9 November, 2021; originally announced November 2021.

    Comments: 8 pages

    ACM Class: K.5

    Journal ref: Published as conference paper at ASONAM 2021, Research Track

  22. arXiv:2110.05633  [pdf, other

    cs.CL cs.AI

    TCube: Domain-Agnostic Neural Time-series Narration

    Authors: Mandar Sharma, John S. Brownstein, Naren Ramakrishnan

    Abstract: The task of generating rich and fluent narratives that aptly describe the characteristics, trends, and anomalies of time-series data is invaluable to the sciences (geology, meteorology, epidemiology) or finance (trades, stocks, or sales and inventory). The efforts for time-series narration hitherto are domain-specific and use predefined templates that offer consistency but lead to mechanical narra… ▽ More

    Submitted 11 October, 2021; originally announced October 2021.

    Comments: To be published in IEEE ICDM 2021

  23. arXiv:2107.00079  [pdf, other

    cs.LG

    Using AntiPatterns to avoid MLOps Mistakes

    Authors: Nikhil Muralidhar, Sathappah Muthiah, Patrick Butler, Manish Jain, Yu Yu, Katy Burne, Weipeng Li, David Jones, Prakash Arunachalam, Hays 'Skip' McCormick, Naren Ramakrishnan

    Abstract: We describe lessons learned from developing and deploying machine learning models at scale across the enterprise in a range of financial analytics applications. These lessons are presented in the form of antipatterns. Just as design patterns codify best software engineering practices, antipatterns provide a vocabulary to describe defective practices and methodologies. Here we catalog and document… ▽ More

    Submitted 30 June, 2021; originally announced July 2021.

  24. arXiv:2104.01156  [pdf, other

    cs.LG

    Detecting Anomalies Through Contrast in Heterogeneous Data

    Authors: Debanjan Datta, Sathappan Muthiah, Naren Ramakrishnan

    Abstract: Detecting anomalies has been a fundamental approach in detecting potentially fraudulent activities. Tasked with detection of illegal timber trade that threatens ecosystems and economies and association with other illegal activities, we formulate our problem as one of anomaly detection. Among other challenges annotations are unavailable for our large-scale trade data with heterogeneous features (ca… ▽ More

    Submitted 2 April, 2021; originally announced April 2021.

  25. arXiv:2101.10247  [pdf, other

    cs.LG

    Incorporating Expert Guidance in Epidemic Forecasting

    Authors: Alexander Rodríguez, Bijaya Adhikari, Naren Ramakrishnan, B. Aditya Prakash

    Abstract: Forecasting influenza like illnesses (ILI) has rapidly progressed in recent years from an art to a science with a plethora of data-driven methods. While these methods have achieved qualified success, their applicability is limited due to their inability to incorporate expert feedback and guidance systematically into the forecasting framework. We propose a new approach leveraging the Seldonian opti… ▽ More

    Submitted 24 December, 2020; originally announced January 2021.

    Comments: Appears in SIGKDD 2020 epiDAMIK

  26. arXiv:2012.06453  [pdf, other

    cs.NE cs.AI

    Better call Surrogates: A hybrid Evolutionary Algorithm for Hyperparameter optimization

    Authors: Subhodip Biswas, Adam D Cobb, Andreea Sistrunk, Naren Ramakrishnan, Brian Jalaian

    Abstract: In this paper, we propose a surrogate-assisted evolutionary algorithm (EA) for hyperparameter optimization of machine learning (ML) models. The proposed STEADE model initially estimates the objective function landscape using RadialBasis Function interpolation, and then transfers the knowledge to an EA technique called Differential Evolution that is used to evolve new solutions guided by a Bayesian… ▽ More

    Submitted 11 December, 2020; originally announced December 2020.

    Comments: Accepted at the black box optimization challenge at NeurIPS 2020

  27. arXiv:2009.12740  [pdf, other

    cs.LG cs.CR

    STAN: Synthetic Network Traffic Generation with Generative Neural Models

    Authors: Shengzhe Xu, Manish Marwah, Martin Arlitt, Naren Ramakrishnan

    Abstract: Deep learning models have achieved great success in recent years but progress in some domains like cybersecurity is stymied due to a paucity of realistic datasets. Organizations are reluctant to share such data, even internally, due to privacy reasons. An alternative is to use synthetically generated data but existing methods are limited in their ability to capture complex dependency structures, b… ▽ More

    Submitted 2 August, 2021; v1 submitted 27 September, 2020; originally announced September 2020.

  28. arXiv:2009.11407  [pdf, other

    cs.LG stat.AP

    Steering a Historical Disease Forecasting Model Under a Pandemic: Case of Flu and COVID-19

    Authors: Alexander Rodríguez, Nikhil Muralidhar, Bijaya Adhikari, Anika Tabassum, Naren Ramakrishnan, B. Aditya Prakash

    Abstract: Forecasting influenza in a timely manner aids health organizations and policymakers in adequate preparation and decision making. However, effective influenza forecasting still remains a challenge despite increasing research interest. It is even more challenging amidst the COVID pandemic, when the influenza-like illness (ILI) counts are affected by various factors such as symptomatic similarities w… ▽ More

    Submitted 23 December, 2020; v1 submitted 23 September, 2020; originally announced September 2020.

    Comments: Appears in AAAI-21

  29. arXiv:2009.02649  [pdf, other

    cs.CL

    Once Upon A Time In Visualization: Understanding the Use of Textual Narratives for Causality

    Authors: Arjun Choudhry, Mandar Sharma, Pramod Chundury, Thomas Kapler, Derek W. S. Gray, Naren Ramakrishnan, Niklas Elmqvist

    Abstract: Causality visualization can help people understand temporal chains of events, such as messages sent in a distributed system, cause and effect in a historical conflict, or the interplay between political actors over time. However, as the scale and complexity of these event sequences grows, even these visualizations can become overwhelming to use. In this paper, we propose the use of textual narrati… ▽ More

    Submitted 6 September, 2020; originally announced September 2020.

    Comments: 9 pages + 2 references, 8 figures, 2 tables, IEEE VIS 2020 VAST Paper

  30. arXiv:2005.12423  [pdf, other

    cs.SI cs.CL cs.CY cs.IR physics.soc-ph

    Racism is a Virus: Anti-Asian Hate and Counterspeech in Social Media during the COVID-19 Crisis

    Authors: Bing He, Caleb Ziems, Sandeep Soni, Naren Ramakrishnan, Diyi Yang, Srijan Kumar

    Abstract: The spread of COVID-19 has sparked racism and hate on social media targeted towards Asian communities. However, little is known about how racial hate spreads during a pandemic and the role of counterspeech in mitigating this spread. In this work, we study the evolution and spread of anti-Asian hate speech through the lens of Twitter. We create COVID-HATE, the largest dataset of anti-Asian hate and… ▽ More

    Submitted 10 November, 2021; v1 submitted 25 May, 2020; originally announced May 2020.

    Comments: ASONAM 2021. The COVID-HATE dataset, annotations, and code are at http://claws.cc.gatech.edu/covid

  31. arXiv:1912.00835  [pdf, other

    cs.CL cs.LG

    Low Rank Factorization for Compact Multi-Head Self-Attention

    Authors: Sneha Mehta, Huzefa Rangwala, Naren Ramakrishnan

    Abstract: Effective representation learning from text has been an active area of research in the fields of NLP and text mining. Attention mechanisms have been at the forefront in order to learn contextual sentence representations. Current state-of-the-art approaches for many NLP tasks use large pre-trained language models such as BERT, XLNet and so on for learning representations. These models are based on… ▽ More

    Submitted 9 August, 2020; v1 submitted 26 November, 2019; originally announced December 2019.

    Comments: 9 pages, 5 figures

  32. arXiv:1911.04240  [pdf, other

    cs.LG physics.comp-ph stat.ML

    Physics-guided Design and Learning of Neural Networks for Predicting Drag Force on Particle Suspensions in Moving Fluids

    Authors: Nikhil Muralidhar, Jie Bu, Ze Cao, Long He, Naren Ramakrishnan, Danesh Tafti, Anuj Karpatne

    Abstract: Physics-based simulations are often used to model and understand complex physical systems and processes in domains like fluid dynamics. Such simulations, although used frequently, have many limitations which could arise either due to the inability to accurately model a physical process owing to incomplete knowledge about certain facets of the process or due to the underlying process being too comp… ▽ More

    Submitted 5 November, 2019; originally announced November 2019.

    MSC Class: 68T99; 76T20

  33. arXiv:1907.07590  [pdf, other

    cs.LG stat.ML

    Mitigating Uncertainty in Document Classification

    Authors: Xuchao Zhang, Fanglan Chen, Chang-Tien Lu, Naren Ramakrishnan

    Abstract: The uncertainty measurement of classifiers' predictions is especially important in applications such as medical diagnoses that need to ensure limited human resources can focus on the most uncertain predictions returned by machine learning models. However, few existing uncertainty models attempt to improve overall prediction accuracy where human resources are involved in the text classification tas… ▽ More

    Submitted 17 July, 2019; originally announced July 2019.

    Comments: Accepted by NAACL19

  34. arXiv:1905.10022  [pdf, other

    cs.DL cs.LG stat.ML

    Patent Citation Dynamics Modeling via Multi-Attention Recurrent Networks

    Authors: Taoran Ji, Zhiqian Chen, Nathan Self, Kaiqun Fu, Chang-Tien Lu, Naren Ramakrishnan

    Abstract: Modeling and forecasting forward citations to a patent is a central task for the discovery of emerging technologies and for measuring the pulse of inventive progress. Conventional methods for forecasting these forward citations cast the problem as analysis of temporal point processes which rely on the conditional intensity of previously received citations. Recent approaches model the conditional i… ▽ More

    Submitted 22 May, 2019; originally announced May 2019.

    Journal ref: IJCAI 2019

  35. arXiv:1812.02303  [pdf, other

    cs.CL cs.LG stat.ML

    Neural Abstractive Text Summarization with Sequence-to-Sequence Models

    Authors: Tian Shi, Yaser Keneshloo, Naren Ramakrishnan, Chandan K. Reddy

    Abstract: In the past few years, neural abstractive text summarization with sequence-to-sequence (seq2seq) models have gained a lot of popularity. Many interesting techniques have been proposed to improve seq2seq models, making them capable of handling different challenges, such as saliency, fluency and human readability, and generate high-quality summaries. Generally speaking, most of these techniques diff… ▽ More

    Submitted 18 September, 2020; v1 submitted 4 December, 2018; originally announced December 2018.

  36. arXiv:1810.06667  [pdf, other

    cs.LG cs.CL stat.ML

    Deep Transfer Reinforcement Learning for Text Summarization

    Authors: Yaser Keneshloo, Naren Ramakrishnan, Chandan K. Reddy

    Abstract: Deep neural networks are data hungry models and thus face difficulties when attempting to train on small text datasets. Transfer learning is a potential solution but their effectiveness in the text domain is not as explored as in areas such as image analysis. In this paper, we study the problem of transfer learning for text summarization and discuss why existing state-of-the-art models fail to gen… ▽ More

    Submitted 24 January, 2019; v1 submitted 15 October, 2018; originally announced October 2018.

    ACM Class: I.2.6; I.2.7; I.2.10

  37. arXiv:1805.09461  [pdf, other

    cs.LG stat.ML

    Deep Reinforcement Learning For Sequence to Sequence Models

    Authors: Yaser Keneshloo, Tian Shi, Naren Ramakrishnan, Chandan K. Reddy

    Abstract: In recent times, sequence-to-sequence (seq2seq) models have gained a lot of popularity and provide state-of-the-art performance in a wide variety of tasks such as machine translation, headline generation, text summarization, speech to text conversion, and image caption generation. The underlying framework for all these models is usually a deep neural network comprising an encoder and a decoder. Al… ▽ More

    Submitted 15 April, 2019; v1 submitted 23 May, 2018; originally announced May 2018.

    ACM Class: I.2.6; I.2.7; I.2.10

  38. arXiv:1705.08473  [pdf, other

    cs.SI

    New methods to generate massive synthetic networks

    Authors: Malay Chakrabarti, Lenwood Heath, Naren Ramakrishnan

    Abstract: One of the biggest needs in network science research is access to large realistic datasets. As data analytics methods permeate a range of diverse disciplines---e.g., computational epidemiology, sustainability, social media analytics, biology, and transportation--- network datasets that can exhibit characteristics encountered in each of these disciplines becomes paramount. The key technical issue i… ▽ More

    Submitted 23 May, 2017; originally announced May 2017.

  39. arXiv:1703.09828  [pdf, ps, other

    cs.SI physics.soc-ph q-bio.PE

    A Framework for Evaluating Epidemic Forecasts

    Authors: Farzaneh Sadat Tabataba, Prithwish Chakraborty, Naren Ramakrishnan, Srinivasan Venkatramanan, Jiangzhuo Chen, Bryan Lewis, Madhav Marathe

    Abstract: Background: Over the past few decades, numerous forecasting methods have been proposed in the field of epidemic forecasting. Such methods can be classified into different categories such as deterministic vs. probabilistic, comparative methods vs. generative methods, and so on. In some of the more popular comparative methods, researchers compare observed epidemiological data from early stages of an… ▽ More

    Submitted 28 March, 2017; originally announced March 2017.

    Comments: Submitted to BMC infectious disease Journal, 2016. Accepted in 2017

    Journal ref: BMC infectious disease, 2017

  40. arXiv:1702.07745  [pdf, other

    cs.CR cs.HC cs.IR cs.SI

    Crowdsourcing Cybersecurity: Cyber Attack Detection using Social Media

    Authors: Rupinder Paul Khandpur, Taoran Ji, Steve Jan, Gang Wang, Chang-Tien Lu, Naren Ramakrishnan

    Abstract: Social media is often viewed as a sensor into various societal events such as disease outbreaks, protests, and elections. We describe the use of social media as a crowdsourced sensor to gain insight into ongoing cyber-attacks. Our approach detects a broad range of cyber-attacks (e.g., distributed denial of service (DDOS) attacks, data breaches, and account hijacking) in an unsupervised manner usin… ▽ More

    Submitted 24 February, 2017; originally announced February 2017.

    Comments: 13 single column pages, 5 figures, submitted to KDD 2017

    ACM Class: I.2.7; H.3.3

  41. arXiv:1702.06921  [pdf, other

    cs.SI cs.LG stat.ML

    Distributed Representation of Subgraphs

    Authors: Bijaya Adhikari, Yao Zhang, Naren Ramakrishnan, B. Aditya Prakash

    Abstract: Network embeddings have become very popular in learning effective feature representations of networks. Motivated by the recent successes of embeddings in natural language processing, researchers have tried to find network embeddings in order to exploit machine learning algorithms for mining tasks like node classification and edge prediction. However, most of the work focuses on finding distributed… ▽ More

    Submitted 22 February, 2017; originally announced February 2017.

    Comments: 9 pages, 7 figures

  42. Distributed Representations of Signed Networks

    Authors: Mohammad Raihanul Islam, B. Aditya Prakash, Naren Ramakrishnan

    Abstract: Recent successes in word embedding and document embedding have motivated researchers to explore similar representations for networks and to use such representations for tasks such as edge prediction, node label prediction, and community detection. Such network embedding methods are largely focused on finding distributed representations for unsigned networks and are unable to discover embeddings th… ▽ More

    Submitted 7 April, 2019; v1 submitted 22 February, 2017; originally announced February 2017.

    Comments: Published in PAKDD 2018

  43. arXiv:1702.06663  [pdf, other

    cs.CL cs.IR

    Guided Deep List: Automating the Generation of Epidemiological Line Lists from Open Sources

    Authors: Saurav Ghosh, Prithwish Chakraborty, Bryan L. Lewis, Maimuna S. Majumder, Emily Cohn, John S. Brownstein, Madhav V. Marathe, Naren Ramakrishnan

    Abstract: Real-time monitoring and responses to emerging public health threats rely on the availability of timely surveillance data. During the early stages of an epidemic, the ready availability of line lists with detailed tabular information about laboratory-confirmed cases can assist epidemiologists in making reliable inferences and forecasts. Such inferences are crucial to understand the epidemiology of… ▽ More

    Submitted 21 February, 2017; originally announced February 2017.

    Comments: This paper has been submitted to a conference

  44. arXiv:1612.01254  [pdf, other

    cs.LG stat.ML

    Deep Symbolic Representation Learning for Heterogeneous Time-series Classification

    Authors: Shengdong Zhang, Soheil Bahrampour, Naveen Ramakrishnan, Mohak Shah

    Abstract: In this paper, we consider the problem of event classification with multi-variate time series data consisting of heterogeneous (continuous and categorical) variables. The complex temporal dependencies between the variables combined with sparsity of the data makes the event classification problem particularly challenging. Most state-of-art approaches address this either by designing hand-engineered… ▽ More

    Submitted 5 December, 2016; originally announced December 2016.

  45. arXiv:1611.06947  [pdf, ps, other

    cs.SI

    Can Self-Censorship in News Media be Detected Algorithmically? A Case Study in Latin America

    Authors: Rongrong Tao, Baojian Zhou, Feng Chen, Naifeng Liu, David Mares, Patrick Butler, Naren Ramakrishnan

    Abstract: Censorship in social media has been well studied and provides insight into how governments stifle freedom of expression online. Comparatively less (or no) attention has been paid to detecting (self) censorship in traditional media (e.g., news) using social media as a bellweather. We present a novel unsupervised approach that views social media as a sensor to detect censorship in news media wherein… ▽ More

    Submitted 17 March, 2017; v1 submitted 21 November, 2016; originally announced November 2016.

  46. arXiv:1609.09162  [pdf, other

    cs.LG

    Universum Learning for Multiclass SVM

    Authors: Sauptik Dhar, Naveen Ramakrishnan, Vladimir Cherkassky, Mohak Shah

    Abstract: We introduce Universum learning for multiclass problems and propose a novel formulation for multiclass universum SVM (MU-SVM). We also propose a span bound for MU-SVM that can be used for model selection thereby avoiding resampling. Empirical results demonstrate the effectiveness of MU-SVM and the proposed bound.

    Submitted 28 September, 2016; originally announced September 2016.

    Comments: 14 pages, 12 figures

  47. arXiv:1608.03889  [pdf, other

    cs.SI cs.DB

    Interactive and Iterative Discovery of Entity Network Subgraphs

    Authors: Hao Wu, Maoyuan Sun, Jilles Vreeken, Nikolaj Tatti, Chris North, Naren Ramakrishnan

    Abstract: Graph mining to extract interesting components has been studied in various guises, e.g., communities, dense subgraphs, cliques. However, most existing works are based on notions of frequency and connectivity and do not capture subjective interestingness from a user's viewpoint. Furthermore, existing approaches to mine graphs are not interactive and cannot incorporate user feedbacks in any natural… ▽ More

    Submitted 12 August, 2016; originally announced August 2016.

  48. arXiv:1606.00411  [pdf, ps, other

    cs.SI cs.CL cs.IR stat.ML

    Temporal Topic Modeling to Assess Associations between News Trends and Infectious Disease Outbreaks

    Authors: Saurav Ghosh, Prithwish Chakraborty, Elaine O. Nsoesie, Emily Cohn, Sumiko R. Mekaru, John S. Brownstein, Naren Ramakrishnan

    Abstract: In retrospective assessments, internet news reports have been shown to capture early reports of unknown infectious disease transmission prior to official laboratory confirmation. In general, media interest and reporting peaks and wanes during the course of an outbreak. In this study, we quantify the extent to which media interest during infectious disease outbreaks is indicative of trends of repor… ▽ More

    Submitted 1 June, 2016; originally announced June 2016.

    Comments: This paper has been submitted to a journal

  49. arXiv:1604.00033  [pdf, other

    cs.CY cs.SI

    EMBERS at 4 years: Experiences operating an Open Source Indicators Forecasting System

    Authors: Sathappan Muthiah, Patrick Butler, Rupinder Paul Khandpur, Parang Saraf, Nathan Self, Alla Rozovskaya, Liang Zhao, Jose Cadena, Chang-Tien Lu, Anil Vullikanti, Achla Marathe, Kristen Summers, Graham Katz, Andy Doyle, Jaime Arredondo, Dipak K. Gupta, David Mares, Naren Ramakrishnan

    Abstract: EMBERS is an anticipatory intelligence system forecasting population-level events in multiple countries of Latin America. A deployed system from 2012, EMBERS has been generating alerts 24x7 by ingesting a broad range of data sources including news, blogs, tweets, machine coded events, currency rates, and food prices. In this paper, we describe our experiences operating EMBERS continuously for near… ▽ More

    Submitted 31 March, 2016; originally announced April 2016.

    Comments: Submitted to a conference

  50. arXiv:1603.09739  [pdf, other

    cs.LG cs.IT stat.ML

    Hierarchical Quickest Change Detection via Surrogates

    Authors: Prithwish Chakraborty, Sathappan Muthiah, Ravi Tandon, Naren Ramakrishnan

    Abstract: Change detection (CD) in time series data is a critical problem as it reveal changes in the underlying generative processes driving the time series. Despite having received significant attention, one important unexplored aspect is how to efficiently utilize additional correlated information to improve the detection and the understanding of changepoints. We propose hierarchical quickest change dete… ▽ More

    Submitted 31 March, 2016; originally announced March 2016.

    Comments: Submitted to a journal. See demo at https://prithwi.github.io/hqcd_supplementary