Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 88 results for author: Majumdar, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.12946  [pdf

    eess.AS cs.AI cs.CL cs.LG

    Instruction Data Generation and Unsupervised Adaptation for Speech Language Models

    Authors: Vahid Noroozi, Zhehuai Chen, Somshubra Majumdar, Steve Huang, Jagadeesh Balam, Boris Ginsburg

    Abstract: In this paper, we propose three methods for generating synthetic samples to train and evaluate multimodal large language models capable of processing both text and speech inputs. Addressing the scarcity of samples containing both modalities, synthetic data generation emerges as a crucial strategy to enhance the performance of such systems and facilitate the modeling of cross-modal relationships be… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: Accepted for Interspeech 2024

  2. arXiv:2406.11871  [pdf, other

    cs.AI

    Generative AI Voting: Fair Collective Choice is Resilient to LLM Biases and Inconsistencies

    Authors: Srijoni Majumdar, Edith Elkind, Evangelos Pournaras

    Abstract: Scaling up deliberative and voting participation is a longstanding endeavor -- a cornerstone for direct democracy and legitimate collective choice. Recent breakthroughs in generative artificial intelligence (AI) and large language models (LLMs) provide unprecedented opportunities, but also alerting risks for digital democracy. AI personal assistants can overcome cognitive bandwidth limitations of… ▽ More

    Submitted 30 May, 2024; originally announced June 2024.

    Comments: 35 pages, 10 figures

  3. arXiv:2406.11704  [pdf, other

    cs.CL cs.AI cs.LG

    Nemotron-4 340B Technical Report

    Authors: Nvidia, :, Bo Adler, Niket Agarwal, Ashwath Aithal, Dong H. Anh, Pallab Bhattacharya, Annika Brundyn, Jared Casper, Bryan Catanzaro, Sharon Clay, Jonathan Cohen, Sirshak Das, Ayush Dattagupta, Olivier Delalleau, Leon Derczynski, Yi Dong, Daniel Egert, Ellie Evans, Aleksander Ficek, Denys Fridman, Shaona Ghosh, Boris Ginsburg, Igor Gitman, Tomasz Grzegorzek , et al. (58 additional authors not shown)

    Abstract: We release the Nemotron-4 340B model family, including Nemotron-4-340B-Base, Nemotron-4-340B-Instruct, and Nemotron-4-340B-Reward. Our models are open access under the NVIDIA Open Model License Agreement, a permissive model license that allows distribution, modification, and use of the models and its outputs. These models perform competitively to open access models on a wide range of evaluation be… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  4. arXiv:2406.11036  [pdf, other

    cs.CL cs.CR

    garak: A Framework for Security Probing Large Language Models

    Authors: Leon Derczynski, Erick Galinkin, Jeffrey Martin, Subho Majumdar, Nanna Inie

    Abstract: As Large Language Models (LLMs) are deployed and integrated into thousands of applications, the need for scalable evaluation of how models respond to adversarial attacks grows rapidly. However, LLM security is a moving target: models produce unpredictable output, are constantly updated, and the potential adversary is highly diverse: anyone with access to the internet and a decent command of natura… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

    Comments: https://garak.ai

  5. arXiv:2405.14577  [pdf, other

    cs.CL cs.LG

    Representation noising effectively prevents harmful fine-tuning on LLMs

    Authors: Domenic Rosati, Jan Wehner, Kai Williams, Ɓukasz Bartoszcze, David Atanasov, Robie Gonzales, Subhabrata Majumdar, Carsten Maple, Hassan Sajjad, Frank Rudzicz

    Abstract: Releasing open-source large language models (LLMs) presents a dual-use risk since bad actors can easily fine-tune these models for harmful purposes. Even without the open release of weights, weight stealing and fine-tuning APIs make closed models vulnerable to harmful fine-tuning attacks (HFAs). While safety measures like preventing jailbreaks and improving safety guardrails are important, such me… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  6. arXiv:2405.05495  [pdf, other

    cs.OH

    PARSAC: Fast, Human-quality Floorplanning for Modern SoCs with Complex Design Constraints

    Authors: Hesham Mostafa, Uday Mallappa, Mikhail Galkin, Mariano Phielipp, Somdeb Majumdar

    Abstract: The floorplanning of Systems-on-a-Chip (SoCs) and of chip sub-systems is a crucial step in the physical design flow as it determines the optimal shapes and locations of the blocks that make up the system. Simulated Annealing (SA) has been the method of choice for tackling classical floorplanning problems where the objective is to minimize wire-length and the total placement area. The goal in indus… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

    Comments: 9 pages, 7 figures

  7. arXiv:2405.05480  [pdf, other

    cs.AR cs.AI cs.LG

    FloorSet -- a VLSI Floorplanning Dataset with Design Constraints of Real-World SoCs

    Authors: Uday Mallappa, Hesham Mostafa, Mikhail Galkin, Mariano Phielipp, Somdeb Majumdar

    Abstract: Floorplanning for systems-on-a-chip (SoCs) and its sub-systems is a crucial and non-trivial step of the physical design flow. It represents a difficult combinatorial optimization problem. A typical large scale SoC with 120 partitions generates a search-space of nearly 10E250. As novel machine learning (ML) approaches emerge to tackle such problems, there is a growing need for a modern benchmark th… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

    Comments: 10 pages, 11 figures

  8. arXiv:2405.05085  [pdf, other

    cs.MA

    Fair Voting Outcomes with Impact and Novelty Compromises? Unraveling Biases of Equal Shares in Participatory Budgeting

    Authors: Sajan Maharjan, Srijoni Majumdar, Evangelos Pournaras

    Abstract: Participatory budgeting, as a paradigm for democratic innovations, engages citizens in the distribution of a public budget to projects, which they propose and vote for implementation. So far, voting algorithms have been devised and studied in social choice literature to elect projects that are popular, while others prioritize on a proportional representation of voters' preferences, for instance, e… ▽ More

    Submitted 9 May, 2024; v1 submitted 8 May, 2024; originally announced May 2024.

    Comments: 23 pages, 9 figures

  9. arXiv:2404.05482  [pdf, other

    cs.LG

    WaveCatBoost for Probabilistic Forecasting of Regional Air Quality Data

    Authors: Jintu Borah, Tanujit Chakraborty, Md. Shahrul Md. Nadzir, Mylene G. Cayetano, Shubhankar Majumdar

    Abstract: Accurate and reliable air quality forecasting is essential for protecting public health, sustainable development, pollution control, and enhanced urban planning. This letter presents a novel WaveCatBoost architecture designed to forecast the real-time concentrations of air pollutants by combining the maximal overlapping discrete wavelet transform (MODWT) with the CatBoost model. This hybrid approa… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

  10. arXiv:2401.05947  [pdf, other

    cs.CR

    Send Message to the Future? Blockchain-based Time Machines for Decentralized Reveal of Locked Information

    Authors: Zhuolun Li, Srijoni Majumdar, Evangelos Pournaras

    Abstract: Conditional information reveal systems automate the release of information upon meeting specific predefined conditions, such as time or location. This paper introduces a breakthrough in the understanding, design and application of conditional information reveal systems that are highly secure and decentralized. By designing a new practical timed-release cryptography system and a verifiable secret s… ▽ More

    Submitted 24 May, 2024; v1 submitted 11 January, 2024; originally announced January 2024.

  11. arXiv:2312.17279  [pdf, other

    cs.CL eess.AS

    Stateful Conformer with Cache-based Inference for Streaming Automatic Speech Recognition

    Authors: Vahid Noroozi, Somshubra Majumdar, Ankur Kumar, Jagadeesh Balam, Boris Ginsburg

    Abstract: In this paper, we propose an efficient and accurate streaming speech recognition model based on the FastConformer architecture. We adapted the FastConformer architecture for streaming applications through: (1) constraining both the look-ahead and past contexts in the encoder, and (2) introducing an activation caching mechanism to enable the non-autoregressive encoder to operate autoregressively du… ▽ More

    Submitted 2 May, 2024; v1 submitted 27 December, 2023; originally announced December 2023.

    Comments: Shorter version accepted to ICASSP 2024

  12. arXiv:2311.03374  [pdf, other

    cs.SE cs.AI cs.IR

    Generative AI for Software Metadata: Overview of the Information Retrieval in Software Engineering Track at FIRE 2023

    Authors: Srijoni Majumdar, Soumen Paul, Debjyoti Paul, Ayan Bandyopadhyay, Samiran Chattopadhyay, Partha Pratim Das, Paul D Clough, Prasenjit Majumder

    Abstract: The Information Retrieval in Software Engineering (IRSE) track aims to develop solutions for automated evaluation of code comments in a machine learning framework based on human and large language model generated labels. In this track, there is a binary classification task to classify comments as useful and not useful. The dataset consists of 9048 code comments and surrounding code snippet pairs e… ▽ More

    Submitted 27 October, 2023; originally announced November 2023.

    Comments: Overview Paper of the Information Retrieval of Software Engineering Track at the Forum for Information Retrieval, 2023

  13. arXiv:2310.17152  [pdf

    cs.CV cs.AI cs.LG q-bio.QM

    Technical Note: Feasibility of translating 3.0T-trained Deep-Learning Segmentation Models Out-of-the-Box on Low-Field MRI 0.55T Knee-MRI of Healthy Controls

    Authors: Rupsa Bhattacharjee, Zehra Akkaya, Johanna Luitjens, Pan Su, Yang Yang, Valentina Pedoia, Sharmila Majumdar

    Abstract: In the current study, our purpose is to evaluate the feasibility of applying deep learning (DL) enabled algorithms to quantify bilateral knee biomarkers in healthy controls scanned at 0.55T and compared with 3.0T. The current study assesses the performance of standard in-practice bone, and cartilage segmentation algorithms at 0.55T, both qualitatively and quantitatively, in terms of comparing segm… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

    Comments: 11 Pages, 3 Figures, 2 Tables

  14. arXiv:2309.09950  [pdf, other

    eess.AS cs.SD

    Investigating End-to-End ASR Architectures for Long Form Audio Transcription

    Authors: Nithin Rao Koluguri, Samuel Kriman, Georgy Zelenfroind, Somshubra Majumdar, Dima Rekesh, Vahid Noroozi, Jagadeesh Balam, Boris Ginsburg

    Abstract: This paper presents an overview and evaluation of some of the end-to-end ASR models on long-form audios. We study three categories of Automatic Speech Recognition(ASR) models based on their core architecture: (1) convolutional, (2) convolutional with squeeze-and-excitation and (3) convolutional models with attention. We selected one ASR model from each category and evaluated Word Error Rate, maxim… ▽ More

    Submitted 20 September, 2023; v1 submitted 18 September, 2023; originally announced September 2023.

    Comments: PrePrint. Submitted to ICASSP 2024

  15. arXiv:2308.09138  [pdf, other

    cs.CL cs.AI cs.CY

    Semantic Consistency for Assuring Reliability of Large Language Models

    Authors: Harsh Raj, Vipul Gupta, Domenic Rosati, Subhabrata Majumdar

    Abstract: Large Language Models (LLMs) exhibit remarkable fluency and competence across various natural language tasks. However, recent research has highlighted their sensitivity to variations in input prompts. To deploy LLMs in a safe and reliable manner, it is crucial for their outputs to be consistent when prompted with expressions that carry the same meaning or intent. While some existing work has explo… ▽ More

    Submitted 17 August, 2023; originally announced August 2023.

  16. arXiv:2308.06653  [pdf, other

    cs.SE cs.AI

    Smart Knowledge Transfer using Google-like Search

    Authors: Srijoni Majumdar, Partha Pratim Das

    Abstract: To address the issue of rising software maintenance cost due to program comprehension challenges, we propose SMARTKT (Smart Knowledge Transfer), a search framework, which extracts and integrates knowledge related to various aspects of an application in form of a semantic graph. This graph supports syntax and semantic queries and converts the process of program comprehension into a {\em google-like… ▽ More

    Submitted 12 August, 2023; originally announced August 2023.

    Comments: 3 pages, 2 figures, accepted in the NDLI-UNESCO International Symposium on Knowledge Engineering for Digital Library Design 2019 (KEDL) as an extended abstract and poster

  17. arXiv:2307.12915  [pdf, other

    cs.MA cs.AI

    Consensus-based Participatory Budgeting for Legitimacy: Decision Support via Multi-agent Reinforcement Learning

    Authors: Srijoni Majumdar, Evangelos Pournaras

    Abstract: The legitimacy of bottom-up democratic processes for the distribution of public funds by policy-makers is challenging and complex. Participatory budgeting is such a process, where voting outcomes may not always be fair or inclusive. Deliberation for which project ideas to put for voting and choose for implementation lack systematization and do not scale. This paper addresses these grand challenges… ▽ More

    Submitted 24 July, 2023; originally announced July 2023.

    Comments: 13 Pages, 8 Figures, 3 Tables, Accepted in International Conference on Machine Learning, Optimization, and Data Science, 2023

    Journal ref: International Conference on Machine Learning, Optimization, and Data Science, 2023

  18. arXiv:2307.08412  [pdf, other

    cs.CR cs.DC

    A Privacy-Preserving Blockchain-based E-voting System

    Authors: Arnab Mukherjee, Souvik Majumdar, Anup Kumar Kolya, Saborni Nandi

    Abstract: Within a modern democratic nation, elections play a significant role in the nation's functioning. However, with the existing infrastructure for conducting elections using Electronic Voting Systems (EVMs), many loopholes exist, which illegitimate entities might leverage to cast false votes or even tamper with the EVMs after the voting session is complete. The need of the hour is to introduce a robu… ▽ More

    Submitted 17 July, 2023; originally announced July 2023.

  19. Improving City Life via Legitimate and Participatory Policy-making: A Data-driven Approach in Switzerland

    Authors: Thomas Wellings, Srijoni Majumdar, Regula HĂ€nggli Fricker, Evangelos Pournaras

    Abstract: This paper introduces a novel data-driven approach to address challenges faced by city policymakers concerning the distribution of public funds. Providing budgeting processes for improving quality of life based on objective (data-driven) evidence has been so far a missing element in policy-making. This paper focuses on a case study of 1,204 citizens in the city of Aarau, Switzerland, and analyzes… ▽ More

    Submitted 23 June, 2023; originally announced June 2023.

    Comments: 18 pages, 15 figures

    Journal ref: 24th Annual International Conference on Digital Government Research (dg.o 2023)

  20. arXiv:2306.06283  [pdf, other

    cond-mat.mtrl-sci cs.LG physics.chem-ph

    14 Examples of How LLMs Can Transform Materials Science and Chemistry: A Reflection on a Large Language Model Hackathon

    Authors: Kevin Maik Jablonka, Qianxiang Ai, Alexander Al-Feghali, Shruti Badhwar, Joshua D. Bocarsly, Andres M Bran, Stefan Bringuier, L. Catherine Brinson, Kamal Choudhary, Defne Circi, Sam Cox, Wibe A. de Jong, Matthew L. Evans, Nicolas Gastellu, Jerome Genzling, MarĂ­a Victoria Gil, Ankur K. Gupta, Zhi Hong, Alishba Imran, Sabine Kruschwitz, Anne Labarre, Jakub LĂĄla, Tao Liu, Steven Ma, Sauradeep Majumdar , et al. (28 additional authors not shown)

    Abstract: Large-language models (LLMs) such as GPT-4 caught the interest of many scientists. Recent studies suggested that these models could be useful in chemistry and materials science. To explore these possibilities, we organized a hackathon. This article chronicles the projects built as part of this hackathon. Participants employed LLMs for various applications, including predicting properties of mole… ▽ More

    Submitted 14 July, 2023; v1 submitted 9 June, 2023; originally announced June 2023.

  21. arXiv:2305.16993  [pdf, ps, other

    cs.DC cs.MA

    Discrete-choice Multi-agent Optimization: Decentralized Hard Constraint Satisfaction for Smart Cities

    Authors: Srijoni Majumdar, Chuhao Qin, Evangelos Pournaras

    Abstract: Making Smart Cities more sustainable, resilient and democratic is emerging as an endeavor of satisfying hard constraints, for instance meeting net-zero targets. Decentralized multi-agent methods for socio-technical optimization of large-scale complex infrastructures such as energy and transport networks are scalable and more privacy-preserving by design. However, they mainly focus on satisfying so… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

    Comments: 8 pages, 7 figures, Accepted for MSDM@AAMAS 2023

  22. arXiv:2305.05084  [pdf, other

    eess.AS cs.SD

    Fast Conformer with Linearly Scalable Attention for Efficient Speech Recognition

    Authors: Dima Rekesh, Nithin Rao Koluguri, Samuel Kriman, Somshubra Majumdar, Vahid Noroozi, He Huang, Oleksii Hrinchuk, Krishna Puvvada, Ankur Kumar, Jagadeesh Balam, Boris Ginsburg

    Abstract: Conformer-based models have become the dominant end-to-end architecture for speech processing tasks. With the objective of enhancing the conformer architecture for efficient training and inference, we carefully redesigned Conformer with a novel downsampling schema. The proposed model, named Fast Conformer(FC), is 2.8x faster than the original Conformer, supports scaling to Billion parameters witho… ▽ More

    Submitted 30 September, 2023; v1 submitted 8 May, 2023; originally announced May 2023.

    Comments: Accepted at ASRU 2023

  23. Very high resolution canopy height maps from RGB imagery using self-supervised vision transformer and convolutional decoder trained on Aerial Lidar

    Authors: Jamie Tolan, Hung-I Yang, Ben Nosarzewski, Guillaume Couairon, Huy Vo, John Brandt, Justine Spore, Sayantan Majumdar, Daniel Haziza, Janaki Vamaraju, Theo Moutakanni, Piotr Bojanowski, Tracy Johns, Brian White, Tobias Tiecke, Camille Couprie

    Abstract: Vegetation structure mapping is critical for understanding the global carbon cycle and monitoring nature-based approaches to climate adaptation and mitigation. Repeated measurements of these data allow for the observation of deforestation or degradation of existing forests, natural forest regeneration, and the implementation of sustainable agricultural practices like agroforestry. Assessments of t… ▽ More

    Submitted 15 December, 2023; v1 submitted 14 April, 2023; originally announced April 2023.

    Journal ref: Remote Sensing of Environment 300, 113888, 2024

  24. arXiv:2304.06795  [pdf, other

    eess.AS cs.CL cs.LG cs.SD

    Efficient Sequence Transduction by Jointly Predicting Tokens and Durations

    Authors: Hainan Xu, Fei Jia, Somshubra Majumdar, He Huang, Shinji Watanabe, Boris Ginsburg

    Abstract: This paper introduces a novel Token-and-Duration Transducer (TDT) architecture for sequence-to-sequence tasks. TDT extends conventional RNN-Transducer architectures by jointly predicting both a token and its duration, i.e. the number of input frames covered by the emitted token. This is achieved by using a joint network with two outputs which are independently normalized to generate distributions… ▽ More

    Submitted 29 May, 2023; v1 submitted 13 April, 2023; originally announced April 2023.

  25. arXiv:2303.08535  [pdf, other

    cond-mat.stat-mech cs.LG physics.data-an

    Singular relaxation of a random walk in a box with a Metropolis Monte Carlo dynamics

    Authors: Alexei D. Chepelianskii, Satya N. Majumdar, Hendrik Schawe, Emmanuel Trizac

    Abstract: We study analytically the relaxation eigenmodes of a simple Monte Carlo algorithm, corresponding to a particle in a box which moves by uniform random jumps. Moves outside of the box are rejected. At long times, the system approaches the equilibrium probability density, which is uniform inside the box. We show that the relaxation towards this equilibrium is unusual: for a jump length comparable to… ▽ More

    Submitted 15 March, 2023; originally announced March 2023.

  26. arXiv:2211.13419  [pdf, other

    cs.CR cs.LG stat.AP

    Network Security Modelling with Distributional Data

    Authors: Subhabrata Majumdar, Ganesh Subramaniam

    Abstract: We investigate the detection of botnet command and control (C2) hosts in massive IP traffic using machine learning methods. To this end, we use NetFlow data -- the industry standard for monitoring of IP traffic -- and ML models using two sets of features: conventional NetFlow variables and distributional features based on NetFlow variables. In addition to using static summaries of NetFlow features… ▽ More

    Submitted 24 November, 2022; originally announced November 2022.

    Comments: Accepted and presented in CAMLIS 2022, https://www.camlis.org/2022-conference. arXiv admin note: text overlap with arXiv:2108.08924

  27. arXiv:2211.05853  [pdf, ps, other

    cs.CL cs.AI cs.CY

    Measuring Reliability of Large Language Models through Semantic Consistency

    Authors: Harsh Raj, Domenic Rosati, Subhabrata Majumdar

    Abstract: While large pretrained language models (PLMs) demonstrate incredible fluency and performance on many natural language tasks, recent work has shown that well-performing PLMs are very sensitive to what prompts are feed into them. Even when prompts are semantically identical, language models may give very different answers. When considering safe and trustworthy deployments of PLMs we would like their… ▽ More

    Submitted 11 April, 2023; v1 submitted 10 November, 2022; originally announced November 2022.

    Comments: NeurIPS 2022 ML Safety Workshop, https://neurips2022.mlsafety.org

  28. arXiv:2211.04568  [pdf, ps, other

    stat.AP cs.CY cs.LG

    Towards Algorithmic Fairness in Space-Time: Filling in Black Holes

    Authors: Cheryl Flynn, Aritra Guha, Subhabrata Majumdar, Divesh Srivastava, Zhengyi Zhou

    Abstract: New technologies and the availability of geospatial data have drawn attention to spatio-temporal biases present in society. For example: the COVID-19 pandemic highlighted disparities in the availability of broadband service and its role in the digital divide; the environmental justice movement in the United States has raised awareness to health implications for minority populations stemming from h… ▽ More

    Submitted 8 November, 2022; originally announced November 2022.

  29. arXiv:2211.03541  [pdf, other

    eess.AS cs.LG cs.SD

    Multi-blank Transducers for Speech Recognition

    Authors: Hainan Xu, Fei Jia, Somshubra Majumdar, Shinji Watanabe, Boris Ginsburg

    Abstract: This paper proposes a modification to RNN-Transducer (RNN-T) models for automatic speech recognition (ASR). In standard RNN-T, the emission of a blank symbol consumes exactly one input frame; in our proposed method, we introduce additional blank symbols, which consume two or more input frames when emitted. We refer to the added symbols as big blanks, and the method multi-blank RNN-T. For training… ▽ More

    Submitted 11 April, 2024; v1 submitted 4 November, 2022; originally announced November 2022.

    Journal ref: ICASSP 2023

  30. arXiv:2210.16780  [pdf

    cs.CV

    Recognizing Handwriting Styles in a Historical Scanned Document Using Unsupervised Fuzzy Clustering

    Authors: Sriparna Majumdar, Aaron Brick

    Abstract: The forensic attribution of the handwriting in a digitized document to multiple scribes is a challenging problem of high dimensionality. Unique handwriting styles may be dissimilar in a blend of several factors including character size, stroke width, loops, ductus, slant angles, and cursive ligatures. Previous work on labeled data with Hidden Markov models, support vector machines, and semi-superv… ▽ More

    Submitted 28 June, 2023; v1 submitted 30 October, 2022; originally announced October 2022.

    Comments: 26 pages in total, 5 figures and 2 tables

  31. arXiv:2210.03255  [pdf, other

    cs.SD cs.CL cs.LG eess.AS

    Damage Control During Domain Adaptation for Transducer Based Automatic Speech Recognition

    Authors: Somshubra Majumdar, Shantanu Acharya, Vitaly Lavrukhin, Boris Ginsburg

    Abstract: Automatic speech recognition models are often adapted to improve their accuracy in a new domain. A potential drawback of model adaptation to new domains is catastrophic forgetting, where the Word Error Rate on the original domain is significantly degraded. This paper addresses the situation when we want to simultaneously adapt automatic speech recognition models to a new domain and limit the degra… ▽ More

    Submitted 6 October, 2022; originally announced October 2022.

    Comments: To appear in Proc. SLT 2022, Jan 09-12, 2023, Doha, Qatar

  32. arXiv:2208.00498  [pdf, other

    cs.CR cs.AR cs.LG

    DNNShield: Dynamic Randomized Model Sparsification, A Defense Against Adversarial Machine Learning

    Authors: Mohammad Hossein Samavatian, Saikat Majumdar, Kristin Barber, Radu Teodorescu

    Abstract: DNNs are known to be vulnerable to so-called adversarial attacks that manipulate inputs to cause incorrect results that can be beneficial to an attacker or damaging to the victim. Recent works have proposed approximate computation as a defense mechanism against machine learning attacks. We show that these approaches, while successful for a range of inputs, are insufficient to address stronger, hig… ▽ More

    Submitted 31 July, 2022; originally announced August 2022.

  33. arXiv:2207.10488  [pdf, other

    cond-mat.stat-mech cs.LG stat.CO

    Metropolis Monte Carlo sampling: convergence, localization transition and optimality

    Authors: Alexei D. Chepelianskii, Satya N. Majumdar, Hendrik Schawe, Emmanuel Trizac

    Abstract: Among random sampling methods, Markov Chain Monte Carlo algorithms are foremost. Using a combination of analytical and numerical approaches, we study their convergence properties towards the steady state, within a random walk Metropolis scheme. Analysing the relaxation properties of some model algorithms sufficiently simple to enable analytic progress, we show that the deviations from the target s… ▽ More

    Submitted 15 April, 2023; v1 submitted 21 July, 2022; originally announced July 2022.

    Journal ref: Journal of Statistical Mechanics 123205, (2023)

  34. arXiv:2207.07783  [pdf, other

    cs.CV

    Learning Long-Term Spatial-Temporal Graphs for Active Speaker Detection

    Authors: Kyle Min, Sourya Roy, Subarna Tripathi, Tanaya Guha, Somdeb Majumdar

    Abstract: Active speaker detection (ASD) in videos with multiple speakers is a challenging task as it requires learning effective audiovisual features and spatial-temporal correlations over long temporal windows. In this paper, we present SPELL, a novel spatial-temporal graph learning framework that can solve complex tasks such as ASD. To this end, each person in a video frame is first encoded in a unique n… ▽ More

    Submitted 12 October, 2022; v1 submitted 15 July, 2022; originally announced July 2022.

    Comments: ECCV 2022 camera ready (Supplementary videos: on ECVA soon). This paper supersedes arXiv:2112.01479

  35. arXiv:2206.13046  [pdf, other

    cs.CR cs.LG

    DPOAD: Differentially Private Outsourcing of Anomaly Detection through Iterative Sensitivity Learning

    Authors: Meisam Mohammady, Han Wang, Lingyu Wang, Mengyuan Zhang, Yosr Jarraya, Suryadipta Majumdar, Makan Pourzandi, Mourad Debbabi, Yuan Hong

    Abstract: Outsourcing anomaly detection to third-parties can allow data owners to overcome resource constraints (e.g., in lightweight IoT devices), facilitate collaborative analysis (e.g., under distributed or multi-party scenarios), and benefit from lower costs and specialized expertise (e.g., of Managed Security Service Providers). Despite such benefits, a data owner may feel reluctant to outsource anomal… ▽ More

    Submitted 27 June, 2022; originally announced June 2022.

  36. arXiv:2206.05391  [pdf, other

    stat.ML cs.LG stat.ME

    Feature Selection using e-values

    Authors: Subhabrata Majumdar, Snigdhansu Chatterjee

    Abstract: In the context of supervised parametric models, we introduce the concept of e-values. An e-value is a scalar quantity that represents the proximity of the sampling distribution of parameter estimates in a model trained on a subset of features to that of the model trained on all features (i.e. the full model). Under general conditions, a rank ordering of e-values separates models that contain all e… ▽ More

    Submitted 16 June, 2022; v1 submitted 10 June, 2022; originally announced June 2022.

    Comments: accepted in ICML-2022

    Journal ref: Proceedings of the 39th International Conference on Machine Learning, PMLR 162:14753-14773, 2022, https://proceedings.mlr.press/v162/majumdar22a.html

  37. arXiv:2204.06386  [pdf

    cs.ET physics.app-ph

    Efficient Deep Neural Network Accelerator Using Controlled Ferroelectric Domain Dynamics

    Authors: Sayani Majumdar

    Abstract: The current work reports an efficient deep neural network (DNN) accelerator where synaptic weight elements are controlled by ferroelectric domain dynamics. An integrated device-to-algorithm framework for benchmarking novel synaptic devices is used. In P(VDF-TrFE) based ferroelectric tunnel junctions, analog conductance states are measured using a custom pulsing protocol and associated custom circu… ▽ More

    Submitted 13 October, 2022; v1 submitted 13 April, 2022; originally announced April 2022.

  38. arXiv:2204.01696  [pdf, other

    cs.CV cs.LG

    Joint Hand Motion and Interaction Hotspots Prediction from Egocentric Videos

    Authors: Shaowei Liu, Subarna Tripathi, Somdeb Majumdar, Xiaolong Wang

    Abstract: We propose to forecast future hand-object interactions given an egocentric video. Instead of predicting action labels or pixels, we directly predict the hand motion trajectory and the future contact points on the next active object (i.e., interaction hotspots). This relatively low-dimensional representation provides a concrete description of future interactions. To tackle this task, we first provi… ▽ More

    Submitted 4 April, 2022; originally announced April 2022.

    Comments: CVPR 2022, Project page: https://stevenlsw.github.io/hoi-forecast

  39. arXiv:2112.09828  [pdf, other

    cs.CV

    Exploiting Long-Term Dependencies for Generating Dynamic Scene Graphs

    Authors: Shengyu Feng, Subarna Tripathi, Hesham Mostafa, Marcel Nassar, Somdeb Majumdar

    Abstract: Dynamic scene graph generation from a video is challenging due to the temporal dynamics of the scene and the inherent temporal fluctuations of predictions. We hypothesize that capturing long-term temporal dependencies is the key to effective generation of dynamic scene graphs. We propose to learn the long-term dependencies in a video by capturing the object-level consistency and inter-object relat… ▽ More

    Submitted 19 October, 2022; v1 submitted 17 December, 2021; originally announced December 2021.

    Comments: WACV 2023

  40. arXiv:2112.01479  [pdf, other

    cs.CV

    Learning Spatial-Temporal Graphs for Active Speaker Detection

    Authors: Sourya Roy, Kyle Min, Subarna Tripathi, Tanaya Guha, Somdeb Majumdar

    Abstract: We address the problem of active speaker detection through a new framework, called SPELL, that learns long-range multimodal graphs to encode the inter-modal relationship between audio and visual data. We cast active speaker detection as a node classification task that is aware of longer-term dependencies. We first construct a graph from a video so that each node corresponds to one person. Nodes re… ▽ More

    Submitted 3 December, 2021; v1 submitted 2 December, 2021; originally announced December 2021.

    Comments: 10 pages

  41. CTC Variations Through New WFST Topologies

    Authors: Aleksandr Laptev, Somshubra Majumdar, Boris Ginsburg

    Abstract: This paper presents novel Weighted Finite-State Transducer (WFST) topologies to implement Connectionist Temporal Classification (CTC)-like algorithms for automatic speech recognition. Three new CTC variants are proposed: (1) the "compact-CTC", in which direct transitions between units are replaced with <epsilon> back-off transitions; (2) the "minimal-CTC", that only adds <blank> self-loops when us… ▽ More

    Submitted 26 June, 2022; v1 submitted 6 October, 2021; originally announced October 2021.

    Comments: Accepted to Interspeech 2022, 5 pages, 2 figures, 7 tables

  42. arXiv:2108.04187  [pdf, other

    cs.MM cs.LG cs.MA

    Scaling New Peaks: A Viewership-centric Approach to Automated Content Curation

    Authors: Subhabrata Majumdar, Deirdre Paul, Eric Zavesky

    Abstract: Summarizing video content is important for video streaming services to engage the user in a limited time span. To this end, current methods involve manual curation or using passive interest cues to annotate potential high-interest segments to form the basis of summarized videos, and are costly and unreliable. We propose a viewership-driven, automated method that accommodates a range of segment ide… ▽ More

    Submitted 9 August, 2021; originally announced August 2021.

  43. arXiv:2107.13268  [pdf, ps, other

    cs.NI cs.LG

    A Distributed Intelligence Architecture for B5G Network Automation

    Authors: Sayantini Majumdar, Riccardo Trivisonno, Georg Carle

    Abstract: The management of networks is automated by closed loops. Concurrent closed loops aiming for individual optimization cause conflicts which, left unresolved, leads to significant degradation in performance indicators, resulting in sub-optimal network performance. Centralized optimization avoids conflicts, but impractical in large-scale networks for time-critical applications. Distributed, pervasive… ▽ More

    Submitted 7 October, 2021; v1 submitted 28 July, 2021; originally announced July 2021.

    Comments: 6 pages, 4 figures. This work has been submitted to the IEEE Networking Letters for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  44. arXiv:2107.10708  [pdf, other

    eess.AS cs.SD

    CarneliNet: Neural Mixture Model for Automatic Speech Recognition

    Authors: Aleksei Kalinov, Somshubra Majumdar, Jagadeesh Balam, Boris Ginsburg

    Abstract: End-to-end automatic speech recognition systems have achieved great accuracy by using deeper and deeper models. However, the increased depth comes with a larger receptive field that can negatively impact model performance in streaming scenarios. We propose an alternative approach that we call Neural Mixture Model. The basic idea is to introduce a parallel mixture of shallow networks instead of a v… ▽ More

    Submitted 22 July, 2021; originally announced July 2021.

    Comments: Submitted to ASRU 2021

  45. Using Undervolting as an On-Device Defense Against Adversarial Machine Learning Attacks

    Authors: Saikat Majumdar, Mohammad Hossein Samavatian, Kristin Barber, Radu Teodorescu

    Abstract: Deep neural network (DNN) classifiers are powerful tools that drive a broad spectrum of important applications, from image recognition to autonomous vehicles. Unfortunately, DNNs are known to be vulnerable to adversarial attacks that affect virtually all state-of-the-art models. These attacks make small imperceptible modifications to inputs that are sufficient to induce the DNNs to produce the wro… ▽ More

    Submitted 6 August, 2021; v1 submitted 20 July, 2021; originally announced July 2021.

    Journal ref: 2021 IEEE International Symposium on Hardware Oriented Security and Trust (HOST)

  46. arXiv:2107.07341  [pdf

    cs.HC cs.AI cs.DC cs.LG cs.NE cs.SI

    Utilizing a digital swarm intelligence platform to improve consensus among radiologists and exploring its applications

    Authors: Rutwik Shah, Bruno Astuto, Tyler Gleason, Will Fletcher, Justin Banaga, Kevin Sweetwood, Allen Ye, Rina Patel, Kevin McGill, Thomas Link, Jason Crane, Valentina Pedoia, Sharmila Majumdar

    Abstract: Radiologists today play a key role in making diagnostic decisions and labeling images for training A.I. algorithms. Low inter-reader reliability (IRR) can be seen between experts when interpreting challenging cases. While teams-based decisions are known to outperform individual decisions, inter-personal biases often creep up in group interactions which limit non-dominant participants from expressi… ▽ More

    Submitted 6 September, 2021; v1 submitted 26 June, 2021; originally announced July 2021.

    Comments: 29 pages, 3 tables, 7 figures

  47. arXiv:2107.01103  [pdf, other

    stat.ME cs.LG stat.ML

    Generalized Multivariate Signs for Nonparametric Hypothesis Testing in High Dimensions

    Authors: Subhabrata Majumdar, Snigdhansu Chatterjee

    Abstract: High-dimensional data, where the dimension of the feature space is much larger than sample size, arise in a number of statistical applications. In this context, we construct the generalized multivariate sign transformation, defined as a vector divided by its norm. For different choices of the norm function, the resulting transformed vector adapts to certain geometrical features of the data distrib… ▽ More

    Submitted 2 July, 2021; originally announced July 2021.

  48. arXiv:2106.08482  [pdf, other

    cs.AI

    Minimizing Communication while Maximizing Performance in Multi-Agent Reinforcement Learning

    Authors: Varun Kumar Vijay, Hassam Sheikh, Somdeb Majumdar, Mariano Phielipp

    Abstract: Inter-agent communication can significantly increase performance in multi-agent tasks that require co-ordination to achieve a shared goal. Prior work has shown that it is possible to learn inter-agent communication protocols using multi-agent reinforcement learning and message-passing network architectures. However, these models use an unconstrained broadcast communication model, in which an agent… ▽ More

    Submitted 8 December, 2021; v1 submitted 15 June, 2021; originally announced June 2021.

  49. arXiv:2106.07611  [pdf

    cs.NE cs.AI

    Neuroevolution-Enhanced Multi-Objective Optimization for Mixed-Precision Quantization

    Authors: Santiago Miret, Vui Seng Chua, Mattias Marder, Mariano Phielipp, Nilesh Jain, Somdeb Majumdar

    Abstract: Mixed-precision quantization is a powerful tool to enable memory and compute savings of neural network workloads by deploying different sets of bit-width precisions on separate compute operations. In this work, we present a flexible and scalable framework for automated mixed-precision quantization that concurrently optimizes task performance, memory compression, and compute savings through multi-o… ▽ More

    Submitted 1 April, 2022; v1 submitted 14 June, 2021; originally announced June 2021.

  50. arXiv:2106.05825  [pdf, other

    cs.CR cs.AR cs.LG

    HASI: Hardware-Accelerated Stochastic Inference, A Defense Against Adversarial Machine Learning Attacks

    Authors: Mohammad Hossein Samavatian, Saikat Majumdar, Kristin Barber, Radu Teodorescu

    Abstract: Deep Neural Networks (DNNs) are employed in an increasing number of applications, some of which are safety critical. Unfortunately, DNNs are known to be vulnerable to so-called adversarial attacks that manipulate inputs to cause incorrect results that can be beneficial to an attacker or damaging to the victim. Multiple defenses have been proposed to increase the robustness of DNNs. In general, the… ▽ More

    Submitted 6 August, 2021; v1 submitted 9 June, 2021; originally announced June 2021.

    Journal ref: Secure and Private Systems for Machine Learning Workshop 2021