Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 296 results for author: Joshi, R

.
  1. A Data Selection Approach for Enhancing Low Resource Machine Translation Using Cross-Lingual Sentence Representations

    Authors: Nidhi Kowtal, Tejas Deshpande, Raviraj Joshi

    Abstract: Machine translation in low-resource language pairs faces significant challenges due to the scarcity of parallel corpora and linguistic resources. This study focuses on the case of English-Marathi language pairs, where existing datasets are notably noisy, impeding the performance of machine translation models. To mitigate the impact of data quality issues, we propose a data filtering approach based… ▽ More

    Submitted 4 September, 2024; originally announced September 2024.

    Comments: Accepted at I2CT 2024

  2. arXiv:2409.02392  [pdf, other

    cs.LG stat.ML

    Building Math Agents with Multi-Turn Iterative Preference Learning

    Authors: Wei Xiong, Chengshuai Shi, Jiaming Shen, Aviv Rosenberg, Zhen Qin, Daniele Calandriello, Misha Khalman, Rishabh Joshi, Bilal Piot, Mohammad Saleh, Chi Jin, Tong Zhang, Tianqi Liu

    Abstract: Recent studies have shown that large language models' (LLMs) mathematical problem-solving capabilities can be enhanced by integrating external tools, such as code interpreters, and employing multi-turn Chain-of-Thought (CoT) reasoning. While current methods focus on synthetic data generation and Supervised Fine-Tuning (SFT), this paper studies the complementary direct preference learning approach… ▽ More

    Submitted 3 September, 2024; originally announced September 2024.

    Comments: A multi-turn direct preference learning framework for tool-integrated reasoning tasks

  3. arXiv:2408.17254  [pdf, other

    astro-ph.SR

    High-resolution observations of recurrent jets from an arch filament system

    Authors: Reetika Joshi, Luc Rouppe van der Voort, Brigitte Schmieder, Fernando Moreno-Insertis, Avijeet Prasad, Guillaume Aulanier, Daniel Nóbrega-Siverio

    Abstract: Solar jets are collimated plasma ejections along magnetic field lines observed in hot (EUV jets) and cool (chromospheric surges) temperature diagnostics. Their trigger mechanisms and the relationship between hot and cool jets are still not completely understood. We aim to investigate the generation of a sequence of active region solar jets and their evolution from the photospheric to the coronal h… ▽ More

    Submitted 30 August, 2024; originally announced August 2024.

    Comments: 14 pages, 11 figures, accepted for publication in A&A

  4. arXiv:2408.13558  [pdf, ps, other

    math.CO

    Combinatorial invariants for certain classes of non-abelian groups

    Authors: Naveen K. Godara, Renu Joshi, Eshita Mazumdar

    Abstract: This article focuses on the study of zero-sum invariants of finite non-abelian groups. We address two main problems: the first centers on the ordered Davenport constant and the second on Gao's constant. We establish a connection between the ordered Davenport constant and the small Davenport constant for a finite non-abelian group of even order, which in turn gives a relation with the Noether numbe… ▽ More

    Submitted 24 August, 2024; originally announced August 2024.

    Comments: 15 pages

    MSC Class: 11B75; 11P70

  5. arXiv:2408.11796  [pdf, other

    cs.CL cs.AI cs.LG

    LLM Pruning and Distillation in Practice: The Minitron Approach

    Authors: Sharath Turuvekere Sreenivas, Saurav Muralidharan, Raviraj Joshi, Marcin Chochowski, Mostofa Patwary, Mohammad Shoeybi, Bryan Catanzaro, Jan Kautz, Pavlo Molchanov

    Abstract: We present a comprehensive report on compressing the Llama 3.1 8B and Mistral NeMo 12B models to 4B and 8B parameters, respectively, using pruning and distillation. We explore two distinct pruning strategies: (1) depth pruning and (2) joint hidden/attention/MLP (width) pruning, and evaluate the results on common benchmarks from the LM Evaluation Harness. The models are then aligned with NeMo Align… ▽ More

    Submitted 26 August, 2024; v1 submitted 21 August, 2024; originally announced August 2024.

    Comments: v2: Added missing references. Cleaned up runtime performance section

  6. Leveraging Parameter Efficient Training Methods for Low Resource Text Classification: A Case Study in Marathi

    Authors: Pranita Deshmukh, Nikita Kulkarni, Sanhita Kulkarni, Kareena Manghani, Raviraj Joshi

    Abstract: With the surge in digital content in low-resource languages, there is an escalating demand for advanced Natural Language Processing (NLP) techniques tailored to these languages. BERT (Bidirectional Encoder Representations from Transformers), serving as the foundational framework for numerous NLP architectures and language models, is increasingly employed for the development of low-resource NLP mod… ▽ More

    Submitted 6 August, 2024; originally announced August 2024.

    Comments: Accepted at I2CT 2024

  7. arXiv:2407.15816  [pdf

    cs.CV

    Efficient and generalizable prediction of molecular alterations in multiple cancer cohorts using H&E whole slide images

    Authors: Kshitij Ingale, Sun Hae Hong, Qiyuan Hu, Renyu Zhang, Bo Osinski, Mina Khoshdeli, Josh Och, Kunal Nagpal, Martin C. Stumpe, Rohan P. Joshi

    Abstract: Molecular testing of tumor samples for targetable biomarkers is restricted by a lack of standardization, turnaround-time, cost, and tissue availability across cancer types. Additionally, targetable alterations of low prevalence may not be tested in routine workflows. Algorithms that predict DNA alterations from routinely generated hematoxylin and eosin (H&E)-stained images could prioritize samples… ▽ More

    Submitted 22 July, 2024; originally announced July 2024.

  8. arXiv:2407.14679  [pdf, other

    cs.CL cs.AI cs.LG

    Compact Language Models via Pruning and Knowledge Distillation

    Authors: Saurav Muralidharan, Sharath Turuvekere Sreenivas, Raviraj Joshi, Marcin Chochowski, Mostofa Patwary, Mohammad Shoeybi, Bryan Catanzaro, Jan Kautz, Pavlo Molchanov

    Abstract: Large language models (LLMs) targeting different deployment scales and sizes are currently produced by training each variant from scratch; this is extremely compute-intensive. In this paper, we investigate if pruning an existing LLM and then re-training it with a fraction (<3%) of the original training data can be a suitable alternative to repeated, full retraining. To this end, we develop a set o… ▽ More

    Submitted 19 July, 2024; originally announced July 2024.

  9. arXiv:2407.12869  [pdf, ps, other

    cs.CL cs.AI

    Bilingual Adaptation of Monolingual Foundation Models

    Authors: Gurpreet Gosal, Yishi Xu, Gokul Ramakrishnan, Rituraj Joshi, Avraham Sheinin, Zhiming, Chen, Biswajit Mishra, Natalia Vassilieva, Joel Hestness, Neha Sengupta, Sunil Kumar Sahu, Bokang Jia, Onkar Pandit, Satheesh Katipomu, Samta Kamboj, Samujjwal Ghosh, Rahul Pal, Parvez Mullah, Soundar Doraiswamy, Mohamed El Karim Chami, Preslav Nakov

    Abstract: We present an efficient method for adapting a monolingual Large Language Model (LLM) to another language, addressing challenges of catastrophic forgetting and tokenizer limitations. We focus this study on adapting Llama 2 to Arabic. Our two-stage approach begins with expanding the vocabulary and training only the embeddings matrix, followed by full model continual pre-training on a bilingual corpu… ▽ More

    Submitted 25 July, 2024; v1 submitted 13 July, 2024; originally announced July 2024.

  10. arXiv:2407.10940  [pdf, other

    quant-ph

    Quantum Control of an Oscillator with a Kerr-cat Qubit

    Authors: Andy Z. Ding, Benjamin L. Brock, Alec Eickbusch, Akshay Koottandavida, Nicholas E. Frattini, Rodrigo G. Cortinas, Vidul R. Joshi, Stijn J. de Graaf, Benjamin J. Chapman, Suhas Ganjam, Luigi Frunzio, Robert J. Schoelkopf, Michel H. Devoret

    Abstract: Bosonic codes offer a hardware-efficient strategy for quantum error correction by redundantly encoding quantum information in the large Hilbert space of a harmonic oscillator. However, experimental realizations of these codes are often limited by ancilla errors propagating to the encoded logical qubit during syndrome measurements. The Kerr-cat qubit has been proposed as an ancilla for these codes… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

  11. arXiv:2407.05493  [pdf, other

    cond-mat.supr-con

    Conventional s-wave superconductivity and hidden peak effect in single crystals of Mo$_8$Ga$_41$ superconductor

    Authors: Sunil Ghimire, Kyuil Cho, Kamal R. Joshi, Makariy A. Tanatar, Zhixiang Hu, Cedomir Petrovic, Ruslan Prozorov

    Abstract: London and Campbell penetration depths were measured in single crystals of the endohedral gallide cluster superconductor, Mo$_{8}$Ga$_{41}$. The full temperature range superfluid density is consistent with the clean isotropic $s-$wave weak-coupling BCS theory without any signs of the second gap or strong coupling. The temperature dependence of the Campbell length is hysteretic between zero-field c… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

  12. arXiv:2407.05264  [pdf, ps, other

    math.CO cs.DM

    $θ$-free matching covered graphs

    Authors: Rohinee Joshi, Nishad Kothari

    Abstract: A nontrivial connected graph is matching covered if each edge belongs to some perfect matching. For most problems pertaining to perfect matchings, one may restrict attention to matching covered graphs; thus, there is extensive literature on them. A cornerstone of this theory is an ear decomposition result due to Lovász and Plummer. Their theorem is a fundamental problem-solving tool, and also yiel… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

    Comments: Submitted to a journal

  13. arXiv:2407.03587  [pdf, other

    cond-mat.supr-con

    Single-gap Isotropic $s-$wave Superconductivity in Single Crystals $\text{AuSn}_4$

    Authors: Sunil Ghimire, Kamal R. Joshi, Elizabeth H. Krenkel, Makariy A. Tanatar, Marcin Konczykowski, Romain Grasset, Paul C. Canfield, Ruslan Prozorov

    Abstract: London, $λ_L (T)$, and Campbell, $λ_{C} (T)$, penetration depths were measured in single crystals of a topological superconductor candidate $\text{AuSn}_4$. At low temperatures, $λ_L (T)$ is exponentially attenuated and, if fitted with the power law, $λ(T) \sim T^n$, gives exponents $n>4$, indistinguishable from the isotropic single $s-$wave gap Bardeen-Cooper-Schrieffer (BCS) asymptotic. The supe… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  14. arXiv:2407.01148  [pdf, ps, other

    math.CO

    On a conjecture related to the Davenport constant

    Authors: Naveen K. Godara, Renu Joshi, Eshita Mazumdar

    Abstract: For a finite group $G,$ $D(G)$ is defined as the least positive integer $k$ such that for every sequence $S=g_1 g_2\cdots g_k$ of length $k$ over $G$, there exist $1 \le i_1 < i_2 <\cdots < i_m \le k $ such that $\prod_{j=1}^{m} g_{i_{σ(j)}}=1$ holds for $σ= id,$ identity element of $S_m.$ For a finite abelian group, this group invariant, known as the Davenport constant, is crucial in the theory o… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  15. Curating Stopwords in Marathi: A TF-IDF Approach for Improved Text Analysis and Information Retrieval

    Authors: Rohan Chavan, Gaurav Patil, Vishal Madle, Raviraj Joshi

    Abstract: Stopwords are commonly used words in a language that are often considered to be of little value in determining the meaning or significance of a document. These words occur frequently in most texts and don't provide much useful information for tasks like sentiment analysis and text classification. English, which is a high-resource language, takes advantage of the availability of stopwords, whereas… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

    Comments: Accepted at I2CT 2024

  16. Universal Cross-Lingual Text Classification

    Authors: Riya Savant, Anushka Shelke, Sakshi Todmal, Sanskruti Kanphade, Ananya Joshi, Raviraj Joshi

    Abstract: Text classification, an integral task in natural language processing, involves the automatic categorization of text into predefined classes. Creating supervised labeled datasets for low-resource languages poses a considerable challenge. Unlocking the language potential of low-resource languages requires robust datasets with supervised labels. However, such datasets are scarce, and the label space… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

    Comments: Accepted at I2CT 2024

  17. arXiv:2406.03989  [pdf, other

    astro-ph.HE

    Numerical Simulation of Radiatively driven Transonic Relativistic Jets

    Authors: Raj Kishor Joshi, Indranil Chattopadhyay, Antonios Tsokaros, Priyesh Kumar Tripathi

    Abstract: We perform the numerical simulations of axisymmetric, relativistic, optically thin jets under the influence of the radiation field of an accretion disk. We show that starting from a very low injection velocity at the base, jets can be accelerated to relativistic terminal speeds when traveling through the radiation field. The jet gains momentum through the interaction with the radiation field. We u… ▽ More

    Submitted 30 June, 2024; v1 submitted 6 June, 2024; originally announced June 2024.

    Comments: 19 pages, 11 figures, accepted for publication in ApJ

  18. arXiv:2405.19107  [pdf, ps, other

    cs.LG cs.AI

    Offline Regularised Reinforcement Learning for Large Language Models Alignment

    Authors: Pierre Harvey Richemond, Yunhao Tang, Daniel Guo, Daniele Calandriello, Mohammad Gheshlaghi Azar, Rafael Rafailov, Bernardo Avila Pires, Eugene Tarassov, Lucas Spangher, Will Ellsworth, Aliaksei Severyn, Jonathan Mallinson, Lior Shani, Gil Shamir, Rishabh Joshi, Tianqi Liu, Remi Munos, Bilal Piot

    Abstract: The dominant framework for alignment of large language models (LLM), whether through reinforcement learning from human feedback or direct preference optimisation, is to learn from preference data. This involves building datasets where each element is a quadruplet composed of a prompt, two independent responses (completions of the prompt) and a human preference between the two independent responses… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  19. arXiv:2405.07933  [pdf, other

    cs.CV

    Authentic Hand Avatar from a Phone Scan via Universal Hand Model

    Authors: Gyeongsik Moon, Weipeng Xu, Rohan Joshi, Chenglei Wu, Takaaki Shiratori

    Abstract: The authentic 3D hand avatar with every identifiable information, such as hand shapes and textures, is necessary for immersive experiences in AR/VR. In this paper, we present a universal hand model (UHM), which 1) can universally represent high-fidelity 3D hand meshes of arbitrary identities (IDs) and 2) can be adapted to each person with a short phone scan for the authentic hand avatar. For effec… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: Accepted to CVPR 2024

  20. TextGram: Towards a better domain-adaptive pretraining

    Authors: Sharayu Hiwarkhedkar, Saloni Mittal, Vidula Magdum, Omkar Dhekane, Raviraj Joshi, Geetanjali Kale, Arnav Ladkat

    Abstract: For green AI, it is crucial to measure and reduce the carbon footprint emitted during the training of large language models. In NLP, performing pre-training on Transformer models requires significant computational resources. This pre-training involves using a large amount of text data to gain prior knowledge for performing downstream tasks. Thus, it is important that we select the correct data in… ▽ More

    Submitted 28 April, 2024; originally announced April 2024.

    Comments: Accepted at SPELLL 2023

  21. L3Cube-MahaNews: News-based Short Text and Long Document Classification Datasets in Marathi

    Authors: Saloni Mittal, Vidula Magdum, Omkar Dhekane, Sharayu Hiwarkhedkar, Raviraj Joshi

    Abstract: The availability of text or topic classification datasets in the low-resource Marathi language is limited, typically consisting of fewer than 4 target labels, with some achieving nearly perfect accuracy. In this work, we introduce L3Cube-MahaNews, a Marathi text classification corpus that focuses on News headlines and articles. This corpus stands out as the largest supervised Marathi Corpus, conta… ▽ More

    Submitted 28 April, 2024; originally announced April 2024.

    Comments: Accepted at SPELLL 2023

  22. arXiv:2404.13364  [pdf, other

    cs.CL cs.LG

    MahaSQuAD: Bridging Linguistic Divides in Marathi Question-Answering

    Authors: Ruturaj Ghatage, Aditya Kulkarni, Rajlaxmi Patil, Sharvi Endait, Raviraj Joshi

    Abstract: Question-answering systems have revolutionized information retrieval, but linguistic and cultural boundaries limit their widespread accessibility. This research endeavors to bridge the gap of the absence of efficient QnA datasets in low-resource languages by translating the English Question Answering Dataset (SQuAD) using a robust data curation approach. We introduce MahaSQuAD, the first-ever full… ▽ More

    Submitted 20 April, 2024; originally announced April 2024.

    Comments: Accepted at the International Conference on Natural Language Processing (ICON 2023)

  23. Generic low-atmosphere signatures of swirled-anemone jets

    Authors: Reetika Joshi, Guillaume Aulanier, Alice Radcliffe, Luc Rouppe van der Voort, Etienne Pariat, Daniel Nóbrega-Siverio, Brigitte Schmieder

    Abstract: Solar jets are collimated plasma flows moving along magnetic field lines and accelerated at low altitude following magnetic reconnection. Several of them originate from anemone-shaped low-lying arcades and the most impulsive ones tend to be relatively wider and display untwisting motions. We aim to establish typical behaviours and observational signatures in the low atmosphere that can occur in re… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

    Comments: 13 pages, 8 figures, Accepted for publication in Astronomy and Astrophysics

    Journal ref: A&A 687, A172 (2024)

  24. arXiv:2403.16796  [pdf

    physics.app-ph

    Development and Assessment of a Miniaturized Thermocouple for Precise Temperature Measurement in Biological Tissues and Cells

    Authors: Onnop Srivannavit, Rakesh Joshi, Weibin Zhu, Bin Gong, Stuart C. Sealfon, Theodorian Borca-Tasciuc, Angelo Gaitas

    Abstract: This study presents a novel thermocouple instrument designed for precise temperature monitoring within biological tissues and cells, addressing a significant gap in biological research. Constructed on a Silicon-On-Insulator (SOI) substrate, the instrument employs doped silicon and chromium/gold junctions, achieving a Seebeck coefficient of up to 447 uV/K, rapid response times, high temperature acc… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

  25. arXiv:2403.16279  [pdf, other

    cond-mat.supr-con

    The nontrivial effects of annealing on superconducting properties of Nb single crystals

    Authors: Amlan Datta, Kamal R. Joshi, Giulia Berti, Sunil Ghimire, Aidan Goerdt, Makariy A. Tanatar, Deborah L. Schlagel, Matthew F. Besser, Dapeng Jing, Matthew Kramer, Maria Iavarone, Ruslan Prozorov

    Abstract: The effect of annealing on the superconducting properties of niobium single crystals cut from the same master boule was studied by local and global magnetic measurements, as well as scanning tunneling microscopy (STM). The formation of large hydride precipitates was observed in unannealed samples. The variation in structural and magnetic properties was studied after annealing under high vacuum at… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

  26. arXiv:2403.14891  [pdf, other

    cond-mat.supr-con

    Creep-enhanced vortex pinning revealed through nonmonotonic relaxation of the Campbell length

    Authors: Sunil Ghimire, Filippo Gaggioli, Kamal R. Joshi, Marcin Konczykowski, Romain Grasset, Elizabeth H. Krenkel, Amlan Datta, Makariy A. Tanatar, Shuzhang Chen, Cedomir Petrovic, Vadim B. Geshkenbein, Ruslan Prozorov

    Abstract: We study the effects of flux creep on the linear AC response of the vortex lattice in single crystals Ca$_3$Ir$_4$Sn$_{13}$ by measuring the Campbell penetration depth, $λ_{\rm \scriptscriptstyle C}(T,H,t)$. Thermal fluctuations release vortices from shallow pinning sites, only for them to become re-trapped by deeper potential wells, causing an initial increase of the effective Labusch parameter,… ▽ More

    Submitted 31 May, 2024; v1 submitted 21 March, 2024; originally announced March 2024.

  27. Small-scale magnetic flux emergence preceding a chain of energetic solar atmospheric events

    Authors: D. Nóbrega-Siverio, I. Cabello, S. Bose, L. H. M. Rouppe van der Voort, R. Joshi, C. Froment, V. M. J. Henriques

    Abstract: Advancements in instrumentation have revealed a multitude of small-scale EUV events in the solar atmosphere. Our aim is to employ high-resolution magnetograms to gain a detailed understanding of the magnetic origin of such phenomena. We have used coordinated observations from SST, IRIS, and SDO to analyze an ephemeral magnetic flux emergence episode and the following chain of small-scale energetic… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: Accepted in A&A, 11 pages, 7 figures, 5 movies

    Journal ref: A&A 686, A218 (2024)

  28. arXiv:2403.11290  [pdf, other

    astro-ph.GA

    A sample of 25 radio galaxies with highly unusual radio morphologies, selected from the LoTSS-DR2 survey at 144 MHz

    Authors: Gopal-Krishna, Dusmanta Patra, Ravi Joshi

    Abstract: From a careful visual scrutiny of the radio structures of a well-defined sample of 2428 sources in the LoTSS DR2 survey made at 144 MHz with a 6" beam, we have selected a subset of 25 (i.e., 1%) sources showing highly unusual radio structures, not conforming to the prevalent radio morphological classification. Here we present and briefly discuss the basic properties of these rare morphological out… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

    Comments: 28 pages, 25 figures, submitted to Journal of Astrophysics and Astronomy, Comments welcome

  29. arXiv:2403.08635  [pdf, other

    cs.LG cs.AI stat.ML

    Human Alignment of Large Language Models through Online Preference Optimisation

    Authors: Daniele Calandriello, Daniel Guo, Remi Munos, Mark Rowland, Yunhao Tang, Bernardo Avila Pires, Pierre Harvey Richemond, Charline Le Lan, Michal Valko, Tianqi Liu, Rishabh Joshi, Zeyu Zheng, Bilal Piot

    Abstract: Ensuring alignment of language models' outputs with human preferences is critical to guarantee a useful, safe, and pleasant user experience. Thus, human alignment has been extensively studied recently and several methods such as Reinforcement Learning from Human Feedback (RLHF), Direct Policy Optimisation (DPO) and Sequence Likelihood Calibration (SLiC) have emerged. In this paper, our contributio… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

  30. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1110 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 8 August, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  31. arXiv:2403.04981  [pdf, other

    cs.ET

    Paving the Way for Pass Disturb Free Vertical NAND Storage via A Dedicated and String-Compatible Pass Gate

    Authors: Zijian Zhao, Sola Woo, Khandker Akif Aabrar, Sharadindu Gopal Kirtania, Zhouhang Jiang, Shan Deng, Yi Xiao, Halid Mulaosmanovic, Stefan Duenkel, Dominik Kleimaier, Steven Soss, Sven Beyer, Rajiv Joshi, Scott Meninger, Mohamed Mohamed, Kijoon Kim, Jongho Woo, Suhwan Lim, Kwangsoo Kim, Wanki Kim, Daewon Ha, Vijaykrishnan Narayanan, Suman Datta, Shimeng Yu, Kai Ni

    Abstract: In this work, we propose a dual-port cell design to address the pass disturb in vertical NAND storage, which can pass signals through a dedicated and string-compatible pass gate. We demonstrate that: i) the pass disturb-free feature originates from weakening of the depolarization field by the pass bias at the high-${V}_{TH}$ (HVT) state and the screening of the applied field by channel at the low-… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

    Comments: 29 pages, 7 figures

  32. arXiv:2402.09545  [pdf

    cs.CR cs.ET

    A 3D Memristor Architecture for In-Memory Computing Demonstrated with SHA3

    Authors: Muayad J. Aljafar, Rasika Joshi, John M. Acken

    Abstract: Security is a growing problem that needs hardware support. Memristors provide an alternative technology for hardware-supported security implementation. This paper presents a specific technique that utilizes the benefits of hybrid CMOS-memristors technology demonstrated with SHA3 over implementations that use only memristor technology. In the proposed technique, SHA3 is implemented in a set of perp… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

    Comments: 14 pages, 4 tables, 12 figures

  33. arXiv:2402.06185  [pdf, other

    cs.CV cs.AI cs.LG

    Development and validation of an artificial intelligence model to accurately predict spinopelvic parameters

    Authors: Edward S. Harake, Joseph R. Linzey, Cheng Jiang, Rushikesh S. Joshi, Mark M. Zaki, Jaes C. Jones, Siri S. Khalsa, John H. Lee, Zachary Wilseck, Jacob R. Joseph, Todd C. Hollon, Paul Park

    Abstract: Objective. Achieving appropriate spinopelvic alignment has been shown to be associated with improved clinical symptoms. However, measurement of spinopelvic radiographic parameters is time-intensive and interobserver reliability is a concern. Automated measurement tools have the promise of rapid and consistent measurements, but existing tools are still limited by some degree of manual user-entry re… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

    Comments: 10 pages, 5 figures, to appear in Journal of Neurosurgery: Spine

  34. arXiv:2402.01878  [pdf, other

    cs.CL cs.LG

    LiPO: Listwise Preference Optimization through Learning-to-Rank

    Authors: Tianqi Liu, Zhen Qin, Junru Wu, Jiaming Shen, Misha Khalman, Rishabh Joshi, Yao Zhao, Mohammad Saleh, Simon Baumgartner, Jialu Liu, Peter J. Liu, Xuanhui Wang

    Abstract: Aligning language models (LMs) with curated human feedback is critical to control their behaviors in real-world applications. Several recent policy optimization methods, such as DPO and SLiC, serve as promising alternatives to the traditional Reinforcement Learning from Human Feedback (RLHF) approach. In practice, human feedback often comes in a format of a ranked list over multiple responses to a… ▽ More

    Submitted 22 May, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

  35. arXiv:2401.12518  [pdf, ps, other

    physics.flu-dyn physics.bio-ph physics.ed-ph

    Existence of three distinct scaling regimes in self-propelled rigid pitching airfoil

    Authors: Rakshita Joshi, Jaywant Arakeri

    Abstract: Oscillating foils in self-propelled mode are the simplest model for investigating oscillatory locomotion in cruising fishes. In this investigation, we explore the self-propulsion characterisitics of a NACA0015 section airfoil, with chord length $C$, subjected to sinusoidal pitching using a rotary apparatus. A power-spring-based crank-rocker mechanism actuates the airfoil. We examine the effect of… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

  36. arXiv:2401.07786  [pdf, other

    astro-ph.HE

    Oscillating shocks in the transonic, viscous, variable $Γ$, accretion flows around black holes

    Authors: Sanjit Debnath, Indranil Chattopadhyay, Raj Kishor Joshi

    Abstract: We investigate the time evolution of the transonic-viscous accretion flow around a non-rotating black hole. The input parameters used for the simulation are obtained from semi-analytical solutions. This code is based on the TVD routine and correctly handles the angular momentum transport due to viscosity. The thermodynamic properties of the flow are described by a variable adiabatic index equation… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

    Comments: Accepted for publication in MNRAS; 18 pages, 16 figures

  37. arXiv:2401.07094  [pdf, ps, other

    math.GR

    The Bogomolov multiplier of a multiplicative Lie algebra

    Authors: Amit Kumar, Renu Joshi, Mani Shankar Pandey, Sumit Kumar Upadhyay

    Abstract: In this paper, we develop the concept of the Bogomolov multiplier for a multiplicative Lie algebra and establish a Hopf-type formula. Consequently, we see that the Bogomolov multipliers of two isoclinic multiplicative Lie algebras are isomorphic.

    Submitted 13 January, 2024; originally announced January 2024.

  38. arXiv:2401.05334  [pdf, other

    cs.CV cs.GR

    URHand: Universal Relightable Hands

    Authors: Zhaoxi Chen, Gyeongsik Moon, Kaiwen Guo, Chen Cao, Stanislav Pidhorskyi, Tomas Simon, Rohan Joshi, Yuan Dong, Yichen Xu, Bernardo Pires, He Wen, Lucas Evans, Bo Peng, Julia Buffalini, Autumn Trimble, Kevyn McPhail, Melissa Schoeller, Shoou-I Yu, Javier Romero, Michael Zollhöfer, Yaser Sheikh, Ziwei Liu, Shunsuke Saito

    Abstract: Existing photorealistic relightable hand models require extensive identity-specific observations in different views, poses, and illuminations, and face challenges in generalizing to natural illuminations and novel identities. To bridge this gap, we present URHand, the first universal relightable hand model that generalizes across viewpoints, poses, illuminations, and identities. Our model allows f… ▽ More

    Submitted 10 January, 2024; originally announced January 2024.

    Comments: Project Page https://frozenburning.github.io/projects/urhand/

  39. arXiv:2401.02254  [pdf, other

    cs.CL cs.LG

    L3Cube-IndicNews: News-based Short Text and Long Document Classification Datasets in Indic Languages

    Authors: Aishwarya Mirashi, Srushti Sonavane, Purva Lingayat, Tejas Padhiyar, Raviraj Joshi

    Abstract: In this work, we introduce L3Cube-IndicNews, a multilingual text classification corpus aimed at curating a high-quality dataset for Indian regional languages, with a specific focus on news headlines and articles. We have centered our work on 10 prominent Indic languages, including Hindi, Bengali, Marathi, Telugu, Tamil, Gujarati, Kannada, Odia, Malayalam, and Punjabi. Each of these news datasets c… ▽ More

    Submitted 26 April, 2024; v1 submitted 4 January, 2024; originally announced January 2024.

    Comments: Accepted at the International Conference on Natural Language Processing (ICON 2023)

  40. L3Cube-MahaSocialNER: A Social Media based Marathi NER Dataset and BERT models

    Authors: Harsh Chaudhari, Anuja Patil, Dhanashree Lavekar, Pranav Khairnar, Raviraj Joshi

    Abstract: This work introduces the L3Cube-MahaSocialNER dataset, the first and largest social media dataset specifically designed for Named Entity Recognition (NER) in the Marathi language. The dataset comprises 18,000 manually labeled sentences covering eight entity classes, addressing challenges posed by social media data, including non-standard language and informal idioms. Deep learning models, includin… ▽ More

    Submitted 30 December, 2023; originally announced January 2024.

    Comments: Accepted at Forum for Information Retrieval Evaluation (FIRE 2023)

  41. arXiv:2312.17166  [pdf, other

    cond-mat.str-el cond-mat.stat-mech quant-ph

    Signatures of quantum phases in a dissipative system

    Authors: Rohan Joshi, Saikat Mondal, Souvik Bandyopadhyay, Sourav Bhattacharjee, Adhip Agarwala

    Abstract: Lindbladian formalism, as tuned to dissipative and open systems, has been all-pervasive to interpret non-equilibrium steady states of quantum many-body systems. We study the fate of free fermionic and superconducting phases in a dissipative one-dimensional Kitaev model - where the bath acts both as a source and a sink of fermionic particles with different coupling rates. As a function of these two… ▽ More

    Submitted 16 April, 2024; v1 submitted 28 December, 2023; originally announced December 2023.

    Comments: 14 pages, 10 figures

    Journal ref: Journal of Physics: Condensed Matter 36 (2024) 275601

  42. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  43. IndoorGNN: A Graph Neural Network based approach for Indoor Localization using WiFi RSSI

    Authors: Rahul Vishwakarma, Rucha Bhalchandra Joshi, Subhankar Mishra

    Abstract: Indoor localization is the process of determining the location of a person or object inside a building. Potential usage of indoor localization includes navigation, personalization, safety and security, and asset tracking. Commonly used technologies for indoor localization include WiFi, Bluetooth, RFID, and Ultra-wideband. Among these, WiFi's Received Signal Strength Indicator (RSSI)-based localiza… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

    Journal ref: Lecture Notes in Computer Science, vol 14418, year 2023. Springer, Cham

  44. arXiv:2312.07418  [pdf

    cs.CV

    Attention Based Encoder Decoder Model for Video Captioning in Nepali (2023)

    Authors: Kabita Parajuli, Shashidhar Ram Joshi

    Abstract: Video captioning in Nepali, a language written in the Devanagari script, presents a unique challenge due to the lack of existing academic work in this domain. This work develops a novel encoder-decoder paradigm for Nepali video captioning to tackle this difficulty. LSTM and GRU sequence-to-sequence models are used in the model to produce related textual descriptions based on features retrieved fro… ▽ More

    Submitted 19 May, 2024; v1 submitted 12 December, 2023; originally announced December 2023.

  45. On Significance of Subword tokenization for Low Resource and Efficient Named Entity Recognition: A case study in Marathi

    Authors: Harsh Chaudhari, Anuja Patil, Dhanashree Lavekar, Pranav Khairnar, Raviraj Joshi, Sachin Pande

    Abstract: Named Entity Recognition (NER) systems play a vital role in NLP applications such as machine translation, summarization, and question-answering. These systems identify named entities, which encompass real-world concepts like locations, persons, and organizations. Despite extensive research on NER systems for the English language, they have not received adequate attention in the context of low reso… ▽ More

    Submitted 3 December, 2023; originally announced December 2023.

    Comments: Accepted at ICDAM 2023

  46. arXiv:2312.01206  [pdf, other

    astro-ph.GA

    The Formation of Bars and Warps in Rotating Halos

    Authors: Robin Joshi, Lawrence M. Widrow

    Abstract: We investigate the effects of halo kinematics on the dynamics of stellar discs by simulating the evolution of isolated disc-halo systems from equilibrium initial conditions. Our main results come from four simulations where the initial disc is identical and the halo is either treated as a rigid potential or is live with isotropic orbits or orbits that preferentially rotate with or counter to the d… ▽ More

    Submitted 2 December, 2023; originally announced December 2023.

    Comments: 14 pages, 13 figures. Accepted for publication in Monthly Notices of the Royal Astronomical Society

  47. arXiv:2312.01107  [pdf, other

    cs.LG

    Rapid Speaker Adaptation in Low Resource Text to Speech Systems using Synthetic Data and Transfer learning

    Authors: Raviraj Joshi, Nikesh Garera

    Abstract: Text-to-speech (TTS) systems are being built using end-to-end deep learning approaches. However, these systems require huge amounts of training data. We present our approach to built production quality TTS and perform speaker adaptation in extremely low resource settings. We propose a transfer learning approach using high-resource language data and synthetically generated data. We transfer the lea… ▽ More

    Submitted 2 December, 2023; originally announced December 2023.

    Comments: Accepted at PACLIC 2023

  48. Code-Mixed Text to Speech Synthesis under Low-Resource Constraints

    Authors: Raviraj Joshi, Nikesh Garera

    Abstract: Text-to-speech (TTS) systems are an important component in voice-based e-commerce applications. These applications include end-to-end voice assistant and customer experience (CX) voice bot. Code-mixed TTS is also relevant in these applications since the product names are commonly described in English while the surrounding text is in a regional language. In this work, we describe our approaches for… ▽ More

    Submitted 2 December, 2023; originally announced December 2023.

    Comments: Accepted at SPECOM 2023

  49. arXiv:2311.17722  [pdf, other

    cs.CL cs.LG

    SenTest: Evaluating Robustness of Sentence Encoders

    Authors: Tanmay Chavan, Shantanu Patankar, Aditya Kane, Omkar Gokhale, Geetanjali Kale, Raviraj Joshi

    Abstract: Contrastive learning has proven to be an effective method for pre-training models using weakly labeled data in the vision domain. Sentence transformers are the NLP counterparts to this architecture, and have been growing in popularity due to their rich and effective sentence representations. Having effective sentence representations is paramount in multiple tasks, such as information retrieval, re… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

  50. arXiv:2311.14335  [pdf, other

    cs.LG cs.AI

    Comparative Analysis of Transformers for Modeling Tabular Data: A Casestudy using Industry Scale Dataset

    Authors: Usneek Singh, Piyush Arora, Shamika Ganesan, Mohit Kumar, Siddhant Kulkarni, Salil R. Joshi

    Abstract: We perform a comparative analysis of transformer-based models designed for modeling tabular data, specifically on an industry-scale dataset. While earlier studies demonstrated promising outcomes on smaller public or synthetic datasets, the effectiveness did not extend to larger industry-scale datasets. The challenges identified include handling high-dimensional data, the necessity for efficient pr… ▽ More

    Submitted 24 November, 2023; originally announced November 2023.

    Comments: Accepted at 7th Joint International Conference on Data Science & Management of Data (11th ACMIKDD CODS and 29th COMAD)