Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 99 results for author: Srinivasan, P

.
  1. arXiv:2407.01849  [pdf, other

    math.CT

    What kind of linearly distributive category do polynomial functors form?

    Authors: David I. Spivak, Priyaa Varshinee Srinivasan

    Abstract: This paper has two purposes. The first is to extend the theory of linearly distributive categories by considering the structures that emerge in a special case: the normal duoidal category $(\mathsf{Poly} ,\mathcal{y}, \otimes, \triangleleft )$ of polynomial functors under Dirichlet and substitution product. This is an isomix LDC which is neither $*$-autonomous nor fully symmetric. The additional s… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: 32 pages

    MSC Class: 18B99 ACM Class: F.4.1

  2. arXiv:2406.06527  [pdf, other

    cs.CV cs.AI cs.GR

    IllumiNeRF: 3D Relighting without Inverse Rendering

    Authors: Xiaoming Zhao, Pratul P. Srinivasan, Dor Verbin, Keunhong Park, Ricardo Martin Brualla, Philipp Henzler

    Abstract: Existing methods for relightable view synthesis -- using a set of images of an object under unknown lighting to recover a 3D representation that can be rendered from novel viewpoints under a target illumination -- are based on inverse rendering, and attempt to disentangle the object geometry, materials, and lighting that explain the input images. Furthermore, this typically involves optimization t… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: Project page: https://illuminerf.github.io/

  3. arXiv:2405.14871  [pdf, other

    cs.CV cs.GR

    NeRF-Casting: Improved View-Dependent Appearance with Consistent Reflections

    Authors: Dor Verbin, Pratul P. Srinivasan, Peter Hedman, Ben Mildenhall, Benjamin Attal, Richard Szeliski, Jonathan T. Barron

    Abstract: Neural Radiance Fields (NeRFs) typically struggle to reconstruct and render highly specular objects, whose appearance varies quickly with changes in viewpoint. Recent works have improved NeRF's ability to render detailed specular appearance of distant environment illumination, but are unable to synthesize consistent reflections of closer content. Moreover, these techniques rely on large computatio… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: Project page: http://nerf-casting.github.io

  4. arXiv:2405.13181  [pdf, other

    cs.CL cs.LG

    Comparative Analysis of Different Efficient Fine Tuning Methods of Large Language Models (LLMs) in Low-Resource Setting

    Authors: Krishna Prasad Varadarajan Srinivasan, Prasanth Gumpena, Madhusudhana Yattapu, Vishal H. Brahmbhatt

    Abstract: In the domain of large language models (LLMs), arXiv:2305.16938 showed that few-shot full-model fine-tuning -- namely Vanilla Fine Tuning (FT) and Pattern-Based Fine Tuning (PBFT) --, and In-Context Learning (ICL) generalize similarly on Out-Of-Domain (OOD) datasets, but vary in terms of task adaptation. However, they both pose challenges, especially in term of memory requirements. In this paper,… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

    Comments: 9 pages of main paper, 1 page of references, 6 appendix pages, 11 figures, 18 tables

  5. arXiv:2405.10314  [pdf, other

    cs.CV

    CAT3D: Create Anything in 3D with Multi-View Diffusion Models

    Authors: Ruiqi Gao, Aleksander Holynski, Philipp Henzler, Arthur Brussee, Ricardo Martin-Brualla, Pratul Srinivasan, Jonathan T. Barron, Ben Poole

    Abstract: Advances in 3D reconstruction have enabled high-quality 3D capture, but require a user to collect hundreds to thousands of images to create a 3D scene. We present CAT3D, a method for creating anything in 3D by simulating this real-world capture process with a multi-view diffusion model. Given any number of input images and a set of target novel viewpoints, our model generates highly consistent nov… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

    Comments: Project page: https://cat3d.github.io

  6. arXiv:2405.05938  [pdf, other

    cs.CL

    DOLOMITES: Domain-Specific Long-Form Methodical Tasks

    Authors: Chaitanya Malaviya, Priyanka Agrawal, Kuzman Ganchev, Pranesh Srinivasan, Fantine Huot, Jonathan Berant, Mark Yatskar, Dipanjan Das, Mirella Lapata, Chris Alberti

    Abstract: Experts in various fields routinely perform methodical writing tasks to plan, organize, and report their work. From a clinician writing a differential diagnosis for a patient, to a teacher writing a lesson plan for students, these tasks are pervasive, requiring to methodically generate structured long-form output for a given input. We develop a typology of methodical tasks structured in the form o… ▽ More

    Submitted 28 May, 2024; v1 submitted 9 May, 2024; originally announced May 2024.

    Comments: Dataset now available at https://dolomites-benchmark.github.io

  7. arXiv:2404.16399  [pdf, other

    cs.LG cs.AI

    Offline Reinforcement Learning with Behavioral Supervisor Tuning

    Authors: Padmanaba Srinivasan, William Knottenbelt

    Abstract: Offline reinforcement learning (RL) algorithms are applied to learn performant, well-generalizing policies when provided with a static dataset of interactions. Many recent approaches to offline RL have seen substantial success, but with one key caveat: they demand substantial per-dataset hyperparameter tuning to achieve reported performance, which requires policy rollouts in the environment to eva… ▽ More

    Submitted 27 July, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

  8. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1092 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 14 June, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  9. arXiv:2402.18226  [pdf, ps, other

    math.CT

    Drazin Inverses in Categories

    Authors: Robin Cockett, Jean-Simon Pacaud Lemay, Priyaa Varshinee Srinivasan

    Abstract: Drazin inverses are a fundamental algebraic structure which have been extensively deployed in semigroup theory and ring theory. Drazin inverses can also be defined for endomorphisms in any category. However, beyond a paper by Puystjens and Robinson from 1987, not much has been done with Drazin inverses in category theory. As such, here we provide a survey of the theory of Drazin inverses from a ca… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

  10. arXiv:2402.12377  [pdf, other

    cs.CV

    Binary Opacity Grids: Capturing Fine Geometric Detail for Mesh-Based View Synthesis

    Authors: Christian Reiser, Stephan Garbin, Pratul P. Srinivasan, Dor Verbin, Richard Szeliski, Ben Mildenhall, Jonathan T. Barron, Peter Hedman, Andreas Geiger

    Abstract: While surface-based view synthesis algorithms are appealing due to their low computational requirements, they often struggle to reproduce thin structures. In contrast, more expensive methods that model the scene's geometry as a volumetric density field (e.g. NeRF) excel at reconstructing fine geometric detail. However, density fields often represent geometry in a "fuzzy" manner, which hinders exac… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

    Comments: Project page at https://binary-opacity-grid.github.io

  11. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  12. arXiv:2312.10003  [pdf, other

    cs.CL

    ReST meets ReAct: Self-Improvement for Multi-Step Reasoning LLM Agent

    Authors: Renat Aksitov, Sobhan Miryoosefi, Zonglin Li, Daliang Li, Sheila Babayan, Kavya Kopparapu, Zachary Fisher, Ruiqi Guo, Sushant Prakash, Pranesh Srinivasan, Manzil Zaheer, Felix Yu, Sanjiv Kumar

    Abstract: Answering complex natural language questions often necessitates multi-step reasoning and integrating external information. Several systems have combined knowledge retrieval with a large language model (LLM) to answer such questions. These systems, however, suffer from various failure cases, and we cannot directly train them end-to-end to fix such failures, as interaction with external knowledge is… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

    Comments: 19 pages, 4 figures, 4 tables, 8 listings

  13. arXiv:2312.05283  [pdf, other

    cs.CV cs.GR

    Nuvo: Neural UV Mapping for Unruly 3D Representations

    Authors: Pratul P. Srinivasan, Stephan J. Garbin, Dor Verbin, Jonathan T. Barron, Ben Mildenhall

    Abstract: Existing UV mapping algorithms are designed to operate on well-behaved meshes, instead of the geometry representations produced by state-of-the-art 3D reconstruction and generation techniques. As such, applying these methods to the volume densities recovered by neural radiance fields and related techniques (or meshes triangulated from such fields) results in texture atlases that are too fragmented… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

    Comments: Project page at https://pratulsrinivasan.github.io/nuvo

  14. arXiv:2312.02981  [pdf, other

    cs.CV

    ReconFusion: 3D Reconstruction with Diffusion Priors

    Authors: Rundi Wu, Ben Mildenhall, Philipp Henzler, Keunhong Park, Ruiqi Gao, Daniel Watson, Pratul P. Srinivasan, Dor Verbin, Jonathan T. Barron, Ben Poole, Aleksander Holynski

    Abstract: 3D reconstruction methods such as Neural Radiance Fields (NeRFs) excel at rendering photorealistic novel views of complex scenes. However, recovering a high-quality NeRF typically requires tens to hundreds of input images, resulting in a time-consuming capture process. We present ReconFusion to reconstruct real-world scenes using only a few photos. Our approach leverages a diffusion prior for nove… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

    Comments: Project page: https://reconfusion.github.io/

  15. arXiv:2312.02149  [pdf, other

    cs.CV cs.AI cs.CL cs.GR

    Generative Powers of Ten

    Authors: Xiaojuan Wang, Janne Kontkanen, Brian Curless, Steve Seitz, Ira Kemelmacher, Ben Mildenhall, Pratul Srinivasan, Dor Verbin, Aleksander Holynski

    Abstract: We present a method that uses a text-to-image model to generate consistent content across multiple image scales, enabling extreme semantic zooms into a scene, e.g., ranging from a wide-angle landscape view of a forest to a macro shot of an insect sitting on one of the tree branches. We achieve this through a joint multi-scale diffusion sampling approach that encourages consistency across different… ▽ More

    Submitted 21 May, 2024; v1 submitted 4 December, 2023; originally announced December 2023.

    Comments: Project page: https://powers-of-10.github.io/

  16. arXiv:2311.03128  [pdf

    cs.NE quant-ph

    Benchmarking Differential Evolution on a Quantum Simulator

    Authors: Parthasarathy Srinivasan

    Abstract: The use of Evolutionary Algorithms (EA) for solving Mathematical/Computational Optimization Problems is inspired by the biological processes of Evolution. Few of the primitives involved in the Evolutionary process/paradigm are selection of 'Fit' individuals (from a population sample) for retention, cloning, mutation, discarding, breeding, crossover etc. In the Evolutionary Algorithm abstraction, t… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

  17. arXiv:2310.07687  [pdf, other

    astro-ph.HE astro-ph.IM cs.CV

    Orbital Polarimetric Tomography of a Flare Near the Sagittarius A* Supermassive Black Hole

    Authors: Aviad Levis, Andrew A. Chael, Katherine L. Bouman, Maciek Wielgus, Pratul P. Srinivasan

    Abstract: The interaction between the supermassive black hole at the center of the Milky Way, Sagittarius A*, and its accretion disk occasionally produces high-energy flares seen in X-ray, infrared, and radio. One proposed mechanism that produces flares is the formation of compact, bright regions that appear within the accretion disk and close to the event horizon. Understanding these flares provides a wind… ▽ More

    Submitted 16 April, 2024; v1 submitted 11 October, 2023; originally announced October 2023.

  18. arXiv:2309.04437  [pdf, other

    cs.CV astro-ph.CO

    Single View Refractive Index Tomography with Neural Fields

    Authors: Brandon Zhao, Aviad Levis, Liam Connor, Pratul P. Srinivasan, Katherine L. Bouman

    Abstract: Refractive Index Tomography is the inverse problem of reconstructing the continuously-varying 3D refractive index in a scene using 2D projected image measurements. Although a purely refractive field is not directly visible, it bends light rays as they travel through space, thus providing a signal for reconstruction. The effects of such fields appear in many scientific computer vision settings, ran… ▽ More

    Submitted 1 December, 2023; v1 submitted 8 September, 2023; originally announced September 2023.

  19. arXiv:2305.16321  [pdf, other

    cs.CV cs.GR

    Eclipse: Disambiguating Illumination and Materials using Unintended Shadows

    Authors: Dor Verbin, Ben Mildenhall, Peter Hedman, Jonathan T. Barron, Todd Zickler, Pratul P. Srinivasan

    Abstract: Decomposing an object's appearance into representations of its materials and the surrounding illumination is difficult, even when the object's 3D shape is known beforehand. This problem is especially challenging for diffuse objects: it is ill-conditioned because diffuse materials severely blur incoming light, and it is ill-posed because diffuse materials under high-frequency lighting can be indist… ▽ More

    Submitted 13 December, 2023; v1 submitted 25 May, 2023; originally announced May 2023.

    Comments: Project page: https://dorverbin.github.io/eclipse/

  20. arXiv:2304.06706  [pdf, other

    cs.CV cs.GR cs.LG

    Zip-NeRF: Anti-Aliased Grid-Based Neural Radiance Fields

    Authors: Jonathan T. Barron, Ben Mildenhall, Dor Verbin, Pratul P. Srinivasan, Peter Hedman

    Abstract: Neural Radiance Field training can be accelerated through the use of grid-based representations in NeRF's learned mapping from spatial coordinates to colors and volumetric density. However, these grid-based approaches lack an explicit understanding of scale and therefore often introduce aliasing, usually in the form of jaggies or missing scene content. Anti-aliasing has previously been addressed b… ▽ More

    Submitted 26 October, 2023; v1 submitted 13 April, 2023; originally announced April 2023.

    Comments: Project page: https://jonbarron.info/zipnerf/

  21. arXiv:2303.14231  [pdf, other

    math.CT quant-ph

    Dagger linear logic and categorical quantum mechanics

    Authors: Priyaa Varshinee Srinivasan

    Abstract: This thesis develops the categorical proof theory for the non-compact multiplicative dagger linear logic, and investigates its applications to Categorical Quantum Mechanics (CQM). The existing frameworks of CQM are categorical proof theories of compact dagger linear logic, and are motivated by the interpretation of quantum systems in the category of finite dimensional Hilbert spaces. This thesis d… ▽ More

    Submitted 24 March, 2023; originally announced March 2023.

    Comments: Thesis submitted for the degree of Doctor of Philosophy, University of Calgary, Fall 2021 (250 pages)

  22. arXiv:2303.11839  [pdf, other

    math.CT quant-ph

    Normalizing Resistor Networks

    Authors: Robin Cockett, Amolak Ratan Kalra, Priyaa Varshinee Srinivasan

    Abstract: Star to mesh transformations are well-known in electrical engineering, and are reminiscent of local complementation for graph states in qudit stabilizer quantum mechanics. This paper describes a rewriting system for resistor circuits over any positive division rig using general star to mesh transformations. We show how these transformations can be organized into a confluent and terminating rewriti… ▽ More

    Submitted 14 December, 2023; v1 submitted 19 March, 2023; originally announced March 2023.

    Comments: In Proceedings ACT 2023, arXiv:2312.08138

    Journal ref: EPTCS 397, 2023, pp. 70-83

  23. arXiv:2302.14859  [pdf, other

    cs.CV

    BakedSDF: Meshing Neural SDFs for Real-Time View Synthesis

    Authors: Lior Yariv, Peter Hedman, Christian Reiser, Dor Verbin, Pratul P. Srinivasan, Richard Szeliski, Jonathan T. Barron, Ben Mildenhall

    Abstract: We present a method for reconstructing high-quality meshes of large unbounded real-world scenes suitable for photorealistic novel view synthesis. We first optimize a hybrid neural volume-surface scene representation designed to have well-behaved level sets that correspond to surfaces in the scene. We then bake this representation into a high-quality triangle mesh, which we equip with a simple and… ▽ More

    Submitted 16 May, 2023; v1 submitted 28 February, 2023; originally announced February 2023.

    Comments: Video and interactive web demo available at https://bakedsdf.github.io/

  24. arXiv:2302.12249  [pdf, other

    cs.CV cs.GR

    MERF: Memory-Efficient Radiance Fields for Real-time View Synthesis in Unbounded Scenes

    Authors: Christian Reiser, Richard Szeliski, Dor Verbin, Pratul P. Srinivasan, Ben Mildenhall, Andreas Geiger, Jonathan T. Barron, Peter Hedman

    Abstract: Neural radiance fields enable state-of-the-art photorealistic view synthesis. However, existing radiance field representations are either too compute-intensive for real-time rendering or require too much memory to scale to large scenes. We present a Memory-Efficient Radiance Field (MERF) representation that achieves real-time rendering of large-scale scenes in a browser. MERF reduces the memory co… ▽ More

    Submitted 23 February, 2023; originally announced February 2023.

    Comments: Video and interactive web demo available at https://merf42.github.io

  25. arXiv:2302.08504  [pdf, other

    cs.CV cs.GR

    PersonNeRF: Personalized Reconstruction from Photo Collections

    Authors: Chung-Yi Weng, Pratul P. Srinivasan, Brian Curless, Ira Kemelmacher-Shlizerman

    Abstract: We present PersonNeRF, a method that takes a collection of photos of a subject (e.g. Roger Federer) captured across multiple years with arbitrary body poses and appearances, and enables rendering the subject with arbitrary novel combinations of viewpoint, body pose, and appearance. PersonNeRF builds a customized neural volumetric 3D model of the subject that is able to render an entire space spann… ▽ More

    Submitted 16 February, 2023; originally announced February 2023.

    Comments: Project Page: https://grail.cs.washington.edu/projects/personnerf/

  26. arXiv:2302.06833  [pdf, other

    cs.CV

    VQ3D: Learning a 3D-Aware Generative Model on ImageNet

    Authors: Kyle Sargent, Jing Yu Koh, Han Zhang, Huiwen Chang, Charles Herrmann, Pratul Srinivasan, Jiajun Wu, Deqing Sun

    Abstract: Recent work has shown the possibility of training generative models of 3D content from 2D image collections on small datasets corresponding to a single object class, such as human faces, animal faces, or cars. However, these models struggle on larger, more complex datasets. To model diverse and unconstrained image collections such as ImageNet, we present VQ3D, which introduces a NeRF-based decoder… ▽ More

    Submitted 14 February, 2023; originally announced February 2023.

    Comments: 15 pages. For visual results, please visit the project webpage at http://kylesargent.github.io/vq3d

  27. arXiv:2301.02222  [pdf, other

    math.NT

    Computing nonsurjective primes associated to Galois representations of genus $2$ curves

    Authors: Barinder S. Banwait, Armand Brumer, Hyun Jong Kim, Zev Klagsbrun, Jacob Mayle, Padmavathi Srinivasan, Isabel Vogt

    Abstract: For a genus $2$ curve $C$ over $\mathbb{Q}$ whose Jacobian $A$ admits only trivial geometric endomorphisms, Serre's open image theorem for abelian surfaces asserts that there are only finitely many primes $\ell$ for which the Galois action on $\ell$-torsion points of $A$ is not maximal. Building on work of Dieulefait, we give a practical algorithm to compute this finite set. The key inputs are Mit… ▽ More

    Submitted 10 July, 2023; v1 submitted 5 January, 2023; originally announced January 2023.

    MSC Class: 11F80(primary); 11G10; 11Y16

  28. High-accuracy thermodynamic properties to the melting point from ab initio calculations aided by machine-learning potentials

    Authors: Jong Hyun Jung, Prashanth Srinivasan, Axel Forslund, Blazej Grabowski

    Abstract: Accurate prediction of thermodynamic properties requires an extremely accurate representation of the free energy surface. Requirements are twofold -- first, the inclusion of the relevant finite-temperature mechanisms, and second, a dense volume-temperature grid on which the calculations are performed. A systematic workflow for such calculations requires computational efficiency and reliability, an… ▽ More

    Submitted 26 December, 2022; v1 submitted 20 December, 2022; originally announced December 2022.

    Comments: 6 figures, supplementary information, typos corrected

    Journal ref: npj Comput. Mater. 9, 3 (2023)

  29. arXiv:2210.03035  [pdf, ps, other

    math.AG math.AT math.KT math.NT

    Quadratic enrichment of the logarithmic derivative of the zeta function

    Authors: Margaret Bilu, Wei Ho, Padmavathi Srinivasan, Isabel Vogt, Kirsten Wickelgren

    Abstract: We define an enrichment of the logarithmic derivative of the zeta function of a variety over a finite field to a power series with coefficients in the Grothendieck--Witt group. We show that this enrichment is related to the topology of the real points of a lift. For cellular schemes over a field, we prove a rationality result for this enriched logarithmic derivative of the zeta function as an anal… ▽ More

    Submitted 30 June, 2024; v1 submitted 6 October, 2022; originally announced October 2022.

    Comments: 41 pages, to appear in Transactions of the AMS

    MSC Class: Primary 14G10; 14F42; Secondary 19D45; 55P25; 11G25

  30. arXiv:2209.13114  [pdf, other

    cs.CL

    Style Matters! Investigating Linguistic Style in Online Communities

    Authors: Osama Khalid, Padmini Srinivasan

    Abstract: Content has historically been the primary lens used to study language in online communities. This paper instead focuses on the linguistic style of communities. While we know that individuals have distinguishable styles, here we ask whether communities have distinguishable styles. Additionally, while prior work has relied on a narrow definition of style, we employ a broad definition involving 262 f… ▽ More

    Submitted 26 September, 2022; originally announced September 2022.

  31. arXiv:2209.12352  [pdf, other

    cs.CL

    Smells like Teen Spirit: An Exploration of Sensorial Style in Literary Genres

    Authors: Osama Khalid, Padmini Srinivasan

    Abstract: It is well recognized that sensory perceptions and language have interconnections through numerous studies in psychology, neuroscience, and sensorial linguistics. Set in this rich context we ask whether the use of sensorial language in writings is part of linguistic style? This question is important from the view of stylometrics research where a rich set of language features have been explored, bu… ▽ More

    Submitted 25 September, 2022; originally announced September 2022.

  32. arXiv:2206.09784  [pdf, other

    quant-ph math.CT

    Extending Resource Monotones using Kan Extensions

    Authors: Robin Cockett, Isabelle Jianing Geng, Carlo Maria Scandolo, Priyaa Varshinee Srinivasan

    Abstract: In this paper we generalize the framework proposed by Gour and Tomamichel regarding extensions of monotones for resource theories. A monotone for a resource theory assigns a real number to each resource in the theory signifying the utility or the value of the resource. Gour and Tomamichel studied the problem of extending monotones using set-theoretical framework when a resource theory embeds fully… ▽ More

    Submitted 31 July, 2023; v1 submitted 20 June, 2022; originally announced June 2022.

    Comments: In Proceedings ACT 2022, arXiv:2307.15519

    Journal ref: EPTCS 380, 2023, pp. 203-223

  33. arXiv:2205.11164  [pdf, other

    cs.LG

    Time-series Transformer Generative Adversarial Networks

    Authors: Padmanaba Srinivasan, William J. Knottenbelt

    Abstract: Many real-world tasks are plagued by limitations on data: in some instances very little data is available and in others, data is protected by privacy enforcing regulations (e.g. GDPR). We consider limitations posed specifically on time-series data and present a model that can generate synthetic time-series which can be used in place of real data. A model that generates synthetic time-series data h… ▽ More

    Submitted 23 May, 2022; originally announced May 2022.

  34. arXiv:2205.01714  [pdf, other

    cs.CL cs.CR cs.LG

    Don't sweat the small stuff, classify the rest: Sample Shielding to protect text classifiers against adversarial attacks

    Authors: Jonathan Rusert, Padmini Srinivasan

    Abstract: Deep learning (DL) is being used extensively for text classification. However, researchers have demonstrated the vulnerability of such classifiers to adversarial attacks. Attackers modify the text in a way which misleads the classifier while keeping the original meaning close to intact. State-of-the-art (SOTA) attack algorithms follow the general principle of making minimal changes to the text so… ▽ More

    Submitted 3 May, 2022; originally announced May 2022.

    Comments: 9 pages, 8 figures, Accepted to NAACL 2022

    ACM Class: I.2.7

  35. arXiv:2204.03715  [pdf, other

    cs.CV astro-ph.IM

    Gravitationally Lensed Black Hole Emission Tomography

    Authors: Aviad Levis, Pratul P. Srinivasan, Andrew A. Chael, Ren Ng, Katherine L. Bouman

    Abstract: Measurements from the Event Horizon Telescope enabled the visualization of light emission around a black hole for the first time. So far, these measurements have been used to recover a 2D image under the assumption that the emission field is static over the period of acquisition. In this work, we propose BH-NeRF, a novel tomography approach that leverages gravitational lensing to recover the conti… ▽ More

    Submitted 7 April, 2022; originally announced April 2022.

    Comments: To appear in the IEEE Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR), 2022. Supplemental material including accompanying pdf, code, and video highlight can be found in the project page: http://imaging.cms.caltech.edu/bhnerf/

  36. arXiv:2203.11849  [pdf, other

    cs.CL cs.CR cs.LG

    A Girl Has A Name, And It's ... Adversarial Authorship Attribution for Deobfuscation

    Authors: Wanyue Zhai, Jonathan Rusert, Zubair Shafiq, Padmini Srinivasan

    Abstract: Recent advances in natural language processing have enabled powerful privacy-invasive authorship attribution. To counter authorship attribution, researchers have proposed a variety of rule-based and learning-based text obfuscation approaches. However, existing authorship obfuscation approaches do not consider the adversarial threat model. Specifically, they are not evaluated against adversarially… ▽ More

    Submitted 22 March, 2022; originally announced March 2022.

    Comments: 9 pages, 7 figures, 3 tables, ACL 2022

  37. arXiv:2203.11401  [pdf, other

    cs.CL cs.LG

    Suum Cuique: Studying Bias in Taboo Detection with a Community Perspective

    Authors: Osama Khalid, Jonathan Rusert, Padmini Srinivasan

    Abstract: Prior research has discussed and illustrated the need to consider linguistic norms at the community level when studying taboo (hateful/offensive/toxic etc.) language. However, a methodology for doing so, that is firmly founded on community language norms is still largely absent. This can lead both to biases in taboo text classification and limitations in our understanding of the causes of bias. We… ▽ More

    Submitted 21 March, 2022; originally announced March 2022.

    Comments: 9 pages, 3 figures, Accepted to the Findings of ACL 2022

    ACM Class: I.2.7

  38. arXiv:2203.11331  [pdf, other

    cs.CL cs.LG

    On The Robustness of Offensive Language Classifiers

    Authors: Jonathan Rusert, Zubair Shafiq, Padmini Srinivasan

    Abstract: Social media platforms are deploying machine learning based offensive language classification systems to combat hateful, racist, and other forms of offensive speech at scale. However, despite their real-world deployment, we do not yet comprehensively understand the extent to which offensive language classifiers are robust against adversarial attacks. Prior work in this space is limited to studying… ▽ More

    Submitted 21 March, 2022; originally announced March 2022.

    Comments: 9 pages, 2 figures, Accepted at ACL 2022

    ACM Class: I.2.7

  39. An Automated Pipeline for Ultra-Violet Imaging Telescope (UVIT)

    Authors: S. K. Ghosh, S. N. Tandon, S. K. Singh, D. S. Shelat, P. Tahlani, A. K. Singh, T. P. Srinivasan, P. Joseph, A. Devaraj, K. George, R. Mohan, J. Postma, C. S. Stalin

    Abstract: We describe a versatile pipeline for processing the data collected by the Ultra-Violet Imaging Telescope (UVIT) on board Indian Multi-wavelength astronomical satellite AstroSat.The UVIT instrument carries out simultaneous astronomical imaging through selected filters / gratings in Far-Ultra-Violet (FUV), Near-Ultra-Violet & visible (VIS) bands of the targeted circular sky field (~ 0.5 deg dia). Th… ▽ More

    Submitted 15 March, 2022; originally announced March 2022.

    Comments: Accepted for publication in Journal of Astrophysics & Astronomy, 50 pages, 16 figures

  40. Making a Radical Misogynist: How online social engagement with the Manosphere influences traits of radicalization

    Authors: Hussam Habib, Padmini Srinivasan, Rishab Nithyanand

    Abstract: The algorithms and the interactions facilitated by online platforms have been used by radical groups to recruit vulnerable individuals to their cause. This has resulted in the sharp growth of violent events and deteriorating online discourse. The Manosphere, a collection of radical anti-feminist communities, is one such group which has attracted attention due to their rapid growth and increasingly… ▽ More

    Submitted 17 February, 2022; originally announced February 2022.

  41. arXiv:2202.06212  [pdf, other

    cs.IR cs.CL

    Uni-Retriever: Towards Learning The Unified Embedding Based Retriever in Bing Sponsored Search

    Authors: Jianjin Zhang, Zheng Liu, Weihao Han, Shitao Xiao, Ruicheng Zheng, Yingxia Shao, Hao Sun, Hanqing Zhu, Premkumar Srinivasan, Denvy Deng, Qi Zhang, Xing Xie

    Abstract: Embedding based retrieval (EBR) is a fundamental building block in many web applications. However, EBR in sponsored search is distinguished from other generic scenarios and technically challenging due to the need of serving multiple retrieval purposes: firstly, it has to retrieve high-relevance ads, which may exactly serve user's search intent; secondly, it needs to retrieve high-CTR ads so as to… ▽ More

    Submitted 13 February, 2022; originally announced February 2022.

  42. arXiv:2202.05263  [pdf, other

    cs.CV cs.GR

    Block-NeRF: Scalable Large Scene Neural View Synthesis

    Authors: Matthew Tancik, Vincent Casser, Xinchen Yan, Sabeek Pradhan, Ben Mildenhall, Pratul P. Srinivasan, Jonathan T. Barron, Henrik Kretzschmar

    Abstract: We present Block-NeRF, a variant of Neural Radiance Fields that can represent large-scale environments. Specifically, we demonstrate that when scaling NeRF to render city-scale scenes spanning multiple blocks, it is vital to decompose the scene into individually trained NeRFs. This decomposition decouples rendering time from scene size, enables rendering to scale to arbitrarily large environments,… ▽ More

    Submitted 10 February, 2022; originally announced February 2022.

    Comments: Project page: https://waymo.com/research/block-nerf/

  43. arXiv:2201.08239  [pdf, other

    cs.CL cs.AI

    LaMDA: Language Models for Dialog Applications

    Authors: Romal Thoppilan, Daniel De Freitas, Jamie Hall, Noam Shazeer, Apoorv Kulshreshtha, Heng-Tze Cheng, Alicia Jin, Taylor Bos, Leslie Baker, Yu Du, YaGuang Li, Hongrae Lee, Huaixiu Steven Zheng, Amin Ghafouri, Marcelo Menegali, Yanping Huang, Maxim Krikun, Dmitry Lepikhin, James Qin, Dehao Chen, Yuanzhong Xu, Zhifeng Chen, Adam Roberts, Maarten Bosma, Vincent Zhao , et al. (35 additional authors not shown)

    Abstract: We present LaMDA: Language Models for Dialog Applications. LaMDA is a family of Transformer-based neural language models specialized for dialog, which have up to 137B parameters and are pre-trained on 1.56T words of public dialog data and web text. While model scaling alone can improve quality, it shows less improvements on safety and factual grounding. We demonstrate that fine-tuning with annotat… ▽ More

    Submitted 10 February, 2022; v1 submitted 20 January, 2022; originally announced January 2022.

  44. arXiv:2201.04127  [pdf, other

    cs.CV cs.GR

    HumanNeRF: Free-viewpoint Rendering of Moving People from Monocular Video

    Authors: Chung-Yi Weng, Brian Curless, Pratul P. Srinivasan, Jonathan T. Barron, Ira Kemelmacher-Shlizerman

    Abstract: We introduce a free-viewpoint rendering method -- HumanNeRF -- that works on a given monocular video of a human performing complex body motions, e.g. a video from YouTube. Our method enables pausing the video at any frame and rendering the subject from arbitrary new camera viewpoints or even a full 360-degree camera path for that particular frame and body pose. This task is particularly challengin… ▽ More

    Submitted 14 June, 2022; v1 submitted 11 January, 2022; originally announced January 2022.

    Comments: CVPR 2022 (oral). Project page with videos: https://grail.cs.washington.edu/projects/humannerf/

  45. arXiv:2112.03907  [pdf, other

    cs.CV cs.GR

    Ref-NeRF: Structured View-Dependent Appearance for Neural Radiance Fields

    Authors: Dor Verbin, Peter Hedman, Ben Mildenhall, Todd Zickler, Jonathan T. Barron, Pratul P. Srinivasan

    Abstract: Neural Radiance Fields (NeRF) is a popular view synthesis technique that represents a scene as a continuous volumetric function, parameterized by multilayer perceptrons that provide the volume density and view-dependent emitted radiance at each location. While NeRF-based techniques excel at representing fine geometric structures with smoothly varying view-dependent appearance, they often fail to a… ▽ More

    Submitted 7 December, 2021; originally announced December 2021.

    Comments: Project page: https://dorverbin.github.io/refnerf/

  46. arXiv:2112.03873  [pdf, ps, other

    math.NT math.AG

    p-adic adelic metrics and Quadratic Chabauty I

    Authors: Amnon Besser, J. Steffen Müller, Padmavathi Srinivasan

    Abstract: We give a new construction of $p$-adic heights on varieties over number fields using $p$-adic Arakelov theory. In analogy with Zhang's construction of real-valued heights in terms of adelic metrics, these heights are given in terms of $p$-adic adelic metrics on line bundles. In particular, we describe a construction of canonical $p$-adic heights on abelian varieties and we show that we recover the… ▽ More

    Submitted 23 November, 2022; v1 submitted 7 December, 2021; originally announced December 2021.

    Comments: Updates include a p-adic Arakelov-theory proof that the method extends when $r>g$ and $r<g+\mathrm{rank}(\mathrm{NS}(J))-1$ (Sections 3.4 and 7.2), comparison to Colmez's work (Sections 4.5 and 9.3), an updated comparison to Balakrishnan and Dogra's work (Section 8), and a new unified q-analytic construction of local heights (p-adic and real) on abelian varieties at a finite prime q (Section 9.2)

  47. arXiv:2112.03288  [pdf, other

    cs.CV

    Dense Depth Priors for Neural Radiance Fields from Sparse Input Views

    Authors: Barbara Roessle, Jonathan T. Barron, Ben Mildenhall, Pratul P. Srinivasan, Matthias Nießner

    Abstract: Neural radiance fields (NeRF) encode a scene into a neural representation that enables photo-realistic rendering of novel views. However, a successful reconstruction from RGB images requires a large number of input views taken under static conditions - typically up to a few hundred images for room-size scenes. Our method aims to synthesize novel views of whole rooms from an order of magnitude fewe… ▽ More

    Submitted 7 April, 2022; v1 submitted 6 December, 2021; originally announced December 2021.

    Comments: CVPR 2022, project page: https://barbararoessle.github.io/dense_depth_priors_nerf/ , video: https://youtu.be/zzkvvdcvksc

  48. arXiv:2111.14643  [pdf, other

    cs.CV cs.GR

    Urban Radiance Fields

    Authors: Konstantinos Rematas, Andrew Liu, Pratul P. Srinivasan, Jonathan T. Barron, Andrea Tagliasacchi, Thomas Funkhouser, Vittorio Ferrari

    Abstract: The goal of this work is to perform 3D reconstruction and novel view synthesis from data captured by scanning platforms commonly deployed for world mapping in urban outdoor environments (e.g., Street View). Given a sequence of posed RGB images and lidar sweeps acquired by cameras and scanners moving through an outdoor scene, we produce a model from which 3D surfaces can be extracted and novel RGB… ▽ More

    Submitted 29 November, 2021; originally announced November 2021.

    Comments: Project: https://urban-radiance-fields.github.io/

  49. arXiv:2111.13679  [pdf, other

    cs.CV cs.GR eess.IV

    NeRF in the Dark: High Dynamic Range View Synthesis from Noisy Raw Images

    Authors: Ben Mildenhall, Peter Hedman, Ricardo Martin-Brualla, Pratul Srinivasan, Jonathan T. Barron

    Abstract: Neural Radiance Fields (NeRF) is a technique for high quality novel view synthesis from a collection of posed input images. Like most view synthesis methods, NeRF uses tonemapped low dynamic range (LDR) as input; these images have been processed by a lossy camera pipeline that smooths detail, clips highlights, and distorts the simple noise distribution of raw sensor data. We modify NeRF to instead… ▽ More

    Submitted 26 November, 2021; originally announced November 2021.

    Comments: Project page: https://bmild.github.io/rawnerf/

  50. arXiv:2111.12077  [pdf, other

    cs.CV cs.GR

    Mip-NeRF 360: Unbounded Anti-Aliased Neural Radiance Fields

    Authors: Jonathan T. Barron, Ben Mildenhall, Dor Verbin, Pratul P. Srinivasan, Peter Hedman

    Abstract: Though neural radiance fields (NeRF) have demonstrated impressive view synthesis results on objects and small bounded regions of space, they struggle on "unbounded" scenes, where the camera may point in any direction and content may exist at any distance. In this setting, existing NeRF-like models often produce blurry or low-resolution renderings (due to the unbalanced detail and scale of nearby a… ▽ More

    Submitted 25 March, 2022; v1 submitted 23 November, 2021; originally announced November 2021.

    Comments: https://jonbarron.info/mipnerf360/