Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–16 of 16 results for author: Witte, P

.
  1. arXiv:2404.14219  [pdf, other

    cs.CL cs.AI

    Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

    Authors: Marah Abdin, Sam Ade Jacobs, Ammar Ahmad Awan, Jyoti Aneja, Ahmed Awadallah, Hany Awadalla, Nguyen Bach, Amit Bahree, Arash Bakhtiari, Jianmin Bao, Harkirat Behl, Alon Benhaim, Misha Bilenko, Johan Bjorck, Sébastien Bubeck, Qin Cai, Martin Cai, Caio César Teodoro Mendes, Weizhu Chen, Vishrav Chaudhary, Dong Chen, Dongdong Chen, Yen-Chun Chen, Yi-Ling Chen, Parul Chopra , et al. (90 additional authors not shown)

    Abstract: We introduce phi-3-mini, a 3.8 billion parameter language model trained on 3.3 trillion tokens, whose overall performance, as measured by both academic benchmarks and internal testing, rivals that of models such as Mixtral 8x7B and GPT-3.5 (e.g., phi-3-mini achieves 69% on MMLU and 8.38 on MT-bench), despite being small enough to be deployed on a phone. The innovation lies entirely in our dataset… ▽ More

    Submitted 23 May, 2024; v1 submitted 22 April, 2024; originally announced April 2024.

    Comments: 19 pages

  2. arXiv:2312.13480  [pdf, other

    cs.LG

    InvertibleNetworks.jl: A Julia package for scalable normalizing flows

    Authors: Rafael Orozco, Philipp Witte, Mathias Louboutin, Ali Siahkoohi, Gabrio Rizzuti, Bas Peters, Felix J. Herrmann

    Abstract: InvertibleNetworks.jl is a Julia package designed for the scalable implementation of normalizing flows, a method for density estimation and sampling in high-dimensional distributions. This package excels in memory efficiency by leveraging the inherent invertibility of normalizing flows, which significantly reduces memory requirements during backpropagation compared to existing normalizing flow pac… ▽ More

    Submitted 20 December, 2023; originally announced December 2023.

    Comments: Submitted to Journal of Open Source Software (JOSS)

  3. arXiv:2304.05592  [pdf, other

    cs.MS cs.DC cs.LG physics.comp-ph physics.geo-ph

    Learned multiphysics inversion with differentiable programming and machine learning

    Authors: Mathias Louboutin, Ziyi Yin, Rafael Orozco, Thomas J. Grady II, Ali Siahkoohi, Gabrio Rizzuti, Philipp A. Witte, Olav Møyner, Gerard J. Gorman, Felix J. Herrmann

    Abstract: We present the Seismic Laboratory for Imaging and Modeling/Monitoring (SLIM) open-source software framework for computational geophysics and, more generally, inverse problems involving the wave-equation (e.g., seismic and medical ultrasound), regularization with learned priors, and learned neural surrogates for multiphase flow simulations. By integrating multiple layers of abstraction, our softwar… ▽ More

    Submitted 11 April, 2023; originally announced April 2023.

  4. arXiv:2211.12709  [pdf, other

    cs.DC cs.AI physics.comp-ph

    SciAI4Industry -- Solving PDEs for industry-scale problems with deep learning

    Authors: Philipp A. Witte, Russell J. Hewett, Kumar Saurabh, AmirHossein Sojoodi, Ranveer Chandra

    Abstract: Solving partial differential equations with deep learning makes it possible to reduce simulation times by multiple orders of magnitude and unlock scientific methods that typically rely on large numbers of sequential simulations, such as optimization and uncertainty quantification. Two of the largest challenges of adopting scientific AI for industrial problem settings is that training datasets must… ▽ More

    Submitted 23 November, 2022; originally announced November 2022.

    Comments: Submitted to International Parallel and Distributed Processing Symposium (IPDPS) on October 5, 2022

  5. arXiv:2204.01205  [pdf, other

    cs.LG cs.DC math.NA

    Model-Parallel Fourier Neural Operators as Learned Surrogates for Large-Scale Parametric PDEs

    Authors: Thomas J. Grady II, Rishi Khan, Mathias Louboutin, Ziyi Yin, Philipp A. Witte, Ranveer Chandra, Russell J. Hewett, Felix J. Herrmann

    Abstract: Fourier neural operators (FNOs) are a recently introduced neural network architecture for learning solution operators of partial differential equations (PDEs), which have been shown to perform significantly better than comparable deep learning approaches. Once trained, FNOs can achieve speed-ups of multiple orders of magnitude over conventional numerical PDE solvers. However, due to the high dimen… ▽ More

    Submitted 1 February, 2023; v1 submitted 3 April, 2022; originally announced April 2022.

  6. arXiv:2203.15038  [pdf, other

    physics.comp-ph cs.MS physics.geo-ph

    Accelerating innovation with software abstractions for scalable computational geophysics

    Authors: Mathias Louboutin, Philipp A. Witte, Ali Siahkoohi, Gabrio Rizzuti, Ziyi Yin, Rafael Orozco, Felix J. Herrmann

    Abstract: We present the SLIM (https://github.com/slimgroup) open-source software framework for computational geophysics, and more generally, inverse problems based on the wave-equation (e.g., medical ultrasound). We developed a software environment aimed at scalable research and development by designing multiple layers of abstractions. This environment allows the researchers to easily formulate their probl… ▽ More

    Submitted 28 March, 2022; originally announced March 2022.

  7. arXiv:2101.03709  [pdf, other

    stat.ML cs.LG physics.geo-ph

    Preconditioned training of normalizing flows for variational inference in inverse problems

    Authors: Ali Siahkoohi, Gabrio Rizzuti, Mathias Louboutin, Philipp A. Witte, Felix J. Herrmann

    Abstract: Obtaining samples from the posterior distribution of inverse problems with expensive forward operators is challenging especially when the unknowns involve the strongly heterogeneous Earth. To meet these challenges, we propose a preconditioning scheme involving a conditional normalizing flow (NF) capable of sampling from a low-fidelity posterior distribution directly. This conditional NF is used to… ▽ More

    Submitted 11 January, 2021; originally announced January 2021.

  8. arXiv:2007.07985  [pdf, other

    stat.ML cs.LG physics.geo-ph

    Faster Uncertainty Quantification for Inverse Problems with Conditional Normalizing Flows

    Authors: Ali Siahkoohi, Gabrio Rizzuti, Philipp A. Witte, Felix J. Herrmann

    Abstract: In inverse problems, we often have access to data consisting of paired samples $(x,y)\sim p_{X,Y}(x,y)$ where $y$ are partial observations of a physical system, and $x$ represents the unknowns of the problem. Under these circumstances, we can employ supervised training to learn a solution $x$ and its uncertainty from the observations $y$. We refer to this problem as the "supervised" case. However,… ▽ More

    Submitted 15 July, 2020; originally announced July 2020.

  9. arXiv:2004.10519  [pdf, other

    physics.comp-ph cs.CL cs.PF physics.ao-ph

    Scaling through abstractions -- high-performance vectorial wave simulations for seismic inversion with Devito

    Authors: Mathias Louboutin, Fabio Luporini, Philipp Witte, Rhodri Nelson, George Bisbas, Jan Thorbecke, Felix J. Herrmann, Gerard Gorman

    Abstract: [Devito] is an open-source Python project based on domain-specific language and compiler technology. Driven by the requirements of rapid HPC applications development in exploration seismology, the language and compiler have evolved significantly since inception. Sophisticated boundary conditions, tensor contractions, sparse operations and features such as staggered grids and sub-domains are all su… ▽ More

    Submitted 22 April, 2020; originally announced April 2020.

    Comments: 11 pages, 3 figures

  10. arXiv:2004.07871  [pdf, other

    physics.geo-ph cs.LG stat.ML

    Parameterizing uncertainty by deep invertible networks, an application to reservoir characterization

    Authors: Gabrio Rizzuti, Ali Siahkoohi, Philipp A. Witte, Felix J. Herrmann

    Abstract: Uncertainty quantification for full-waveform inversion provides a probabilistic characterization of the ill-conditioning of the problem, comprising the sensitivity of the solution with respect to the starting model and data noise. This analysis allows to assess the confidence in the candidate solution and how it is reflected in the tasks that are typically performed after imaging (e.g., stratigrap… ▽ More

    Submitted 16 April, 2020; originally announced April 2020.

  11. arXiv:2004.07389  [pdf, other

    physics.geo-ph cs.CE eess.IV

    Extended source imaging, a unifying framework for seismic & medical imaging

    Authors: Ziyi Yin, Rafael Orozco, Philipp Witte, Mathias Louboutin, Gabrio Rizzuti, Felix J. Herrmann

    Abstract: We present three imaging modalities that live on the crossroads of seismic and medical imaging. Through the lens of extended source imaging, we can draw deep connections among the fields of wave-equation based seismic and medical imaging, despite first appearances. From the seismic perspective, we underline the importance to work with the correct physics and spatially varying velocity fields. Medi… ▽ More

    Submitted 15 April, 2020; originally announced April 2020.

    Comments: Submitted to the Society of Exploration Geophysicists Annual Meeting 2020

  12. arXiv:2003.01159  [pdf, ps, other

    physics.geo-ph eess.SP

    Time-domain sparsity promoting least-squares reverse time migration with source estimation

    Authors: Mengmeng Yang, Zhilong Fang, Philipp Witte, Felix J. Herrmann

    Abstract: Least-squares reverse time migration is well-known for its capability to generate artifact-free true-amplitude subsurface images through fitting observed data in the least-squares sense. However, when applied to realistic imaging problems, this approach is faced with issues related to overfitting and excessive computational costs induced by many wave-equation solves. The fact that the source funct… ▽ More

    Submitted 2 March, 2020; originally announced March 2020.

  13. arXiv:1911.12447  [pdf, other

    cs.DC physics.geo-ph

    Serverless seismic imaging in the cloud

    Authors: Philipp A. Witte, Mathias Louboutin, Charles Jones, Felix J. Herrmann

    Abstract: This abstract presents a serverless approach to seismic imaging in the cloud based on high-throughput containerized batch processing, event-driven computations and a domain-specific language compiler for solving the underlying wave equations. A 3D case study on Azure demonstrates that this approach allows reducing the operating cost of up to a factor of 6, making the cloud a viable alternative to… ▽ More

    Submitted 27 November, 2019; originally announced November 2019.

    Comments: Submitted to the 2020 Rice Oil & Gas HPC Conference

  14. arXiv:1909.01279  [pdf, other

    cs.DC physics.geo-ph

    An Event-Driven Approach to Serverless Seismic Imaging in the Cloud

    Authors: Philipp A. Witte, Mathias Louboutin, Henryk Modzelewski, Charles Jones, James Selvage, Felix J. Herrmann

    Abstract: Adapting the cloud for high-performance computing (HPC) is a challenging task, as software for HPC applications hinges on fast network connections and is sensitive to hardware failures. Using cloud infrastructure to recreate conventional HPC clusters is therefore in many cases an infeasible solution for migrating HPC applications to the cloud. As an alternative to the generic lift and shift approa… ▽ More

    Submitted 3 September, 2019; originally announced September 2019.

    Comments: Submitted to IEEE Transactions on Parallel and Distributed Systems. August 2019

  15. arXiv:1808.01995  [pdf, other

    cs.DM physics.geo-ph

    Devito (v3.1.0): an embedded domain-specific language for finite differences and geophysical exploration

    Authors: Mathias Louboutin, Michael Lange, Fabio Luporini, Navjot Kukreja, Philipp A. Witte, Felix J. Herrmann, Paulius Velesko, Gerard J. Gorman

    Abstract: We introduce Devito, a new domain-specific language for implementing high-performance finite difference partial differential equation solvers. The motivating application is exploration seismology where methods such as Full-Waveform Inversion and Reverse-Time Migration are used to invert terabytes of seismic data to create images of the earth's subsurface. Even using modern supercomputers, it can t… ▽ More

    Submitted 9 August, 2019; v1 submitted 6 August, 2018; originally announced August 2018.

    Journal ref: https://www.geosci-model-dev.net/12/1165/2019/

  16. arXiv:1807.03032  [pdf, other

    cs.MS

    Architecture and performance of Devito, a system for automated stencil computation

    Authors: Fabio Luporini, Michael Lange, Mathias Louboutin, Navjot Kukreja, Jan Hückelheim, Charles Yount, Philipp Witte, Paul H. J. Kelly, Felix J. Herrmann, Gerard J. Gorman

    Abstract: Stencil computations are a key part of many high-performance computing applications, such as image processing, convolutional neural networks, and finite-difference solvers for partial differential equations. Devito is a framework capable of generating highly-optimized code given symbolic equations expressed in Python, specialized in, but not limited to, affine (stencil) codes. The lowering process… ▽ More

    Submitted 7 February, 2020; v1 submitted 9 July, 2018; originally announced July 2018.

    Comments: Submitted to ACM Transactions on Mathematical Software

    MSC Class: 65N06; 68N20