Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–13 of 13 results for author: Hewett, R J

.
  1. arXiv:2404.14219  [pdf, other

    cs.CL cs.AI

    Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

    Authors: Marah Abdin, Sam Ade Jacobs, Ammar Ahmad Awan, Jyoti Aneja, Ahmed Awadallah, Hany Awadalla, Nguyen Bach, Amit Bahree, Arash Bakhtiari, Jianmin Bao, Harkirat Behl, Alon Benhaim, Misha Bilenko, Johan Bjorck, Sébastien Bubeck, Qin Cai, Martin Cai, Caio César Teodoro Mendes, Weizhu Chen, Vishrav Chaudhary, Dong Chen, Dongdong Chen, Yen-Chun Chen, Yi-Ling Chen, Parul Chopra , et al. (90 additional authors not shown)

    Abstract: We introduce phi-3-mini, a 3.8 billion parameter language model trained on 3.3 trillion tokens, whose overall performance, as measured by both academic benchmarks and internal testing, rivals that of models such as Mixtral 8x7B and GPT-3.5 (e.g., phi-3-mini achieves 69% on MMLU and 8.38 on MT-bench), despite being small enough to be deployed on a phone. The innovation lies entirely in our dataset… ▽ More

    Submitted 23 May, 2024; v1 submitted 22 April, 2024; originally announced April 2024.

    Comments: 19 pages

  2. arXiv:2211.12709  [pdf, other

    cs.DC cs.AI physics.comp-ph

    SciAI4Industry -- Solving PDEs for industry-scale problems with deep learning

    Authors: Philipp A. Witte, Russell J. Hewett, Kumar Saurabh, AmirHossein Sojoodi, Ranveer Chandra

    Abstract: Solving partial differential equations with deep learning makes it possible to reduce simulation times by multiple orders of magnitude and unlock scientific methods that typically rely on large numbers of sequential simulations, such as optimization and uncertainty quantification. Two of the largest challenges of adopting scientific AI for industrial problem settings is that training datasets must… ▽ More

    Submitted 23 November, 2022; originally announced November 2022.

    Comments: Submitted to International Parallel and Distributed Processing Symposium (IPDPS) on October 5, 2022

  3. arXiv:2204.01205  [pdf, other

    cs.LG cs.DC math.NA

    Model-Parallel Fourier Neural Operators as Learned Surrogates for Large-Scale Parametric PDEs

    Authors: Thomas J. Grady II, Rishi Khan, Mathias Louboutin, Ziyi Yin, Philipp A. Witte, Ranveer Chandra, Russell J. Hewett, Felix J. Herrmann

    Abstract: Fourier neural operators (FNOs) are a recently introduced neural network architecture for learning solution operators of partial differential equations (PDEs), which have been shown to perform significantly better than comparable deep learning approaches. Once trained, FNOs can achieve speed-ups of multiple orders of magnitude over conventional numerical PDE solvers. However, due to the high dimen… ▽ More

    Submitted 1 February, 2023; v1 submitted 3 April, 2022; originally announced April 2022.

  4. arXiv:2006.03108  [pdf, other

    cs.LG cs.DC stat.ML

    A Linear Algebraic Approach to Model Parallelism in Deep Learning

    Authors: Russell J. Hewett, Thomas J. Grady II

    Abstract: Training deep neural networks (DNNs) in large-cluster computing environments is increasingly necessary, as networks grow in size and complexity. Local memory and processing limitations require robust data and model parallelism for crossing compute node boundaries. We propose a linear-algebraic approach to model parallelism in deep learning, which allows parallel distribution of any tensor in the D… ▽ More

    Submitted 4 June, 2020; originally announced June 2020.

  5. arXiv:2003.14186  [pdf, other

    astro-ph.IM physics.ed-ph

    A Survey of Computational Tools in Solar Physics

    Authors: Monica G. Bobra, Stuart J. Mumford, Russell J. Hewett, Steven D. Christe, Kevin Reardon, Sabrina Savage, Jack Ireland, Tiago M. D. Pereira, Bin Chen, David Pérez-Suárez

    Abstract: The SunPy Project developed a 13-question survey to understand the software and hardware usage of the solar physics community. 364 members of the solar physics community, across 35 countries, responded to our survey. We found that 99$\pm$0.5% of respondents use software in their research and 66% use the Python scientific software stack. Students are twice as likely as faculty, staff scientists, an… ▽ More

    Submitted 27 March, 2020; originally announced March 2020.

    Journal ref: Solar Physics, 295, 57 (2020)

  6. L-Sweeps: A scalable, parallel preconditioner for the high-frequency Helmholtz equation

    Authors: Matthias Taus, Leonardo Zepeda-Núñez, Russell J Hewett, Laurent Demanet

    Abstract: We present the first fast solver for the high-frequency Helmholtz equation that scales optimally in parallel, for a single right-hand side. The L-sweeps approach achieves this scalability by departing from the usual propagation pattern, in which information flows in a 180 degree cone from interfaces in a layered decomposition. Instead, with L-sweeps, information propagates in 90 degree cones induc… ▽ More

    Submitted 15 October, 2019; v1 submitted 3 September, 2019; originally announced September 2019.

  7. arXiv:1812.07167  [pdf, other

    math.NA

    A parallel shared-memory implementation of a high-order accurate solution technique for variable coefficient Helmholtz problems

    Authors: Natalie Beams, Adrianna Gillman, Russell J. Hewett

    Abstract: The recently developed Hierarchical Poincaré-Steklov (HPS) method is a high-order discretization technique that comes with a direct solver. Results from previous papers demonstrate the method's ability to solve Helmholtz problems to high accuracy without the so-called pollution effect. While the asymptotic scaling of the direct solver's computational cost is the same as the nested dissection metho… ▽ More

    Submitted 25 April, 2019; v1 submitted 17 December, 2018; originally announced December 2018.

    Comments: 25 pages

  8. arXiv:1801.08655  [pdf, other

    math.NA

    The method of polarized traces for the 3D Helmholtz equation

    Authors: Leonardo Zepeda-Núñez, Adrien Scheuer, Russell J. Hewett, Laurent Demanet

    Abstract: We present a fast solver for the 3D high-frequency Helmholtz equation in heterogeneous, constant density, acoustic media. The solver is based on the method of polarized traces, coupled with distributed linear algebra libraries and pipelining to obtain an empirical online runtime $ \mathcal{O}(\max(1,R/n) N \log N)$ where $N = n^3$ is the total number of degrees of freedom and $R$ is the number of… ▽ More

    Submitted 25 January, 2018; originally announced January 2018.

    Comments: 22 pages

  9. arXiv:1608.03836  [pdf, other

    math.NA

    Weight-adjusted discontinuous Galerkin methods: curvilinear meshes

    Authors: Jesse Chan, Russell J. Hewett, T. Warburton

    Abstract: Traditional time-domain discontinuous Galerkin (DG) methods result in large storage costs at high orders of approximation due to the storage of dense elemental matrices. In this work, we propose a weight-adjusted DG (WADG) methods for curvilinear meshes which reduce storage costs while retaining energy stability. A priori error estimates show that high order accuracy is preserved under sufficient… ▽ More

    Submitted 12 August, 2016; originally announced August 2016.

    Comments: Submitted to SISC

  10. arXiv:1608.01944  [pdf, other

    math.NA

    Weight-adjusted discontinuous Galerkin methods: wave propagation in heterogeneous media

    Authors: Jesse Chan, Russell J. Hewett, T. Warburton

    Abstract: Time-domain discontinuous Galerkin (DG) methods for wave propagation require accounting for the inversion of dense elemental mass matrices, where each mass matrix is computed with respect to a parameter-weighted L2 inner product. In applications where the wavespeed varies spatially at a sub-element scale, these matrices are distinct over each element, necessitating additional storage. In this work… ▽ More

    Submitted 1 January, 2017; v1 submitted 5 August, 2016; originally announced August 2016.

    Comments: Submitted to SISC

  11. arXiv:1607.03399  [pdf, other

    math.NA

    Reduced storage nodal discontinuous Galerkin methods on semi-structured prismatic meshes

    Authors: Jesse Chan, Zheng Wang, Russell J. Hewett, T. Warburton

    Abstract: We present a high order time-domain nodal discontinuous Galerkin method for wave problems on hybrid meshes consisting of both wedge and tetrahedral elements. We allow for vertically mapped wedges which can be deformed along the extruded coordinate, and present a simple method for producing quasi-uniform wedge meshes for layered domains. We show that standard mass lumping techniques result in a los… ▽ More

    Submitted 31 October, 2016; v1 submitted 12 July, 2016; originally announced July 2016.

    Comments: Submitted to CAMWA

  12. arXiv:1505.02563  [pdf, other

    astro-ph.IM astro-ph.SR

    SunPy - Python for Solar Physics

    Authors: The SunPy Community, Stuart J Mumford, Steven Christe, David Pérez-Suárez, Jack Ireland, Albert Y Shih, Andrew R Inglis, Simon Liedtke, Russell J Hewett, Florian Mayer, Keith Hughitt, Nabil Freij, Tomas Meszaros, Samuel M Bennett, Michael Malocha, John Evans, Ankit Agrawal, Andrew J Leonard, Thomas P Robitaille, Benjamin Mampaey, Jose Iván Campos-Rozo, Michael S Kirk

    Abstract: This paper presents SunPy (version 0.5), a community-developed Python package for solar physics. Python, a free, cross-platform, general-purpose, high-level programming language, has seen widespread adoption among the scientific community, resulting in the availability of a large number of software packages, from numerical computation (NumPy, SciPy) and machine learning (scikit-learn) to visualisa… ▽ More

    Submitted 11 May, 2015; originally announced May 2015.

  13. Multiresolution analysis of active region magnetic structure and its correlation with the Mt. Wilson classification and flaring activity

    Authors: J. Ireland, C. A. Young, R. T. J. McAteer, C. Whelan, R. J. Hewett, P. T. Gallagher

    Abstract: Two different multi-resolution analyses are used to decompose the structure of active region magnetic flux into concentrations of different size scales. Lines separating these opposite polarity regions of flux at each size scale are found. These lines are used as a mask on a map of the magnetic field gradient to sample the local gradient between opposite polarity regions of given scale sizes. It… ▽ More

    Submitted 1 May, 2008; originally announced May 2008.

    Comments: Accepted for publication in Solar Physics