Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–12 of 12 results for author: Vytiniotis, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2401.11202  [pdf, other

    cs.LG cs.DC cs.PL

    PartIR: Composing SPMD Partitioning Strategies for Machine Learning

    Authors: Sami Alabed, Daniel Belov, Bart Chrzaszcz, Juliana Franco, Dominik Grewe, Dougal Maclaurin, James Molloy, Tom Natan, Tamara Norman, Xiaoyue Pan, Adam Paszke, Norman A. Rink, Michael Schaarschmidt, Timur Sitdikov, Agnieszka Swietlik, Dimitrios Vytiniotis, Joel Wee

    Abstract: Training of modern large neural networks (NN) requires a combination of parallelization strategies encompassing data, model, or optimizer sharding. When strategies increase in complexity, it becomes necessary for partitioning tools to be 1) expressive, allowing the composition of simpler strategies, and 2) predictable to estimate performance analytically. We present PartIR, our design for a NN par… ▽ More

    Submitted 3 March, 2024; v1 submitted 20 January, 2024; originally announced January 2024.

  2. arXiv:2212.10307  [pdf, other

    cs.PL cs.LG cs.MS

    Efficient and Sound Differentiable Programming in a Functional Array-Processing Language

    Authors: Amir Shaikhha, Mathieu Huot, Shabnam Ghasemirad, Andrew Fitzgibbon, Simon Peyton Jones, Dimitrios Vytiniotis

    Abstract: Automatic differentiation (AD) is a technique for computing the derivative of a function represented by a program. This technique is considered as the de-facto standard for computing the differentiation in many machine learning and optimisation software tools. Despite the practicality of this technique, the performance of the differentiated programs, especially for functional languages and in the… ▽ More

    Submitted 20 December, 2022; originally announced December 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:1806.02136

  3. arXiv:2210.06352  [pdf, other

    cs.DC cs.LG cs.NE

    Automatic Discovery of Composite SPMD Partitioning Strategies in PartIR

    Authors: Sami Alabed, Dominik Grewe, Juliana Franco, Bart Chrzaszcz, Tom Natan, Tamara Norman, Norman A. Rink, Dimitrios Vytiniotis, Michael Schaarschmidt

    Abstract: Large neural network models are commonly trained through a combination of advanced parallelism strategies in a single program, multiple data (SPMD) paradigm. For example, training large transformer models requires combining data, model, and pipeline partitioning; and optimizer sharding techniques. However, identifying efficient combinations for many model architectures and accelerator systems requ… ▽ More

    Submitted 7 October, 2022; originally announced October 2022.

  4. arXiv:2112.02958  [pdf, other

    cs.LG cs.DC

    Automap: Towards Ergonomic Automated Parallelism for ML Models

    Authors: Michael Schaarschmidt, Dominik Grewe, Dimitrios Vytiniotis, Adam Paszke, Georg Stefan Schmid, Tamara Norman, James Molloy, Jonathan Godwin, Norman Alexander Rink, Vinod Nair, Dan Belov

    Abstract: The rapid rise in demand for training large neural network architectures has brought into focus the need for partitioning strategies, for example by using data, model, or pipeline parallelism. Implementing these methods is increasingly supported through program primitives, but identifying efficient partitioning strategies requires expensive experimentation and expertise. We present the prototype o… ▽ More

    Submitted 6 December, 2021; originally announced December 2021.

    Comments: Workshop on ML for Systems at NeurIPS 2021

  5. arXiv:2112.01075  [pdf, other

    cs.DC cs.LG cs.PL

    Memory-efficient array redistribution through portable collective communication

    Authors: Norman A. Rink, Adam Paszke, Dimitrios Vytiniotis, Georg Stefan Schmid

    Abstract: Modern large-scale deep learning workloads highlight the need for parallel execution across many devices in order to fit model data into hardware accelerator memories. In these settings, array redistribution may be required during a computation, but can also become a bottleneck if not done efficiently. In this paper we address the problem of redistributing multi-dimensional array data in SPMD comp… ▽ More

    Submitted 28 November, 2022; v1 submitted 2 December, 2021; originally announced December 2021.

    Comments: minor errata fixed

  6. arXiv:2110.10548  [pdf, other

    cs.PL cs.DC cs.LG

    Synthesizing Optimal Parallelism Placement and Reduction Strategies on Hierarchical Systems for Deep Learning

    Authors: Ningning Xie, Tamara Norman, Dominik Grewe, Dimitrios Vytiniotis

    Abstract: We present a novel characterization of the mapping of multiple parallelism forms (e.g. data and model parallelism) onto hierarchical accelerator systems that is hierarchy-aware and greatly reduces the space of software-to-hardware mapping. We experimentally verify the substantial effect of these mappings on all-reduce performance (up to 448x). We offer a novel syntax-guided program synthesis frame… ▽ More

    Submitted 16 November, 2021; v1 submitted 20 October, 2021; originally announced October 2021.

  7. arXiv:2106.08318  [pdf, other

    cs.CV cs.DC cs.LG eess.IV

    Gradient Forward-Propagation for Large-Scale Temporal Video Modelling

    Authors: Mateusz Malinowski, Dimitrios Vytiniotis, Grzegorz Swirszcz, Viorica Patraucean, Joao Carreira

    Abstract: How can neural networks be trained on large-volume temporal data efficiently? To compute the gradients required to update parameters, backpropagation blocks computations until the forward and backward passes are completed. For temporal signals, this introduces high latency and hinders real-time learning. It also creates a coupling between consecutive layers, which limits model parallelism and incr… ▽ More

    Submitted 12 July, 2021; v1 submitted 15 June, 2021; originally announced June 2021.

    Comments: Accepted to CVPR 2021. arXiv admin note: text overlap with arXiv:2001.06232

  8. arXiv:2104.05372  [pdf, other

    cs.PL

    Getting to the Point. Index Sets and Parallelism-Preserving Autodiff for Pointful Array Programming

    Authors: Adam Paszke, Daniel Johnson, David Duvenaud, Dimitrios Vytiniotis, Alexey Radul, Matthew Johnson, Jonathan Ragan-Kelley, Dougal Maclaurin

    Abstract: We present a novel programming language design that attempts to combine the clarity and safety of high-level functional languages with the efficiency and parallelism of low-level numerical languages. We treat arrays as eagerly-memoized functions on typed index sets, allowing abstract function manipulations, such as currying, to work on arrays. In contrast to composing primitive bulk-array operatio… ▽ More

    Submitted 12 April, 2021; originally announced April 2021.

    Comments: 31 pages with appendix, 11 figures. A conference submission is still under review

  9. arXiv:1806.02136  [pdf, other

    cs.MS cs.LG cs.PL cs.SC stat.ML

    Efficient Differentiable Programming in a Functional Array-Processing Language

    Authors: Amir Shaikhha, Andrew Fitzgibbon, Dimitrios Vytiniotis, Simon Peyton Jones, Christoph Koch

    Abstract: We present a system for the automatic differentiation of a higher-order functional array-processing language. The core functional language underlying this system simultaneously supports both source-to-source automatic differentiation and global optimizations such as loop transformations. Thanks to this feature, we demonstrate how for some real-world machine learning and computer vision benchmarks,… ▽ More

    Submitted 6 June, 2018; originally announced June 2018.

  10. arXiv:1705.09786  [pdf, other

    cs.LG cs.AI cs.DC stat.ML

    AMPNet: Asynchronous Model-Parallel Training for Dynamic Neural Networks

    Authors: Alexander L. Gaunt, Matthew A. Johnson, Maik Riechert, Daniel Tarlow, Ryota Tomioka, Dimitrios Vytiniotis, Sam Webster

    Abstract: New types of machine learning hardware in development and entering the market hold the promise of revolutionizing deep learning in a manner as profound as GPUs. However, existing software frameworks and training algorithms for deep learning have yet to evolve to fully leverage the capability of the new wave of silicon. We already see the limitations of existing algorithms for models that exploit s… ▽ More

    Submitted 22 June, 2017; v1 submitted 27 May, 2017; originally announced May 2017.

    Comments: 17 pages, 13 figures

  11. arXiv:1605.07262  [pdf, other

    cs.LG cs.CV cs.NE

    Measuring Neural Net Robustness with Constraints

    Authors: Osbert Bastani, Yani Ioannou, Leonidas Lampropoulos, Dimitrios Vytiniotis, Aditya Nori, Antonio Criminisi

    Abstract: Despite having high accuracy, neural nets have been shown to be susceptible to adversarial examples, where a small perturbation to an input can cause it to become mislabeled. We propose metrics for measuring the robustness of a neural net and devise a novel algorithm for approximating these metrics based on an encoding of robustness as a linear program. We show how our metrics can be used to evalu… ▽ More

    Submitted 16 June, 2017; v1 submitted 23 May, 2016; originally announced May 2016.

  12. Testing Noninterference, Quickly

    Authors: Catalin Hritcu, Leonidas Lampropoulos, Antal Spector-Zabusky, Arthur Azevedo de Amorim, Maxime Dénès, John Hughes, Benjamin C. Pierce, Dimitrios Vytiniotis

    Abstract: Information-flow control mechanisms are difficult both to design and to prove correct. To reduce the time wasted on doomed proof attempts due to broken definitions, we advocate modern random testing techniques for finding counterexamples during the design process. We show how to use QuickCheck, a property-based random-testing tool, to guide the design of increasingly complex information-flow abstr… ▽ More

    Submitted 25 July, 2015; v1 submitted 1 September, 2014; originally announced September 2014.

    Journal ref: J. Funct. Prog. 26 (2016) e4