Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–19 of 19 results for author: Fursin, G

.
  1. arXiv:2406.16791  [pdf, other

    cs.LG cs.ET cs.PF

    Enabling more efficient and cost-effective AI/ML systems with Collective Mind, virtualized MLOps, MLPerf, Collective Knowledge Playground and reproducible optimization tournaments

    Authors: Grigori Fursin

    Abstract: In this white paper, I present my community effort to automatically co-design cheaper, faster and more energy-efficient software and hardware for AI, ML and other popular workloads with the help of the Collective Mind framework (CM), virtualized MLOps, MLPerf benchmarks and reproducible optimization tournaments. I developed CM to modularize, automate and virtualize the tedious process of building,… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  2. A Community Roadmap for Scientific Workflows Research and Development

    Authors: Rafael Ferreira da Silva, Henri Casanova, Kyle Chard, Ilkay Altintas, Rosa M Badia, Bartosz Balis, Tainã Coleman, Frederik Coppens, Frank Di Natale, Bjoern Enders, Thomas Fahringer, Rosa Filgueira, Grigori Fursin, Daniel Garijo, Carole Goble, Dorran Howell, Shantenu Jha, Daniel S. Katz, Daniel Laney, Ulf Leser, Maciej Malawski, Kshitij Mehta, Loïc Pottier, Jonathan Ozik, J. Luc Peterson , et al. (4 additional authors not shown)

    Abstract: The landscape of workflow systems for scientific applications is notoriously convoluted with hundreds of seemingly equivalent workflow systems, many isolated research claims, and a steep learning curve. To address some of these challenges and lay the groundwork for transforming workflows research and development, the WorkflowsRI and ExaWorks projects partnered to bring the international workflows… ▽ More

    Submitted 8 October, 2021; v1 submitted 5 October, 2021; originally announced October 2021.

    Comments: arXiv admin note: substantial text overlap with arXiv:2103.09181

  3. arXiv:2110.01406  [pdf

    cs.LG cs.DC cs.PF cs.SE

    MedPerf: Open Benchmarking Platform for Medical Artificial Intelligence using Federated Evaluation

    Authors: Alexandros Karargyris, Renato Umeton, Micah J. Sheller, Alejandro Aristizabal, Johnu George, Srini Bala, Daniel J. Beutel, Victor Bittorf, Akshay Chaudhari, Alexander Chowdhury, Cody Coleman, Bala Desinghu, Gregory Diamos, Debo Dutta, Diane Feddema, Grigori Fursin, Junyi Guo, Xinyuan Huang, David Kanter, Satyananda Kashyap, Nicholas Lane, Indranil Mallick, Pietro Mascagni, Virendra Mehta, Vivek Natarajan , et al. (17 additional authors not shown)

    Abstract: Medical AI has tremendous potential to advance healthcare by supporting the evidence-based practice of medicine, personalizing patient treatment, reducing costs, and improving provider and patient experience. We argue that unlocking this potential requires a systematic way to measure the performance of medical AI models on large-scale heterogeneous data. To meet this need, we are building MedPerf,… ▽ More

    Submitted 28 December, 2021; v1 submitted 29 September, 2021; originally announced October 2021.

  4. Workflows Community Summit: Advancing the State-of-the-art of Scientific Workflows Management Systems Research and Development

    Authors: Rafael Ferreira da Silva, Henri Casanova, Kyle Chard, Tainã Coleman, Dan Laney, Dong Ahn, Shantenu Jha, Dorran Howell, Stian Soiland-Reys, Ilkay Altintas, Douglas Thain, Rosa Filgueira, Yadu Babuji, Rosa M. Badia, Bartosz Balis, Silvina Caino-Lores, Scott Callaghan, Frederik Coppens, Michael R. Crusoe, Kaushik De, Frank Di Natale, Tu M. A. Do, Bjoern Enders, Thomas Fahringer, Anne Fouilloux , et al. (33 additional authors not shown)

    Abstract: Scientific workflows are a cornerstone of modern scientific computing, and they have underpinned some of the most significant discoveries of the last decade. Many of these workflows have high computational, storage, and/or communication demands, and thus must execute on a wide range of large-scale platforms, from large clouds to upcoming exascale HPC platforms. Workflows will play a crucial role i… ▽ More

    Submitted 9 June, 2021; originally announced June 2021.

  5. Workflows Community Summit: Bringing the Scientific Workflows Community Together

    Authors: Rafael Ferreira da Silva, Henri Casanova, Kyle Chard, Dan Laney, Dong Ahn, Shantenu Jha, Carole Goble, Lavanya Ramakrishnan, Luc Peterson, Bjoern Enders, Douglas Thain, Ilkay Altintas, Yadu Babuji, Rosa M. Badia, Vivien Bonazzi, Taina Coleman, Michael Crusoe, Ewa Deelman, Frank Di Natale, Paolo Di Tommaso, Thomas Fahringer, Rosa Filgueira, Grigori Fursin, Alex Ganose, Bjorn Gruning , et al. (20 additional authors not shown)

    Abstract: Scientific workflows have been used almost universally across scientific domains, and have underpinned some of the most significant discoveries of the past several decades. Many of these workflows have high computational, storage, and/or communication demands, and thus must execute on a wide range of large-scale platforms, from large clouds to upcoming exascale high-performance computing (HPC) pla… ▽ More

    Submitted 16 March, 2021; originally announced March 2021.

  6. Collective Knowledge: organizing research projects as a database of reusable components and portable workflows with common APIs

    Authors: Grigori Fursin

    Abstract: This article provides the motivation and overview of the Collective Knowledge framework (CK or cKnowledge). The CK concept is to decompose research projects into reusable components that encapsulate research artifacts and provide unified application programming interfaces (APIs), command-line interfaces (CLIs), meta descriptions and common automation actions for related artifacts. The CK framework… ▽ More

    Submitted 30 January, 2021; v1 submitted 2 November, 2020; originally announced November 2020.

    Comments: Accepted for Philosophical Transactions of the Royal Society. arXiv admin note: text overlap with arXiv:2006.07161

  7. arXiv:2006.07161  [pdf, other

    cs.LG cs.SE stat.ML

    The Collective Knowledge project: making ML models more portable and reproducible with open APIs, reusable best practices and MLOps

    Authors: Grigori Fursin

    Abstract: This article provides an overview of the Collective Knowledge technology (CK or cKnowledge). CK attempts to make it easier to reproduce ML&systems research, deploy ML models in production, and adapt them to continuously changing data sets, models, research techniques, software, and hardware. The CK concept is to decompose complex systems and ad-hoc research projects into reusable sub-components wi… ▽ More

    Submitted 18 June, 2020; v1 submitted 12 June, 2020; originally announced June 2020.

    Comments: arXiv admin note: text overlap with arXiv:2001.07935

  8. arXiv:2001.07935  [pdf, other

    cs.LG cs.SE stat.ML

    CodeReef: an open platform for portable MLOps, reusable automation actions and reproducible benchmarking

    Authors: Grigori Fursin, Herve Guillou, Nicolas Essayan

    Abstract: We present CodeReef - an open platform to share all the components necessary to enable cross-platform MLOps (MLSysOps), i.e. automating the deployment of ML models across diverse systems in the most efficient way. We also introduce the CodeReef solution - a way to package and share models as non-virtualized, portable, customizable and reproducible archive files. Such ML packages include JSON meta… ▽ More

    Submitted 27 January, 2020; v1 submitted 22 January, 2020; originally announced January 2020.

    Comments: Presented at the 1st Workshop on MLOps Systems co-located with the 3rd Conference on Machine Learning and Systems (MLSys'20) in Austin, TX, USA: https://mlops-systems.github.io . A live interactive demo: https://CodeReef.ai/demo

  9. arXiv:1904.03257  [pdf, ps, other

    cs.LG cs.DB cs.DC cs.SE stat.ML

    MLSys: The New Frontier of Machine Learning Systems

    Authors: Alexander Ratner, Dan Alistarh, Gustavo Alonso, David G. Andersen, Peter Bailis, Sarah Bird, Nicholas Carlini, Bryan Catanzaro, Jennifer Chayes, Eric Chung, Bill Dally, Jeff Dean, Inderjit S. Dhillon, Alexandros Dimakis, Pradeep Dubey, Charles Elkan, Grigori Fursin, Gregory R. Ganger, Lise Getoor, Phillip B. Gibbons, Garth A. Gibson, Joseph E. Gonzalez, Justin Gottschlich, Song Han, Kim Hazelwood , et al. (44 additional authors not shown)

    Abstract: Machine learning (ML) techniques are enjoying rapidly increasing adoption. However, designing and implementing the systems that support ML models in real-world deployments remains a significant obstacle, in large part due to the radically different development and deployment profile of modern ML methods, and the range of practical concerns that come with broader adoption. We propose to foster a ne… ▽ More

    Submitted 1 December, 2019; v1 submitted 29 March, 2019; originally announced April 2019.

  10. arXiv:1904.00324  [pdf, other

    cs.LG

    SysML'19 demo: customizable and reusable Collective Knowledge pipelines to automate and reproduce machine learning experiments

    Authors: Grigori Fursin

    Abstract: Reproducing, comparing and reusing results from machine learning and systems papers is a very tedious, ad hoc and time-consuming process. I will demonstrate how to automate this process using open-source, portable, customizable and CLI-based Collective Knowledge workflows and pipelines developed by the community. I will help participants run several real-world non-virtualized CK workflows from the… ▽ More

    Submitted 30 March, 2019; originally announced April 2019.

    Comments: Accepted demo at the Conference on Systems and Machine Learning (SysML'19)

  11. arXiv:1806.07060  [pdf, other

    cs.PF cs.DC cs.MS cs.SE

    A model-driven approach for a new generation of adaptive libraries

    Authors: Marco Cianfriglia, Flavio Vella, Cedric Nugteren, Anton Lokhmotov, Grigori Fursin

    Abstract: Efficient high-performance libraries often expose multiple tunable parameters to provide highly optimized routines. These can range from simple loop unroll factors or vector sizes all the way to algorithmic changes, given that some implementations can be more suitable for certain devices by exploiting hardware characteristics such as local memories and vector units. Traditionally, such parameters… ▽ More

    Submitted 19 June, 2018; originally announced June 2018.

    Comments: New detailed analysis will be provided

    Report number: Volume 18 Issue 1 Pages 1-24

    Journal ref: ACM Transactions on Architecture and Code Optimization 2021

  12. arXiv:1801.08024  [pdf, other

    cs.HC cs.CY

    A Collective Knowledge workflow for collaborative research into multi-objective autotuning and machine learning techniques

    Authors: Grigori Fursin, Anton Lokhmotov, Dmitry Savenko, Eben Upton

    Abstract: Developing efficient software and hardware has never been harder whether it is for a tiny IoT device or an Exascale supercomputer. Apart from the ever growing design and optimization complexity, there exist even more fundamental problems such as lack of interdisciplinary knowledge required for effective software/hardware co-design, and a growing technology transfer gap between academia and industr… ▽ More

    Submitted 19 January, 2018; originally announced January 2018.

    Comments: Interactive CK report: http://cKnowledge.org/rpi-crowd-tuning ; CK repository with artifacts: https://github.com/ctuning/ck-rpi-optimization-results ; FigShare data archive: https://doi.org/10.6084/m9.figshare.5789007.v2

  13. arXiv:1801.06378  [pdf, other

    stat.ML cs.LG cs.SE

    Introducing ReQuEST: an Open Platform for Reproducible and Quality-Efficient Systems-ML Tournaments

    Authors: Thierry Moreau, Anton Lokhmotov, Grigori Fursin

    Abstract: Co-designing efficient machine learning based systems across the whole hardware/software stack to trade off speed, accuracy, energy and costs is becoming extremely complex and time consuming. Researchers often struggle to evaluate and compare different published works across rapidly evolving software frameworks, heterogeneous hardware platforms, compilers, libraries, algorithms, data sets, models,… ▽ More

    Submitted 19 January, 2018; originally announced January 2018.

    Comments: ReQuEST tournament website: http://cKnowledge.org/request

  14. arXiv:1506.06256  [pdf, other

    cs.SE cs.LG cs.PF

    Collective Mind, Part II: Towards Performance- and Cost-Aware Software Engineering as a Natural Science

    Authors: Grigori Fursin, Abdul Memon, Christophe Guillon, Anton Lokhmotov

    Abstract: Nowadays, engineers have to develop software often without even knowing which hardware it will eventually run on in numerous mobile phones, tablets, desktops, laptops, data centers, supercomputers and cloud services. Unfortunately, optimizing compilers are not keeping pace with ever increasing complexity of computer systems anymore and may produce severely underperforming executable codes while wa… ▽ More

    Submitted 20 June, 2015; originally announced June 2015.

    Comments: Presented at the 18th International Workshop on Compilers for Parallel Computing (CPC'15), London, UK

  15. arXiv:1412.2347   

    cs.PF

    Proceedings of the 5th International Workshop on Adaptive Self-tuning Computing Systems 2015 (ADAPT'15)

    Authors: Christophe Dubach, Grigori Fursin

    Abstract: This is the proceedings of the 5th International Workshop on Adaptive Self-tuning Computing Systems 2015 (ADAPT'15).

    Submitted 7 December, 2014; originally announced December 2014.

    Report number: ADAPT/2015/00

  16. arXiv:1407.4075  [pdf, other

    cs.PL cs.LG

    Finding representative sets of optimizations for adaptive multiversioning applications

    Authors: Lianjie Luo, Yang Chen, Chengyong Wu, Shun Long, Grigori Fursin

    Abstract: Iterative compilation is a widely adopted technique to optimize programs for different constraints such as performance, code size and power consumption in rapidly evolving hardware and software environments. However, in case of statically compiled programs, it is often restricted to optimizations for a specific dataset and may not be applicable to applications that exhibit different run-time behav… ▽ More

    Submitted 14 July, 2014; originally announced July 2014.

    Comments: 3rd Workshop on Statistical and Machine Learning Approaches Applied to Architectures and Compilation (SMART'09), co-located with HiPEAC'09 conference, Paphos, Cyprus, 2009

  17. arXiv:1407.3487  [pdf, other

    cs.DC

    Collective Tuning Initiative

    Authors: Grigori Fursin

    Abstract: Computing systems rarely deliver best possible performance due to ever increasing hardware and software complexity and limitations of the current optimization technology. Additional code and architecture optimizations are often required to improve execution time, size, power consumption, reliability and other important characteristics of computing systems. However, it is often a tedious, repetitiv… ▽ More

    Submitted 13 July, 2014; originally announced July 2014.

    Comments: GCC Developers' Summit'09, 14 June 2009, Montreal, Canada

  18. arXiv:1406.4020  [pdf, ps, other

    cs.DL

    Community-driven reviewing and validation of publications

    Authors: Grigori Fursin, Christophe Dubach

    Abstract: In this report, we share our practical experience on crowdsourcing evaluation of research artifacts and reviewing of publications since 2008. We also briefly discuss encountered problems including reproducibility of experimental results and possible solutions.

    Submitted 16 June, 2014; originally announced June 2014.

  19. arXiv:1308.2410  [pdf, other

    cs.SE cs.HC stat.ML

    Collective Mind: cleaning up the research and experimentation mess in computer engineering using crowdsourcing, big data and machine learning

    Authors: Grigori Fursin

    Abstract: Software and hardware co-design and optimization of HPC systems has become intolerably complex, ad-hoc, time consuming and error prone due to enormous number of available design and optimization choices, complex interactions between all software and hardware components, and multiple strict requirements placed on performance, power consumption, size, reliability and cost. We present our novel long-… ▽ More

    Submitted 11 August, 2013; originally announced August 2013.

    Comments: I started drafting this document at the beginning of the development of the 3rd version of plugin-based cTuning infrastructure and repository (aka Collective Mind) to systematize and crowdsource program and architecture auto-tuning; (2013)