Overview

Authors:

Sergey I. Nikolenko ⁰

Sergey I. Nikolenko
1. Synthesis AI, San Francisco, USA
View author publications

You can also search for this author in PubMed Google Scholar

The first book about synthetic data, an important field which is rapidly rising in popularity throughout machine learning
Provides a wide survey of several different fields where synthetic data is or can potentially be useful, including domain adaptation and differential privacy
Contains a very extensive list of references, and in certain specific fields goes sufficiently in-depth to say that it discusses or at least mentions all relevant work

Part of the book series: Springer Optimization and Its Applications (SOIA, volume 174)

81k Accesses
42 Altmetric

This is a preview of subscription content, log in via an institution to check access.

Access this book

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

eBook USD 129.00

Price excludes VAT (USA)

Softcover Book USD 169.99

Price excludes VAT (USA)

Hardcover Book USD 169.99

Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Other ways to access

Licence this eBook for your library

Institutional subscriptions

About this book

This is the first book on synthetic data for deep learning, and its breadth of coverage may render this book as the default reference on synthetic data for years to come. The book can also serve as an introduction to several other important subfields of machine learning that are seldom touched upon in other books. Machine learning as a discipline would not be possible without the inner workings of optimization at hand. The book includes the necessary sinews of optimization though the crux of the discussion centers on the increasingly popular tool for training deep learning models, namely synthetic data. It is expected that the field of synthetic data will undergo exponential growth in the near future. This book serves as a comprehensive survey of the field.

In the simplest case, synthetic data refers to computer-generated graphics used to train computer vision models. There are many more facets of synthetic data to consider. In the section on basic computer vision, the book discusses fundamental computer vision problems, both low-level (e.g., optical flow estimation) and high-level (e.g., object detection and semantic segmentation), synthetic environments and datasets for outdoor and urban scenes (autonomous driving), indoor scenes (indoor navigation), aerial navigation, and simulation environments for robotics. Additionally, it touches upon applications of synthetic data outside computer vision (in neural programming, bioinformatics, NLP, and more). It also surveys the work on improving synthetic data development and alternative ways to produce it such as GANs.

The book introduces and reviews several different approaches to synthetic data in various domains of machine learning, most notably the following fields: domain adaptation for making synthetic data more realistic and/or adapting the models to be trained on synthetic data and differential privacy for generating synthetic data with privacy guarantees. This discussion is accompanied by an introduction into generative adversarial networks (GAN) and an introduction to differential privacy.

Review and analysis of synthetic dataset generation methods and techniques for application in computer vision

Article 30 January 2023

A Survey of Synthetic Data Augmentation Methods in Machine Vision

Article 20 March 2024

Synthetic Data for Video Surveillance Applications of Computer Vision: A Review

Article Open access 17 May 2024

Keywords

Table of contents (12 chapters)

Front Matter

Pages i-xii

Download chapter PDF
Introduction: The Data Problem
- Sergey I. Nikolenko
Pages 1-17
Deep Learning and Optimization
- Sergey I. Nikolenko
Pages 19-58
Deep Neural Networks for Computer Vision
- Sergey I. Nikolenko
Pages 59-95
Generative Models in Deep Learning
- Sergey I. Nikolenko
Pages 97-137
The Early Days of Synthetic Data
- Sergey I. Nikolenko
Pages 139-159
Synthetic Data for Basic Computer Vision Problems
- Sergey I. Nikolenko
Pages 161-194
Synthetic Simulated Environments
- Sergey I. Nikolenko
Pages 195-215
Synthetic Data Outside Computer Vision
- Sergey I. Nikolenko
Pages 217-226
Directions in Synthetic Data Development
- Sergey I. Nikolenko
Pages 227-234
Synthetic-to-Real Domain Adaptation and Refinement
- Sergey I. Nikolenko
Pages 235-268
Privacy Guarantees in Synthetic Data
- Sergey I. Nikolenko
Pages 269-283
Promising Directions for Future Work
- Sergey I. Nikolenko
Pages 285-294
Back Matter

Pages 295-348

Download chapter PDF

Authors and Affiliations

Synthesis AI, San Francisco, USA

Sergey I. Nikolenko

About the author

Sergey I. Nikolenko is a computer scientist specializing in machine learning and analysis of algorithms. He is the Head of AI at Synthesis AI, a San Francisco based company specializing on the generation and use of synthetic data for modern machine learning models, and also serves as the Head of the Artificial Intelligence Lab at the Steklov Mathematical Institute at St. Petersburg, Russia. Dr. Nikolenko's interests include synthetic data in machine learning, deep learning models for natural language processing, image manipulation, and computer vision, and algorithms for networking. His previous research includes works on cryptography, theoretical computer science, and algebra.

Bibliographic Information

Book Title: Synthetic Data for Deep Learning
Authors: Sergey I. Nikolenko
Series Title: Springer Optimization and Its Applications
DOI: https://doi.org/10.1007/978-3-030-75178-4
Publisher: Springer Cham
eBook Packages: Computer Science, Computer Science (R0)
Copyright Information: The Editor(s) (if applicable) and The Author(s), under exclusive license to Springer Nature Switzerland AG 2021
Hardcover ISBN: 978-3-030-75177-7Published: 27 June 2021
Softcover ISBN: 978-3-030-75180-7Published: 28 June 2022
eBook ISBN: 978-3-030-75178-4Published: 26 June 2021
Series ISSN: 1931-6828
Series E-ISSN: 1931-6836
Edition Number: 1
Number of Pages: XII, 348
Number of Illustrations: 25 b/w illustrations, 100 illustrations in colour
Topics: Machine Learning, Operations Research, Management Science, Image Processing and Computer Vision

Publish with us

Policies and ethics

Synthetic Data for Deep Learning

Overview

Access this book

Subscribe and save

Buy Now

Other ways to access

About this book

Similar content being viewed by others

Review and analysis of synthetic dataset generation methods and techniques for application in computer vision

A Survey of Synthetic Data Augmentation Methods in Machine Vision

Synthetic Data for Video Surveillance Applications of Computer Vision: A Review

Keywords

Table of contents (12 chapters)

Front Matter

Introduction: The Data Problem

Deep Learning and Optimization

Deep Neural Networks for Computer Vision

Generative Models in Deep Learning

The Early Days of Synthetic Data

Synthetic Data for Basic Computer Vision Problems

Synthetic Simulated Environments

Synthetic Data Outside Computer Vision

Directions in Synthetic Data Development

Synthetic-to-Real Domain Adaptation and Refinement

Privacy Guarantees in Synthetic Data

Promising Directions for Future Work

Back Matter

Authors and Affiliations

Synthesis AI, San Francisco, USA

About the author

Bibliographic Information

Publish with us

Navigation

Synthetic Data for Deep Learning

Overview

Access this book

Subscribe and save

Buy Now

Other ways to access

About this book

Similar content being viewed by others

Keywords

Table of contents (12 chapters)

Front Matter

Back Matter

Authors and Affiliations

Synthesis AI, San Francisco, USA

About the author

Bibliographic Information

Publish with us

Search

Navigation