Whitepaper 0.1: The Blockchain Platform To Collectively Build AI Apps
Whitepaper 0.1: The Blockchain Platform To Collectively Build AI Apps
Whitepaper 0.1: The Blockchain Platform To Collectively Build AI Apps
0.1
The blockchain platform to collectively
build AI Apps
dbrain.io
1
Introduction
Artificial Intelligence (AI) happens to be the next big thing. AI is all about data.
Datasets used for machine learning are still labeled by hand, which requires a lot
of effort. This creates a lot of friction: labeling quality is not guaranteed, and the
initial source data is not secured. Dbrain allows people to work together in se-
cure, seamless, integrated processes for buying, designing, and building AI apps,
from start to finish.
Dbrain is an open blockchain platform that links crowdworkers and data scien-
tists enabling them to transform raw data into real-world AI solutions. Crowd-
workers do simple tasks of data labelling and validation, and are paid instantly in
cryptocurrency for their work. Data scientists use the resulting datasets to train
Neural Networks (NN) and build AI apps. Businesses use existing AI solutions or
specify new ones to serve their particular needs. Dbrain automates AI produc-
tion and data workflow by providing efficient tools to all parties, including a web
application, a Telegram bot, and a mobile app.
2
Blockchain technology helps us meet many of AI’s current challenges. Using
the blockchain, we can confidently manage high-quality data labelling, security
concerns, intellectual property rights, and international micropayments. Using
existing commercial computation infrastructure allows us to build an affordable,
scalable toolkit for developing and deploying AI apps.
Anyone with a connected device can join Dbrain and get a role in building In-
dustry 4.0. Our platform connects exploding demand for hand-labeled AI data
with the abundant supply of global crowdworkers. In particular, we reach 2 billion
unbanked people in low-wage countries, offering them cryptocurrency income
in exchange for data labeling and validation. Integrating this global workforce
into its platform, Dbrain provides a secure, unified infrastructure to supercharge
businesses through accessible, high-quality AI products.
Right now, AI is off limits to all but the wealthiest and most powerful operations.
Dbrain makes AI affordable to more customers. We make AI buildable by more
developers. We make AI profitable for more workers. We democratize AI.
3
Contents
Introduction.............................................2 5. Team........................................................25
5.1. Founders.............................................25
Contents....................................................4 5.2. Advisors..............................................26
5.3. Core team..........................................27
1. Challenges...............................................5 5.4. Partnerships.....................................28
1.1. High-quality datasets.......................5 5.5. Background......................................29
1.2. Security and trust...............................6 5.6. Icon8....................................................29
1.3. Abundant crowdwork supply........6 5.7. Connectcome..................................30
1.4. Last-mile infrastructure..................8
6. Roadmap...............................................31
2. Platform....................................................9
2.1. AI production line...............................9 Disclaimer..............................................32
2.2. Blockchain and crypto...................11
2.3. SPOCK protocol...............................11
2.4. PICARD protocol.............................12
2.5. Dbraincoin (DBR)............................13
2.6. Product................................................14
2.7. Web application..............................15
2.8. Telegram bot.....................................16
2.9. Mobile application.........................17
2.10. Competitive advantages...........18
3. Use cases..............................................19
3.1. Image recognition............................19
3.2. Video surveillance.........................20
3.3. Medical data processing.............21
3.4. Natural language processing...22
3.5. You name it.......................................23
4. Revenue model...................................24
4
1. Challenges
Blockchain technology helps us meet many of AI’s current challenges. Using
existing commercial computation infrastructure allows us to build an affordable,
scalable toolkit for developing, integrating, and deploying AI apps.
Dbrain guarantees quality of datasets without any work duplication. To align the
incentives for crowdworkers, validators, AI developers, and data owners, Dbrain
implements the Subjective Proof of Crowdwork Protocol (SPOCK), which vali-
dates data quality automatically and guarantees real-time, fair, transparent billing
to workers and data owners.
5
1.2. Security and trust
Sharing sensitive data with third parties, and even in-house developers, poses
certain security risks. AI developers can replicate third-party software within a
very short time when given access to someone else’s data. Labeled data, rather
than software, are the defensible barrier for many businesses. Data owners lose
revenue when datasets are leaked to third parties.
Dbrain protects data owners’ interests and prevents leaks at all stages of AI app
development. No matter who uploads data on the platform, the Protocol for Indi-
rect Controlled Access to Repository Data (PICARD) protects datasets and AI
apps hosted on the platform. It also allows data scientists to train AI models using
datasets without downloading them, and to sell AI solutions to business clients
later on. The protocol guarantees security and trust in the Dbrain community
with regards to data access control and reward distribution.
In the 10 largest developing countries, the total number of internet users is close
to 2 billion; with nearly 50% technology penetration, the online population is
growing rapidly1. The number of internet users in these countries is greater than
in all other countries combined. At the same time, the World Bank estimates that
there are around 2 billion unbanked people in the world2. Clearly, internet con-
nectivity reaches the developing world much faster than the banking system, and
many people connected to the internet are still excluded from the global finan-
cial system. Cross-border payments via banks are expensive, slow, and location
dependent. Cryptocurrencies can solve this problem by reaching any person
connected to the internet.
The supply of online crowdwork is abundant globally. The World Bank estimates
that in 2013 the minimum total supply of crowdwork was $239B, while the market
demand in 2016 was $4.8B, or 50 times less than the work supply. Only demand
limits the market growth.
1 — (https://www.internetworldstats.com/)
2 — World Bank “Measuring Financial Inclusion around the World” (http://www.worldbank.org/en/programs/globalfindex)
3 — World Bank “The Global Opportunity in Online Outsourcing” (https://openknowledge.worldbank.org/handle/10986/22284)
6
$ 240 B+
CR OWDWOR K
MA R K ET*
D BR A IN
MA R K ET FIT
$200B+
A I M A R K ET *
Dbrain is the channel for AI data labeling tasks. We see this situation as a blue
ocean opportunity that allows us to build and control a substantial part of that
market instead of fighting for a market share. Our platform will satisfy the explod-
ing demand for human-labeled data and human-in-the-loop APIs. We will pro-
vide a better income for people who need it most and improve their quality of life,
while managing to reduce data labeling costs for our customers.
7
1.4. Last-mile infrastructure
Even the most sophisticated AI platform is useless without access to end users.
To use AI solutions in the real world, businesses need to find AI developers, the
scarcest resource on the market. Developers need access to scalable and afford-
able AI computation infrastructure to train and deploy their AI Apps. They also
need access to raw data and crowdworkers for data labeling and model output
validation. Labelers need simple, accessible interfaces, and micropayment chan-
nels to be paid for their work.
The Dbrain platform allows users to deploy and share AI solutions. Anyone can
use AI models via convenient on-demand APIs without incurring development
and infrastructure costs.
8
2. Platform
Dbrain is an open blockchain platform for turning raw data into real-world AI
solutions. We make AI accessible to businesses and allow anyone to earn money
for their effort.
Dbrain levels the playing field for all participants on the AI market.
9
AI production line
Data
R AW
Data Owners upload data and host
DATA datasets on the platform
DBR
SPOCK
Label
DATA
Labelers label data and validate
S ET datasets to get DBR in return
DBR
PICARD
Train
DBR
API
DBR
Profit
10
2.2. Blockchain and crypto
The Dbrain platform works on the Ethereum network and relies on its smart con-
tracts. We’re building a scalable permissioned blockchain anchored to the Ethe-
reum network via state channels. Our solution can securely process thousands of
transactions per second which all involved parties can verify independently. We
implement two blockchain protocols for decentralized access to our platform
and an in-house cryptocurrency.
All work tasks performed on the Dbrain platform require multiple validations by
other random labelers. Validators either do the same work for the simplest tasks
such as image classification, or confirm the correctness of complex tasks. When
the majority of validators agree on the task result quality, then the original worker
receives a payment and a higher rating. Workers get a lower rating and no pay-
ment for rejected tasks.
Validators who approve a bad result, if present, are punished with a significantly
reduced rating. Such a system does not discourage conscientious validators from
being suspicious, because when they are right, they receive a higher rating. How-
ever, it does strongly discourage validators from accepting wrong results.
With validators motivated to accept good results and even more motivated to
reject bad results, the best strategy for workers is to do their best and deliver cor-
rect results,while the best strategy for validators is to accept the correct results.
Such behavior is a Nash equilibrium in this contrived game and payoff matrix.
11
We have several requirements for our rating and task validation system to be able
to process task completion and validation in real time:
Transparency. All rating changes and billing events should be visible to task
requesters and workers online.
The Protocol for indirect controlled access to repository data (PICARD) pro-
tects datasets and AI applications hosted on the Dbrain platform and allows data
scientists to train AI models using the datasets without downloading them, and
to sell AI solutions to business clients later. The protocol allows data scientists to
work on a contract basis as well as to contribute to community owned datasets
and public kernels. It also allows participation in Kaggle-like competitions on
openly listed challenges.
12
We log every access to the data and model APIs as a transaction in our private
blockchain. The entire access history is available to dataset and model owners.
The owners can completely restrict access to their data. However, a much more
interesting scenario is indirect controlled access to datasets and models. Any
developer can build a new better model or mix an existing one with some secret
sauce and sell the results as a new API on the platform. Owners can set a public
price on their data and models and list them publicly on the Dbrain platform.
Alternatively, the owners of existing data and models and new developers can
agree on revenue sharing from the resulting product. Our very granular access
control records history into immutable ledgers audited by third parties, guaran-
teeing fair and precise distribution of revenues from AI Apps.
13
2.6. Product
The Dbrain web application integrated with Ethereum (DApp) allows every
Internet user to perform tasks, earn Crypto and withdraw it with a single click.
Telegram bot for simple data labeling and task validation gives us access to
crowdworker audience with the least imaginable friction. Our upcoming mobile
app will provide user interface for complex tasks on smartphones and tablets and
allow to collect custom data from crowdworkers.
Later in 2018, we will release our data science computation tool and present it
to the public as a Jupyter notebook with all modern ML libraries and tools and a
connection to our data.
14
2.7. Web application
The Dbrain web application integrated with Ethereum (DApp) provides an intu-
itive tool for data labeling and validation tasks for crowdworkers. The complex
user interface allows crowdworkers to perform advanced tasks, such as image
labeling for classification and regression, object annotation with bounding boxes
and segmentation masks.
15
2.8. Telegram bot
The Telegram Bot is ideal for simple image labeling and validation tasks.3 Anyone
with the internet connected device can label data and get paid instantly with
Dbraincoins. Smartphones are more accessible and widespread in developing
countries than laptops, while internet penetration is high, which gives us an edge
in accessing workers in those regions.
Easy way
to label data
With the Telegram bot, we reach
millions of unbanked people to give
them an income stream in Crypto.
16
2.9. Mobile application
Mobile apps are a great tool to create new data — video, audio, photos, acceler-
ation, GPS coordinates and touch input. Our app will allow any platform user to
become a data provider and earn additional Dbraincoins.
Coming soon
App Store
Google Play
17
2.10. Competitive advantages
There are a few large existing crowdwork platforms that are mostly used for
AI-related tasks (e.g., Amazon Mechanical Turk, Yandex.Toloka). However, they
fail to meet most of AI developers’ needs . Developers need to find raw data and
upload tasks or validate data themselves. Then developers need access to AI
compute infrastructure in order to train models and build AI Apps. Finally, devel-
opers need to deploy AI Apps somewhere to make them available to business cli-
ents. The Dbrain platform covers all stages of the AI production cycle and offers a
comprehensive solution.
Upload data
Label data
Validation
Payments in Crypto
Instant payments
Chatbot integration
Low cost
Data security
Revenue distribution
AI marketplace
Global reach
AI is by far the major case for online crowdwork that could be formalized with a
decentralized protocol at scale. Our crowdwork solution as part of our AI plat-
form adds much more value to all AI users, and allows for human work integration
with AI Apps.
18
3. Use cases
The Dbrain platform provides a scalable and accessible infrastructure to super-
charge businesses with high quality AI, integrated via a convenient API. We offer
a wide range of turnkey and custom AI solutions, integration, and customization
for our clients’ particular needs. Static image recognition, video surveillance and
action detection, medical data processing, and content analysis of text streams,
which currently lack working solutions would benefit from business-ready AI
solutions. These areas account for almost half of the future AI market; they are
our target.
Image recognition (including classification and tagging) is one of the most com-
monly applied AI use cases today. Image recognition is an area that is developing
rapidly and that will have a major impact on the consumer, automotive, adver-
tising, healthcare, defense, media, and entertainment industries. People com-
municate in images, and images are essential for product discovery nowadays.
Businesses spend billions every year on repetitive graphic design tasks.
19
Pizza is OK!
20
Hands washing
21
Cancer tumor
One of the most important AI use cases is cancer detection. Deep learning
models can assist pathologists in this task. A pathologist’s report serves as a basis
for diagnosing many diseases. Despite the fact that pathologists study for many
years in order to improve their cancer prediction skills, even today AI can detect
cancer with higher accuracy than doctors can. Like in all AI models, more and
better data are needed in order to improve its performance.
Pathologists, patients and medical organizations can leverage the Dbrain pro-
tocol and contribute their datasets to AI models without sacrificing patients’
privacy. They will help to create an AI detection algorithm that can complement
pathologists’ workflow naturally. Such a platform could help predict cancer at
early stages, thus saving millions of cancer patients’ lives .
22
3.4. Natural language processing
NLP is an AI application that recognizes not only formal content of texts, but also
their sentiment and meaning. AI can also detect messages that signal danger-
ous situations, for example, a terrorist threat or suicide intention. Telegram is one
of the leading messengers worldwide. It has more than 100 million active users
and delivers over 15 billion messages daily. Telegram has recently been blocked
in Indonesia by the government, which said that the messenger is “full of radical
and terrorist propaganda”. The developers of Telegram do not provide access
to users’ messages to any governments or officials. Therefore, AI in combination
with a human feedback loop is the only possible solution for content moderation.
Another use case for AI in NLP is chatbots that streamline consumer experience
in online services. Such chatbots could replace people for simple queries, and
could be of great assistance to people with complex queries. Dbrain will give
access to distributed workers who will perform text analysis, train AI models and
validate ambiguous AI results in near real-time.
We offer a wide range of turnkey and custom AI solutions, integration, and cus-
tomization for our clients’ particular needs. Become one!
Request AI App
23
4. Revenue model
We charge a 10% commission from every transaction on our platform to com-
pensate our costs of running the infrastructure and maintaining a healthy plat-
form. Our commission is much lower than those charged by the existing crowd-
work platforms. We believe that zero commissions are unsustainable for a large
crowdwork platform. Those who promise to never charge any money for the
value they add either aren’t going to build a sustainable business, or aren’t telling
the whole story, or don’t add any real value in the long run.
Up to 40%
10%
Dbrain Amazon Mechanical Turk
Commission comparision
Our AI platform will save our clients much more than the commission we charge,
because they do not have to set up any infrastructure for data labeling, AI devel-
opment, training, and deployment.
24
5. Team
5.1. Founders
25
5.2. Advisors
26
Dr. Antti Saarnio Ilya Glazyrin
Matthew Graham
27
5.3. Core team
Sergey Kananykhin
Backend developer
Dmitry Dubsky
Frontend developer
David Kuryakin
Frontend developer
Danila Makarov
Designer
28
5.4. Partnerships
nvidia.com
chronobank.io
bizspark.microsoft.com
29
5.5. Background
5.6. Icon8
The Dbrain team created the Icon8 AI chatbot, which was #1 on Telegram, #3
globally on Facebook, and ranked as the #1 bot in 2016 by VentureBeat. The bot
applies an artistic style to any submitted picture and returns the result almost
immediately. The bot received a grant personally from Telegram CEO Pavel
Durov for cloud infrastructure financing . VentureBeat included Dmitry Matskev-
ich within the list of 100 people to watch in the chatbot space, right next to Pavel
Durov.
30
The Icon8 chatbot was mentioned as one of the best examples of good user ex-
perience in a book by Slack director Amir Shevat.
5.7. Connectome
31
6. Roadmap
Q3 2017 Q4
Proof of concept and research phase Product development and MVP
to define the need for a collective AI testing on the first business clients
development tool
Private seed round for $2.5m
Assembling the team
Q1 2018 Q2
Public Alpha version of web appli- Public Beta for training neural net-
cation and Telegram bot to label works on labeled data, mobile app
and validate data for crowdworkers for data labeling
(SPOCK)
Dbraincoin (DBR) issue
Coin sale
Q3 Q4
Launch of the fully-running block- Scaling platform to meet new mar-
chain platform for building AI Apps kets with a focus on ever-growing AI
with API integration for businesses community and client base
32
Disclaimer
This document is for informational purposes only and does not constitute an
offer or solicitation to sell any securities in any jurisdiction. Any such offer or solic-
itation will be made only by means that are in compliance with the applicable se-
curities and other laws of the relevant jurisdiction(s). No information or opinions
presented herein are intended to form the basis for any purchase decision, and
this document does not constitute investment advice or counsel. This document
is not part of, and may not be relied on in connection with, any contract or com-
mitment whatsoever. Any purchase or sale involving Dbrain will be set forth and
governed exclusively by other documents. Dbrain expressly disclaims any and all
responsibility for any direct or consequential loss or damage of any kind whatso-
ever arising directly or indirectly from: (i) reliance on any information contained in
this document; (ii) any error, omission or inaccuracy in any such information; and
(iii) any action resulting therefrom.
33