Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
SlideShare a Scribd company logo
SphereEx
Overview
Linking Data & Services Simply
Company
Overview
About Us
Core Team
01
Open Source
Ecosystem
Open Source Projects
Open Source Ecosystem
03
Technology
Deep Dive
Industry Snapshot
02
Product &
Solution
Product Concept
Functions & Properties
Solutions
04
Industry
Recognition
Honors
Media Mentions
05
Contents
About Us|Founder Team
01 Company Overview
About Us
SphereEx -Linking Data & Services Simply
SphereEx was established in April 2021, received millions of dollars in investment in May 2021 for its Seed round, and
completed a Pre-A round of financing worth millions of dollars on January 4, 2022.
Headquartered in Beijing, with R&D centers in Hangzhou, Chengdu, Nanjing, SphereEx was founded by the core team of
Apache ShardingSphere.
SphereEx provides enterprises with new generation distributed data service infrastructures, and builds products &
solutions enabling actual enterprise implementation scenarios to thrive.
Our core competence is improving users’ data service capabilities by creating upper-level standards and ecosystem layer
above fragmented databases’ basic services.
To allow enterprises easily deal with the challenges of complex data application scenarios due to increasing databases
fragmentation, we built the pioneering "Database Plus" concept - designed to unlock innovative data usage.
Zhang Liang
SphereEx Founder & CEO
• Notable personality in the open source and
database architecture fields.
• Over 10 years of development and practical
experience in the database field.
• Served in leadership positions at many
enterprises including JD.com and DangDang.
• Apache Member, Microsoft MVP and Tencent
Cloud Valuable Professional.
• Apache ShardingSphere Founder & PMC Chair
• Author of the book "Future Architecture: from
SOA to Cloud Native".
Pan Juan
SphereEx Co-Founder & CTO
• Served as Senior DBA at JD Technology.
• “2020 China Open Source Pioneer” award
recipient.
• 2021 China Open Source Cloud League
Outstanding Contributor.
• China Mulan Open Source Community Mentor
& AWS DATA Hero.
• China’ s Only Woman Apache Member.
• Notable personality in the open source and
database architecture fields.
Core Team
Zhang Lei
SphereEx Co-Founder & CTO
• Founded three companies.
• Over 20 years of solution and sales
management experience.
• Served as Vice President at several start-
ups.
• Served as Business Department Executive
at leading domestic IT companies.
• Served as Sales Director at leading
multinational IT company.
• Familiar and experienced in finance, public,
Internet and other sectors.
Xu Ying
SphereEx Co-Founder & CTO
• Expert in Internet verticals’ business
operation and maintenance.
• 10+ years of experience in network,
applications, architecture, IT and data.
• Served as Senior Technical Director at
Dangdang Dangdang Information
Technology.
• Tencent Cloud Valuable Professional and
iTech Member.
• Specially-appointed expert at China General
Chamber of Commerce (CGCC)
• Served as CLS.cn think tank expert.
SphereEx Development Milestones
From Middleware to Data Service Ecosystem
2015
Sharding-JDBC
Project Initiation
2016
Sharding-JDBC
Officially Open
Sourced
2018
ShardingSphere
Project Enters the
Apache Incubator
2020
ElasticJob Becomes
a Sub-Project
ShardingSphere
Crowned Apache
Top-Level Project
The Pluggable
Architecture is
Complete
2021
SphereEx is
Founded
Open Source→
Commercial
The Database Plus
Concept is Born
SphereEx
Enterprise Edition
1.0
2022
To Cloud
ShardingSphere in
the Cloud
Database Mesh
Release
2023
To Ecosystem
Build an Ecosystem
with a Pluggable
Platform and
Features
Industry Snapshot
02 Technology Deep Dive
Diversified Scenarios
Mass storage computing, multi-module
heterogeneous query, database high
availability, etc.
Vertical Data Application Plan
Data Security (Data Masking, Encryption and
Audit)
Data Access (Multiple Sources &
Heterogeneous)
Data Access Management To
Be Improved
Mostly, fined-grained database access control
is self-developed, inefficient and costly (circuit
breaker, traffic control, black and white lists,
etc.).
Ever-Expanding Data
Rapidly increasing business data bring pressure to
database storage and querying.
Diversified Data Application
Scenarios
TP, AP, HTAP…
Diversified Database Architectures
Commercial centralized DB, open source
standalone DB, distributed DB…
Diversified Database Protocols
MySQL ecosystem + Postgres ecosystem +
vendor ecosystem
Lack of DBs Collaboration
Long-term co-existence of heterogeneous DBs and lack
of upper-level DB standard result in absence of DBs
collaboration.
Data Service Platform
Unified, enhanced data services platform
(distributed, heterogeneous, enhanced computing,
standardized, multi-access model, etc.).
Increasing O&M Costs Due to Low
Database Management Efficiency
The co-existence of relational & non-relational databases, open
source & commercial databases leads to inefficiency, steeper
learning curve and higher cost of database management.
Technical Challenges
Business architectures are evolving into microservices,
resulting in surging data access. Mass data lead to
ever-changing demand and more frequent iteration.
Industry Snapshot
Increasing databases fragmentation creates the need for a unified data services platform, and poses greater challenges to
traditional database management models.
Status Quo Pain Points Trends
Open Source Projects|Open Source Ecosystem
03 Open Source Ecosystem
Apache ShardingSphere Community Overview
• ShardingSphere is one of the Top-Level Apache Projects.
• Top Chinese Led Apache Project, 2020
• Top 10 Apache Project by Number of Commits, 2021
15000+ Stars
8000+ Pull Requests
300+ Contributors
5000+ Forks
• The World's Largest Open Source
Foundation
• 227M+ Lines of Code in Stewardship
• 300+ Top-Level Projects
Projects by Number of Commits, 2021
• As a part of the Linux Foundation,
CNCF’s mission is to make cloud
native computing ubiquitous.
• So far, 1,029 cards with a total of
2,984,816 stars, market cap of $16.3T
and funding of $27.2 billion.
CNCF: Cloud Native Computing Foundation
Apache ShardingSphere’s Global Development
Stargazers Over Time
Contributors Over Time
Product Concept|Functions & Properties|Solutions
04 Product & Solution
Product Design Concept: Database Plus
• Database Plus is our design concept of distributed database
system.
• It aims to build a standard layer and ecosystem above fragmented
databases and minimize or eliminate the challenges caused by
underlying databases. Guided by this concept, ShardingSphere
not only links all applications and databases, but also provides
enhanced capabilities such as data sharding and data encryption.
Connect Enhance Pluggable
Connecting data &
applications, focusing on
collaboration between
heterogeneous databases
Provide functional
enhancement services for
database data calculation
Adopting a pluggable architecture
design, with modules that are
independent of each other, to be
used independently or in
combination.
Core Value
Core Product Value
Industry-leading cost-
benefit ratio.
One-stop management for
heterogeneous tech stack.
Higher data performance than
distributed databases.
One-Stop Low Cost
High
Performance
Functional Architecture Diagram
Deployment Diagram
l High Performance & Multi-Language
Support
l Zero Intrusion / Low Coupling
l Multi-End Access, Unified Management
Provide both efficient access to Java and support
for heterogeneous languages, making it easier to
create, transfer and expand.
Proxy is deployed independently, without intrusion
into the database, and low coupling. Sharding-
JDBC is used in the same way as traditional JDBC.
Users can quickly use the enhanced capabilities
brought by ShardingSphere.
The driver end and the agent end are online at
the same time, sharing the source data
information of the cluster, achieving multi-end
access, and facilitating the cluster operation and
maintenance management.
SphereEx-Boot
SphereEx-Boot tool is a command line tool based on Python to facilitate the management of ShardingSphere-Proxy clusters.
The main functions of SphereEx-Boot are to install, uninstall, start, stop, view the running status, and other operations on
ShardingSphere-Proxy.
• Quick & Easy Implementation
With the SphereEx-Boot tool, you can run any
ShardingSphere-Proxy cluster component with just one
command line.
• Simple Operation and Maintenance
SphereEx-Boot tool can quickly install and deploy
ShardingSphere-Proxy cluster, as well as manage
ShardingSphere-Proxy cluster to reduce operation and
maintenance costs.
• Easy to Expand
The provided standardized horizontal expansion
function, can dynamically expand the cluster anytime
and anywhere by increasing the number of data servers.
SphereEx-Console
SphereEx-Console is a powerful dashboard applied to the management and control of the SphereEx Enterprise Data Service Platform. It
provides enhanced data service capabilities, such as data sharding, data security etc.
• Easy to Use
Improves user experience and eliminates
configuration errors. Users do not need to operate
the SphereEx Enterprise Data Service Platform
through configuration commands, greatly improving
user experience and achieving "zero" platform
bottlenecks.
• Comprehensive
Evolving the management capabilities of the open
source version, it provides a unified solution for the
management and control of the SphereEx
Enterprise Data Service Platform from basic
resources to plugin capabilities.
• Visual Monitoring
With its data visualization & management
dashboard, SphereEx Enterprise Data Service
Platform’s clusters, instances and hosts monitoring
data are visible online and in real time.
Kernel
SQL
Compatibility
MySQL ✔ ✔
PostgreSQL ✔ ✔
Oracle ✔* ✔
SQL Server ✔* ✔
openGauss ✔ ✔
SQL 92/2003 ✔ ✔
SQL
Extension
SQL Hint ✔ ✔
SQL Dialect Convert ✔ ✔
DistSQL ✔ ✔
Distributed
Transaction
XA Transaction ✔ ✔
Base Transaction ✔ ✔
Global Transaction ❌ ✔
Query
Optimization
Native Query Optimizer ✔ ✔
Federated Query
Optimizer
✔* ✔
Plugin
Data Sharding
Basic Sharding Strategy ✔ ✔
Customized Sharding Strategy ❌ ✔
Auto Sharding Management ❌ ✔
Pre-sharding ❌ ✔
Auto Shard Rebalance ❌ ✔
Distributed Object ✔* ✔
Data
Security- Encryption
Data Storage Encryption ✔ ✔
Open Source Encryptor ✔ ✔
Commercial Encryptor ❌ ✔
Simple Encryption Calculate ✔ ✔
Complex Encryption Calculate ❌ ✔
Data
Security - Authority
User Management ✔ ✔
Role Management ❌ ✔
Schema & Table Level Authority ❌ ✔
Column & Row Level Authority ❌ ✔
LDAP Integration ❌ ✔
3rd
Party System Integration ❌ ✔
Operation
Security- Audit
User Operation Audit (After) ❌ ✔
Rule Based Audit (Before) ❌ ✔
Elastic Scaling
Basic Migration ✔ ✔
Distributed Migration ❌ ✔
Data Consistency Verification ✔ ✔
Advanced Request Management ❌ ✔
Data Isolation
Test & Production data Isolation ✔ ✔
Test Data Event Source ✔ ❌
Open Source Version VS Commercial Version
Module Primary
Function
Secondary Function
Open Source
Version
Commercial
Version
Module Primary Function Secondary Function
Open Source
Version
Commercial
Version
Open Source Version VS Commercial Version
Management
Metadata
Management
Consistent Change
Online
❌ ✔
DDL Multiple Version ❌ ✔
High
Availability
HA for Compute Node ✔ ✔
HA for Storage Node ❌ ✔
Detected & Auth Health ✔ ✔
Monitoring &
Diagnosis
Machine Monitoring ❌ ✔
Component Monitoring ✔ ✔
Statement
Monitoring(Slow Query)
✔ ✔
Advanced Metrics ❌ ✔
Monitoring Alarm
Interfacing
❌ ✔
Link Tracking (APM) ✔ ✔
Delivery
Production
ShardingSphere-JDBC ✔ ✔
ShardingSphere-Proxy ✔ ✔
spex boot (Manage Command
Line Tool)
✔* ✔
spex portal (Manage Console) ❌ ✔
Compatible Estimation Tool ❌ ✔
Support
Community Support ✔ ✔
Commercial Support ❌ ✔
Support
Delivery
QA ✔ ✔
Architecture Consulting ❌ ✔
Training ❌ ✔
Remote Support ❌ ✔
On-site Support ❌ ✔
Install & Upgrade Service ❌ ✔
On-site Service ❌ ✔
Module Primary
Function
Secondary Function Open Source
Version
Commercial
Version
Module Primary
Function
Secondary Function Open Source
Version
Commercial
Version
What Is SphereEx Used For?
Massive Data Storage & Access
When relational database capabilities hit the
bottleneck, Apache ShardingSphere can
horizontally scale out the capabilities of storage
database.
Faced with relational and non-relational databases,
open source and commercial databases, to lower the
management and learning cost of heterogeneous
database, and improve data computing efficiency,
SphereEx provides heterogeneous data computing
services to enterprises, slashing their development cost
and increasing their efficiency.
Heterogeneous Data Computing
Due to large user base and frequent marketing
activities, sectors such as retailing are facing
slower core transaction system response.
SphereEx provides scale-out capability, improving
database efficiency to tackle high-concurrent
scenarios.
Online High Concurrency for Massive Data
To prevent data leak of plaintext storage, which is
resulted from both data theft and hackers and to
ensure the security of production data in application
scenarios, we provide a series of data masking rules
to conceal or obscure sensitive information.
Data Security
Distributed Database Solution
Distributed Database Solution: To solve existing solutions’ technological bottlenecks, reduce architecture replacement complexity and
risk, a distributed database solution based on data sharding, distributed transactions, and elastic scale-out is offered. It has both the
stability of standalone transaction databases, and the scalability of distributed databases.
Functional Features
• Multiple Database Dialects Parsing
• Lightweight JDBC Driver
• MySQL / PostgreSQL Protocol Support
• Horizontal Partitioning / Vertical
Partitioning
• Built-in Algorithm / Customization
• Distributed Transaction
• Elastic Migration
• Read/write splitting
• Distributed Management
Data Security Solution
Data Security Solution: To prevent data leaks and ensure data security, product-based data encryption and data desensitization
features provide enterprise users with a cross-platform data security solution for heterogeneous environments.
Data Security
• Multiple Database Dialects
Parsing
• MySQL / PostgreSQL Protocol
Support
• Data Encryption & Decryption/
Data Desensitization
• Built-in Desensitization
Algorithm/ Customization
• Online Encryption/ Conversion
of Plaintext Data to Encrypted
State
• Permission Control
Production Database’s Online Stress Testing Solution
Distributed database solution: The capability to parse and route SQL and command data, allows the creation of a shadow database
(testing database) for production. Production and testing data could be routed online to the testing and production according to the
configuration. Integrated with APM and gateway system, a complete solution from data marking, mark data tracking to data isolation for
a distributed system can be provided.
Online Data Stress Testing
• Multiple Database Dialects Parsing
• MySQL / PostgreSQL Protocol
Support
• Online Routing Strategy
• Built-in Desensitization Algorithm /
Customization
• Metric / Tracing Observability
• Linked Gateway Data Marking
Single Node Performance Comparison
0
100,000
200,000
300,000
400,000
500,000
thread1 thread20 thread100 thread200
OLTP Point Select
MySQL ShardingSphere-JDBC ShardingSphere-Proxy
thread1 7,195 8,309 2,745
thread20 102,685 105,258 42,387
thread100 315,075 318,691 108,571
thread200 440,372 451,842 148,815
[1]APP:56Core/192G,Proxy:48Core/96G,DB:48Core/96G
[2]1,000,000records
MySQL features with a single node, storing 1,00 0,000 data in one table. ShardingSphere-JDBC and ShardingSphere-Proxy are associated with one physical database, with no
splitting database or table strategies. One physical table has 1,000,000 data.
Thread MySQL ShardingSphere-JDBC ShardingSphere-Proxy
[1]APP:56Core/192G,Proxy:48Core/96G,DB:48Core/96G
OLTP Point Select Thread 200 48,685 261,778 207,779
Multi-node Performance Comparison
[2]40,000,000records
MySQL features with a single node, storing 4,000,000 data in one table. ShardingSphere-JDBC and ShardingSphere-Proxy are associated with 5 physical databases– each
database has 10 tables and 8,000,000 data.
OLTP Read Write Thread 200 1,820 43,257 21,811
0
10,000
20,000
30,000
40,000
50,000
OLTP Read Write Thread 200
OLTP Point Select(QPS)
MySQL ShardingSphere-JDBC ShardingSphere-Proxy
0
10,000
20,000
30,000
40,000
50,000
OLTP Read Write Thread 200
OLTP Read Write(TPS)
MySQL ShardingSphere-JDBC ShardingSphere-Proxy
QPS MySQL ShardingSphere-JDBC ShardingSphere-Proxy
TPS MySQL ShardingSphere-JDBC ShardingSphere-Proxy
ShardingSphere x JD Baitiao
ShardingSphere
Business Code
DBRep Components
MySQL
1
MySQL
2
MySQL
3
MySQL
N
Binlog
Zookeeper Flume Kafka
Baitiao
System
Database Cluster
Distributed Computing
HBase
Cluster
ES
Cluster
API
Big Data Platform
Operations System
ShardingSphere’s Multiple Advantages
• Simplified Upgrade Path
• Save R&D Cost
• Improve Architecture Extensibility
[1] https://www.mysql.com/customers/view/?id=1461
( 6,400 Shards / Over 100 Billion Records )
Honors|Strengths | Partnerships
05 Industry Recognition
Partnerships
SphereEx Honors
ShardingSphere Community Honors
• "Zhongguancun High-Tech Enterprise" Certification
• "Outstanding Open Source Innovation Enterprise" Honor
• “China’s Cutting-Edge Technology Pioneer Enterprise of 2021”
Honor
• “Outstanding Chinese Open Source Original Startups of 2021”
awarded by OSC
• Apache Top-Level Project
• CNCF Landscape
• Kunpeng Technology Certified Project
• China Academy of Information and Communications Technology Trusted
Open Source Community Project
• China Open Source Cloud League "Outstanding Open Source Project "
• Most Active Chinese-led Apache Project of the Year
Certifications & Honors
Product Capability Open Source Software
A plugin oriented
architecture allows both
standalone and mixed
deployments of functions
such as data sharding
and RW splitting.
Innovation Global
The Apache ShardingSphere
community includes lots of
users and contributors, and is
successfully used in many
application cases.
Our Strengths
Apache ShardingSphere is now
widely used by over 170
businesses from all walks of life.
Apache ShardingSphere technical
architecture is flexible and mature
enough to develop innovative
solutions for tomorrow.
Leveraging Cloud Native &
distributed technologies, while
being compatible with public Cloud,
Apache ShardingSphere builds a
flexible, scalable database cluster,
providing users with convenient
data application and connection
services.
Misson Mission Values
Customer Centric
Win-win cooperation
Growth & Achievements
Driven by Excellence
Corporate Culture
Build heterogeneous database
standards and ecosystems,
based on open source.
Serve the world and become the
new engine for data-based
ecosystems.
ICT Forum 2021 Apache Con
2021 AWS Summit QCon
2021 DTCC
Marketing Activities & Media Mentions
openGauss Summit 2021
Thank You
@ShardingSphere
linkedin.com/company/sphere-ex
@sphereextech
@shardingsphere
@ShardingSphere @sphereextech
@SphereEx @linkedin.com/company/sphere-ex
@ShardingSphere
linkedin.com/company/sphere-ex
@sphereextech
@shardingsphere
@ShardingSphere
linkedin.com/company/sphere-ex
@sphereextech
@shardingsphere

More Related Content

SphereEx pitch deck

  • 2. Company Overview About Us Core Team 01 Open Source Ecosystem Open Source Projects Open Source Ecosystem 03 Technology Deep Dive Industry Snapshot 02 Product & Solution Product Concept Functions & Properties Solutions 04 Industry Recognition Honors Media Mentions 05 Contents
  • 3. About Us|Founder Team 01 Company Overview
  • 4. About Us SphereEx -Linking Data & Services Simply SphereEx was established in April 2021, received millions of dollars in investment in May 2021 for its Seed round, and completed a Pre-A round of financing worth millions of dollars on January 4, 2022. Headquartered in Beijing, with R&D centers in Hangzhou, Chengdu, Nanjing, SphereEx was founded by the core team of Apache ShardingSphere. SphereEx provides enterprises with new generation distributed data service infrastructures, and builds products & solutions enabling actual enterprise implementation scenarios to thrive. Our core competence is improving users’ data service capabilities by creating upper-level standards and ecosystem layer above fragmented databases’ basic services. To allow enterprises easily deal with the challenges of complex data application scenarios due to increasing databases fragmentation, we built the pioneering "Database Plus" concept - designed to unlock innovative data usage.
  • 5. Zhang Liang SphereEx Founder & CEO • Notable personality in the open source and database architecture fields. • Over 10 years of development and practical experience in the database field. • Served in leadership positions at many enterprises including JD.com and DangDang. • Apache Member, Microsoft MVP and Tencent Cloud Valuable Professional. • Apache ShardingSphere Founder & PMC Chair • Author of the book "Future Architecture: from SOA to Cloud Native". Pan Juan SphereEx Co-Founder & CTO • Served as Senior DBA at JD Technology. • “2020 China Open Source Pioneer” award recipient. • 2021 China Open Source Cloud League Outstanding Contributor. • China Mulan Open Source Community Mentor & AWS DATA Hero. • China’ s Only Woman Apache Member. • Notable personality in the open source and database architecture fields. Core Team Zhang Lei SphereEx Co-Founder & CTO • Founded three companies. • Over 20 years of solution and sales management experience. • Served as Vice President at several start- ups. • Served as Business Department Executive at leading domestic IT companies. • Served as Sales Director at leading multinational IT company. • Familiar and experienced in finance, public, Internet and other sectors. Xu Ying SphereEx Co-Founder & CTO • Expert in Internet verticals’ business operation and maintenance. • 10+ years of experience in network, applications, architecture, IT and data. • Served as Senior Technical Director at Dangdang Dangdang Information Technology. • Tencent Cloud Valuable Professional and iTech Member. • Specially-appointed expert at China General Chamber of Commerce (CGCC) • Served as CLS.cn think tank expert.
  • 6. SphereEx Development Milestones From Middleware to Data Service Ecosystem 2015 Sharding-JDBC Project Initiation 2016 Sharding-JDBC Officially Open Sourced 2018 ShardingSphere Project Enters the Apache Incubator 2020 ElasticJob Becomes a Sub-Project ShardingSphere Crowned Apache Top-Level Project The Pluggable Architecture is Complete 2021 SphereEx is Founded Open Source→ Commercial The Database Plus Concept is Born SphereEx Enterprise Edition 1.0 2022 To Cloud ShardingSphere in the Cloud Database Mesh Release 2023 To Ecosystem Build an Ecosystem with a Pluggable Platform and Features
  • 8. Diversified Scenarios Mass storage computing, multi-module heterogeneous query, database high availability, etc. Vertical Data Application Plan Data Security (Data Masking, Encryption and Audit) Data Access (Multiple Sources & Heterogeneous) Data Access Management To Be Improved Mostly, fined-grained database access control is self-developed, inefficient and costly (circuit breaker, traffic control, black and white lists, etc.). Ever-Expanding Data Rapidly increasing business data bring pressure to database storage and querying. Diversified Data Application Scenarios TP, AP, HTAP… Diversified Database Architectures Commercial centralized DB, open source standalone DB, distributed DB… Diversified Database Protocols MySQL ecosystem + Postgres ecosystem + vendor ecosystem Lack of DBs Collaboration Long-term co-existence of heterogeneous DBs and lack of upper-level DB standard result in absence of DBs collaboration. Data Service Platform Unified, enhanced data services platform (distributed, heterogeneous, enhanced computing, standardized, multi-access model, etc.). Increasing O&M Costs Due to Low Database Management Efficiency The co-existence of relational & non-relational databases, open source & commercial databases leads to inefficiency, steeper learning curve and higher cost of database management. Technical Challenges Business architectures are evolving into microservices, resulting in surging data access. Mass data lead to ever-changing demand and more frequent iteration. Industry Snapshot Increasing databases fragmentation creates the need for a unified data services platform, and poses greater challenges to traditional database management models. Status Quo Pain Points Trends
  • 9. Open Source Projects|Open Source Ecosystem 03 Open Source Ecosystem
  • 10. Apache ShardingSphere Community Overview • ShardingSphere is one of the Top-Level Apache Projects. • Top Chinese Led Apache Project, 2020 • Top 10 Apache Project by Number of Commits, 2021 15000+ Stars 8000+ Pull Requests 300+ Contributors 5000+ Forks • The World's Largest Open Source Foundation • 227M+ Lines of Code in Stewardship • 300+ Top-Level Projects Projects by Number of Commits, 2021
  • 11. • As a part of the Linux Foundation, CNCF’s mission is to make cloud native computing ubiquitous. • So far, 1,029 cards with a total of 2,984,816 stars, market cap of $16.3T and funding of $27.2 billion. CNCF: Cloud Native Computing Foundation
  • 12. Apache ShardingSphere’s Global Development Stargazers Over Time Contributors Over Time
  • 13. Product Concept|Functions & Properties|Solutions 04 Product & Solution
  • 14. Product Design Concept: Database Plus • Database Plus is our design concept of distributed database system. • It aims to build a standard layer and ecosystem above fragmented databases and minimize or eliminate the challenges caused by underlying databases. Guided by this concept, ShardingSphere not only links all applications and databases, but also provides enhanced capabilities such as data sharding and data encryption. Connect Enhance Pluggable Connecting data & applications, focusing on collaboration between heterogeneous databases Provide functional enhancement services for database data calculation Adopting a pluggable architecture design, with modules that are independent of each other, to be used independently or in combination.
  • 15. Core Value Core Product Value Industry-leading cost- benefit ratio. One-stop management for heterogeneous tech stack. Higher data performance than distributed databases. One-Stop Low Cost High Performance
  • 17. Deployment Diagram l High Performance & Multi-Language Support l Zero Intrusion / Low Coupling l Multi-End Access, Unified Management Provide both efficient access to Java and support for heterogeneous languages, making it easier to create, transfer and expand. Proxy is deployed independently, without intrusion into the database, and low coupling. Sharding- JDBC is used in the same way as traditional JDBC. Users can quickly use the enhanced capabilities brought by ShardingSphere. The driver end and the agent end are online at the same time, sharing the source data information of the cluster, achieving multi-end access, and facilitating the cluster operation and maintenance management.
  • 18. SphereEx-Boot SphereEx-Boot tool is a command line tool based on Python to facilitate the management of ShardingSphere-Proxy clusters. The main functions of SphereEx-Boot are to install, uninstall, start, stop, view the running status, and other operations on ShardingSphere-Proxy. • Quick & Easy Implementation With the SphereEx-Boot tool, you can run any ShardingSphere-Proxy cluster component with just one command line. • Simple Operation and Maintenance SphereEx-Boot tool can quickly install and deploy ShardingSphere-Proxy cluster, as well as manage ShardingSphere-Proxy cluster to reduce operation and maintenance costs. • Easy to Expand The provided standardized horizontal expansion function, can dynamically expand the cluster anytime and anywhere by increasing the number of data servers.
  • 19. SphereEx-Console SphereEx-Console is a powerful dashboard applied to the management and control of the SphereEx Enterprise Data Service Platform. It provides enhanced data service capabilities, such as data sharding, data security etc. • Easy to Use Improves user experience and eliminates configuration errors. Users do not need to operate the SphereEx Enterprise Data Service Platform through configuration commands, greatly improving user experience and achieving "zero" platform bottlenecks. • Comprehensive Evolving the management capabilities of the open source version, it provides a unified solution for the management and control of the SphereEx Enterprise Data Service Platform from basic resources to plugin capabilities. • Visual Monitoring With its data visualization & management dashboard, SphereEx Enterprise Data Service Platform’s clusters, instances and hosts monitoring data are visible online and in real time.
  • 20. Kernel SQL Compatibility MySQL ✔ ✔ PostgreSQL ✔ ✔ Oracle ✔* ✔ SQL Server ✔* ✔ openGauss ✔ ✔ SQL 92/2003 ✔ ✔ SQL Extension SQL Hint ✔ ✔ SQL Dialect Convert ✔ ✔ DistSQL ✔ ✔ Distributed Transaction XA Transaction ✔ ✔ Base Transaction ✔ ✔ Global Transaction ❌ ✔ Query Optimization Native Query Optimizer ✔ ✔ Federated Query Optimizer ✔* ✔ Plugin Data Sharding Basic Sharding Strategy ✔ ✔ Customized Sharding Strategy ❌ ✔ Auto Sharding Management ❌ ✔ Pre-sharding ❌ ✔ Auto Shard Rebalance ❌ ✔ Distributed Object ✔* ✔ Data Security- Encryption Data Storage Encryption ✔ ✔ Open Source Encryptor ✔ ✔ Commercial Encryptor ❌ ✔ Simple Encryption Calculate ✔ ✔ Complex Encryption Calculate ❌ ✔ Data Security - Authority User Management ✔ ✔ Role Management ❌ ✔ Schema & Table Level Authority ❌ ✔ Column & Row Level Authority ❌ ✔ LDAP Integration ❌ ✔ 3rd Party System Integration ❌ ✔ Operation Security- Audit User Operation Audit (After) ❌ ✔ Rule Based Audit (Before) ❌ ✔ Elastic Scaling Basic Migration ✔ ✔ Distributed Migration ❌ ✔ Data Consistency Verification ✔ ✔ Advanced Request Management ❌ ✔ Data Isolation Test & Production data Isolation ✔ ✔ Test Data Event Source ✔ ❌ Open Source Version VS Commercial Version Module Primary Function Secondary Function Open Source Version Commercial Version Module Primary Function Secondary Function Open Source Version Commercial Version
  • 21. Open Source Version VS Commercial Version Management Metadata Management Consistent Change Online ❌ ✔ DDL Multiple Version ❌ ✔ High Availability HA for Compute Node ✔ ✔ HA for Storage Node ❌ ✔ Detected & Auth Health ✔ ✔ Monitoring & Diagnosis Machine Monitoring ❌ ✔ Component Monitoring ✔ ✔ Statement Monitoring(Slow Query) ✔ ✔ Advanced Metrics ❌ ✔ Monitoring Alarm Interfacing ❌ ✔ Link Tracking (APM) ✔ ✔ Delivery Production ShardingSphere-JDBC ✔ ✔ ShardingSphere-Proxy ✔ ✔ spex boot (Manage Command Line Tool) ✔* ✔ spex portal (Manage Console) ❌ ✔ Compatible Estimation Tool ❌ ✔ Support Community Support ✔ ✔ Commercial Support ❌ ✔ Support Delivery QA ✔ ✔ Architecture Consulting ❌ ✔ Training ❌ ✔ Remote Support ❌ ✔ On-site Support ❌ ✔ Install & Upgrade Service ❌ ✔ On-site Service ❌ ✔ Module Primary Function Secondary Function Open Source Version Commercial Version Module Primary Function Secondary Function Open Source Version Commercial Version
  • 22. What Is SphereEx Used For? Massive Data Storage & Access When relational database capabilities hit the bottleneck, Apache ShardingSphere can horizontally scale out the capabilities of storage database. Faced with relational and non-relational databases, open source and commercial databases, to lower the management and learning cost of heterogeneous database, and improve data computing efficiency, SphereEx provides heterogeneous data computing services to enterprises, slashing their development cost and increasing their efficiency. Heterogeneous Data Computing Due to large user base and frequent marketing activities, sectors such as retailing are facing slower core transaction system response. SphereEx provides scale-out capability, improving database efficiency to tackle high-concurrent scenarios. Online High Concurrency for Massive Data To prevent data leak of plaintext storage, which is resulted from both data theft and hackers and to ensure the security of production data in application scenarios, we provide a series of data masking rules to conceal or obscure sensitive information. Data Security
  • 23. Distributed Database Solution Distributed Database Solution: To solve existing solutions’ technological bottlenecks, reduce architecture replacement complexity and risk, a distributed database solution based on data sharding, distributed transactions, and elastic scale-out is offered. It has both the stability of standalone transaction databases, and the scalability of distributed databases. Functional Features • Multiple Database Dialects Parsing • Lightweight JDBC Driver • MySQL / PostgreSQL Protocol Support • Horizontal Partitioning / Vertical Partitioning • Built-in Algorithm / Customization • Distributed Transaction • Elastic Migration • Read/write splitting • Distributed Management
  • 24. Data Security Solution Data Security Solution: To prevent data leaks and ensure data security, product-based data encryption and data desensitization features provide enterprise users with a cross-platform data security solution for heterogeneous environments. Data Security • Multiple Database Dialects Parsing • MySQL / PostgreSQL Protocol Support • Data Encryption & Decryption/ Data Desensitization • Built-in Desensitization Algorithm/ Customization • Online Encryption/ Conversion of Plaintext Data to Encrypted State • Permission Control
  • 25. Production Database’s Online Stress Testing Solution Distributed database solution: The capability to parse and route SQL and command data, allows the creation of a shadow database (testing database) for production. Production and testing data could be routed online to the testing and production according to the configuration. Integrated with APM and gateway system, a complete solution from data marking, mark data tracking to data isolation for a distributed system can be provided. Online Data Stress Testing • Multiple Database Dialects Parsing • MySQL / PostgreSQL Protocol Support • Online Routing Strategy • Built-in Desensitization Algorithm / Customization • Metric / Tracing Observability • Linked Gateway Data Marking
  • 26. Single Node Performance Comparison 0 100,000 200,000 300,000 400,000 500,000 thread1 thread20 thread100 thread200 OLTP Point Select MySQL ShardingSphere-JDBC ShardingSphere-Proxy thread1 7,195 8,309 2,745 thread20 102,685 105,258 42,387 thread100 315,075 318,691 108,571 thread200 440,372 451,842 148,815 [1]APP:56Core/192G,Proxy:48Core/96G,DB:48Core/96G [2]1,000,000records MySQL features with a single node, storing 1,00 0,000 data in one table. ShardingSphere-JDBC and ShardingSphere-Proxy are associated with one physical database, with no splitting database or table strategies. One physical table has 1,000,000 data. Thread MySQL ShardingSphere-JDBC ShardingSphere-Proxy
  • 27. [1]APP:56Core/192G,Proxy:48Core/96G,DB:48Core/96G OLTP Point Select Thread 200 48,685 261,778 207,779 Multi-node Performance Comparison [2]40,000,000records MySQL features with a single node, storing 4,000,000 data in one table. ShardingSphere-JDBC and ShardingSphere-Proxy are associated with 5 physical databases– each database has 10 tables and 8,000,000 data. OLTP Read Write Thread 200 1,820 43,257 21,811 0 10,000 20,000 30,000 40,000 50,000 OLTP Read Write Thread 200 OLTP Point Select(QPS) MySQL ShardingSphere-JDBC ShardingSphere-Proxy 0 10,000 20,000 30,000 40,000 50,000 OLTP Read Write Thread 200 OLTP Read Write(TPS) MySQL ShardingSphere-JDBC ShardingSphere-Proxy QPS MySQL ShardingSphere-JDBC ShardingSphere-Proxy TPS MySQL ShardingSphere-JDBC ShardingSphere-Proxy
  • 28. ShardingSphere x JD Baitiao ShardingSphere Business Code DBRep Components MySQL 1 MySQL 2 MySQL 3 MySQL N Binlog Zookeeper Flume Kafka Baitiao System Database Cluster Distributed Computing HBase Cluster ES Cluster API Big Data Platform Operations System ShardingSphere’s Multiple Advantages • Simplified Upgrade Path • Save R&D Cost • Improve Architecture Extensibility [1] https://www.mysql.com/customers/view/?id=1461 ( 6,400 Shards / Over 100 Billion Records )
  • 31. SphereEx Honors ShardingSphere Community Honors • "Zhongguancun High-Tech Enterprise" Certification • "Outstanding Open Source Innovation Enterprise" Honor • “China’s Cutting-Edge Technology Pioneer Enterprise of 2021” Honor • “Outstanding Chinese Open Source Original Startups of 2021” awarded by OSC • Apache Top-Level Project • CNCF Landscape • Kunpeng Technology Certified Project • China Academy of Information and Communications Technology Trusted Open Source Community Project • China Open Source Cloud League "Outstanding Open Source Project " • Most Active Chinese-led Apache Project of the Year Certifications & Honors
  • 32. Product Capability Open Source Software A plugin oriented architecture allows both standalone and mixed deployments of functions such as data sharding and RW splitting. Innovation Global The Apache ShardingSphere community includes lots of users and contributors, and is successfully used in many application cases. Our Strengths Apache ShardingSphere is now widely used by over 170 businesses from all walks of life. Apache ShardingSphere technical architecture is flexible and mature enough to develop innovative solutions for tomorrow. Leveraging Cloud Native & distributed technologies, while being compatible with public Cloud, Apache ShardingSphere builds a flexible, scalable database cluster, providing users with convenient data application and connection services.
  • 33. Misson Mission Values Customer Centric Win-win cooperation Growth & Achievements Driven by Excellence Corporate Culture Build heterogeneous database standards and ecosystems, based on open source. Serve the world and become the new engine for data-based ecosystems.
  • 34. ICT Forum 2021 Apache Con 2021 AWS Summit QCon 2021 DTCC Marketing Activities & Media Mentions openGauss Summit 2021
  • 35. Thank You @ShardingSphere linkedin.com/company/sphere-ex @sphereextech @shardingsphere @ShardingSphere @sphereextech @SphereEx @linkedin.com/company/sphere-ex @ShardingSphere linkedin.com/company/sphere-ex @sphereextech @shardingsphere @ShardingSphere linkedin.com/company/sphere-ex @sphereextech @shardingsphere