PhonePe Case Study - Detailed
PhonePe Case Study - Detailed
PhonePe Case Study - Detailed
(>) Open-source,
HDP binaries
acceldata.io 2
PhonePe is a Walmart subsidiary that provides more than 350 million consumers across India
with the ability to send and receive money, make payments at more than ten million physical
and online retail stores, use ATMs and invest in mutual funds and other securities.
PhonePe’s Challenge:
PhonePe uses a variety of open source data technologies, such as adopting the service, all while adding Hive LLAP, Spark 3.x and Druid
Apache Hbase, HDFS, Kafka, Spark, and Spark Streaming, to run their to the platform, technologies that were needed to support new
high-volume, real-time payments and cash transfer platform. With products and business requirements.
hundreds of millions of customers and millions of merchants on
the system, PhonePe’s Data Warehouse cluster must be highly Even in the early stages of this infrastructure expansion, the
performant, reliable and transparent, which includes the ability to technology team experienced tremendous pressure on system
accurately report on system and business performance to internal performance and reliability. Key engineers spent the majority of their
and external stakeholders 24/7. time firefighting problems and searching for causes behind data
application issues and infrastructure failure instead of focusing on
Scaling to Meet The Needs of Growing increasing scale and new capabilities as required by the business.
Data Infrastructure
PhonePe’s Chief Reliability Officer, Burzin Engineer, quickly realized
As PhonePe’s business grew explosively in 2018-19, the company that his team needed tools to improve visibility into every aspect of
embarked on a massive data infrastructure expansion in terms of the company’s data operations. WIthout more advanced tools that
both scale and new technologies. The company needed to increase matched the sophistication of his core open source technologies,
the size of its Hadoop infrastructure to support tens of millions of PhonePe’s critical data initiative would fail, jeopardizing the
new consumers and millions of new merchants who were rapidly company’s growth prospects and business success.
acceldata.io 3
The Acceldata Solution:
After gaining an understanding of PhonePe’s objectives and challenges
with Burzin Engineer and the PhonePe team, Acceldata demonstrated
how its Pulse data observability tool could provide real-time monitoring
of Hbase, Hive, and Spark data pipelines.
acceldata.io 4
Results: Multi-Dimensional Data
In the first 18 months of using Acceldata Pulse, PhonePe has Observability
been able to realize these, among other, benefits:
(>) Scale data infrastructure rapidly from 70 to more than 1500 Enterprises are overwhelmed with the challenges
Hadoop nodes; more than 2000% growth of observing, operating, and optimizing large-scale
data systems.
(>) Deliver 99.97% availability across its Hadoop infrastructure
Multi-dimensional data observability can simplify
(>) Eliminate day-to-day engineering involvement and firefighting modern data pipelines by monitoring and correlating
on outages and performance degradation issues data workload events across application, data, and
infrastructure layers to resolve issues that break
(>) Support multi-cluster data and workload management with production analytics and AI workloads.
uniform configurations
The right data observability tools can significantly
(>) Upgrade systems and migrate to new applications and nodes improve enterprise data system performance, cost,
with no performance degradation and agility.
acceldata.io 5