0% found this document useful (0 votes)

160 views

Chapter 1 - Introduction To KAFKA: Objectives

This chapter introduces Kafka and provides an overview of key concepts. It discusses microservices architecture and how messaging fits into it. Kafka is an open-source messaging system that allows publishing and subscribing to streams of records. It is highly scalable, fault-tolerant, and very fast. The chapter covers Kafka's architecture, including topics that messages are published to, producers that publish messages, consumers that subscribe to topics, and brokers that manage the data.

Uploaded by

Suchismita Sahu

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

160 views

Chapter 1 - Introduction To KAFKA: Objectives

Uploaded by

Suchismita Sahu

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 17

Chapter 1 - Introduction to KAFKA

Objectives
Key objectives of this chapter
 What is Microservices?
 Messaging Architectures
 What is Kafka?
 Need for Kafka
 Where is Kafka useful?
 Architecture
 Core concepts in Kafka
 Overview of ZooKeeper
 Cluster, Kafka Brokers, Producer, Consumer, Topic

1.1 Microservices
 Small, autonomous services which work well together.
 Being able to change individual components independently.
 Independent processes
 Communicate over APIs, rather than using databases directly
 High degree of autonomy
 Small, focused on doing one thing well
 A form of SOA. Typical SOA-based applications used to be monolithic.
 Microservices concept facilitates in adopting Agile Software Development.

1.2 Microservices vs Classic SOA

SOA Microservices
XML JSON
Complex to integrate Easy to integrate
Chapter 1 - Introduction to KAFKA

SOA Microservices
Heavy Lightweight
HTTP/SOAP HTTP/REST

1.3 Traditional Enterprise Application Architecture

 Classical architecture
 Typical 3 layers:
◊ client-side UI (Browser, HTML + JS)
◊ a database (RDBMS, NoSQL …)
◊ server-side application (Java, .NET, PHP, …)
 Any changes to the system involve building and deploying a new version
of the application. Changes are expensive.
 Scaling requires scaling of the entire application, rather than parts of it that
require greater resource.
 Long release cycles.

2
Chapter 1 - Introduction to KAFKA

1.4 Sample Microservices Architecture

 Applications naturally start as Monoliths, they scale and evolve to
Microservice architecture
 Applications are decomposed to components – smaller independent
service applications.
 Components are loosely coupled.

1.5 Microservices Architecture – Pros

 Multiple developers and teams can deliver relatively independently of each
other
 Can be written in different programming languages
 Can be managed by different teams
 Can use different data storage technologies
 Centralized management is minimal
 Independently deployable by fully automated deployment machinery
 Works well with Continuous Delivery
 Allows frequent releases while keeping the rest of the system available
and stable

3
Chapter 1 - Introduction to KAFKA

1.6 Messaging Architectures – What is Messaging?

 Application-to-application communication
 Supports asynchronous operations.
 Message:
 A message is a self-contained package of business data and network
routing headers.

1.7 Messaging Architectures – Steps to Messaging

 Messaging connects multiple applications in an exchange of data.
 Messaging uses an encapsulated asynchronous approach to exchange
data through a network.
 A traditional messaging system has two models of abstraction:
◊ Queue – a message channel where a single message is received
exactly by one consumer in a point-to-point message-queue pattern. If
there are no consumers available, the message is retained until a
consumer processes the message.
◊ Topic - a message feed that implements the publish-subscribe pattern
and broadcasts messages to consumers that subscribe to that topic.
 A single message is transmitted in five steps:
◊ Create
◊ Send

4
Chapter 1 - Introduction to KAFKA

◊ Deliver
◊ Receive
◊ Process

1.8 Messaging Architectures – Messaging Models

 1. Point to Point
 2. Publish and Subscribe

1.9 What is Kafka?

 In modern applications, real-time information is continuously generated by
applications (publishers/producers) and routed to other applications
(subscribers/consumers)
 Apache Kafka is an open source, distributed publish-subscribe messaging
system.
 Kafka allows integration of information of producers and consumers to
avoid any kind of rewriting of an application at either end.
 Kafka provides overcomes the challenges of real-time data usage for
consumption of data volumes that may grow in order of magnitude, larger
than the real data.

5
Chapter 1 - Introduction to KAFKA

 Kafka also supports parallel data loading in the Hadoop systems.

1.10 What is Kafka? (Contd.)

 Kafka is a unique distributed publish-subscribe messaging system written
in the Scala language with multi-language support and runs on the Java
Virtual Machine (JVM).
 Kafka relies on another service named Zookeeper – a distributed
coordination system – to function.
 Kafka has high-throughput and is built to scale-out in a distributed model
on multiple servers.
 Kafka persists messages on disk and can be used for batched
consumption as well as real-time applications.

1.11 Kafka Overview

 When used in the right way and for the right use case, Kafka has unique
attributes that make it a highly attractive option for data integration.

 Data Integration is the combination of technical and business processes

used to combine data from disparate sources into meaningful and valuable
information.
 A complete data integration solution encompasses discovery, cleansing,
monitoring, transforming and delivery of data from a variety of sources
 Messaging is a key data integration strategy employed in many distributed
environments such as the cloud.

6
Chapter 1 - Introduction to KAFKA

 Messaging supports asynchronous operations, enabling you to decouple a

process that consumes a service from the process that implements the
service.

1.12 Kafka Overview (Contd.)

1.13 Need for Kafka

 High Throughput
◊ Provides support for hundreds of thousands of messages with modest
hardware
 Scalability
◊ Highly scalable distributed systems with no downtime
 Replication

7
Chapter 1 - Introduction to KAFKA

◊ Messages can be replicated across a cluster, which provides support

for multiple subscribers and also in case of failure balances the
consumers
 Durability
◊ Provides support for persistence of messages to disk which can be
further used for batch consumption
 Stream Processing
◊ Kafka can be used along with real-time streaming applications like
spark, flink, and storm
 Data Loss
◊ Kafka with proper configurations can ensure zero data loss

1.14 Kafka Architecture

1.15 Core concepts in Kafka

 Topic

8
Chapter 1 - Introduction to KAFKA

◊ A category or feed to which messages are published

 Producer
◊ Publishes messages to Kafka Topic
 Consumer
◊ Subscribes and consumes messages from Kafka Topic
 Broker
◊ Handles hundreds of megabytes of reads and writes

1.16 Kafka Topic

 User defined category where the messages are published
 For each topic, a partition log is maintained
 Each partition basically contains an ordered, immutable sequences of
messages where each message assigned a sequential ID number called
offset
 Writes to a partition are generally sequential thereby reducing the number
of hard disk seeks
 Reading messages from partition can either be from the beginning and
also can rewind or skip to any point in a partition by supplying an offset
value

9
Chapter 1 - Introduction to KAFKA

1.17 Kafka Producer

 Application publishes messages to the topic in Kafka Cluster
 Can be of any kind like Front End, Streaming etc.
 While writing messages, it is also possible to attach a key to the message
 By attaching key the producers basically provide a guarantee that all
messages with the same key will arrive in the same partition
 Supports both async and sync modes
 Publishes as many messages as fast as the broker in a cluster can handle

10
Chapter 1 - Introduction to KAFKA

1.18 Kafka Consumer

 Application subscribes and consumes messages from brokers in Kafka
Cluster
 Can be of any kind like real-time consumers, NoSQL consumers etc.
 During consumption of messages from a topic a consumer group can be
configured with multiple consumers.
 Each consumer of consumer group reads messages from a unique subset
of partitions in each topic they subscribe to
 Messages with the same key arrive at the same consumer
 Supports both Queuing and Publish-Subscribe
 Consumers have to maintain the number of messages consumed

11
Chapter 1 - Introduction to KAFKA

1.19 Kafka Broker

 Kafka cluster basically comprised of one or more servers
 Each of the servers in the cluster is called a broker
 Handles hundreds of megabytes of writes from producers and reads from
consumers
 Retains all the published messages irrespective of whether it is consumed
or not
 If retention is configured for n days, then messages once published, it is
available for consumption for configured n days and thereafter it is
discarded

12
Chapter 1 - Introduction to KAFKA

1.20 Kafka Cluster

 A Kafka Cluster is generally fast, highly scalable messaging system
 A publish-subscribe messaging system
 Can be used effectively in place of ActiveMQ, RabbitMQ, Java Messaging
System (JMS), and Advanced Messaging Queuing Protocol (AMQP)
 Can be integrated with Hadoop Ecosystem
 Expanding of the cluster can be done with ease
 Effective for applications which involve large-scale message processing

13
Chapter 1 - Introduction to KAFKA

1.21 Why Kafka Cluster?

 Kafka is preferred in place of more traditional brokers like JMS and
AMQP?
◊ With Kafka, we can easily handle hundreds of thousands of messages
in a second, which makes Kafka a high throughput messaging system
◊ The cluster can be expanded with no downtime, making Kafka highly
scalable
◊ Messages are replicated, which provides reliability and durability
◊ Fault-tolerant

1.22 Sample Multi-Broker Cluster

14
Chapter 1 - Introduction to KAFKA

1.23 Overview of ZooKeeper

 An open source Apache project
 Provides a centralized infrastructure and services that enable
synchronization across a cluster
 Common objects used across the large cluster environments are
maintained in Zookeeper
 Objects such as configuration, hierarchical naming space etc. are
maintained in Zookeeper
 Zookeeper services are used by large scale applications to coordinate
distributed processing across large clusters

1.24 Kafka Cluster & ZooKeeper

1.25 Kafka Integration

 Databases: MongoDB/CosmosDB/CouchDB/Oracle
 Big Data: Hadoop, Spark

15
Chapter 1 - Introduction to KAFKA

 Logging: Logstash (ELK stack)

 IoT

1.26 Who Uses Kafka?

16
Chapter 1 - Introduction to KAFKA

1.27 Courses
 WA2708 – Kafka for Application Modernization
 WA2684 – Developing Microservices

1.28 Summary
 Kafka is a unique distributed publish-subscribe messaging system written
in the Scala language with multi-language support and runs on the Java
Virtual Machine (JVM).
 Kafka relies on another service named Zookeeper – a distributed
coordination system – to function.
 Kafka has high-throughput and is built to scale-out in a distributed model
on multiple servers.
 Kafka persists messages on disk and can be used for batched
consumption as well as real-time applications.

Solid Starts - First 100 Days
94% (18)
Solid Starts - First 100 Days
287 pages
Hourglass Workout Program by Luisagiuliet 2
76% (21)
Hourglass Workout Program by Luisagiuliet 2
51 pages
12 Week Program: Summer Body Starts Now
89% (45)
12 Week Program: Summer Body Starts Now
70 pages
The Hold Me Tight Workbook - Dr. Sue Johnson
100% (16)
The Hold Me Tight Workbook - Dr. Sue Johnson
187 pages
Read People Like A Book by Patrick King-Edited
62% (66)
Read People Like A Book by Patrick King-Edited
12 pages
Livingood, Blake - Livingood Daily Your 21-Day Guide To Experience Real Health
77% (13)
Livingood, Blake - Livingood Daily Your 21-Day Guide To Experience Real Health
260 pages
Facial Gains Guide (001 081)
91% (45)
Facial Gains Guide (001 081)
81 pages
Cheat Code To The Universe
94% (77)
Cheat Code To The Universe
34 pages
Curse of Strahd
95% (467)
Curse of Strahd
258 pages
The Psychiatric Interview - Daniel Carlat
91% (34)
The Psychiatric Interview - Daniel Carlat
473 pages
The Borax Conspiracy
91% (57)
The Borax Conspiracy
14 pages
COSMIC CONSCIOUSNESS OF HUMANITY - PROBLEMS OF NEW COSMOGONY (V.P.Kaznacheev,. Л. V. Trofimov.)
94% (212)
COSMIC CONSCIOUSNESS OF HUMANITY - PROBLEMS OF NEW COSMOGONY (V.P.Kaznacheev,. Л. V. Trofimov.)
212 pages
The Secret Language of Attraction
86% (107)
The Secret Language of Attraction
278 pages
How To Develop and Write A Grant Proposal
83% (541)
How To Develop and Write A Grant Proposal
17 pages
Workbook For The Body Keeps The Score
88% (52)
Workbook For The Body Keeps The Score
111 pages
Donald Trump & Jeffrey Epstein Rape Lawsuit and Affidavits
83% (1016)
Donald Trump & Jeffrey Epstein Rape Lawsuit and Affidavits
13 pages
KamaSutra Positions
78% (69)
KamaSutra Positions
55 pages
7 Hermetic Principles
93% (28)
7 Hermetic Principles
3 pages
27 Feedback Mechanisms Pogil Key
75% (12)
27 Feedback Mechanisms Pogil Key
6 pages
Frank Hammond - List of Demons
92% (92)
Frank Hammond - List of Demons
3 pages
36 Questions That Lead To Love
91% (35)
36 Questions That Lead To Love
3 pages
36 Questions To Fall in Love 1
97% (31)
36 Questions To Fall in Love 1
2 pages
The 36 Questions That Lead To Love - The New York Times
94% (34)
The 36 Questions That Lead To Love - The New York Times
3 pages
100 Questions To Ask Your Partner
80% (35)
100 Questions To Ask Your Partner
2 pages
The 36 Questions That Lead To Love - The New York Times
95% (21)
The 36 Questions That Lead To Love - The New York Times
3 pages
Jeffrey Epstein39s Little Black Book Unredacted PDF
75% (12)
Jeffrey Epstein39s Little Black Book Unredacted PDF
95 pages
ALCHEMIST
64% (14)
ALCHEMIST
4 pages
1001 Songs
71% (69)
1001 Songs
1,798 pages
Zodiac Sign & Their Most Common Addictions
63% (30)
Zodiac Sign & Their Most Common Addictions
9 pages
The 4 Hour Workweek, Expanded and Updated by Timothy Ferriss - Excerpt
23% (954)
The 4 Hour Workweek, Expanded and Updated by Timothy Ferriss - Excerpt
38 pages
Quarkus 4
No ratings yet
Quarkus 4
10 pages
Sample Toastmaster Script
No ratings yet
Sample Toastmaster Script
3 pages
Q1.An Animal Feed Company Must Produce at Least 200 Kgs of A Mixture Consisting of Ingredients
No ratings yet
Q1.An Animal Feed Company Must Produce at Least 200 Kgs of A Mixture Consisting of Ingredients
13 pages
Documentation
No ratings yet
Documentation
105 pages
Mastering Kafka Streams: From Basics to Expert Proficiency
From Everand
Mastering Kafka Streams: From Basics to Expert Proficiency
William Smith
No ratings yet
Apache Kafka
No ratings yet
Apache Kafka
17 pages
Spring Transactions
No ratings yet
Spring Transactions
22 pages
Multithreading in Java (Unit 4)
100% (1)
Multithreading in Java (Unit 4)
19 pages
Hibernate Interview Question
No ratings yet
Hibernate Interview Question
119 pages
MicroService - Introduction
100% (1)
MicroService - Introduction
45 pages
Microservices CXF Karaf PDF
No ratings yet
Microservices CXF Karaf PDF
47 pages
JVM Architecture
No ratings yet
JVM Architecture
23 pages
Containerized Microservices Architecture: WWW - Ijecs.in
No ratings yet
Containerized Microservices Architecture: WWW - Ijecs.in
10 pages
Spring Cloud
No ratings yet
Spring Cloud
44 pages
Microservices with Spring Boot - Day5
No ratings yet
Microservices with Spring Boot - Day5
30 pages
Design Patterns
No ratings yet
Design Patterns
10 pages
Pattern Saga
No ratings yet
Pattern Saga
5 pages
Microservice Patterns
No ratings yet
Microservice Patterns
8 pages
SudheerKumar Ponnana Resume
No ratings yet
SudheerKumar Ponnana Resume
4 pages
Dzone Com Articles JVM Architecture Explained
No ratings yet
Dzone Com Articles JVM Architecture Explained
8 pages
Java and Caching: by Martin Nad
100% (1)
Java and Caching: by Martin Nad
20 pages
Exp Solid Questions 8
No ratings yet
Exp Solid Questions 8
29 pages
Microservices Architecture
No ratings yet
Microservices Architecture
60 pages
Microservices Raghu
0% (1)
Microservices Raghu
107 pages
Design Patterns
No ratings yet
Design Patterns
13 pages
Java Spring Boot Microservices Training Content - Manish Singh
No ratings yet
Java Spring Boot Microservices Training Content - Manish Singh
1 page
Java Spring Questions
No ratings yet
Java Spring Questions
9 pages
Intellij Idea Ide
No ratings yet
Intellij Idea Ide
12 pages
Dhruba Jyoti Saha - Java Architect
No ratings yet
Dhruba Jyoti Saha - Java Architect
15 pages
Microservices Notes and Practise
No ratings yet
Microservices Notes and Practise
176 pages
Microservices Patterns Dia03 DecoderWeek
No ratings yet
Microservices Patterns Dia03 DecoderWeek
53 pages
Node - Js - Interview Questions - Tutorialspoint
No ratings yet
Node - Js - Interview Questions - Tutorialspoint
12 pages
RESTful Web Service Composition With BPEL For REST
No ratings yet
RESTful Web Service Composition With BPEL For REST
28 pages
Micro Service
No ratings yet
Micro Service
38 pages
Kotlin Vs Java - Which Is The Best Option For Android App Development?
No ratings yet
Kotlin Vs Java - Which Is The Best Option For Android App Development?
16 pages
Getting Started Microservices PDF
No ratings yet
Getting Started Microservices PDF
6 pages
Kubernetes
No ratings yet
Kubernetes
61 pages
Introducing Spring Boot
No ratings yet
Introducing Spring Boot
18 pages
Camel Microservices With Spring Boot and Kubernetes
No ratings yet
Camel Microservices With Spring Boot and Kubernetes
67 pages
Spring and Hibernate
No ratings yet
Spring and Hibernate
4 pages
MICROSERVICES
No ratings yet
MICROSERVICES
16 pages
EJB Roseindia Notes
No ratings yet
EJB Roseindia Notes
126 pages
Docker Short Notes 1729397680
No ratings yet
Docker Short Notes 1729397680
17 pages
Spring Boot Annotations
No ratings yet
Spring Boot Annotations
12 pages
General Questions: 1. What Is Java?
100% (1)
General Questions: 1. What Is Java?
125 pages
Java Questionnaire
No ratings yet
Java Questionnaire
2 pages
AWS Certified Developer Associate Guide Your one stop solution to pass the AWS developer s certification 1st Edition Vipul Tankariya 2024 Scribd Download
100% (2)
AWS Certified Developer Associate Guide Your one stop solution to pass the AWS developer s certification 1st Edition Vipul Tankariya 2024 Scribd Download
65 pages
45 Tips To Improve Programming Information
100% (1)
45 Tips To Improve Programming Information
5 pages
Java 8 Stream Practice
No ratings yet
Java 8 Stream Practice
3 pages
Mastering Concurrency Programming Java 8 Ebook B012o8s89k PDF
No ratings yet
Mastering Concurrency Programming Java 8 Ebook B012o8s89k PDF
5 pages
Spring Cloud
No ratings yet
Spring Cloud
195 pages
Struts Interview
No ratings yet
Struts Interview
11 pages
SHIVA KUMARA - JavaArchitect
No ratings yet
SHIVA KUMARA - JavaArchitect
9 pages
New Features in JDK 8: Ivan St. Ivanov Dmitry Alexandrov Martin Toshev
No ratings yet
New Features in JDK 8: Ivan St. Ivanov Dmitry Alexandrov Martin Toshev
58 pages
Junit Interview Questions
No ratings yet
Junit Interview Questions
6 pages
Microservices Interview Questions
No ratings yet
Microservices Interview Questions
5 pages
Hibernate
No ratings yet
Hibernate
161 pages
12 Microservices Design Patterns 1696645895
No ratings yet
12 Microservices Design Patterns 1696645895
14 pages
Java servlet Second Edition
From Everand
Java servlet Second Edition
Gerardus Blokdyk
No ratings yet
Java SE 21 Developer Study Guide
From Everand
Java SE 21 Developer Study Guide
Esteban Herrera
5/5 (1)
Ultimate AWS Certified Cloud Practitioner’s Exam Guide: Master the Concepts, Services, Security, and Architectural Best Practices of AWS, EC2, S3, and RDS, and Crack AWS CLF-C02 Certification (English Edition)
From Everand
Ultimate AWS Certified Cloud Practitioner’s Exam Guide: Master the Concepts, Services, Security, and Architectural Best Practices of AWS, EC2, S3, and RDS, and Crack AWS CLF-C02 Certification (English Edition)
Gaurav H Kankaria
No ratings yet
Cloud Native Applications with Jakarta EE: Build, Design, and Deploy Cloud-Native Applications and Microservices with Jakarta EE (English Edition)
From Everand
Cloud Native Applications with Jakarta EE: Build, Design, and Deploy Cloud-Native Applications and Microservices with Jakarta EE (English Edition)
Kamalmeet Singh
No ratings yet
Apitestcases 200822060622
No ratings yet
Apitestcases 200822060622
1 page
02 SAFe Foundations (v4.5.0)
No ratings yet
02 SAFe Foundations (v4.5.0)
37 pages
Touronen Ville Pro Gradu 2019
No ratings yet
Touronen Ville Pro Gradu 2019
60 pages
Individual Case Safety Report (Icsr) Form
No ratings yet
Individual Case Safety Report (Icsr) Form
2 pages
Draft: I. Reaction Information
No ratings yet
Draft: I. Reaction Information
2 pages
MW Master Portfolio 2018
No ratings yet
MW Master Portfolio 2018
318 pages
Fujitsu Lifebook LH520 (Quanta FK1) Laptop Schematics - Quanta - fk1
No ratings yet
Fujitsu Lifebook LH520 (Quanta FK1) Laptop Schematics - Quanta - fk1
37 pages
Diode Clippers: 1. Positive Clipper and Negative Clipper
No ratings yet
Diode Clippers: 1. Positive Clipper and Negative Clipper
6 pages
Ghost in The Shell 2017 Bluray 1080P Truehd Atmos 7 1 Avc Remux-Framestor
No ratings yet
Ghost in The Shell 2017 Bluray 1080P Truehd Atmos 7 1 Avc Remux-Framestor
2 pages
Primer Número Revista BIM y Puentes
No ratings yet
Primer Número Revista BIM y Puentes
45 pages
HP Data Center Networking Solutions Brochure
No ratings yet
HP Data Center Networking Solutions Brochure
12 pages
Discover Arest Sample
No ratings yet
Discover Arest Sample
7 pages
Gmail - [GCash Help Center] I requested for my transaction history and did not receive it - (174249574)
No ratings yet
Gmail - [GCash Help Center] I requested for my transaction history and did not receive it - (174249574)
2 pages
MetaCAM Enterprise
No ratings yet
MetaCAM Enterprise
8 pages
Cradle Design - 00
No ratings yet
Cradle Design - 00
47 pages
d102277 Ac Stag LPG Qmax Basic
No ratings yet
d102277 Ac Stag LPG Qmax Basic
28 pages
Zoom+H6+Mini+Manual
No ratings yet
Zoom+H6+Mini+Manual
4 pages
RA CIVILENG LEGAZPI May2018 PDF
No ratings yet
RA CIVILENG LEGAZPI May2018 PDF
11 pages
gp-435g CHG 1
No ratings yet
gp-435g CHG 1
41 pages
Format Disk Structure Corrupted !
No ratings yet
Format Disk Structure Corrupted !
59 pages
Specs Fortnite
No ratings yet
Specs Fortnite
61 pages
Detailed Lesson Plan in Arts 3
No ratings yet
Detailed Lesson Plan in Arts 3
5 pages
InDesign CC21 Student Packet - P2 Conference Poster
No ratings yet
InDesign CC21 Student Packet - P2 Conference Poster
9 pages
Class 1st Math
No ratings yet
Class 1st Math
4 pages
NotOnly Studios
No ratings yet
NotOnly Studios
13 pages
Fibonacci Time Lines
No ratings yet
Fibonacci Time Lines
3 pages
DATAKOM D500 Ethernet Configuration
No ratings yet
DATAKOM D500 Ethernet Configuration
14 pages
Aa - Req - 000131 - Quality Requirements Third Party Design Verification
No ratings yet
Aa - Req - 000131 - Quality Requirements Third Party Design Verification
11 pages