This document describes building a REST job server for interactive Spark as a service using Livy. It discusses the history and challenges of running Spark jobs in Hue, introduces Livy as a Spark server, and details its local and YARN-cluster modes as well as session creation, execution flows, and interpreter support for Scala, Python, R and more. Magic commands are also described for JSON, table, and plot output.
1 of 83
More Related Content
Building a REST Job Server for Interactive Spark as a Service
1. BUILDING A REST JOB SERVER
FOR INTERACTIVE SPARK
AS A SERVICE
Romain Rigaux - Cloudera
Erick Tryzelaar - Cloudera
7. HISTORY
V1: OOZIE
• It works
• Code snippet
THE GOOD
• Submit through Oozie
• Shell ac:on
• Very Slow
• Batch
THE BAD
workflow.xml
snippet.py
stdout
8. HISTORY
V2: SPARK IGNITER
• It works beAer
THE GOOD
• Compiler Jar
• Batch only, no shell
• No Python, R
• Security
• Single point of failure
THE BAD Compile
Implement
Upload
json output
Batch
Scala
jar
Ooyala