Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
SlideShare a Scribd company logo
<Insert Picture Here>




Oracle Big Data Appliance and Solutions
Jean-Pierre Dijcks
Hadoop World – Nov 8th, 2012
The following is intended to outline our general product
direction. It is intended for information purposes only, and
may not be incorporated into any contract. It is not a
commitment to deliver any material, code, or functionality,
and should not be relied upon in making purchasing
decisions.
The development, release, and timing of any features or
functionality described for Oracle’s products remain at the
sole discretion of Oracle.
Case: On-line Ads and Content

                                                       Real-time: Determine
 Low                                                     best ad to place
Latency             Lookup user                        on page for this user
                       profile
                Add user    NoSQL                Expert
             if not present   DB    Input into   System
                                                        Actual
  HDFS                                 Predictions
                                                         ads
                                       on browsing
          Web                                           served
          logs
                            High scale                                    Batch
                          data reductions          BI and
                                                            Billing
 NoSQL DB                                         Analytics

      Profiles
Agenda


• Big Data Technology
• Oracle Big Data Appliance
• Big Data Applications
• Summary
• Q&A
<Insert Picture Here>

Big Data Technology
Big Data: Infrastructure Requirements


   Acquire                       Organize                     Analyze


• Low, predictable Latency
• High Transaction Volume                                    • Deep Analytics
• Flexible Data Structures                                   • Agile Development
                                                             • Massive Scalability
                             • High Throughput
                                                             • Real Time Results
                             • In-Place Preparation
                             • All Data Sources/Structures
Divided Solution Spectrum
  Data
 Variety



            Distributed                                     NoSQL
Dynamic     File Systems                                     Flexible
                                 MapReduce
Schema                                                     Specialized
                                  Solutions
           Transaction                                     Developer
            (Key-Value)                                      Centric
              Stores




                                                              SQL
Schema     DBMS                   DBMS        Advanced       Trusted
                           ETL                 Analytics
            (OLTP)                 (DW)                      Secure
                                                           Administered



           Acquire         Organize             Analyze
Oracle Integrated Software Solution Stack

          Data
         Variety


                                                                 HDFS                                Hadoop                                    In-DB
                                                                                                                                              Analytics
    Dynamic                                                                                     Oracle Loader
    Schema                                                                                                                                     “R”
                                                     Oracle NoSQL                                for Hadoop                                   Mining
                                                           DB
                                                                                                                                               Text
                                                                                                Oracle
                                                                                            Data Integrator                                   Graph
                                                                                                                                              Spatial

                                                          Oracle                                                                        Oracle
    Schema                                               Database                                                                      Database    Oracle
                                                          (OLTP)                                                                        (DW)       BI EE



                                                       Acquire                                         Organize                                Analyze


8   Copyright © 2011, Oracle and/or its affiliates. All rights      Insert Information Protection Policy Classification from Slide 8
    reserved.
Oracle Engineered Solutions
        Data
       Variety

                                                             Big Data Appliance
                                                              HDFS         Hadoop                                                    In-DB
                                                        • Hadoop                                                                    Analytics
    Dynamic                                             • NoSQL Database Loader
                                                                       Oracle
    Schema                                              • Oracle Loader for hadoop
                                                     Oracle NoSQL                                                                    “R”
                                                                         for Hadoop
                                                        • Oracle Data Integrator
                                                           DB                                                                       Mining      Exalytics
                                                                          Oracle                                                     Text       • Speed of
                                                                     Data Integrator                                                Graph         Thought
                                                                                                                                    Spatial       Analytics
                                                          Oracle                            Oracle Exadata  Oracle
    Schema                                               Database                           • OLTP & DW Database                         Oracle
                                                          (OLTP)                                             (DW)
                                                                                            • Data Mining & Oracle R                     BI EE
                                                                                              • Semantics
                                                                                              • Spatial



                                                       Acquire                                      Organize                         Analyze

9   Copyright © 2011, Oracle and/or its affiliates. All rights   Insert Information Protection Policy Classification from Slide 8
    reserved.
Big Data Appliance
Batch Usage Model



           Oracle                        Oracle              Oracle
      Big Data Appliance                 Exadata            Exalytics




                           InfiniBand              InfiniBand




     Acquire         Organize           Analyze
Why build a Hadoop Appliance?




                                             • Time to Build?
                                             • Required Expertise?
                                             • Cost and Difficulty Maintaining?

11   Copyright © 2011, Oracle and/or its affiliates. All rights   Insert Information Protection Policy Classification from Slide 8
     reserved.
Oracle Big Data Appliance Hardware


•18 Sun X4270 M2 Servers
  – 48 GB memory per node = 864 GB memory
  – 12 Intel cores per node = 216 cores
  – 24 TB storage per node = 432 TB storage
•40 Gb p/sec InfiniBand
•10 Gb p/sec Ethernet
Big Data Appliance
  Cluster of industry standard servers for Hadoop and NoSQL Database
  • Focus on Scalability and Availability at low cost


InfiniBand Network
                                                 Compute and Storage
• Redundant 40Gb/s switches
                                             • 18 High-performance low-cost
• IB connectivity to Exadata
                                               servers acting as Hadoop
                                               nodes



10GigE Network                               •   24 TB Capacity per node
• 8 10GigE ports                             •   2 6-core CPUs per node
• Datacenter connectivity                    •   Hadoop triple replication
                                             •   NoSQL Database triple
                                                 replication
Scale Out to Infinity




         Scale out by connecting racks
         to each other using Infiniband
          • Expand up to eight racks without
            additional switches
          • Scale beyond eight racks by adding
            an additional switch
Oracle Big Data Appliance Software

  •Oracle Linux 5.6
  •Java Hotspot VM
  •Apache Hadoop Distribution v0.20.x
  •R Distribution
  •Oracle NoSQL Database Enterprise
   Edition
  •Oracle Data Integrator Application
   Adapter for Hadoop
  •Oracle Loader for Hadoop
Why Open-Source Apache Hadoop?


• Fast evolution in critical features
  • Built by the Hadoop experts in the community
  • Practical instead of esoteric
  • Focus on what is needed for large clusters
• Proven at very large scale
  • In production at all the large consumers of Hadoop
  • Extremely stable in those environments
  • Well-understood by practitioners
Software Layout
           • Node 1:
             • M: Name Node, Balancer & HBase Master
             • S: HDFS Data Node, NoSQL DB Storage Node
           • Node 2:
             • M: Secondary Name Node, Management,
               Zookeeper, MySQL Slave
             • S: HDFS Data Node, NoSQL DB Storage Node
           • Node 3:
             • M: JobTracker, MySQL Master, ODI Agent,
               Hive Server
             • S: HDFS Data Node, NoSQL DB Storage Node
           • Node 4 – 18:
             • S: HDFS Data Nodes, Task Tracker, HBase
               Region Server, NoSQL DB Storage Nodes
             • Your MapReduce runs here!
Big Data Appliance
  Big Data for the Enterprise


• Optimized and Complete
  • Everything you need to store and integrate
    your lower information density data
• Integrated with Oracle Exadata
  • Analyze all your data
• Easy to Deploy
  • Risk Free, Quick Installation and Setup
• Single Vendor Support
  • Full Oracle support for the entire system and
    software set
<Insert Picture Here>

Oracle NoSQL Database
Key-Value Store Workloads

• Large dynamic schema based data repositories

• Data capture
  • Web applications
  • Online retail
  • Sensor/statistics/network capture/Mobile Devices
• Data services
  •   Scalable authentication
  •   Real-time communication (MMS, SMS, routing)
  •   Personalization / Localization
  •   Social Networks
Oracle NoSQL DB
  A distributed, scalable key-value database

• Simple Data Model
   • Key-value pair with major+sub-key paradigm
   • Read/insert/update/delete operations                    Application      Application

• Scalability                                              NoSQLDB Driver   NoSQLDB Driver

   • Dynamic data partitioning and distribution
   • Optimized data access via intelligent driver
• High availability
   • One or more replicas
   • Disaster recovery through location of replicas
   • Resilient to partition master failures
   • No single point of failure
                                                      Storage Nodes             Storage Nodes
• Transparent load balancing                                                     Data Center B
                                                       Data Center A
   • Reads from master or replicas
   • Driver is network topology & latency aware
Resolving a Request
     Operation + Key[M,m] + Value + Transaction Policy
                                                                 Client


Hash Major Key to determine
Partition id

   Use Partition Map to map Partition                    • Operation result
   id to a Rep Group                                     • New Partition Map
                                                         • RepNodeStorageTable
      Use State Table to determine eligible              information
      Storage Node(s) within Rep Group

         Use Load Balancer to select best
         eligible Rep Node

             Contact Rep Node directly
ACID Transactions
Transaction Policy                             Transaction Policy
Write Durability                               Read Consistency
• Configurable per-operation,                  • Configurable per-operation,
  application can set defaults                   application can set defaults
• Write Transaction Durability consists        • Read Consistency specified as
  of both
                                                 Absolute, Time-based, Version or
    a) Sync policy (on Master and               None
       Replica)
                                                 • Absolute  Read from the master
      • Sync – force to disk
      • Write No Sync – force to OS              • Time-based  Read from any
        buffer                                     replica that is within <time-
      • No Sync – write to local log buffer,       interval> of master or better
        flush when convenient                    • Version  Read from any replica
    b) Replica Acknowledgement Policy              that is current with <transaction-
      • All                                        token> or higher
      • Simple Majority                          • None  Read from any replica
      • None
Oracle NoSQL DB Differentiation

• Commercial Grade Software and Support
  • General-purpose
  • Reliable – Based on proven Berkeley DB JE HA
  • Easy to install and configure
• Scalable throughput, bounded latency
• Simple Programming and Operational Model
  • Simple Major + Sub key and Value data structure
  • ACID transactions
  • Configurable consistency & durability
• Easy Management
  • Web-based console, API accessible
  • Manages and Monitors: Topology; Load; Performance; Events; Alerts
• Completes Oracle large scale data storage offerings
Try NoSQL Database on OTN




 Oracle NoSQL Database:
 • Community Edition is available as a software
   only distribution
 • Enterprise Edition is available as a separately
   licensable product or as part of Big Data Appliance
<Insert Picture Here>

Oracle Loader for Hadoop
Oracle Loader for Hadoop Features

     • Load data into a partitioned or non-partitioned table
           – Single level, composite or interval partitioned table
           – Support for scalar datatypes of Oracle Database
           – Load into Oracle Database 11g Release 2


     • Runs as a Hadoop job and supports standard options

     • Pre-partitions and sorts data on Hadoop

     • Online and offline load modes




27   Copyright © 2011, Oracle and/or its affiliates. All rights
     reserved.
Oracle Loader for Hadoop

INPUT
  1
                  MAP                                             MAP
                                                                                           ORACLE LOADER FOR HADOOP

                  MAP                                   REDUCE                    REDUCE

                                                                                            MAP

                  MAP                                   REDUCE    MAP                                         REDUCE

                                                                                            MAP

                  MAP                                   REDUCE                    REDUCE                      REDUCE
                                                                                                    SHUFFLE
                                                                                            MAP      /SORT
                                    SHUFFLE
                  MAP                /SORT                        MAP




                  MAP                                             MAP             REDUCE



                  MAP                                   REDUCE                              MAP               REDUCE



                  MAP                                   REDUCE    MAP                       MAP               REDUCE


                                    SHUFFLE                                                         SHUFFLE
                  MAP                /SORT                                                  MAP      /SORT    REDUCE
                                                                        SHUFFLE
                                                                         /SORT
INPUT
  2




28   Copyright © 2011, Oracle and/or its affiliates. All rights
     reserved.
Oracle Loader for Hadoop: Online Option

                   Read target table metadata                             Perform partitioning,
                                                                        ORACLE LOADER FOR HADOOP            Connect to the database
                    from the database                                      sorting, and data                 from reducer nodes, load
                                                                           conversion                        into database partitions in
                                                                                                             parallel
                                                                  MAP

                                                                                                   REDUCE

                                                                  MAP

                                                                                                   REDUCE
                                                                                   SHUFFLE
                                                                  MAP
                                                                                    /SORT




                                                                  MAP                              REDUCE



                                                                  MAP                              REDUCE



                                                                                   SHUFFLE
                                                                  MAP                              REDUCE
                                                                                    /SORT




29   Copyright © 2011, Oracle and/or its affiliates. All rights
     reserved.
Oracle Loader for Hadoop: Offline Option

                   Read target table metadata                             Perform partitioning,
                                                                        ORACLE LOADER FOR HADOOP
                                                                                                            Write from reducer nodes to
                    from the database                                      sorting, and data                 Oracle Data Pump files
                                                                           conversion

                                                                  MAP
                                                                                                                              Import into the database in
                                                                                                   REDUCE                       parallel using external table
                                                                  MAP                                                           mechanism

                                                                                                   REDUCE
                                                                                   SHUFFLE
                                                                  MAP
                                                                                    /SORT




                                                                  MAP                              REDUCE



                                                                  MAP                              REDUCE



                                                                                   SHUFFLE
                                                                  MAP                              REDUCE
                                                                                    /SORT




30   Copyright © 2011, Oracle and/or its affiliates. All rights
     reserved.
Oracle Loader for Hadoop Advantages


     • Offload database server processing to Hadoop:
           – Convert input data to final database format
           – Compute table partition for row
           – Sort rows by primary key within a table partition
     • Generate binary datapump files
     • Balance partition groups across reducers




31   Copyright © 2011, Oracle and/or its affiliates. All rights
     reserved.
Input and Output Formats

Input Formats                                                      Output Formats
                                                                   Online Mode
• Delimited text                                                   • Load directly from Hadoop nodes to
                                                                     Oracle database
• Hive tables                                                        – JDBC
  – Managed and external tables                                      – Parallel direct path
  – Native and non-native tables

                                                                   Offline Mode
• Write your own input format                                      • Datapump format
                                                                     – Create binary files for external tables
                                                                     – Import data into the database from the
                                                                       external table with a SQL statement
                                                                   • CSV, delimited text
                                                                     – Load through SQL*Loader or external
                                                                       table mechanism

 32   Copyright © 2011, Oracle and/or its affiliates. All rights
      reserved.
Selection Output Option for Use Case

     Oracle Loader for Hadoop
                                                                  Use Case Characteristics
     Output Option
     Online load with JDBC                                        The simplest use case for non
                                                                  partitioned tables
     Online load with Direct Path                                 Fast online load for partitioned
                                                                  tables
     Offline load with datapump files                             Fastest load method for external
                                                                  tables
     On Oracle Big Data Appliance                                 Leave data on HDFS
     Direct HDFS                                                  Parallel access from database
                                                                  Import into database when
                                                                  needed



33   Copyright © 2011, Oracle and/or its affiliates. All rights
     reserved.
Invoking Oracle Loader for Hadoop


     • Command line
           $ hadoop jar oraloader.jar oracle.hadoop.loader.OraLoader
                      -libjars <library jar files>
                      -D <configuration properties>



            $HADOOP_HOME/bin/hadoop jar oraloader.jar oracle.hadoop.loader.oraLoader
               -libjars avro-1.4.1.jar, commons-math-2.2.jar
               -conf connection.xml
               -D mapreduce.inputformat.class=oracle.hadoop.loader.lib.input.DelimitedTextInputFormat
               -D mapreduce.outputformat.class=oracle.hadoop.loader.lib.output.JDBCOutputFormat




34   Copyright © 2011, Oracle and/or its affiliates. All rights
     reserved.
Automate Usage of Oracle Loader for Hadoop
     Oracle Data Integrator (ODI)

     • ODI has knowledge modules to
           – Generate data transformation code to run on Hive/Hadoop
           – Invoke Oracle Loader for Hadoop


     • Use the drag-and-drop interface in ODI to
           – Include invocation of Oracle Loader for Hadoop in any ODI
             packaged flow




36   Copyright © 2011, Oracle and/or its affiliates. All rights
     reserved.
37   Copyright © 2011, Oracle and/or its affiliates. All rights
     reserved.
<Insert Picture Here>

Summary
Big Data Appliance
  Big Data for the Enterprise


• Optimized and Complete
  • Everything you need to store and integrate your lower
    information density data
• Integrated with Oracle Exadata
  • Analyze all your data
• Easy to Deploy
  • Risk Free, Quick Installation and Setup
• Single Vendor Support
  • Full Oracle support for the entire system and software
    set
Big Data Appliance and Exadata
Big Data for the Enterprise


     NoSQL DB
                     
        HDFS
                     
      Hadoop
                     
      RDBMS          
Questions

More Related Content

What's hot

Magic quadrant for data warehouse database management systems
Magic quadrant for data warehouse database management systems Magic quadrant for data warehouse database management systems
Magic quadrant for data warehouse database management systems
divjeev
 
Hadoop - Now, Next and Beyond
Hadoop - Now, Next and BeyondHadoop - Now, Next and Beyond
Hadoop - Now, Next and Beyond
Teradata Aster
 
Enterprise Data Workflows with Cascading
Enterprise Data Workflows with CascadingEnterprise Data Workflows with Cascading
Enterprise Data Workflows with Cascading
Paco Nathan
 
Golam Md. Enamul Haque
Golam Md. Enamul HaqueGolam Md. Enamul Haque
Golam Md. Enamul Haque
memasum13
 
#dbhouseparty - Using Oracle’s Converged “AI” Database to Pick a Good but Ine...
#dbhouseparty - Using Oracle’s Converged “AI” Database to Pick a Good but Ine...#dbhouseparty - Using Oracle’s Converged “AI” Database to Pick a Good but Ine...
#dbhouseparty - Using Oracle’s Converged “AI” Database to Pick a Good but Ine...
Tammy Bednar
 
Ibm pure data system for analytics n200x
Ibm pure data system for analytics n200xIbm pure data system for analytics n200x
Ibm pure data system for analytics n200x
IBM Sverige
 
Cascading: Enterprise Data Workflows based on Functional Programming
Cascading: Enterprise Data Workflows based on Functional ProgrammingCascading: Enterprise Data Workflows based on Functional Programming
Cascading: Enterprise Data Workflows based on Functional Programming
Paco Nathan
 
HBaseCon 2012 | Real-time Analytics with HBase - Sematext
HBaseCon 2012 | Real-time Analytics with HBase - SematextHBaseCon 2012 | Real-time Analytics with HBase - Sematext
HBaseCon 2012 | Real-time Analytics with HBase - Sematext
Cloudera, Inc.
 
Cool features 7.4
Cool features 7.4Cool features 7.4
Cool features 7.4
Mahesh Someshetty
 
Ibm pure data system for analytics n3001
Ibm pure data system for analytics n3001Ibm pure data system for analytics n3001
Ibm pure data system for analytics n3001
Abhishek Satyam
 
Microsoft SQL Azure - Cloud Based Database Datasheet
Microsoft SQL Azure - Cloud Based Database DatasheetMicrosoft SQL Azure - Cloud Based Database Datasheet
Microsoft SQL Azure - Cloud Based Database Datasheet
Microsoft Private Cloud
 
Greenplum Database Overview
Greenplum Database Overview Greenplum Database Overview
Greenplum Database Overview
EMC
 
Oracle Data Warehouse
Oracle Data WarehouseOracle Data Warehouse
Oracle Data Warehouse
DataminingTools Inc
 
Oracle no sql database bigdata
Oracle no sql database   bigdataOracle no sql database   bigdata
Oracle no sql database bigdata
João Gabriel Lima
 
The IBM Netezza Data Warehouse Appliance
The IBM Netezza Data Warehouse ApplianceThe IBM Netezza Data Warehouse Appliance
The IBM Netezza Data Warehouse Appliance
IBM Sverige
 
Oracle: Fundamental Of DW
Oracle: Fundamental Of DWOracle: Fundamental Of DW
Oracle: Fundamental Of DW
DataminingTools Inc
 
BACK TO THE FUTURE: DATAFLOW FINALLY COMES OF AGE from Structure 2012
BACK TO THE FUTURE: DATAFLOW FINALLY COMES OF AGE from Structure 2012BACK TO THE FUTURE: DATAFLOW FINALLY COMES OF AGE from Structure 2012
BACK TO THE FUTURE: DATAFLOW FINALLY COMES OF AGE from Structure 2012
Gigaom
 
NetApp’s Open Solution for Hadoop
NetApp’s Open Solution for HadoopNetApp’s Open Solution for Hadoop
NetApp’s Open Solution for Hadoop
NetApp
 
Hadoop as data refinery
Hadoop as data refineryHadoop as data refinery
Hadoop as data refinery
Steve Loughran
 
Hadoop as Data Refinery - Steve Loughran
Hadoop as Data Refinery - Steve LoughranHadoop as Data Refinery - Steve Loughran
Hadoop as Data Refinery - Steve Loughran
JAX London
 

What's hot (20)

Magic quadrant for data warehouse database management systems
Magic quadrant for data warehouse database management systems Magic quadrant for data warehouse database management systems
Magic quadrant for data warehouse database management systems
 
Hadoop - Now, Next and Beyond
Hadoop - Now, Next and BeyondHadoop - Now, Next and Beyond
Hadoop - Now, Next and Beyond
 
Enterprise Data Workflows with Cascading
Enterprise Data Workflows with CascadingEnterprise Data Workflows with Cascading
Enterprise Data Workflows with Cascading
 
Golam Md. Enamul Haque
Golam Md. Enamul HaqueGolam Md. Enamul Haque
Golam Md. Enamul Haque
 
#dbhouseparty - Using Oracle’s Converged “AI” Database to Pick a Good but Ine...
#dbhouseparty - Using Oracle’s Converged “AI” Database to Pick a Good but Ine...#dbhouseparty - Using Oracle’s Converged “AI” Database to Pick a Good but Ine...
#dbhouseparty - Using Oracle’s Converged “AI” Database to Pick a Good but Ine...
 
Ibm pure data system for analytics n200x
Ibm pure data system for analytics n200xIbm pure data system for analytics n200x
Ibm pure data system for analytics n200x
 
Cascading: Enterprise Data Workflows based on Functional Programming
Cascading: Enterprise Data Workflows based on Functional ProgrammingCascading: Enterprise Data Workflows based on Functional Programming
Cascading: Enterprise Data Workflows based on Functional Programming
 
HBaseCon 2012 | Real-time Analytics with HBase - Sematext
HBaseCon 2012 | Real-time Analytics with HBase - SematextHBaseCon 2012 | Real-time Analytics with HBase - Sematext
HBaseCon 2012 | Real-time Analytics with HBase - Sematext
 
Cool features 7.4
Cool features 7.4Cool features 7.4
Cool features 7.4
 
Ibm pure data system for analytics n3001
Ibm pure data system for analytics n3001Ibm pure data system for analytics n3001
Ibm pure data system for analytics n3001
 
Microsoft SQL Azure - Cloud Based Database Datasheet
Microsoft SQL Azure - Cloud Based Database DatasheetMicrosoft SQL Azure - Cloud Based Database Datasheet
Microsoft SQL Azure - Cloud Based Database Datasheet
 
Greenplum Database Overview
Greenplum Database Overview Greenplum Database Overview
Greenplum Database Overview
 
Oracle Data Warehouse
Oracle Data WarehouseOracle Data Warehouse
Oracle Data Warehouse
 
Oracle no sql database bigdata
Oracle no sql database   bigdataOracle no sql database   bigdata
Oracle no sql database bigdata
 
The IBM Netezza Data Warehouse Appliance
The IBM Netezza Data Warehouse ApplianceThe IBM Netezza Data Warehouse Appliance
The IBM Netezza Data Warehouse Appliance
 
Oracle: Fundamental Of DW
Oracle: Fundamental Of DWOracle: Fundamental Of DW
Oracle: Fundamental Of DW
 
BACK TO THE FUTURE: DATAFLOW FINALLY COMES OF AGE from Structure 2012
BACK TO THE FUTURE: DATAFLOW FINALLY COMES OF AGE from Structure 2012BACK TO THE FUTURE: DATAFLOW FINALLY COMES OF AGE from Structure 2012
BACK TO THE FUTURE: DATAFLOW FINALLY COMES OF AGE from Structure 2012
 
NetApp’s Open Solution for Hadoop
NetApp’s Open Solution for HadoopNetApp’s Open Solution for Hadoop
NetApp’s Open Solution for Hadoop
 
Hadoop as data refinery
Hadoop as data refineryHadoop as data refinery
Hadoop as data refinery
 
Hadoop as Data Refinery - Steve Loughran
Hadoop as Data Refinery - Steve LoughranHadoop as Data Refinery - Steve Loughran
Hadoop as Data Refinery - Steve Loughran
 

Similar to Big dataappliance hadoopworld_final

Big Data Real Time Applications
Big Data Real Time ApplicationsBig Data Real Time Applications
Big Data Real Time Applications
DataWorks Summit
 
Oracle Advanced Analytics
Oracle Advanced AnalyticsOracle Advanced Analytics
Oracle Advanced Analytics
aghosh_us
 
How Apache Hadoop is Revolutionizing Business Intelligence and Data Analytics...
How Apache Hadoop is Revolutionizing Business Intelligence and Data Analytics...How Apache Hadoop is Revolutionizing Business Intelligence and Data Analytics...
How Apache Hadoop is Revolutionizing Business Intelligence and Data Analytics...
Amr Awadallah
 
Business Intelligence and Data Analytics Revolutionized with Apache Hadoop
Business Intelligence and Data Analytics Revolutionized with Apache HadoopBusiness Intelligence and Data Analytics Revolutionized with Apache Hadoop
Business Intelligence and Data Analytics Revolutionized with Apache Hadoop
Cloudera, Inc.
 
Hadoop World 2011: How Hadoop Revolutionized Business Intelligence and Advanc...
Hadoop World 2011: How Hadoop Revolutionized Business Intelligence and Advanc...Hadoop World 2011: How Hadoop Revolutionized Business Intelligence and Advanc...
Hadoop World 2011: How Hadoop Revolutionized Business Intelligence and Advanc...
Cloudera, Inc.
 
Introduction to Hadoop
Introduction to HadoopIntroduction to Hadoop
Introduction to Hadoop
Ovidiu Dimulescu
 
Sql no sql
Sql no sqlSql no sql
Sql no sql
Dave Stokes
 
Hadoop in the Enterprise - Dr. Amr Awadallah @ Microstrategy World 2011
Hadoop in the Enterprise - Dr. Amr Awadallah @ Microstrategy World 2011Hadoop in the Enterprise - Dr. Amr Awadallah @ Microstrategy World 2011
Hadoop in the Enterprise - Dr. Amr Awadallah @ Microstrategy World 2011
Cloudera, Inc.
 
Anexinet Big Data Solutions
Anexinet Big Data SolutionsAnexinet Big Data Solutions
Anexinet Big Data Solutions
Mark Kromer
 
Processing Big Data
Processing Big DataProcessing Big Data
Processing Big Data
cwensel
 
Choosing a Next Gen Database: the New World Order of NoSQL, NewSQL, and MySQL
Choosing a Next Gen Database: the New World Order of NoSQL, NewSQL, and MySQLChoosing a Next Gen Database: the New World Order of NoSQL, NewSQL, and MySQL
Choosing a Next Gen Database: the New World Order of NoSQL, NewSQL, and MySQL
ScaleBase
 
Introducing the Big Data Ecosystem with Caserta Concepts & Talend
Introducing the Big Data Ecosystem with Caserta Concepts & TalendIntroducing the Big Data Ecosystem with Caserta Concepts & Talend
Introducing the Big Data Ecosystem with Caserta Concepts & Talend
Caserta
 
A unified data modeler in the world of big data
A unified data modeler in the world of big dataA unified data modeler in the world of big data
A unified data modeler in the world of big data
William Luk
 
Cloud computing era
Cloud computing eraCloud computing era
Cloud computing era
TrendProgContest13
 
From the Big Data keynote at InCSIghts 2012
From the Big Data keynote at InCSIghts 2012From the Big Data keynote at InCSIghts 2012
From the Big Data keynote at InCSIghts 2012
Anand Deshpande
 
Big data hadoop ecosystem and nosql
Big data hadoop ecosystem and nosqlBig data hadoop ecosystem and nosql
Big data hadoop ecosystem and nosql
Khanderao Kand
 
Integrating hadoop - Big Data TechCon 2013
Integrating hadoop - Big Data TechCon 2013Integrating hadoop - Big Data TechCon 2013
Integrating hadoop - Big Data TechCon 2013
Jonathan Seidman
 
Big data ppt
Big data pptBig data ppt
Big data ppt
Thirunavukkarasu Ps
 
Drill njhug -19 feb2013
Drill njhug -19 feb2013Drill njhug -19 feb2013
Drill njhug -19 feb2013
MapR Technologies
 
Presentation big dataappliance-overview_oow_v3
Presentation   big dataappliance-overview_oow_v3Presentation   big dataappliance-overview_oow_v3
Presentation big dataappliance-overview_oow_v3
xKinAnx
 

Similar to Big dataappliance hadoopworld_final (20)

Big Data Real Time Applications
Big Data Real Time ApplicationsBig Data Real Time Applications
Big Data Real Time Applications
 
Oracle Advanced Analytics
Oracle Advanced AnalyticsOracle Advanced Analytics
Oracle Advanced Analytics
 
How Apache Hadoop is Revolutionizing Business Intelligence and Data Analytics...
How Apache Hadoop is Revolutionizing Business Intelligence and Data Analytics...How Apache Hadoop is Revolutionizing Business Intelligence and Data Analytics...
How Apache Hadoop is Revolutionizing Business Intelligence and Data Analytics...
 
Business Intelligence and Data Analytics Revolutionized with Apache Hadoop
Business Intelligence and Data Analytics Revolutionized with Apache HadoopBusiness Intelligence and Data Analytics Revolutionized with Apache Hadoop
Business Intelligence and Data Analytics Revolutionized with Apache Hadoop
 
Hadoop World 2011: How Hadoop Revolutionized Business Intelligence and Advanc...
Hadoop World 2011: How Hadoop Revolutionized Business Intelligence and Advanc...Hadoop World 2011: How Hadoop Revolutionized Business Intelligence and Advanc...
Hadoop World 2011: How Hadoop Revolutionized Business Intelligence and Advanc...
 
Introduction to Hadoop
Introduction to HadoopIntroduction to Hadoop
Introduction to Hadoop
 
Sql no sql
Sql no sqlSql no sql
Sql no sql
 
Hadoop in the Enterprise - Dr. Amr Awadallah @ Microstrategy World 2011
Hadoop in the Enterprise - Dr. Amr Awadallah @ Microstrategy World 2011Hadoop in the Enterprise - Dr. Amr Awadallah @ Microstrategy World 2011
Hadoop in the Enterprise - Dr. Amr Awadallah @ Microstrategy World 2011
 
Anexinet Big Data Solutions
Anexinet Big Data SolutionsAnexinet Big Data Solutions
Anexinet Big Data Solutions
 
Processing Big Data
Processing Big DataProcessing Big Data
Processing Big Data
 
Choosing a Next Gen Database: the New World Order of NoSQL, NewSQL, and MySQL
Choosing a Next Gen Database: the New World Order of NoSQL, NewSQL, and MySQLChoosing a Next Gen Database: the New World Order of NoSQL, NewSQL, and MySQL
Choosing a Next Gen Database: the New World Order of NoSQL, NewSQL, and MySQL
 
Introducing the Big Data Ecosystem with Caserta Concepts & Talend
Introducing the Big Data Ecosystem with Caserta Concepts & TalendIntroducing the Big Data Ecosystem with Caserta Concepts & Talend
Introducing the Big Data Ecosystem with Caserta Concepts & Talend
 
A unified data modeler in the world of big data
A unified data modeler in the world of big dataA unified data modeler in the world of big data
A unified data modeler in the world of big data
 
Cloud computing era
Cloud computing eraCloud computing era
Cloud computing era
 
From the Big Data keynote at InCSIghts 2012
From the Big Data keynote at InCSIghts 2012From the Big Data keynote at InCSIghts 2012
From the Big Data keynote at InCSIghts 2012
 
Big data hadoop ecosystem and nosql
Big data hadoop ecosystem and nosqlBig data hadoop ecosystem and nosql
Big data hadoop ecosystem and nosql
 
Integrating hadoop - Big Data TechCon 2013
Integrating hadoop - Big Data TechCon 2013Integrating hadoop - Big Data TechCon 2013
Integrating hadoop - Big Data TechCon 2013
 
Big data ppt
Big data pptBig data ppt
Big data ppt
 
Drill njhug -19 feb2013
Drill njhug -19 feb2013Drill njhug -19 feb2013
Drill njhug -19 feb2013
 
Presentation big dataappliance-overview_oow_v3
Presentation   big dataappliance-overview_oow_v3Presentation   big dataappliance-overview_oow_v3
Presentation big dataappliance-overview_oow_v3
 

More from jdijcks

Oracle Big Data Appliance and Big Data SQL for advanced analytics
Oracle Big Data Appliance and Big Data SQL for advanced analyticsOracle Big Data Appliance and Big Data SQL for advanced analytics
Oracle Big Data Appliance and Big Data SQL for advanced analytics
jdijcks
 
Oracle Openworld Presentation with Paul Kent (SAS) on Big Data Appliance and ...
Oracle Openworld Presentation with Paul Kent (SAS) on Big Data Appliance and ...Oracle Openworld Presentation with Paul Kent (SAS) on Big Data Appliance and ...
Oracle Openworld Presentation with Paul Kent (SAS) on Big Data Appliance and ...
jdijcks
 
Expand a Data warehouse with Hadoop and Big Data
Expand a Data warehouse with Hadoop and Big DataExpand a Data warehouse with Hadoop and Big Data
Expand a Data warehouse with Hadoop and Big Data
jdijcks
 
2013 05 Oracle big_dataapplianceoverview
2013 05 Oracle big_dataapplianceoverview2013 05 Oracle big_dataapplianceoverview
2013 05 Oracle big_dataapplianceoverview
jdijcks
 
2012 10 bigdata_overview
2012 10 bigdata_overview2012 10 bigdata_overview
2012 10 bigdata_overview
jdijcks
 
2012 09 Oracle bigdata_architecture
2012 09 Oracle bigdata_architecture2012 09 Oracle bigdata_architecture
2012 09 Oracle bigdata_architecture
jdijcks
 

More from jdijcks (6)

Oracle Big Data Appliance and Big Data SQL for advanced analytics
Oracle Big Data Appliance and Big Data SQL for advanced analyticsOracle Big Data Appliance and Big Data SQL for advanced analytics
Oracle Big Data Appliance and Big Data SQL for advanced analytics
 
Oracle Openworld Presentation with Paul Kent (SAS) on Big Data Appliance and ...
Oracle Openworld Presentation with Paul Kent (SAS) on Big Data Appliance and ...Oracle Openworld Presentation with Paul Kent (SAS) on Big Data Appliance and ...
Oracle Openworld Presentation with Paul Kent (SAS) on Big Data Appliance and ...
 
Expand a Data warehouse with Hadoop and Big Data
Expand a Data warehouse with Hadoop and Big DataExpand a Data warehouse with Hadoop and Big Data
Expand a Data warehouse with Hadoop and Big Data
 
2013 05 Oracle big_dataapplianceoverview
2013 05 Oracle big_dataapplianceoverview2013 05 Oracle big_dataapplianceoverview
2013 05 Oracle big_dataapplianceoverview
 
2012 10 bigdata_overview
2012 10 bigdata_overview2012 10 bigdata_overview
2012 10 bigdata_overview
 
2012 09 Oracle bigdata_architecture
2012 09 Oracle bigdata_architecture2012 09 Oracle bigdata_architecture
2012 09 Oracle bigdata_architecture
 

Recently uploaded

20240702 QFM021 Machine Intelligence Reading List June 2024
20240702 QFM021 Machine Intelligence Reading List June 202420240702 QFM021 Machine Intelligence Reading List June 2024
20240702 QFM021 Machine Intelligence Reading List June 2024
Matthew Sinclair
 
20240705 QFM024 Irresponsible AI Reading List June 2024
20240705 QFM024 Irresponsible AI Reading List June 202420240705 QFM024 Irresponsible AI Reading List June 2024
20240705 QFM024 Irresponsible AI Reading List June 2024
Matthew Sinclair
 
5G bootcamp Sep 2020 (NPI initiative).pptx
5G bootcamp Sep 2020 (NPI initiative).pptx5G bootcamp Sep 2020 (NPI initiative).pptx
5G bootcamp Sep 2020 (NPI initiative).pptx
SATYENDRA100
 
@Call @Girls Thiruvananthapuram 🚒 XXXXXXXXXX 🚒 Priya Sharma Beautiful And Cu...
@Call @Girls Thiruvananthapuram  🚒 XXXXXXXXXX 🚒 Priya Sharma Beautiful And Cu...@Call @Girls Thiruvananthapuram  🚒 XXXXXXXXXX 🚒 Priya Sharma Beautiful And Cu...
@Call @Girls Thiruvananthapuram 🚒 XXXXXXXXXX 🚒 Priya Sharma Beautiful And Cu...
kantakumariji156
 
How Netflix Builds High Performance Applications at Global Scale
How Netflix Builds High Performance Applications at Global ScaleHow Netflix Builds High Performance Applications at Global Scale
How Netflix Builds High Performance Applications at Global Scale
ScyllaDB
 
Paradigm Shifts in User Modeling: A Journey from Historical Foundations to Em...
Paradigm Shifts in User Modeling: A Journey from Historical Foundations to Em...Paradigm Shifts in User Modeling: A Journey from Historical Foundations to Em...
Paradigm Shifts in User Modeling: A Journey from Historical Foundations to Em...
Erasmo Purificato
 
Implementations of Fused Deposition Modeling in real world
Implementations of Fused Deposition Modeling  in real worldImplementations of Fused Deposition Modeling  in real world
Implementations of Fused Deposition Modeling in real world
Emerging Tech
 
一比一原版(msvu毕业证书)圣文森山大学毕业证如何办理
一比一原版(msvu毕业证书)圣文森山大学毕业证如何办理一比一原版(msvu毕业证书)圣文森山大学毕业证如何办理
一比一原版(msvu毕业证书)圣文森山大学毕业证如何办理
uuuot
 
DealBook of Ukraine: 2024 edition
DealBook of Ukraine: 2024 editionDealBook of Ukraine: 2024 edition
DealBook of Ukraine: 2024 edition
Yevgen Sysoyev
 
Why do You Have to Redesign?_Redesign Challenge Day 1
Why do You Have to Redesign?_Redesign Challenge Day 1Why do You Have to Redesign?_Redesign Challenge Day 1
Why do You Have to Redesign?_Redesign Challenge Day 1
FellyciaHikmahwarani
 
7 Most Powerful Solar Storms in the History of Earth.pdf
7 Most Powerful Solar Storms in the History of Earth.pdf7 Most Powerful Solar Storms in the History of Earth.pdf
7 Most Powerful Solar Storms in the History of Earth.pdf
Enterprise Wired
 
How RPA Help in the Transportation and Logistics Industry.pptx
How RPA Help in the Transportation and Logistics Industry.pptxHow RPA Help in the Transportation and Logistics Industry.pptx
How RPA Help in the Transportation and Logistics Industry.pptx
SynapseIndia
 
Coordinate Systems in FME 101 - Webinar Slides
Coordinate Systems in FME 101 - Webinar SlidesCoordinate Systems in FME 101 - Webinar Slides
Coordinate Systems in FME 101 - Webinar Slides
Safe Software
 
How Social Media Hackers Help You to See Your Wife's Message.pdf
How Social Media Hackers Help You to See Your Wife's Message.pdfHow Social Media Hackers Help You to See Your Wife's Message.pdf
How Social Media Hackers Help You to See Your Wife's Message.pdf
HackersList
 
How to Avoid Learning the Linux-Kernel Memory Model
How to Avoid Learning the Linux-Kernel Memory ModelHow to Avoid Learning the Linux-Kernel Memory Model
How to Avoid Learning the Linux-Kernel Memory Model
ScyllaDB
 
Cookies program to display the information though cookie creation
Cookies program to display the information though cookie creationCookies program to display the information though cookie creation
Cookies program to display the information though cookie creation
shanthidl1
 
@Call @Girls Guwahati 🚒 XXXXXXXXXX 🚒 Priya Sharma Beautiful And Cute Girl any...
@Call @Girls Guwahati 🚒 XXXXXXXXXX 🚒 Priya Sharma Beautiful And Cute Girl any...@Call @Girls Guwahati 🚒 XXXXXXXXXX 🚒 Priya Sharma Beautiful And Cute Girl any...
@Call @Girls Guwahati 🚒 XXXXXXXXXX 🚒 Priya Sharma Beautiful And Cute Girl any...
kantakumariji156
 
The Increasing Use of the National Research Platform by the CSU Campuses
The Increasing Use of the National Research Platform by the CSU CampusesThe Increasing Use of the National Research Platform by the CSU Campuses
The Increasing Use of the National Research Platform by the CSU Campuses
Larry Smarr
 
What's New in Copilot for Microsoft365 May 2024.pptx
What's New in Copilot for Microsoft365 May 2024.pptxWhat's New in Copilot for Microsoft365 May 2024.pptx
What's New in Copilot for Microsoft365 May 2024.pptx
Stephanie Beckett
 
Recent Advancements in the NIST-JARVIS Infrastructure
Recent Advancements in the NIST-JARVIS InfrastructureRecent Advancements in the NIST-JARVIS Infrastructure
Recent Advancements in the NIST-JARVIS Infrastructure
KAMAL CHOUDHARY
 

Recently uploaded (20)

20240702 QFM021 Machine Intelligence Reading List June 2024
20240702 QFM021 Machine Intelligence Reading List June 202420240702 QFM021 Machine Intelligence Reading List June 2024
20240702 QFM021 Machine Intelligence Reading List June 2024
 
20240705 QFM024 Irresponsible AI Reading List June 2024
20240705 QFM024 Irresponsible AI Reading List June 202420240705 QFM024 Irresponsible AI Reading List June 2024
20240705 QFM024 Irresponsible AI Reading List June 2024
 
5G bootcamp Sep 2020 (NPI initiative).pptx
5G bootcamp Sep 2020 (NPI initiative).pptx5G bootcamp Sep 2020 (NPI initiative).pptx
5G bootcamp Sep 2020 (NPI initiative).pptx
 
@Call @Girls Thiruvananthapuram 🚒 XXXXXXXXXX 🚒 Priya Sharma Beautiful And Cu...
@Call @Girls Thiruvananthapuram  🚒 XXXXXXXXXX 🚒 Priya Sharma Beautiful And Cu...@Call @Girls Thiruvananthapuram  🚒 XXXXXXXXXX 🚒 Priya Sharma Beautiful And Cu...
@Call @Girls Thiruvananthapuram 🚒 XXXXXXXXXX 🚒 Priya Sharma Beautiful And Cu...
 
How Netflix Builds High Performance Applications at Global Scale
How Netflix Builds High Performance Applications at Global ScaleHow Netflix Builds High Performance Applications at Global Scale
How Netflix Builds High Performance Applications at Global Scale
 
Paradigm Shifts in User Modeling: A Journey from Historical Foundations to Em...
Paradigm Shifts in User Modeling: A Journey from Historical Foundations to Em...Paradigm Shifts in User Modeling: A Journey from Historical Foundations to Em...
Paradigm Shifts in User Modeling: A Journey from Historical Foundations to Em...
 
Implementations of Fused Deposition Modeling in real world
Implementations of Fused Deposition Modeling  in real worldImplementations of Fused Deposition Modeling  in real world
Implementations of Fused Deposition Modeling in real world
 
一比一原版(msvu毕业证书)圣文森山大学毕业证如何办理
一比一原版(msvu毕业证书)圣文森山大学毕业证如何办理一比一原版(msvu毕业证书)圣文森山大学毕业证如何办理
一比一原版(msvu毕业证书)圣文森山大学毕业证如何办理
 
DealBook of Ukraine: 2024 edition
DealBook of Ukraine: 2024 editionDealBook of Ukraine: 2024 edition
DealBook of Ukraine: 2024 edition
 
Why do You Have to Redesign?_Redesign Challenge Day 1
Why do You Have to Redesign?_Redesign Challenge Day 1Why do You Have to Redesign?_Redesign Challenge Day 1
Why do You Have to Redesign?_Redesign Challenge Day 1
 
7 Most Powerful Solar Storms in the History of Earth.pdf
7 Most Powerful Solar Storms in the History of Earth.pdf7 Most Powerful Solar Storms in the History of Earth.pdf
7 Most Powerful Solar Storms in the History of Earth.pdf
 
How RPA Help in the Transportation and Logistics Industry.pptx
How RPA Help in the Transportation and Logistics Industry.pptxHow RPA Help in the Transportation and Logistics Industry.pptx
How RPA Help in the Transportation and Logistics Industry.pptx
 
Coordinate Systems in FME 101 - Webinar Slides
Coordinate Systems in FME 101 - Webinar SlidesCoordinate Systems in FME 101 - Webinar Slides
Coordinate Systems in FME 101 - Webinar Slides
 
How Social Media Hackers Help You to See Your Wife's Message.pdf
How Social Media Hackers Help You to See Your Wife's Message.pdfHow Social Media Hackers Help You to See Your Wife's Message.pdf
How Social Media Hackers Help You to See Your Wife's Message.pdf
 
How to Avoid Learning the Linux-Kernel Memory Model
How to Avoid Learning the Linux-Kernel Memory ModelHow to Avoid Learning the Linux-Kernel Memory Model
How to Avoid Learning the Linux-Kernel Memory Model
 
Cookies program to display the information though cookie creation
Cookies program to display the information though cookie creationCookies program to display the information though cookie creation
Cookies program to display the information though cookie creation
 
@Call @Girls Guwahati 🚒 XXXXXXXXXX 🚒 Priya Sharma Beautiful And Cute Girl any...
@Call @Girls Guwahati 🚒 XXXXXXXXXX 🚒 Priya Sharma Beautiful And Cute Girl any...@Call @Girls Guwahati 🚒 XXXXXXXXXX 🚒 Priya Sharma Beautiful And Cute Girl any...
@Call @Girls Guwahati 🚒 XXXXXXXXXX 🚒 Priya Sharma Beautiful And Cute Girl any...
 
The Increasing Use of the National Research Platform by the CSU Campuses
The Increasing Use of the National Research Platform by the CSU CampusesThe Increasing Use of the National Research Platform by the CSU Campuses
The Increasing Use of the National Research Platform by the CSU Campuses
 
What's New in Copilot for Microsoft365 May 2024.pptx
What's New in Copilot for Microsoft365 May 2024.pptxWhat's New in Copilot for Microsoft365 May 2024.pptx
What's New in Copilot for Microsoft365 May 2024.pptx
 
Recent Advancements in the NIST-JARVIS Infrastructure
Recent Advancements in the NIST-JARVIS InfrastructureRecent Advancements in the NIST-JARVIS Infrastructure
Recent Advancements in the NIST-JARVIS Infrastructure
 

Big dataappliance hadoopworld_final

  • 1. <Insert Picture Here> Oracle Big Data Appliance and Solutions Jean-Pierre Dijcks Hadoop World – Nov 8th, 2012
  • 2. The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into any contract. It is not a commitment to deliver any material, code, or functionality, and should not be relied upon in making purchasing decisions. The development, release, and timing of any features or functionality described for Oracle’s products remain at the sole discretion of Oracle.
  • 3. Case: On-line Ads and Content Real-time: Determine Low best ad to place Latency Lookup user on page for this user profile Add user NoSQL Expert if not present DB Input into System Actual HDFS Predictions ads on browsing Web served logs High scale Batch data reductions BI and Billing NoSQL DB Analytics Profiles
  • 4. Agenda • Big Data Technology • Oracle Big Data Appliance • Big Data Applications • Summary • Q&A
  • 5. <Insert Picture Here> Big Data Technology
  • 6. Big Data: Infrastructure Requirements Acquire Organize Analyze • Low, predictable Latency • High Transaction Volume • Deep Analytics • Flexible Data Structures • Agile Development • Massive Scalability • High Throughput • Real Time Results • In-Place Preparation • All Data Sources/Structures
  • 7. Divided Solution Spectrum Data Variety Distributed NoSQL Dynamic File Systems Flexible MapReduce Schema Specialized Solutions Transaction Developer (Key-Value) Centric Stores SQL Schema DBMS DBMS Advanced Trusted ETL Analytics (OLTP) (DW) Secure Administered Acquire Organize Analyze
  • 8. Oracle Integrated Software Solution Stack Data Variety HDFS Hadoop In-DB Analytics Dynamic Oracle Loader Schema “R” Oracle NoSQL for Hadoop Mining DB Text Oracle Data Integrator Graph Spatial Oracle Oracle Schema Database Database Oracle (OLTP) (DW) BI EE Acquire Organize Analyze 8 Copyright © 2011, Oracle and/or its affiliates. All rights Insert Information Protection Policy Classification from Slide 8 reserved.
  • 9. Oracle Engineered Solutions Data Variety Big Data Appliance HDFS Hadoop In-DB • Hadoop Analytics Dynamic • NoSQL Database Loader Oracle Schema • Oracle Loader for hadoop Oracle NoSQL “R” for Hadoop • Oracle Data Integrator DB Mining Exalytics Oracle Text • Speed of Data Integrator Graph Thought Spatial Analytics Oracle Oracle Exadata Oracle Schema Database • OLTP & DW Database Oracle (OLTP) (DW) • Data Mining & Oracle R BI EE • Semantics • Spatial Acquire Organize Analyze 9 Copyright © 2011, Oracle and/or its affiliates. All rights Insert Information Protection Policy Classification from Slide 8 reserved.
  • 10. Big Data Appliance Batch Usage Model Oracle Oracle Oracle Big Data Appliance Exadata Exalytics InfiniBand InfiniBand Acquire Organize Analyze
  • 11. Why build a Hadoop Appliance? • Time to Build? • Required Expertise? • Cost and Difficulty Maintaining? 11 Copyright © 2011, Oracle and/or its affiliates. All rights Insert Information Protection Policy Classification from Slide 8 reserved.
  • 12. Oracle Big Data Appliance Hardware •18 Sun X4270 M2 Servers – 48 GB memory per node = 864 GB memory – 12 Intel cores per node = 216 cores – 24 TB storage per node = 432 TB storage •40 Gb p/sec InfiniBand •10 Gb p/sec Ethernet
  • 13. Big Data Appliance Cluster of industry standard servers for Hadoop and NoSQL Database • Focus on Scalability and Availability at low cost InfiniBand Network Compute and Storage • Redundant 40Gb/s switches • 18 High-performance low-cost • IB connectivity to Exadata servers acting as Hadoop nodes 10GigE Network • 24 TB Capacity per node • 8 10GigE ports • 2 6-core CPUs per node • Datacenter connectivity • Hadoop triple replication • NoSQL Database triple replication
  • 14. Scale Out to Infinity Scale out by connecting racks to each other using Infiniband • Expand up to eight racks without additional switches • Scale beyond eight racks by adding an additional switch
  • 15. Oracle Big Data Appliance Software •Oracle Linux 5.6 •Java Hotspot VM •Apache Hadoop Distribution v0.20.x •R Distribution •Oracle NoSQL Database Enterprise Edition •Oracle Data Integrator Application Adapter for Hadoop •Oracle Loader for Hadoop
  • 16. Why Open-Source Apache Hadoop? • Fast evolution in critical features • Built by the Hadoop experts in the community • Practical instead of esoteric • Focus on what is needed for large clusters • Proven at very large scale • In production at all the large consumers of Hadoop • Extremely stable in those environments • Well-understood by practitioners
  • 17. Software Layout • Node 1: • M: Name Node, Balancer & HBase Master • S: HDFS Data Node, NoSQL DB Storage Node • Node 2: • M: Secondary Name Node, Management, Zookeeper, MySQL Slave • S: HDFS Data Node, NoSQL DB Storage Node • Node 3: • M: JobTracker, MySQL Master, ODI Agent, Hive Server • S: HDFS Data Node, NoSQL DB Storage Node • Node 4 – 18: • S: HDFS Data Nodes, Task Tracker, HBase Region Server, NoSQL DB Storage Nodes • Your MapReduce runs here!
  • 18. Big Data Appliance Big Data for the Enterprise • Optimized and Complete • Everything you need to store and integrate your lower information density data • Integrated with Oracle Exadata • Analyze all your data • Easy to Deploy • Risk Free, Quick Installation and Setup • Single Vendor Support • Full Oracle support for the entire system and software set
  • 20. Key-Value Store Workloads • Large dynamic schema based data repositories • Data capture • Web applications • Online retail • Sensor/statistics/network capture/Mobile Devices • Data services • Scalable authentication • Real-time communication (MMS, SMS, routing) • Personalization / Localization • Social Networks
  • 21. Oracle NoSQL DB A distributed, scalable key-value database • Simple Data Model • Key-value pair with major+sub-key paradigm • Read/insert/update/delete operations Application Application • Scalability NoSQLDB Driver NoSQLDB Driver • Dynamic data partitioning and distribution • Optimized data access via intelligent driver • High availability • One or more replicas • Disaster recovery through location of replicas • Resilient to partition master failures • No single point of failure Storage Nodes Storage Nodes • Transparent load balancing Data Center B Data Center A • Reads from master or replicas • Driver is network topology & latency aware
  • 22. Resolving a Request Operation + Key[M,m] + Value + Transaction Policy Client Hash Major Key to determine Partition id Use Partition Map to map Partition • Operation result id to a Rep Group • New Partition Map • RepNodeStorageTable Use State Table to determine eligible information Storage Node(s) within Rep Group Use Load Balancer to select best eligible Rep Node Contact Rep Node directly
  • 23. ACID Transactions Transaction Policy Transaction Policy Write Durability Read Consistency • Configurable per-operation, • Configurable per-operation, application can set defaults application can set defaults • Write Transaction Durability consists • Read Consistency specified as of both Absolute, Time-based, Version or a) Sync policy (on Master and None Replica) • Absolute  Read from the master • Sync – force to disk • Write No Sync – force to OS • Time-based  Read from any buffer replica that is within <time- • No Sync – write to local log buffer, interval> of master or better flush when convenient • Version  Read from any replica b) Replica Acknowledgement Policy that is current with <transaction- • All token> or higher • Simple Majority • None  Read from any replica • None
  • 24. Oracle NoSQL DB Differentiation • Commercial Grade Software and Support • General-purpose • Reliable – Based on proven Berkeley DB JE HA • Easy to install and configure • Scalable throughput, bounded latency • Simple Programming and Operational Model • Simple Major + Sub key and Value data structure • ACID transactions • Configurable consistency & durability • Easy Management • Web-based console, API accessible • Manages and Monitors: Topology; Load; Performance; Events; Alerts • Completes Oracle large scale data storage offerings
  • 25. Try NoSQL Database on OTN Oracle NoSQL Database: • Community Edition is available as a software only distribution • Enterprise Edition is available as a separately licensable product or as part of Big Data Appliance
  • 26. <Insert Picture Here> Oracle Loader for Hadoop
  • 27. Oracle Loader for Hadoop Features • Load data into a partitioned or non-partitioned table – Single level, composite or interval partitioned table – Support for scalar datatypes of Oracle Database – Load into Oracle Database 11g Release 2 • Runs as a Hadoop job and supports standard options • Pre-partitions and sorts data on Hadoop • Online and offline load modes 27 Copyright © 2011, Oracle and/or its affiliates. All rights reserved.
  • 28. Oracle Loader for Hadoop INPUT 1 MAP MAP ORACLE LOADER FOR HADOOP MAP REDUCE REDUCE MAP MAP REDUCE MAP REDUCE MAP MAP REDUCE REDUCE REDUCE SHUFFLE MAP /SORT SHUFFLE MAP /SORT MAP MAP MAP REDUCE MAP REDUCE MAP REDUCE MAP REDUCE MAP MAP REDUCE SHUFFLE SHUFFLE MAP /SORT MAP /SORT REDUCE SHUFFLE /SORT INPUT 2 28 Copyright © 2011, Oracle and/or its affiliates. All rights reserved.
  • 29. Oracle Loader for Hadoop: Online Option Read target table metadata Perform partitioning, ORACLE LOADER FOR HADOOP Connect to the database from the database sorting, and data from reducer nodes, load conversion into database partitions in parallel MAP REDUCE MAP REDUCE SHUFFLE MAP /SORT MAP REDUCE MAP REDUCE SHUFFLE MAP REDUCE /SORT 29 Copyright © 2011, Oracle and/or its affiliates. All rights reserved.
  • 30. Oracle Loader for Hadoop: Offline Option Read target table metadata Perform partitioning, ORACLE LOADER FOR HADOOP Write from reducer nodes to from the database sorting, and data Oracle Data Pump files conversion MAP Import into the database in REDUCE parallel using external table MAP mechanism REDUCE SHUFFLE MAP /SORT MAP REDUCE MAP REDUCE SHUFFLE MAP REDUCE /SORT 30 Copyright © 2011, Oracle and/or its affiliates. All rights reserved.
  • 31. Oracle Loader for Hadoop Advantages • Offload database server processing to Hadoop: – Convert input data to final database format – Compute table partition for row – Sort rows by primary key within a table partition • Generate binary datapump files • Balance partition groups across reducers 31 Copyright © 2011, Oracle and/or its affiliates. All rights reserved.
  • 32. Input and Output Formats Input Formats Output Formats Online Mode • Delimited text • Load directly from Hadoop nodes to Oracle database • Hive tables – JDBC – Managed and external tables – Parallel direct path – Native and non-native tables Offline Mode • Write your own input format • Datapump format – Create binary files for external tables – Import data into the database from the external table with a SQL statement • CSV, delimited text – Load through SQL*Loader or external table mechanism 32 Copyright © 2011, Oracle and/or its affiliates. All rights reserved.
  • 33. Selection Output Option for Use Case Oracle Loader for Hadoop Use Case Characteristics Output Option Online load with JDBC The simplest use case for non partitioned tables Online load with Direct Path Fast online load for partitioned tables Offline load with datapump files Fastest load method for external tables On Oracle Big Data Appliance Leave data on HDFS Direct HDFS Parallel access from database Import into database when needed 33 Copyright © 2011, Oracle and/or its affiliates. All rights reserved.
  • 34. Invoking Oracle Loader for Hadoop • Command line $ hadoop jar oraloader.jar oracle.hadoop.loader.OraLoader -libjars <library jar files> -D <configuration properties> $HADOOP_HOME/bin/hadoop jar oraloader.jar oracle.hadoop.loader.oraLoader -libjars avro-1.4.1.jar, commons-math-2.2.jar -conf connection.xml -D mapreduce.inputformat.class=oracle.hadoop.loader.lib.input.DelimitedTextInputFormat -D mapreduce.outputformat.class=oracle.hadoop.loader.lib.output.JDBCOutputFormat 34 Copyright © 2011, Oracle and/or its affiliates. All rights reserved.
  • 35. Automate Usage of Oracle Loader for Hadoop Oracle Data Integrator (ODI) • ODI has knowledge modules to – Generate data transformation code to run on Hive/Hadoop – Invoke Oracle Loader for Hadoop • Use the drag-and-drop interface in ODI to – Include invocation of Oracle Loader for Hadoop in any ODI packaged flow 36 Copyright © 2011, Oracle and/or its affiliates. All rights reserved.
  • 36. 37 Copyright © 2011, Oracle and/or its affiliates. All rights reserved.
  • 38. Big Data Appliance Big Data for the Enterprise • Optimized and Complete • Everything you need to store and integrate your lower information density data • Integrated with Oracle Exadata • Analyze all your data • Easy to Deploy • Risk Free, Quick Installation and Setup • Single Vendor Support • Full Oracle support for the entire system and software set
  • 39. Big Data Appliance and Exadata Big Data for the Enterprise NoSQL DB  HDFS  Hadoop  RDBMS 

Editor's Notes

  1. Changed Count to Volume =&gt;
  2. Is Developer Centric the right word? Should we hyphenate, or put comma’s
  3. Benefits for Online Mode: No need to write to disk after Hadoop job Simpler management for use cases with lots of nodes generating output filesBenefits for Offline Mode (DP Files): Import operation can be parallelized in the database Fastest option for external tables
  4. Direct HDFS:Access data on HDFS through the external table mechanismBenefitsData on HDFS can be queried from the databaseImport into the database as needed