0% found this document useful (1 vote)

77 views

Data Analytics Using NoSQL

Data Analytics using NoSQL documents discusses NoSQL databases and MongoDB. It covers typical NoSQL architectures using hashing to map keys to servers. It summarizes the CAP theorem, explaining that it is impossible to satisfy all three properties (consistency, availability, and partition tolerance) simultaneously. The document also covers MongoDB specific topics like sharding of data, replica sets for redundancy and failover, and the flexible schema-less document model using BSON format.

Uploaded by

PREM KUMAR M

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (1 vote)

77 views

Data Analytics Using NoSQL

Uploaded by

PREM KUMAR M

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 50

Data Analytics using NoSQL

DHINESHKUMAR S K
Taxonomy of NoSQL
Key-value

Graph database

Document-oriented

Column family

3
Typical NoSQL architecture

Hashing
K
function
maps each
key to a
server (node)
CAP theorem for NoSQL

What the CAP theorem really says:

• If you cannot limit the number of faults and requests can be
directed to any server and you insist on serving every request you

receive then you cannot possibly be consistent

How it is interpreted:
• You must always give something up: consistency, availability or
tolerance to failure and reconfiguration
Theory of NOSQL: CAP
GIVEN:
C
• Many nodes
• Nodes contain replicas of partitions
of the data

• Consistency
• All replicas contain the same version
of data
• Client always has the same view of
the data (no matter what node)
• Availability A P
• System remains operationalon failing
nodes
• All clients can always read and write
• Partition tolerance CAP Theorem: satisfying
• multiple entry points
• System remains operationalon
all three at the same
system split (communication
malfunction) time is impossible
• System works well across physical
network partitions
Sharding of data

 Distributes a single logical database system across a cluster of machines

 Uses range-based partitioning to distribute documents based
 on a specific shard key
 Automatically balances the data associated with each shard
 Can be turned on and off per collection (table)
Replica Sets

 Redundancy and Failover

 Zero downtime for upgrades and
maintenance replica1

 Master-slave replication Client

 Strong Consistency
 Delayed Consistency

 Geospatial features
How does NoSQL vary from RDBMS?

 Looser schema definition

 Applications written to deal with specific documents/ data
 Applications aware of the schema definition as opposed to the data
 Designed to handle distributed, large databases
 Trade offs:
 No strong support for ad hoc queries but designed for speed and growth of database
 Query language through the API
 Relaxation of the ACID properties
Benefits of NoSQL

Elastic Scaling Big Data

• RDBMS scale up – bigger • Huge increase in data
load , bigger server RDMS: capacity and
• NO SQL scale out – constraints of data
distribute data across volumes at its limits
multiple hosts • NoSQL designed for big
seamlessly data
DBA Specialists
• RDMS require highly
trained expert to
monitor DB
• NoSQL require less
management, automatic
repair and simpler data
models
Benefits of NoSQL

Flexible data models Economics

• Change management to • RDMS rely on expensive
schema for RDMS have proprietary servers to
to be carefully managed manage data
• NoSQL databases more • No SQL: clusters of
relaxed in structure of cheap commodity
data servers to manage the
• Database schema data and transaction
changes do not have to volumes
be managed as one • Cost per gigabyte or
complicated change unit
transaction/second for
• Application already
written to address an
NoSQL can be lower
amorphous schema than the cost for a
RDBMS
Drawbacks of NoSQL

• Support • Maturity
• RDBMS vendors • RDMS mature
provide a high level of product: means stable
support to clients and dependable
• Stellar reputation • Also means old no
• NoSQL – are open longer cutting edge nor
interesting
source projects
with startups • NoSQL are still
supporting them implementing their
• Reputation not yet basic feature set
established
Drawbacks of NoSQL

• Administration • Analytics and Business

• RDMS administrator well Intelligence
defined role • RDMS designed to
• No SQL’s goal: no  address this niche
administrator necessary • NoSQL designed to meet
however NO SQL still the needs of an Web 2.0
requires effort to application - not
maintain designed for ad hoc
• Lack of Expertise query of the data
• Whole workforce of • Tools are being
trained and seasoned developed to address
RDMS developers this need
• Still recruiting
developers to the NoSQL
camp
RDB ACID to NoSQL BASE

Atomicity Basically

Consistency Available (CP)

Isolation Soft-state
(State of system may change
over time)

Durability Eventually
consistent
(Asynchronous propagation)
MongoDB
What is MongoDB?

 Developed by 10gen
 Founded in 2007
 A document-oriented, NoSQL database
 Written in C++
 Supports APIs (drivers) in many computer languages
 JavaScript, Python, Ruby, Perl, Java, Java Scala, C#, C++, Haskell, Erlang
Functionality of MongoDB

• Dynamic schema
• No DDL
• Document-based database
• Secondary indexes
• Query language via an API
• Atomic writes and fully-consistent reads
• If system configured that way
• Master-slave replication with automated failover (replica sets)
• Built-in horizontal scaling via automated range-based
• partitioning of data (sharding)
Why use MongoDB?

 Simple queries
 Functionality provided applicable to most web applications
 Easy and fast integration of data
 No ERD diagram
 Not well suited for heavy and complex transactions systems
MongoDB: CAP approach

C
Focus on Consistency and
Partition tolerance

• Consistency
• all replicas contain the same
version of the data
• Availability
• system remains operational on A P
failingnodes
• Partition tolarence
CAP Theorem:
• multiple entry points satisfying all three at the same time is
• system remains operational on impossible
system split
MongoDB: Hierarchical Objects

0 or more Databases
 A MongoDB instance may have
0 or more Collections
 zero or more ‘databases’
0 or more Documents
 A database may have
 zero or more ‘collections’.
 A collection may have
 zero or more ‘documents’.
0 or
more
 A document may have
Fields
 one or more ‘fields’.
 MongoDB ‘Indexes’ function much like their
RDBMS counterparts.
RDB Concepts to NO SQL
RDBMS MongoDB
Database Database
Table, View Collection

Row Document (BSON)

Column Field
Index Index
Join Embedded Document
Foreign Key Reference

Partition Shard
Choices made for Design of MongoDB

 Scale horizontally over commodity hardware

 Lots of relatively inexpensive servers
 Keep the functionality that works well in RDBMSs
 Ad hoc queries
 Fully featured indexes
 Secondary indexes
 What doesn’t distribute well in RDB?
 Long running multi-row transactions
 Joins
 Both artifacts of the relational data model (row x column)
BSON format

 Binary-encoded serialization of JSON-like documents

 Zero or more key/value pairs are stored as a single entity
 Each entry consists of a field name, a data type, and a value
 Large elements in a BSON document are prefixed with a
 length field to facilitate scanning
Schema Free

 MongoDB does not need any pre-defined data schema

 Every document in a collection could have different data
 Addresses NULL data fields

{name: “will”, name: “jeff”, {name: “brendan”,

eyes: “blue”, eyes: “blue”, aliases: [“el diablo”]}
birthplace: “NY”, loc: [40.7, 73.4],
aliases: [“bill”, “la ciacco”], boss: “ben”}
loc: [32.7, 63.4],
boss: ”ben”}
{name: “matt”,
pizza: “DiGiorno”,
height: 72,
name: “ben”, loc: [44.6, 71.3]}
hat: ”yes”}
JSON format

 Data is in name / value pairs

 A name/value pair consists of a field name followed by a colon, followed by a value:
 Example: “name”: “R2-D2”
 Data is separated by commas
 Example: “name”: “R2-D2”, race : “Droid”
 Curly braces hold objects
 Example: {“name”: “R2-D2”, race : “Droid”, affiliation:“rebels”}
 An array is stored in brackets []
 Example [ {“name”: “R2-D2”, race : “Droid”, affiliation: “rebels”}, {“name”: “Yoda”,
affiliation: “rebels”} ]
MongoDB Features

 Document-Oriented storage
 Full Index Support
 Replication & High Availability Agile
 Auto-Sharding
 Querying
 Fast In-Place Updates
 Map/Reduce functionality Scalable
Index Functionality

• B+ tree indexes
• An index is automatically created on the _id field (the primary key)
• Users can create other indexes to improve query performance or to enforce Unique values
for a particular field
• Supports single field index as well as Compound index
• Like SQL order of the fields in a compound index matters
• If you index a field that holds an array value, MongoDB creates
• separate index entries for every element of the array
• Sparse property of an index ensures that the index only contain entries for documents that
have the indexed field. (so ignore records that do not have the field defined)
• If an index is both unique and sparse – then the system will reject records that have a
duplicate key value but allow records that do not have the indexed field defined
Hands ON!!!!!
Example: Mongo Document

{
name: 'Brad Steve’,
address: {
street: 'Oak Terrace', city: 'Denton’
}
}
Example: Mongo Collection

{
"_id": ObjectId("4efa8d2b7d284dad101e4bc9"),
"Last Name": "DUMONT",
"First Name": "Jean",
"Date of Birth": "01-22-1963" Obligatory, and automatically generated
}, by MongoDB
{
"_id": ObjectId("4efa8d2b7d284dad101e4bc7"),
"Last Name": "PELLERIN",
"First Name": "Franck",
"Date of Birth": "09-19-1983",
"Address": "1 chemin des Loges",
"City": "VERSAILLES"
}
Sample!

 BLOG
 A blog post has an author, some text, and many comments
 The comments are unique per post, but one author has many posts

 How would you design this in SQL?

Blog – BAD Design

 Collections for posts, authors, and comments

 References by manually created ID

post = { author = {
id: 150, id: 100,
author: 100, name: 'Michael Arrington' posts: [150]
text: 'This is a pretty awesome post.’, }
comments: [100, 105, 112]
} comment = {
id: 105,
text: 'Whatever this is good comment’
}
Sample: Better Design

 Collection for posts

 Embed comments, author name

post = {
author: 'Michael Arrington’,
text: 'This is a pretty awesome post.’,
comments: [ 'Whatever this post sux.', 'I agree, lame!’ ]
}
Installation
CRUD Operations

• Create
• db.collection.insert( <document> )
• db.collection.save( <document> )
• db.collection.update( <query>, <update>, { upsert: true } )
Collection
• Read specifies the
• db.collection.find( <query>, <projection> )
• db.collection.findOne( <query>, <projection> )
collection or
• Update
the ‘table’ to
•
• db.collection.update( <query>, <update>, <options> )
Delete
store the
• db.collection.remove( <query>, <justOne> ) document
Create Operations

Db.collection specifies the collection or the ‘table’ to store the document

• db.collection_name.insert( <document> )
• Omit the _id field to have MongoDB generate a unique key
• Example db.parts.insert( {{type: “screwdriver”, quantity: 15 } )
• db.parts.insert({_id: 10, type: “hammer”, quantity: 1 })
• db.collection_name.update( <query>, <update>, { upsert: true } )
• Will update 1 or more records in a collection satisfying query
• db.collection_name.save( <document> )
• Updates an existing record or creates a new record
Read Operations

• db.collection.find( <query>, <projection> ).cursor modified

• Provides functionality similar to the SELECT command
• <query> where condition , <projection> fields in result set
• Example: var PartsCursor = db.parts.find({parts: “hammer”}).limit(5)
• Has cursors to handle a result set
• Can modify the query to impose limits, skips, and sort orders.
• Can specify to return the ‘top’ number of records from the result set
• db.collection.findOne( <query>, <projection> )
Query Operators

Name Description
$eq Matches value that are equal to a specified value
$gt, $gte Matches values that are greater than (or equal to a specified value

$lt, $lte Matches values less than or ( equal to ) a specified value

$ne Matches values that are not equal to a specified value

$in Matches any of the values specified in an array
$nin Matches none of the values specified in an array
$or Joins query clauses with a logical OR returns all
$and Join query clauses with a loginal AND
$not Inverts the effect of a query expression
$nor Join query clauses with a logical NOR
$exists Matches documents that have a specified field
Update Operations

• db.collection_name.insert( <document> )
• Omit the _id field to have MongoDB generate a unique key
• Example db.parts.insert( {{type: “screwdriver”, quantity: 15 } )
• db.parts.insert({_id: 10, type: “hammer”, quantity: 1 })
• db.collection_name.save( <document> )
• Updates an existing record or creates a new record
• db.collection_name.update( <query>, <update>, { upsert: true } )
• Will update 1 or more records in a collection satisfying query
• db.collection_name.findAndModify(<query>, <sort>, <update>,<new>,
<fields>,<upsert>)
• Modify existing record(s) – retrieve old or new version of the record
Delete Operations

• db.collection_name.remove(<query>, <justone>)
• Delete all records from a collection or matching a criterion
• <justone> - specifies to delete only 1 record matching the criterion
• Example: db.parts.remove(type: /^h/ } ) - remove all parts starting with h
• Db.parts.remove() – delete all documents in the parts collections
SQL vs. Mongo DB entities

My SQL Mongo DB
START TRANSACTION; db.contacts.save( { user
INSERT INTO contacts VALUES (NULL, Name: “joeblow”,
‘joeblow’);
emailAddresses:
INSERT INTO contact_emails
VALUES
[ “joe@blow.co
( NULL, ”joe@blow.com”, m”,
“joseph@blow.com” ] }
LAST_INSERT_ID() ),
( NULL,
“joseph@blow.com”,
LAST_INSERT_ID() ); COMMIT; MongoDB separates physical structure
from logical structure
Designed to deal with large &distributed
Aggregation
Aggregation Framework Operators

 $project
 $match
 $limit
 $skip
 $sort
 $unwind
 $group
 …….
$match

 Filter documents
 Uses existing query syntax
 If using $geoNear it has to be first in pipeline
 $where is not supported
Matching Field Values

{
"_id" : 271421,
"amenity" : "pub",
"name" : "Sir Walter Tyrrell",
"location" : {
"type" : "Point",
"coordinates" : [
-1.6192422,
50.9131996
]
}
} {
"_id" : 271466,
{ "amenity" : "pub",
"_id" : 271466,
"amenity" : "pub", "name" : "The Red Lion",
"name" : "The Red Lion", "location" : {
"location" : { "type" : "Point",
"type" : "Point",
"coordinates" : [ "coordinates" : [
-1.5494749, -1.5494749,
50.7837119 50.7837119
]
} ]}
$project

 Reshape documents
 Include, exclude or rename fields
 Inject computed fields
 Create sub-document fields
Including and Excluding Fields
{ { “$project”: {
"_id" : 271466,
“_id”: 0,
"amenity" : "pub", “amenity”: 1,
"name" : "The Red Lion", “name”: 1,

"location" : { }}
"type" : "Point",
"coordinates" : [
-1.5494749,
50.7837119 {
] “amenity” : “pub”,
“name” : “The Red Lion”
} }
}
Reformatting Documents
{ { “$project”: {
"_id" : 271466,
“_id”: 0,
"amenity" : "pub", “name”: 1,
"name" : "The Red Lion", “meta”: {
“type”: “$amenity”}
"location" : { }}
"type" : "Point",
"coordinates" : [
-1.5494749,
50.7837119 {
] “name” : “The Red Lion”
“meta” : {
} “type” : “pub”
} }}
$group

• Group documents by an ID

 Field reference, object, constant

• Other output fields are computed
$max, $min, $avg, $sum
$addToSet, $push $first, $last

• Processes all data in memory

Aggregation Framework Benefits

 Real-time
 Simple yet powerful interface
 Declared in JSON, executes in C++
 Runs inside MongoDB on local data
− Adds load to your DB
− Limited Operators
− Data output is limited

Ouvrir ALMiG AC B V1.80 Service Manual GB
0% (1)
Ouvrir ALMiG AC B V1.80 Service Manual GB
74 pages
Anurag Arwalkar CV
No ratings yet
Anurag Arwalkar CV
1 page
IBDP Math Applications & Interpretation HL COURSE OUTLINES
100% (2)
IBDP Math Applications & Interpretation HL COURSE OUTLINES
23 pages
WsCube Tech Online MERN Stack Course
No ratings yet
WsCube Tech Online MERN Stack Course
24 pages
Big Data Analytics TEXTBOOK
No ratings yet
Big Data Analytics TEXTBOOK
230 pages
ADF
No ratings yet
ADF
54 pages
E-Commerce Application - Angular Front-End and Spring Boot Back-End
No ratings yet
E-Commerce Application - Angular Front-End and Spring Boot Back-End
2 pages
Lab Manual
No ratings yet
Lab Manual
162 pages
MySQL Tutorial
No ratings yet
MySQL Tutorial
52 pages
Hybrid Resume ATS_TDS
No ratings yet
Hybrid Resume ATS_TDS
2 pages
What Is LINQ PDF
No ratings yet
What Is LINQ PDF
145 pages
Sourabh Bisht RESUME PDF
No ratings yet
Sourabh Bisht RESUME PDF
2 pages
Ignite Sample
0% (1)
Ignite Sample
88 pages
GCP Official Icons and Solution Architectures PDF
No ratings yet
GCP Official Icons and Solution Architectures PDF
95 pages
Janmejaya_Sahoo
No ratings yet
Janmejaya_Sahoo
2 pages
Angular 01052023
No ratings yet
Angular 01052023
62 pages
SQL Server Material Free PDF by MR Bangar Raju Part1
No ratings yet
SQL Server Material Free PDF by MR Bangar Raju Part1
137 pages
Lecture 07 - Key-Value Databases
No ratings yet
Lecture 07 - Key-Value Databases
75 pages
Introduction To Data Model L-1
No ratings yet
Introduction To Data Model L-1
17 pages
Udemy Course Access
No ratings yet
Udemy Course Access
27 pages
ETL Vs DB Testing
No ratings yet
ETL Vs DB Testing
13 pages
Building Single Page Applications Using Web API and AngularJS
67% (3)
Building Single Page Applications Using Web API and AngularJS
140 pages
Ethans Tech: AWS Curriculum
No ratings yet
Ethans Tech: AWS Curriculum
5 pages
Ebay Case Study
No ratings yet
Ebay Case Study
11 pages
Durga Core Java
50% (2)
Durga Core Java
2 pages
Submitted To: Efforts By:: Ms. Kareena Bhatia Rahul Kr. Gupta
No ratings yet
Submitted To: Efforts By:: Ms. Kareena Bhatia Rahul Kr. Gupta
18 pages
Lekcija09 - 04 NoSQL Redis
No ratings yet
Lekcija09 - 04 NoSQL Redis
40 pages
Git Workshop - PDF Version 1-Rotated
No ratings yet
Git Workshop - PDF Version 1-Rotated
36 pages
AWS Simple-Icons v2.0
No ratings yet
AWS Simple-Icons v2.0
10 pages
Analysis Node - Js Platform Web Application Security
No ratings yet
Analysis Node - Js Platform Web Application Security
60 pages
Big Data and Data Analysis: Offurum Paschal I Kunoch Education and Training College, Owerri
No ratings yet
Big Data and Data Analysis: Offurum Paschal I Kunoch Education and Training College, Owerri
35 pages
MongoDB Lab
No ratings yet
MongoDB Lab
41 pages
Intro DB JDBC JPA SpringData
No ratings yet
Intro DB JDBC JPA SpringData
136 pages
Unit 2 - Angular-Componenets
No ratings yet
Unit 2 - Angular-Componenets
30 pages
Bookstore E-Commerce Platform With MERN Stack
No ratings yet
Bookstore E-Commerce Platform With MERN Stack
66 pages
Hibernate Interview Question
No ratings yet
Hibernate Interview Question
119 pages
Very Important Example of Hibernate3.0
No ratings yet
Very Important Example of Hibernate3.0
161 pages
Billing Sytem
No ratings yet
Billing Sytem
14 pages
UK - Angular - Day 2 PDF
No ratings yet
UK - Angular - Day 2 PDF
12 pages
MSSQL Server 2008 Developer
No ratings yet
MSSQL Server 2008 Developer
240 pages
OnkarPramodKurle (3 0)
No ratings yet
OnkarPramodKurle (3 0)
7 pages
Suraj Resume
No ratings yet
Suraj Resume
2 pages
AWS Scenario Based Interview Questions On EC2, IAM & VPC
No ratings yet
AWS Scenario Based Interview Questions On EC2, IAM & VPC
14 pages
Database Study Guide
No ratings yet
Database Study Guide
67 pages
Resume 3
No ratings yet
Resume 3
4 pages
Chapter 7 Relational Database and SQL
No ratings yet
Chapter 7 Relational Database and SQL
12 pages
Flutter Developer Resume
No ratings yet
Flutter Developer Resume
2 pages
Angular 7 For Beginners
No ratings yet
Angular 7 For Beginners
65 pages
What Is Aws?: Saas (Software As A Service)
No ratings yet
What Is Aws?: Saas (Software As A Service)
16 pages
Javascript Notes 123
No ratings yet
Javascript Notes 123
15 pages
Pluralsight - Angular Reactive Forms
No ratings yet
Pluralsight - Angular Reactive Forms
190 pages
Typescript Notes
No ratings yet
Typescript Notes
23 pages
MongoBoulder - Schema Design
No ratings yet
MongoBoulder - Schema Design
59 pages
Alternative Databases: Scott Macvicar Dutch PHP Conference 2009
100% (2)
Alternative Databases: Scott Macvicar Dutch PHP Conference 2009
52 pages
Shubham Pande
No ratings yet
Shubham Pande
2 pages
SudheerKumar Ponnana Resume
No ratings yet
SudheerKumar Ponnana Resume
4 pages
Trust Decay
No ratings yet
Trust Decay
10 pages
CSRF Presentation
No ratings yet
CSRF Presentation
10 pages
DSML - Curriculum Brochure
No ratings yet
DSML - Curriculum Brochure
40 pages
No SQLMongo DB
No ratings yet
No SQLMongo DB
47 pages
Chapter 5-NoSQL PDF
No ratings yet
Chapter 5-NoSQL PDF
47 pages
NoSQL Database
No ratings yet
NoSQL Database
45 pages
Computer Vision: Models, Learning and Inference
No ratings yet
Computer Vision: Models, Learning and Inference
59 pages
Apache Spark: Dhineshkumar S K
No ratings yet
Apache Spark: Dhineshkumar S K
31 pages
Python Quick Guide
No ratings yet
Python Quick Guide
162 pages
CheatSheet Python 3 Complex Data Types
No ratings yet
CheatSheet Python 3 Complex Data Types
1 page
Notes On 9th Computer Science
78% (9)
Notes On 9th Computer Science
7 pages
Role of The Good Angel and The Bad
No ratings yet
Role of The Good Angel and The Bad
13 pages
Bethany Hamilton Thesis
100% (3)
Bethany Hamilton Thesis
4 pages
My Expectations in Filipino
100% (1)
My Expectations in Filipino
10 pages
Computer Skills - CPIT 100
No ratings yet
Computer Skills - CPIT 100
4 pages
Practical 17
No ratings yet
Practical 17
5 pages
The Reading / Tapescript: Advertisements
No ratings yet
The Reading / Tapescript: Advertisements
16 pages
Deskripsi Mata Kuliah: Teknik Informatika UNS
No ratings yet
Deskripsi Mata Kuliah: Teknik Informatika UNS
5 pages
RTU560 Remote Terminal Unit: Subdevice Communication Interface With Hitachi HDLC Protocol
No ratings yet
RTU560 Remote Terminal Unit: Subdevice Communication Interface With Hitachi HDLC Protocol
33 pages
06 Activity 1 ARG Data Structures Orienza Kenan M PDF
No ratings yet
06 Activity 1 ARG Data Structures Orienza Kenan M PDF
1 page
Gen Sec Ref Books
No ratings yet
Gen Sec Ref Books
516 pages
ACA - Memory
No ratings yet
ACA - Memory
26 pages
Az-204 2
No ratings yet
Az-204 2
51 pages
How Excellent Is Thy Loving Kindness
No ratings yet
How Excellent Is Thy Loving Kindness
2 pages
Elementary 10 Topics: Negative Questions: Question Tags
No ratings yet
Elementary 10 Topics: Negative Questions: Question Tags
7 pages
A Photograph
No ratings yet
A Photograph
4 pages
IBM CE - Industrial Skills Training - Programme Details
No ratings yet
IBM CE - Industrial Skills Training - Programme Details
2 pages
Vesica Piscis Key
No ratings yet
Vesica Piscis Key
36 pages
Criticism On Literature
No ratings yet
Criticism On Literature
4 pages
Community Service Reflection
No ratings yet
Community Service Reflection
2 pages
CoE3DJ4 Digital Systems Design Hardware Summary
No ratings yet
CoE3DJ4 Digital Systems Design Hardware Summary
164 pages
Milen_Dimitrov_HW2_Q2
No ratings yet
Milen_Dimitrov_HW2_Q2
28 pages
Wa0001.
No ratings yet
Wa0001.
4 pages
CS8494 Software Engineering MCQ
No ratings yet
CS8494 Software Engineering MCQ
34 pages
Sacred Heart Catholic College, Ijebu-Ode AOC For Third Term Entry Test JS1
No ratings yet
Sacred Heart Catholic College, Ijebu-Ode AOC For Third Term Entry Test JS1
6 pages
Introduction To Python
No ratings yet
Introduction To Python
13 pages
UNDS How Much Do You Know Yourself
No ratings yet
UNDS How Much Do You Know Yourself
2 pages
Semantika Engleskoga Jezika-Skripta Za Zic-Fuchs
No ratings yet
Semantika Engleskoga Jezika-Skripta Za Zic-Fuchs
14 pages

Data Analytics Using NoSQL

Uploaded by

Data Analytics Using NoSQL

Uploaded by

Data Analytics using NoSQL

What the CAP theorem really says:

receive then you cannot possibly be consistent

 Distributes a single logical database system across a cluster of machines

 Redundancy and Failover

 Master-slave replication Client

 Looser schema definition

Elastic Scaling Big Data

Flexible data models Economics

• Administration • Analytics and Business

Consistency Available (CP)

Row Document (BSON)

 Scale horizontally over commodity hardware

 Binary-encoded serialization of JSON-like documents

 MongoDB does not need any pre-defined data schema

{name: “will”, name: “jeff”, {name: “brendan”,

 Data is in name / value pairs

 How would you design this in SQL?

 Collections for posts, authors, and comments

 Collection for posts

Db.collection specifies the collection or the ‘table’ to store the document

• db.collection.find( <query>, <projection> ).cursor modified

$lt, $lte Matches values less than or ( equal to ) a specified value

$ne Matches values that are not equal to a specified value

 Field reference, object, constant

• Processes all data in memory

You might also like