0% found this document useful (0 votes)

6 views

M5_dbm_sql_notes

NoSQL databases are a diverse group of database management systems that do not rely on the traditional relational model, offering schema-less data storage, scalability, and support for various data models such as key-value, document, wide-column, and graph formats. They are particularly suited for big data applications and scenarios requiring flexible data models and horizontal scalability. While NoSQL databases provide advantages like flexibility and performance, they may also present challenges related to consistency, maturity, and complexity compared to traditional relational databases.

Uploaded by

ajith

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views

M5_dbm_sql_notes

Uploaded by

ajith

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

NoSQL

NoSQL databases are a broad class of database management systems that differ from traditional
relational database management systems (RDBMS) in that they do not use the standard relational
database model. The term "NoSQL" originally stood for "non-SQL" to emphasize their departure
from SQL-based relational databases, but it has since been interpreted to mean "not only SQL,"
indicating that some NoSQL databases support query languages that are SQL-like.

Key Characteristics of NoSQL Databases:

 Schema-less: NoSQL databases do not require a fixed schema before data insertion,
allowing for more flexible data models that can evolve with the application's needs.
 Scalability: Scalability can generally be achieved in two ways vertical scaling and
horizontal scaling.
 Veertical Scaling : Involves adding more resources (CPU, RAM, storage) to a single
server or node.
 Horizontal Scaling: Involves adding more servers or nodes to a distributed system,
spreading out the data and workload across many machines rather than relying on a
single one.
 Variety of Data Models: NoSQL databases support a wide range of data models, including
key-value, document, wide-column, and graph formats, that is unstructured data.

Types of NoSQL Databases:

1. Key-Value Stores: The simplest form of NoSQL databases, storing data as a collection of
key-value pairs. Examples include Redis and Amazon DynamoDB.
2. Document Stores: These databases store data as documents, typically in JSON or BSON
(Binary JSON) formats. They are schema-less, which means the structure of these
documents can change over time. Examples include MongoDB and Couchbase.
3. Wide-Column Stores: Similar to relational databases in that they store data in tables with
rows and columns, but unlike RDBMS, the names and format of the columns can vary from
row to row in the same table. . Examples include Apache Cassandra and Google Bigtable.
4. Graph Databases: They represent and store data in terms of entities and the relationships
between these entities. Examples include Neo4j and Amazon Neptune.

Use Cases:
 Big Data and Real-Time Web Applications: NoSQL databases are well-suited for handling
large volumes of data with varying structures, or when an application requires real-time
access to data.
 Flexible Data Models: Applications that require a flexible data model with the ability to
change the data schema.
 Scalability: When an application needs to scale horizontally across many servers to handle
large loads or large volumes of data, NoSQL databases are often more cost-effective and
technically feasible than traditional RDBMS.
Advantages and Disadvantages:
 Advantages:
 Scalability: Designed to horizontal scale, that is involve across many servers.
 Flexibility: Schema-less models allow for flexible and rapid development.
 Variety: high volume, unstructured data.
 Disadvantages:
 Consistency: Many NoSQL databases sacrifice ACID (Atomicity, Consistency,
Isolation, Durability)
 Maturity: Relational databases benefit from decades of research, optimization, and
tooling, while NoSQL solutions are generally newer and may lack comprehensive
tools and best practices.
 Complexity: The variety of data models can introduce complexity in selecting and
managing the appropriate NoSQL database for specific use cases.

1. Document stores
1. “document" refers to the main unit of storage, which encapsulates data in formats like
JSON, BSON. These databases are designed to store, retrieve, and manage document-
oriented information, typically in JSON, BSON (Binary JSON), or XML formats.
2. Document stores provide a flexible schema approach, which means that the structure of the
data can change from one document to another within the same database.
3. This flexibility makes document stores particularly well-suited for applications dealing with
large volumes of unstructured data that may not fit neatly into the rows and columns of a
traditional relational database.

Key Features of NoSQL Document Stores:

 Schema-less Data Storage: Documents can contain different sets and types of attributes.
 Scalability: Many document stores offer horizontal scalability, that is adding more servers
in a distributed architecture. This makes them suitable for web-scale applications.
 High Performance: Document stores are optimized for fast data retrieval. Indexing on
document attributes can further enhance query performance.

Example
MongoDB, Apache CouchDB, Couchbase, DocumentDB

A data base for Books in MongoDB is represented as.

[
{
"isbn": "100-100-100",
"title": "Alchemist",
"authors": ["Paulo Coelho"],
},
{
"isbn": "100-100-101",
"title": "The Godfather",
"authors": ["Mario Puzo"],
"publisher": "Penguin Books",
"genres": ["Novel", "Psychological Fiction"],
},

Above we can see two records of books , Alchemist and God father having different schema. That
is the record for the book “God father“ got two more attributes (publisher and genres). This is
described as flexible schema

Insert operation
db.book.insert({
isbn: "100-100-103",
title: "Hamlet",
authors: ["William Shakespeare"],
publicationYear: 1599,
genres: [“play”, “Shakespearean tragedy”],
})

Query operation

db.book.find({title: "Hamlet"})
db.book.find({isbn: 100-100-103)}

Both query return same data

{
"_id" : ObjectId("someObjectId"),
"isbn": "100-100-104",
"title": "Hamlet",
"authors": ["William Shakespeare"],
"publicationYear": 1599,
"genres": ["play", "Shakespearean tragedy"]
}
2. Key-value stores
1. Organize data as a collection of key-value pairs, where a key serves as a unique identifier
and is mapped to a corresponding value.
2. This simple data model allows for highly efficient data retrieval, storage, and management,
particularly for use cases requiring fast access to large amounts of data.

Characteristics of Key-Value Stores:

Simplicity: The model is straightforward, with each key acting as a unique identifier through which
its associated value can be accessed.
Performance: Key-value stores are optimized for speed, particularly for read and write operations,
due to their simple data model and the ability to distribute data across multiple nodes.
Scalability: Many key-value databases are designed to scale out horizontally, making it easy to
increase capacity and throughput by adding more nodes to the system.
Flexibility: Values stored can be anything from simple data types (such as strings or numbers) to
more complex objects or binary data. The structure of the value is not enforced by the database,
providing flexibility in what can be stored.
Schema-less: There is no fixed schema required for the data, allowing the structure of values to
change without the need for database migrations.

Ecample
Redis, Amazon DynamoDB, Riak, Riak

To represent the given book records as key-value pairs suitable for a key-value store like Amazon
DynamoDB, you can structure each record with its ISBN as the key and the rest of the book's
information as the value.

{
"100-100-100": {
"title": "Alchemist",
"authors": ["Paulo Coelho"],
},
"100-100-101": {
"title": "The Godfather",
"authors": ["Mario Puzo"],
"publisher": "Penguin Books",
"genres": ["Novel", "Psychological Fiction"],
},
}

Insert operatio
aws dynamodb put-item \
--table-name Books \
--item '{
"isbn": {"S": "100-100-103"},
"title": {"S": "Hamlet"},
"authors": {"L": [{"S": "William Shakespeare"}]},
"publicationYear": {"N": "1599"},
"genres": {"L": [{"S": "play"}, {"S": "Shakespearean tragedy"}]}
}' \
Query operation
aws dynamodb get-item \
--table-name Books \
--key '{"isbn": {"S": "100-100-103"}}' \

This will return..

{
"Item": {
"isbn": {"S": "100-100-103"},
"title": {"S": "Hamlet"},
"authors": {"L": [{"S": "William Shakespeare"}]},
"publicationYear": {"N": "1599"},
"genres": {"L": [{"S": "play"}, {"S": "Shakespearean tragedy"}]}
}
}

3. Wide-column stores
Wide-column stores are a type of NoSQL database that organizes data into tables, rows, and
dynamic columns. They combine elements of both relational databases and key-value stores but are
unique in their approach to column management. Wide-column stores allow for each row to have
a different set of columns.

Key Features of Wide-Column Stores:

 Dynamic Columns: Unlike traditional relational databases that require a fixed schema with
a predetermined set of columns, wide-column stores allow each row to have a varying
number of columns. This flexibility is particularly useful for storing data with many
attributes that may not be uniform across all records.
 Efficient Storage and Retrieval: They are optimized for queries over large datasets and can
efficiently retrieve specific columns across a subset of rows.
 Scalability: Wide-column stores are designed to scale horizontally across many nodes,
making them suitable for applications that require high performance, availability, and
scalability.
 Composite Keys: They often support composite keys, consisting of a partition key and
clustering keys, allowing for efficient data access patterns and enabling data to be stored and
retrieved in sorted order.
Examples
Apache Cassandra, Google Cloud Bigtable, ScyllaDB

Cassandra encourages denormalization and duplication of data across multiple tables to optimize
read performance.

Creating Schema

CREATE TABLE books (

isbn TEXT PRIMARY KEY,
title TEXT,
authors LIST<TEXT>,
publicationYear INT,
publisher TEXT,
genres LIST<TEXT>,
);

// inseting three attributes for Book - Alcheist

INSERT INTO books (isbn, title, authors)
VALUES ('100-100-100', 'Alchemist', ['Paulo Coelho']);

// inseting three attributes for Book - Alcheist

INSERT INTO books (isbn, title, authors, publicationYear, publisher, genres)
VALUES ('100-100-101', “The God father”, ['Mario Puzo'], 1988, 'HarperCollins', ['Novel',
'Philosophical']);

Schema Flexibility of Cassandra

 Although Cassandra requires a schema to define tables and columns, it is more flexible
compared to traditional RDBMS.
 Each row in the same table can have a different set of columns with some columns being
present in one row but absent in others. Additionally,
 Cassandra supports adding new columns to existing tables without affecting existing rows

So the major difference are

1. cells can have more than one value
2. when new column is added, it will not be allocated in memory for older rows.

Modify the Table Schema

Now we want to insert a record with a new attribute “Language”
For eg:
isbn :'100-100-103',
title: '100 years of Solitude',
authors: ['Gabriel Garcia Marques'],
publicationYear: 1967
publisher: Green Books
genres : [‘Magical Realism’]
language: ‘spanish’

In Apache Cassandra, if you need to add a new attribute (like "language" in your example) to an
existing table, you would typically alter the table schema to include this new column.

This will not add memory overhead to existing rows whereas in RDBMS, scema updates will add
memory overhead to all exisitng records.

The process involved are

1. update existing schema using ALTER TABLE
2. insert new record

ALTER TABLE books ADD language text;

INSERT INTO books (isbn, title, authors, publicationYear, publisher, genres, language)
VALUES ('100-100-103', '100 years of Solitude', ['Gabriel Garcia Marques'], 1967, 'Green
Books', ['Magical Realism'], 'spanish');
Query Operation

SELECT * FROM books WHERE isbn = '100-100-103';

4. Graph stores
1. Graph stores, or graph databases, are a type of NoSQL database designed to treat
relationships between data as equally important to the data itself.
2. They are optimized for managing interconnected data and for queries that traverse complex
relationships with many hops in the data.
3. Graph databases excel at handling data whose relationships are as important or more
important than the data itself.

Key Features of Graph Stores

 Structure: Graph databases use graph structures for semantic queries, with nodes, edges,
and properties to represent and store data. Each node represents an entity (such as a person,
business, or any other item), and each edge represents a connection or relationship between
two nodes. Each node and edge can have properties associated with it.
 Schema-less: Similar to other NoSQL databases, graph databases are generally schema-less.
This means that nodes and edges can have arbitrary and varying sets of attributes.
 Relationship-First: In graph databases, focuses on/ relationships . This means they are
stored at the individual record level and can be navigated efficiently. This is in contrast to
relational databases where relationships are derived by joining tables which can be
computationally expensive.
Example
Neo4j, ArangoDB, Amazon Neptune, OrientDB

Designing the Graph Schema

Here's how you might design the graph model:
 Nodes: Each entity type (Book, Author, Publisher, Reviewer) becomes a node.
 Relationships: Define relationships such as AUTHORED_BY between Books and Authors,
PUBLISHED_BY between Books and Publishers, and REVIEWED_BY between Books and
Reviewers.
 Properties: Each node can have properties. For books, properties might include ISBN, title,
publication year, and genres. Authors might have names as properties, publishers could have
names and locations, and reviewers might have usernames.

Example Graph Representation

Let's take the books and their related data you mentioned and translate them into a Cypher query for
Neo4j:
// Create Authors
CREATE (a1:Author {name: "Paulo Coelho"})
CREATE (a2:Author {name: "Mario Puzo"})
CREATE (a3:Author {name: "Gabrierl Garcia Marques"})

// Create Publishers
CREATE (p1:Publisher {name: "Penguin Books"})
CREATE (p2:Publisher {name: "Green Bools"})

// Create Books
CREATE (b1:Book {isbn: "100-100-101", title: "The Alchemist"})
CREATE (b2:Book {isbn: "100-100-102", title: "The God Father", publisher: “Penguin Books”,
genres: ['Novel', 'Psychological Fiction']})
CREATE (b3:Book {isbn: "100-100-103", title: "100 years of Solitude", publicationYear: 1967,
publisher: “Green Books”, genres: ['Novel', 'Fiction', 'Coming-of-Age']})

// Establish Relationships
CREATE (b1)-[:AUTHORED_BY]->(a1)

CREATE (b2)-[:AUTHORED_BY]->(a2)
CREATE (b2)-[:PUBLISHED_BY]->(p2)

CREATE (b3)-[:AUTHORED_BY]->(a3)
CREATE (b3)-[:PUBLISHED_BY]->(p3)

Query Operations

To find a book by its ISBN, you can use the following query:
MATCH (b:Book {isbn: "100-100-103"})
RETURN b;

find all books authored by "Gabriel Garcia Marques", use a query like:
MATCH (a:Author {name: "Gabriel Garcia Marques"})-[:AUTHORED_BY]-(b:Book)
RETURN b;

Data Privacy Notice and Consent Form
No ratings yet
Data Privacy Notice and Consent Form
1 page
Veritas Netbackup™ Emergency Engineering Binary Guide: Release 8.2 and 8.2.X
No ratings yet
Veritas Netbackup™ Emergency Engineering Binary Guide: Release 8.2 and 8.2.X
41 pages
All Mcqs
No ratings yet
All Mcqs
35 pages
Unit 2
No ratings yet
Unit 2
26 pages
No SQL
No ratings yet
No SQL
38 pages
Unit 3 NoSQL
No ratings yet
Unit 3 NoSQL
98 pages
Bcse302l Dbms Module-7 Nosql
No ratings yet
Bcse302l Dbms Module-7 Nosql
30 pages
Unit 2
No ratings yet
Unit 2
65 pages
Cs 620 / Dasc 600 Introduction To Data Science & Analytics: Lecture 6-Nosql
No ratings yet
Cs 620 / Dasc 600 Introduction To Data Science & Analytics: Lecture 6-Nosql
31 pages
Types of NoSQL Databases
No ratings yet
Types of NoSQL Databases
3 pages
What Is NoSQL
No ratings yet
What Is NoSQL
4 pages
Module 5_NoSQL databases
No ratings yet
Module 5_NoSQL databases
33 pages
Features of Nosql: Non-Relational
No ratings yet
Features of Nosql: Non-Relational
7 pages
01 NSQL
No ratings yet
01 NSQL
5 pages
CH.5 NOSQL database for Business Applications
No ratings yet
CH.5 NOSQL database for Business Applications
21 pages
NoSQL Database
No ratings yet
NoSQL Database
10 pages
NOsql Presentation
No ratings yet
NOsql Presentation
20 pages
Chapter14_BigData&NoSQLDatabases
No ratings yet
Chapter14_BigData&NoSQLDatabases
39 pages
NoSQL Big Data Management
No ratings yet
NoSQL Big Data Management
36 pages
NoSQL Tutorial - New
No ratings yet
NoSQL Tutorial - New
10 pages
Unit 5_230601_174540-1
No ratings yet
Unit 5_230601_174540-1
14 pages
Lecture 3.1.2
No ratings yet
Lecture 3.1.2
47 pages
No SQL Lecture Notes
No ratings yet
No SQL Lecture Notes
17 pages
Aimma Butt - 038 Assignment 2
No ratings yet
Aimma Butt - 038 Assignment 2
7 pages
DSA 4-Introduction To NoSQL
No ratings yet
DSA 4-Introduction To NoSQL
59 pages
Lec 15 Notes
No ratings yet
Lec 15 Notes
3 pages
No SQL
No ratings yet
No SQL
10 pages
MongoDB Slides Until ClassTest
No ratings yet
MongoDB Slides Until ClassTest
221 pages
Lecture 1 - NoSQL
No ratings yet
Lecture 1 - NoSQL
31 pages
Lecture 1
No ratings yet
Lecture 1
31 pages
Introduction To Nosql: - Key Value Databases
No ratings yet
Introduction To Nosql: - Key Value Databases
14 pages
Dynamo DB
No ratings yet
Dynamo DB
20 pages
Bda Notes (Unit-2)
No ratings yet
Bda Notes (Unit-2)
26 pages
Full Stack-Unit-Iii
No ratings yet
Full Stack-Unit-Iii
56 pages
Unit 3
No ratings yet
Unit 3
10 pages
NoSQL_Notes
No ratings yet
NoSQL_Notes
11 pages
NOSQL
No ratings yet
NOSQL
25 pages
Unit Ii - Nosql Databases
No ratings yet
Unit Ii - Nosql Databases
112 pages
10gen Top 5 NoSQL Considerations
No ratings yet
10gen Top 5 NoSQL Considerations
10 pages
NoSQL (1)
No ratings yet
NoSQL (1)
12 pages
Lecture 6 - NoSQL
No ratings yet
Lecture 6 - NoSQL
43 pages
Dbms Presentation
No ratings yet
Dbms Presentation
22 pages
Bda Unit-5 PDF
No ratings yet
Bda Unit-5 PDF
83 pages
DOC-20250306-WA0001.
No ratings yet
DOC-20250306-WA0001.
34 pages
Nosql Databases
No ratings yet
Nosql Databases
2 pages
Unit 2 Handouts
No ratings yet
Unit 2 Handouts
11 pages
chap 4
No ratings yet
chap 4
18 pages
Comparison Between NoSQL and RDBMS
No ratings yet
Comparison Between NoSQL and RDBMS
6 pages
NOSQL Lecture 1 Notes
No ratings yet
NOSQL Lecture 1 Notes
31 pages
NoSql 2024 Assign2
No ratings yet
NoSql 2024 Assign2
189 pages
What Is Nosql: Features of Nosql Databases
No ratings yet
What Is Nosql: Features of Nosql Databases
11 pages
HBase
No ratings yet
HBase
36 pages
NoSQL Databases Notes
No ratings yet
NoSQL Databases Notes
5 pages
What Is NoSQL
No ratings yet
What Is NoSQL
10 pages
Session 8 - NoSQL
No ratings yet
Session 8 - NoSQL
17 pages
Nosql Database
No ratings yet
Nosql Database
19 pages
1842-week6-NoSQL
No ratings yet
1842-week6-NoSQL
51 pages
Unit5_Notes_Short_DB
No ratings yet
Unit5_Notes_Short_DB
6 pages
BIG Data 2
No ratings yet
BIG Data 2
18 pages
41 NoSQL Introduction.pptx
No ratings yet
41 NoSQL Introduction.pptx
18 pages
Chapter 6b - No SQL
No ratings yet
Chapter 6b - No SQL
27 pages
DBMS MASTER: Become Pro in Database Management System
From Everand
DBMS MASTER: Become Pro in Database Management System
Ummed Singh
No ratings yet
SQL Query Basics
From Everand
SQL Query Basics
Isabella Ramirez
No ratings yet
dbms qp series 1
No ratings yet
dbms qp series 1
2 pages
Ocean Currents - Factors and Impact On Climate
No ratings yet
Ocean Currents - Factors and Impact On Climate
7 pages
The Directive Principles Which Are Fundamental in The Governance of The Country Cannot Be Isolated From The Fundamental Rights Guaranteed
No ratings yet
The Directive Principles Which Are Fundamental in The Governance of The Country Cannot Be Isolated From The Fundamental Rights Guaranteed
10 pages
Context: 1. A Violation of Right Found, But No Remedy Given
No ratings yet
Context: 1. A Violation of Right Found, But No Remedy Given
8 pages
Lec 2 Data Modeling and Database Design
No ratings yet
Lec 2 Data Modeling and Database Design
10 pages
TOR - Program Management Consultant
100% (1)
TOR - Program Management Consultant
23 pages
Technical Aptitude
No ratings yet
Technical Aptitude
250 pages
An Analytical Paragraph
No ratings yet
An Analytical Paragraph
2 pages
PDF LP Mindmap Dyspnea - Compress
No ratings yet
PDF LP Mindmap Dyspnea - Compress
3 pages
Error DB
No ratings yet
Error DB
4 pages
SQL 20
100% (1)
SQL 20
4 pages
Linux FileSystem PDF
No ratings yet
Linux FileSystem PDF
8 pages
Checking The Data Using Extractor Checker in ECC Delta and Repea Delta
No ratings yet
Checking The Data Using Extractor Checker in ECC Delta and Repea Delta
21 pages
Mod5 Chapter3
No ratings yet
Mod5 Chapter3
25 pages
TalendOpenStudio BigData UG 5.5.2 en
No ratings yet
TalendOpenStudio BigData UG 5.5.2 en
248 pages
Data Science Create Teams That Ask the Right Questions and Deliver Real Value 1st Edition Doug Rose - The complete ebook version is now available for download
100% (3)
Data Science Create Teams That Ask the Right Questions and Deliver Real Value 1st Edition Doug Rose - The complete ebook version is now available for download
67 pages
Group 13 Reporter 2
No ratings yet
Group 13 Reporter 2
23 pages
CA Individual Assignments 2019 04
No ratings yet
CA Individual Assignments 2019 04
1 page
Juliet Remarks collections
No ratings yet
Juliet Remarks collections
58 pages
Mcse 70-290
No ratings yet
Mcse 70-290
46 pages
Hamdy's Oracle Ideas
No ratings yet
Hamdy's Oracle Ideas
73 pages
Machine Learning: Aigerim Bogyrbayeva
No ratings yet
Machine Learning: Aigerim Bogyrbayeva
85 pages
Vicat Test PDF
No ratings yet
Vicat Test PDF
6 pages
Distributed Database: GDC Thana Semester 6
No ratings yet
Distributed Database: GDC Thana Semester 6
10 pages
Oracle9i (통합)
No ratings yet
Oracle9i (통합)
349 pages
[FREE PDF sample] Foundations for Analytics with Python 1st Edition Clinton W. Brownley ebooks
100% (2)
[FREE PDF sample] Foundations for Analytics with Python 1st Edition Clinton W. Brownley ebooks
65 pages
Module 2-Data Science
No ratings yet
Module 2-Data Science
3 pages
MYSQL NOTES 2024 XII
No ratings yet
MYSQL NOTES 2024 XII
19 pages
The of Scientific: Nature Enquiry
No ratings yet
The of Scientific: Nature Enquiry
42 pages
Answer Key To Lab Exercises
No ratings yet
Answer Key To Lab Exercises
58 pages
23 Things You Should Know About VLOOKUP - Exceljet
100% (1)
23 Things You Should Know About VLOOKUP - Exceljet
16 pages

M5_dbm_sql_notes

Uploaded by

M5_dbm_sql_notes

Uploaded by

NoSQL

Key Characteristics of NoSQL Databases:

Types of NoSQL Databases:

Key Features of NoSQL Document Stores:

A data base for Books in MongoDB is represented as.

Both query return same data

Characteristics of Key-Value Stores:

This will return..

Key Features of Wide-Column Stores:

CREATE TABLE books (

// inseting three attributes for Book - Alcheist

// inseting three attributes for Book - Alcheist

Schema Flexibility of Cassandra

So the major difference are

Modify the Table Schema

The process involved are

ALTER TABLE books ADD language text;

SELECT * FROM books WHERE isbn = '100-100-103';

Key Features of Graph Stores

Designing the Graph Schema

Example Graph Representation

You might also like