Lecture 5- DataBase

Uploaded by

aletimanaswini

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views

Lecture 5- DataBase

Uploaded by

aletimanaswini

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 18

ISC 211 Introduction

to Bioinformatics
Lecture 5 – Bioinformatics DataBase
Dr. Athira B
Asst. Professor, CSE
IIIT Kottayam
Motivation
• Key concept in Molecular Biology is the information flow
DNA →RNA→ Protein
• From a data point of view: we have multiple omic data:
Genomics → Trancriptomics → Proteomic → Metabolomisc
• This vast amount of data needs to be stored and organized for easy
access around the globe
Motivation-Human Genome Project
• A landmark global scientific effort whose signature goal was to
generate the first sequence of the human genome (almost all genes in
human)
• Identified 1,00,000 genes in DNA
• more than 3 Billion base pairs were extracted
• The goals were:
• Alert patients that are at risk of certain diseases
• Reliably predict course of disease
• Precise diagnose and treatment
• Developing new treatments at molecular level
• Milestone in Biomedical Research
• https://www.genome.gov/about-genomics/educational-resources/
fact-sheets/human-genome-project.
Motivation-Biological Big Data
• Advancement in sequencing techniques generated good amount of
Biological data
• Similar to human, genetic data of other model organisms are also
generated:
• Yeast (Saccharomyces cerevisiae)
• Fruit fly (Drosophila melanogaster)
• Nematode worm (Caenorhabditis elegans)
• Western clawed frog (Xenopus tropicalis)
• Mouse (Mus musculus)
• Zebrafish (Danio rerio)
• How to store these data so that researchers can easily retrieve data
efficiently
Databases
• Database stores and organizes related data for easy retrieval
Eg: Your Phone contact book
• Most common form of Database is relational database (SQL)
• There are many other databases- column databases, graph databases,
etc
• Biological databases stores biological data and associated knowledge
• These knowledge bases are fundamentals to the survival of science
Biological Databases
• Store and handle the staggering volume of Biological information
through the establishment and use of computer databases
• Current biological databases use all three types of database
structures: flat files, relational, and object oriented
• Based on their contents, biological databases can be roughly divided
into three categories: primary databases, secondary databases, and
specialized databases.
Primary Databases
• Contain original biological data. They are archives of raw sequence or
structural data submitted by the scientific community
• GenBank, the European Molecular Biology Laboratory (EMBL)
database, Protein Data Bank (PDB) and the DNA Data Bank of
Japan (DDBJ)
Secondary Databases
• Secondary databases contain computationally processed or manually
curated information, based on original information from primary
databases.
• Translated protein sequence databases containing functional
annotation belong to this category
SWISS-PROT
Specialized Databases
• Specialized databases normally serve a specific research community
or focus on a particular organism
• The content of these databases may be sequences or other types of
information
• Examples include Flybase, WormBase, AceDB, Microarray gene
expression database, and TAIR
Composite Databases
• Variety of primary databases combined
• One place for different primary databases
Information Retrieval from Biological
Databases
• The most popular retrieval systems for biological databases are
Entrez and Sequence Retrieval Systems (SRS)
• Join a series of keywords using logical terms such as AND, OR, and
NOT to indicate relationships between the keywords used in a search
• Entrez3, a biological database retrieval system by NCBI
• For a complex search, a user can use the Boolean operators
• Online Mendelian Inheritance in Man (OMIM) accessible from Entrez,
which is a non-sequence-based database of human disease genes and
human genetic disorders
GenBank
• GenBank is the most complete collection of annotated nucleic acid
sequence data for almost every organism.
• The content includes genomic DNA, mRNA, cDNA, ESTs, high
throughput raw sequence data, and sequence polymorphisms
• There is also a GenPept database for protein sequences
GenBank: Sequence Format
Header
• origin of the sequence, identification of organism, unique identifiers
• Locus: unique database identifier
• Sequence length and molecule type(DNA or RNA)
• Three-letter code eg: PLN for plant, BCT for bacteria…
• Definition : name of the sequence, name and source of organism,
whether sequence is partial or complete
• Accession number : number cited in publications
• Version number : to identify the current version, if the sequence is
revised at a later stage
• Organism: source of organism with the scientific name of the species
• Reference : author and title information, contact information
Gene information
• Features : annotation information
• Source: length of sequence, scientific name of organism
• Gene : nucleotide coding sequence and its name
• CDS : information about boundaries of the sequence that can be
translated into amino acids. For eukaryotic, locaton of exons also
mentioned
DNA SEQUENCE
• ORIGIN: sequence itself; ends with two forward slashes (“//”)

• In retrieving the DNA sequence, search can be limited to “organism”,

“accession number”, “author”, “publication date”.
Fasta: Sequence Format
Reading Assignment
• Read more on Biological Databases:
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4411498/
[ZMYZ15]
• Practice: explore various databases
• Assessment
• Bring your laptops
• Explore Entrez: https://www.ncbi.nlm.nih.gov/search/
• Explore NCBI databases
• Read Chapter 2, Essential Bioinformatics by Jin Xiong[Xio06]

Biological Databases Lec 2,3
No ratings yet
Biological Databases Lec 2,3
49 pages
Ferrari 328 Microplex ECU Testing
100% (3)
Ferrari 328 Microplex ECU Testing
18 pages
Sec1 Introduction to Bioinformatics
No ratings yet
Sec1 Introduction to Bioinformatics
20 pages
Bioinformatics and Omics Topic: Database and Biological Database With Examples Assignment-3
No ratings yet
Bioinformatics and Omics Topic: Database and Biological Database With Examples Assignment-3
5 pages
Bioinformatics Lecture Notes Database
No ratings yet
Bioinformatics Lecture Notes Database
28 pages
Module 2 (Bioinformatics)
No ratings yet
Module 2 (Bioinformatics)
81 pages
BCH 505 Bioinformatics 3(2 2) Databases
No ratings yet
BCH 505 Bioinformatics 3(2 2) Databases
17 pages
Bio PPT
No ratings yet
Bio PPT
35 pages
Introduction To Bioinformatics (Databases)
No ratings yet
Introduction To Bioinformatics (Databases)
28 pages
Biological Databases: - Bio-Informatics
No ratings yet
Biological Databases: - Bio-Informatics
16 pages
Bioinformatics Biological Database
No ratings yet
Bioinformatics Biological Database
31 pages
Capture D'écran . 2023-03-14 À 00.15.22
No ratings yet
Capture D'écran . 2023-03-14 À 00.15.22
54 pages
BCH 516-1
No ratings yet
BCH 516-1
32 pages
Bioinformatics PPT Section B Data Storage and Retrival Group 3
No ratings yet
Bioinformatics PPT Section B Data Storage and Retrival Group 3
36 pages
Bio in For Ma Tics
No ratings yet
Bio in For Ma Tics
52 pages
UNIT II
No ratings yet
UNIT II
23 pages
Bioinformatics Database and Applications
100% (3)
Bioinformatics Database and Applications
82 pages
M Lec 01 & 02 Biological Database
No ratings yet
M Lec 01 & 02 Biological Database
50 pages
2024.HF_BioInformatics_Lec3p
No ratings yet
2024.HF_BioInformatics_Lec3p
11 pages
Generating Structural Data Analysis
No ratings yet
Generating Structural Data Analysis
8 pages
Day 1
No ratings yet
Day 1
38 pages
Database
No ratings yet
Database
40 pages
Tics - A Brief Introduction
No ratings yet
Tics - A Brief Introduction
4 pages
Databases in Bioinformatics - An Introduction
No ratings yet
Databases in Bioinformatics - An Introduction
11 pages
Lec2 Databases
No ratings yet
Lec2 Databases
135 pages
Basics of Bioinformatics in Biological Research
No ratings yet
Basics of Bioinformatics in Biological Research
5 pages
Online Biological Databases: A/Prof. Ly Le
No ratings yet
Online Biological Databases: A/Prof. Ly Le
64 pages
9. Biological Databases
No ratings yet
9. Biological Databases
17 pages
Essential Info Notes-1
No ratings yet
Essential Info Notes-1
57 pages
Database
No ratings yet
Database
16 pages
CH12
No ratings yet
CH12
8 pages
Bioinfo U2 KD 2
No ratings yet
Bioinfo U2 KD 2
3 pages
Bioinformatics Tools For Nucleotide Sequence Analysis and Database Exploration
No ratings yet
Bioinformatics Tools For Nucleotide Sequence Analysis and Database Exploration
75 pages
#1 L1 BioDatabases
No ratings yet
#1 L1 BioDatabases
89 pages
Bio in For Matics
No ratings yet
Bio in For Matics
26 pages
"MBG1002 Biological Databases Week II
No ratings yet
"MBG1002 Biological Databases Week II
37 pages
Biological Database 1
No ratings yet
Biological Database 1
50 pages
Biological Data and Database Biological Data
No ratings yet
Biological Data and Database Biological Data
10 pages
4Bioinformaticsdatabases
No ratings yet
4Bioinformaticsdatabases
71 pages
Bioinformatics Lab Notebook: Comsats University, Islamabad
No ratings yet
Bioinformatics Lab Notebook: Comsats University, Islamabad
27 pages
Biol BDs Singapore
No ratings yet
Biol BDs Singapore
24 pages
BIOINFORMATICS - eNOTES
No ratings yet
BIOINFORMATICS - eNOTES
23 pages
Bif501 Handouts PDF Bif
No ratings yet
Bif501 Handouts PDF Bif
197 pages
8024 Bio Info
No ratings yet
8024 Bio Info
28 pages
Bioinformatics Overview
100% (1)
Bioinformatics Overview
18 pages
FALLSEM2019-20 BIT2001 ETH VL2019201000690 Reference Material I 11-Jul-2019 Unit I New
No ratings yet
FALLSEM2019-20 BIT2001 ETH VL2019201000690 Reference Material I 11-Jul-2019 Unit I New
48 pages
Basics of Bioinformatics in Biological Research
No ratings yet
Basics of Bioinformatics in Biological Research
5 pages
Biological Databases: DR Z Chikwambi Biotechnology
No ratings yet
Biological Databases: DR Z Chikwambi Biotechnology
47 pages
Lab 1
No ratings yet
Lab 1
39 pages
المحاضرة 2
No ratings yet
المحاضرة 2
16 pages
1. Databases
No ratings yet
1. Databases
34 pages
Bioinformatics
No ratings yet
Bioinformatics
47 pages
A Review Article On Bioinformatics Tools and Software
No ratings yet
A Review Article On Bioinformatics Tools and Software
14 pages
Biological Databases (1)
No ratings yet
Biological Databases (1)
41 pages
Data Base in Bioinformatics
No ratings yet
Data Base in Bioinformatics
30 pages
CMSC 838T - Lecture 9: Bioinformatics Databases
No ratings yet
CMSC 838T - Lecture 9: Bioinformatics Databases
65 pages
Nucleic_Acid_Databases
No ratings yet
Nucleic_Acid_Databases
37 pages
Bioinform-Tica-Pdf-May-6-2010-12-38-Pm-3-5-Meg
No ratings yet
Bioinform-Tica-Pdf-May-6-2010-12-38-Pm-3-5-Meg
105 pages
Introduction to Bioinformatics, Sequence and Genome Analysis
From Everand
Introduction to Bioinformatics, Sequence and Genome Analysis
Jerry H. Swift
No ratings yet
Bioinformatics Unveiled
From Everand
Bioinformatics Unveiled
Joan Melody
No ratings yet
Introduction to Bioinformatics Using Action Labs
From Everand
Introduction to Bioinformatics Using Action Labs
Jean-Louis Lassez
5/5 (1)
12 Basco Cpet4101
100% (1)
12 Basco Cpet4101
4 pages
Indifference Curve Analysis
No ratings yet
Indifference Curve Analysis
36 pages
Mark Louies M. Villarosa
No ratings yet
Mark Louies M. Villarosa
1 page
Lindsay Anton - The Final Paper - 2956488
No ratings yet
Lindsay Anton - The Final Paper - 2956488
21 pages
PSL Help
100% (1)
PSL Help
58 pages
Service Ai-2301l 3010l
No ratings yet
Service Ai-2301l 3010l
478 pages
Poultry Industry in Moldova
No ratings yet
Poultry Industry in Moldova
5 pages
Characterization and Reuse of Kiln Rollers Waste in The Manufacture of Ceramic Floor Tiles
No ratings yet
Characterization and Reuse of Kiln Rollers Waste in The Manufacture of Ceramic Floor Tiles
7 pages
Green Architecture_ Designing a Sustainable Future
No ratings yet
Green Architecture_ Designing a Sustainable Future
2 pages
Instant ebooks textbook Financial Sector Development in Ghana: Exploring Bank Stability, Financing Models, and Development Challenges for Sustainable Financial Markets James Atta Peprah download all chapters
100% (4)
Instant ebooks textbook Financial Sector Development in Ghana: Exploring Bank Stability, Financing Models, and Development Challenges for Sustainable Financial Markets James Atta Peprah download all chapters
76 pages
KPMG UC How To Analyze A Case
No ratings yet
KPMG UC How To Analyze A Case
2 pages
Recruitment Services in Romania
No ratings yet
Recruitment Services in Romania
3 pages
Agri Market Brief 19 Organic Imports - en
No ratings yet
Agri Market Brief 19 Organic Imports - en
19 pages
FRM Notes
100% (3)
FRM Notes
76 pages
L6 Cuk Converter
No ratings yet
L6 Cuk Converter
20 pages
Total Portfolio Activation
No ratings yet
Total Portfolio Activation
26 pages
zastosowanie_metodologii_ue_do_zdefiniowania_obszarow_rynku_pracy_w_polsce
No ratings yet
zastosowanie_metodologii_ue_do_zdefiniowania_obszarow_rynku_pracy_w_polsce
196 pages
Minimum Wages Act Labour Law Project
50% (2)
Minimum Wages Act Labour Law Project
15 pages
Google Maps and Graph Theory
No ratings yet
Google Maps and Graph Theory
16 pages
Electronic Reservation Slip (ERS) : 8655867945 12961/AVANTIKA EXP Sleeper Class (SL)
No ratings yet
Electronic Reservation Slip (ERS) : 8655867945 12961/AVANTIKA EXP Sleeper Class (SL)
2 pages
Traditional Network Architecture and SDN
No ratings yet
Traditional Network Architecture and SDN
9 pages
Adult Male Shirt Decals - Google Search
No ratings yet
Adult Male Shirt Decals - Google Search
1 page
Lists_Sets
No ratings yet
Lists_Sets
2 pages
Orange and Violet Illustration Class Syllabus Education Presentation
No ratings yet
Orange and Violet Illustration Class Syllabus Education Presentation
5 pages
Time, The Protector From Crime. Police Men Get A Corporate Identity From The Uniform They
No ratings yet
Time, The Protector From Crime. Police Men Get A Corporate Identity From The Uniform They
1 page
Makalah Inggris
No ratings yet
Makalah Inggris
19 pages
Magnetic Piston Operated Engine: Sumit Dhangar, Ajinkya Korane, Durgesh Barve
No ratings yet
Magnetic Piston Operated Engine: Sumit Dhangar, Ajinkya Korane, Durgesh Barve
7 pages
14 1 22 Engineering Datasheet Rse75n A13 Revc
No ratings yet
14 1 22 Engineering Datasheet Rse75n A13 Revc
1 page
Stress Strain Diagram
No ratings yet
Stress Strain Diagram
8 pages