TQ Ebook3
TQ Ebook3
TQ Ebook3
1
Contents
1 Introduction
2 What is Data Science
3 Top 7 tools in Data
Analytics
4 Power BI
5 Python
6 Tableau
7 Excel
8 SQL
9 Jupiter Notebook
10 Apache Spark
TechQuest
STEM
Academy
4
Introduction
5
Data Analytics
Data analytics is the act of studying and analyzing massive datasets in order to
create predictions and improve data-driven decision-making. Data analytics
enables us to collect, clean, and manipulate data in order to generate relevant
insights. It aids in answering questions, testing hypotheses, and refuting theories.
6
Top 7 Data Analysis tools
.
There are plenty of tools used in the analyzing data, however due to the high
demand of the use of some of the tools, having the skillset and knowing how
to use them give you an advantage.
Below are the list of the seven top Data analysis tools
1 Power BI
2 Python
3 Tableau
4 Excel
5 SQL
6 Jupiter Notebook
7 Apache Spark
7
Power BI
SQL for data analysis refers to the database querying language's use of
relational databases and its capacity for simultaneous interaction with
various databases. The combination of a surprisingly low learning curve and
a deep complexity that enables users to build sophisticated tools and
dashboards for data analytics makes SQL one of the most widely used and
adaptable languages.
In order to quickly construct and interact with databases, SQL has been
converted into a variety of proprietary tools, each with a specific purpose
and target audience, such as the well-known MySQL, Microsoft Access, and
PostgreSQL.
SQL is widely used because it is a basic language capable of doing
surprisingly complicated data analysis, even though its major appeal is still
its speed in creating and interacting with databases. The logic of the
language itself and the way it interacts with data sets are highly comparable
to those of Excel and even the well-known Python library Pandas.
How can I Use SQL for Analysis
The most common application of SQL today (in all of its forms) may be as
the foundation for the creation of user-friendly dashboards and reporting
tools, or what is known as SQL for data analytics. SQL creates user-friendly
dashboards that may present data in a number of ways because it makes it so
simple to send complex commands to databases and change data in a matter
of seconds. In addition, SQL is a great tool for creating data warehouses due
to its simplicity of use, clarity of organization, and effectiveness of
interaction.
Because these languages can interact directly with databases, SQL can be
used as a bridge between simpler data storage systems and end users,
making them more accessible to specialists and data scientists.
Jupiter Notebook
Jupyter Notebook is an open-source online application that provides a
computing environment that is interactive. It generates documents (notebooks)
by combining inputs (code) and outputs into a single file. It provides a single
document that includes:
•Visualizations
•Mathematical equations
•Statistical modeling
•Narrative text
•Any other rich media
Users may create, display the results, and add data, charts, and formulae using
this one-document method, which improves the work’s comprehension,
reproducibility, and shareability.
Over 40 programming languages are supported by Jupyter notebooks, however,
Python is the primary focus. Anyone may utilize this tool for their data science
initiatives as it is free and open-source. Jupyter notebooks come in two different
styles:
Jupyter Classic Notebook, which has all the aforementioned features.
Advanced Analytics Spark does more than just support "Map" and
"Reduce." Additionally, it supports Graph algorithms, SQL queries,
streaming data, and machine learning (ML).