Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
100% found this document useful (1 vote)
26 views

Introduction Data Science

Uploaded by

Thendral
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
100% found this document useful (1 vote)
26 views

Introduction Data Science

Uploaded by

Thendral
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 23

DATA SCIENCE AND VISUALIZATION

MODULE 1
INTRODUCTION TO DATA SCIENCE
E xploring the
F ascinating World of
Data S cience
In a world awash with information, data science has emerged as a
powerful discipline that harnesses the insights hidden within vast troves of
data. From uncovering patterns in customer behavior to predicting global
trends, this field promises to transform how we understand and interact
with the world around us.
Unders tanding B ig
Data and Data
S cience
When it comes to big data and data science, there's a lot of However, it's
important for college students to know that not everything they hear is
true. This presentation aims to explain what big data and data science
really are. We will explore the challenges, opportunities, and real-world
uses of these fields, helping students make informed decisions as they
explore the world of data.
Big Data and Data Science Hype-
Why actually data science is hyped. So, what is eyebrow-raising about Big Data and data science?
Let’s count the ways:
• There’s a lack of definitions around the most basic terminology. What is “Big Data” anyway? What does “data
science” mean? What is the relationship between Big Data and data science? Is data science the science of Big
Data? Is data science only the stuff going on in companies like Google and Facebook and tech companies?
Why do many people refer to Big Data as crossing disciplines (astronomy, finance, tech, etc.) and to data
science as only taking place in tech? Just how big is big? Or is it just a relative term? These terms are so
ambiguous, they’re well-nigh meaningless.
• There’s a distinct lack of respect for the researchers in academia and industry labs who have been working on
this kind of stuff for years, and whose work is based on decades (in some cases, centuries) of work by
statisticians, computer scientists, mathematicians, engineers, and scientists of all types. From the way the
media describes it, machine learning algorithms were just invented last week and data was never “big” until
Google came along. This is simply not the case. Many of the methods and techniques we’re using—and the
challenges we’re facing now—are part of the evolution of everything that’s come before. This doesn’t mean
that there’s not new and exciting stuff going on, but we think it’s important to show some basic respect for
everything that came before.
Understanding Big Data and Data
Science
What is "Big Data"? What is "Data The Connection
Science"
The term "Big Data" is often "Data Science" is a broad term While big data and data science
used to describe large, that encompasses various are related, their relationship is not
complex datasets that are activities, including statistical well-defined. It is unclear whether
difficult to process using analysis and machine learning. It data science is solely focused on
traditional methods. However, is not always clear how it relates analyzing big data or includes
the exact "big" varies and can to other fields like statistics and other data-driven activities.
be subjective. computer science. Clarifying these definitions is
important for establishing a more
coherent and respected field.
Res pecting the Pas t

1 Honoring Contributions 2 Avoiding Confus ion


Researchers in statistics, computer Data science isn't just a new name for
science, mathematics, and other fields existing fields. While it shares some
have laid the foundation for big data and similarities with statistics and machine
data science. It's important to recognize learning, it has its own unique qualities.
and respect their contributions.

3 Combining Skills 4 Staying Grounded


Data science combines knowledge from It's important not to get too excited about
various fields, including math, computer data science and big data. Taking a
science, and other areas. This makes it practical approach helps us understand
stronger. what they can and cannot do.
What is Data S cience?
Getting Data 1
Data science starts by collecting and
organizing relevant data, whether it's
from traditional sources or new big 2 Exploring and Analyzing Data
data platforms. This involves
understanding data structures, Data scientists use statistical, mathematical,
formats, and working with large and and computational techniques to explore,
complex datasets. understand, and find insights in the data.
They apply methods like machine learning
and predictive modeling.
Interpreting and 3
Communic ating
The final step in data science is
interpreting the findings, effectively
communicating insights, and turning
them into actionable
recommendations or solutions. This
requires good communication skills
and bridging the gap between
technical analysis and business
needs.
Creating an Effective Data Science
Field

1 Collaboration 2 Ethics
Working together with experts from Following strong ethical guidelines is
different areas is important for solving real- crucial to address concerns about privacy,
world problems and making a positive bias, and responsible use of data and
impact with data-driven insights. algorithms. This builds trust and credibility.

3 Continuous Learning 4 Interdisciplinary Approach


As the field evolves quickly, ongoing By combining knowledge from different
learning and keeping up with the latest fields like statistics, computer science, and
tools and techniques are essential for domain expertise, data science can better
remaining relevant and effective as a data tackle complex, real-world problems.
scientist.
Why Now?
• Right now, we have more information than ever before because of
all the data we collect and the powerful computers we have.

• This data tells us a lot about how people behave and how society works.
• With new technology, we can use this data to learn new things and
come up with new ideas in many different industries.

• There's a huge amount of data out there, from what we do online


to how we move in the real world, and it gives us a clear picture of

• our lives.
At the same time, computers are getting cheaper and more
powerful, so we can process and analyze all this data on a large

• scale.
This perfect combination of lots of data and advanced technology
is creating a new way of making decisions based on data and
making data science a really important field.
Datafication: Turning Life into Data

1 Intentional 2 Passive 3 Transforming


Datafication Datafication Data into
• When we actively participate in social media, • However, the datafication of our
Value
Regardless of the level of intentionality, the
online shopping, or other digital platforms, we lives extends beyond our datafication of our lives has enabled the transformation
are intentionally sharing our data and allowing it conscious choices. Our offline of information into new forms of value. Businesses,
to be collected. behaviors, such as walking governments, and other organizations are leveraging
through a store or using a this data to drive decision-making, personalize
• This type of datafication is often seen as a fair fitness tracker, are also being products and services, and uncover insights that were
exchange, where we willingly trade personal captured and turned into data, previously inaccessible.
information for the convenience and benefits of often without our explicit
these services. knowledge or consent.
Understand Datafication by watching this video-
The Evolving Landscape of Data Science
R ebranding or Industry vs. Academia Emerging Specialties
R evolution?
The growth of data As data science
There is an ongoing science has been continues to evolve,
debate about whether driven primarily by new specialties and
data science is a industry, with sub-disciplines are
genuine new field or companies like emerging, such as
simply a rebranding of Google, Facebook, machine learning,
existing disciplines like and LinkedIn natural language
statistics and pioneering the field. In processing, and data
analytics. S ome argue contrast, the academic visualization. These
that data science is world has been slower specialized skills are
merely a collection of to embrace data in high demand,
well-established science, with few reflecting the diverse
techniques, while dedicated programs or and multifaceted
others see it as a professorships. This nature of the field.
transformative disconnect highlights
approach that the practical, real-
combines diverse world focus of data
skills and science compared to
The Data Scientist: A Rising Role

1 Unique Skill Set 2 Curiosity and Persistence


Data scientists have a special Successful data scientists are
mix of skills. They know curious. They want to find
statistics, computer science, hidden insights in data. They
and have knowledge in a never give up easily, even when
specific field. This helps them working with messy and
solve complex problems that unorganized data. They search
need both technical and for valuable information that can

3 analytical abilities.
Collaborative Mindset 4 bring meaningfulRole
An Emerging change.

Data science is a team effort. The term "data scientist"


Data scientists work with appeared in the late 2000s. It
stakeholders, experts in different describes the unique skills
fields, and other data needed to make sense of the
professionals. They should be growing amount of data. This
good at communicating their role has gained recognition and
findings and turning technical prestige. In fact, Harvard
insights into practical business Business Review called data
The Data Science Toolkit

Programming Statistics Machine Learning Data Visualization


Proficiency in A strong The ability to Effective data
programming foundation in apply machine visualization
languages like statistical learning skills are needed
Python, R, and methods and algorithms and to communicate
SQL is essential modeling techniques to complex findings
for data techniques is identify patterns, in a clear and
scientists to crucial for make compelling way.
extract, drawing predictions, and
manipulate, and meaningful automate
analyze data. insights from decision-
data. making.
In the class, Rachel handed out index cards and asked everyone to profile themselves (on a relative rather than
absolute scale) with respect to their skill levels in the following domains:
• Computer science
• Math
• Statistics
• Machine learning
• Domain expertise
• Communication and presentation skills
• Data visualization
Data Science in Finance: Credit
Ratings and Trading
Credit Ratings Trading Algorithms

Data-driven models analyze an Sophisticated algorithms analyze


individual's credit history, income, market data, news, and other
and other financial information to information in real-time to identify
determine their creditworthiness trading opportunities and execute
and assign a credit score. transactions automatically.
Conclusion of Today’s Class
• Introduction to Data Science
• What is Data Science?
• Big Data and Data Science hype
• And getting past the hype
• Why now?
• Datafication
• Current landscape of perspectives
• Skill sets.

You might also like