Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
0% found this document useful (0 votes)
10 views

Module 1 - What Is Data Science

Uploaded by

k61.2212550036
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
10 views

Module 1 - What Is Data Science

Uploaded by

k61.2212550036
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 17

Module 1 - What is Data Science?

> Module Introduction > Module Introduction

Module Introduction
Bookmark this page
Welcome to the first module of the Introduction to data science course. The
purpose of this module is to define data science and what data scientist do.
We will explore the data science topics and data science in business through
examples.

Learning Objectives:

 Explain about data science, data scientists, and how each is defined.
 Understanding some statistics about the field of data science, the
demand for data scientists, and some of the qualities of excelling
data scientists.
 Explain topics of Data Science, algorithms used in Data Science.
 Understanding about hard skills are required for anyone interested
in.
 Learn about what companies need to do in order to start with data
science.
 Explain some of the qualities that differentiate data scientists from
other professionals.
 Explain about analytics and what important role data scientists play
in this process.
 Understand the popular tools used by data scientists and practice
with Jupyter Notebook.
Learning Objectives
Bookmark this page
In this lesson, you will learn what data science is, what data scientists do,
and what tools and algorithms data scientists use on a daily basis. You will
be required to complete a reading assignment to learn why data science
is considered the sexiest job in the 21st century. Also, you understand
data science, data scientists, and how each is defined.

o Lesson Objective

o What is Data Science?

o What is Scientists?

o The Many Paths to Data Science

o Advice for New Data Scientists

o A day in the Life of a Data Scientist

o Data Science Topics and Algorithms

o What is the cloud?

o Reading: Course Syllabus

o Reading: Data Science: The Sexiest Job in the 21st Century

o Reading: What Makes Someone a Data Scientist?

Lesson Objective

1. Explain what data science is.

2. Understanding some statistics about the field of data science, the


demand for data scientists, and some of the qualities of excelling data
scientists.

3. Comprehend about data science, data scientists, and how each is


defined

Learning Outcomes:

o DSP301x_O1: The student will be able to describe about data science,


data scientists, a day in life of a data scientist
Defining Data Science and What Data
Scientists Do
Bookmark this page
Please study online the “Defining Data Science and What Data
Scientists Do” lesson at www.coursera.org as the following order:

o Video: What is Data Science?

o Video1, Video 2, Video 3: What is Data Scientists?

o Video: The Many Paths to Data Science

o Video: Advice for New Data Scientists

o Video: A day in the Life of a Data Scientist

o Video: Data Science Topics and Algorithms

o Video: What is the cloud?

o Reading: Data Science: The Sexiest Job in the 21st Century (Translated
version)

o Reading: What Makes Someone a Data Scientist? (Translated version)

After finishing learning on Coursera.com, please:

 Review lesson key content with the Lesson Summary

 Finish Quiz with 100% passed

 Then move on to the next lesson on LMS: Lesson 2: Data Science Topics
Lesson Summaries
Bookmark this page
At the end of this lesson, you learned:

- Data science is the study of data as a process of using data to understand


different things, to understand the world. Data science is what data scientists do.

- Data scientist as someone who finds solutions to problems by analyzing big or


small data using appropriate tools for the relevant stakeholders.

- The advice for data scientists is to be curious, extremely argumentative and


judgmental.

- The main tools in data science: Jupyter notebooks, Python, some regular
expressions, relational databases, Python Pandas, R for Python, mathematical
and statistical calculations in Python.

- Data Science topics and algorithms are regression, neural networks, Nearest
neighbor, Data visualization with R, …

Next lesson: Lesson 2: Data Science Topics


Quiz 1
Bookmark this page

Quiz 1
5/5 points (graded)

Question 1. Harvard Business Review called data science the sexiest job in the
21st century.

TRUE correct

FALSE
Question 2. According to the report by the McKinsey Global Institute, by 2018, it
is projected that there will be a shortage of 140,000 – 190,000 people with deep
analytical skills in the world.

TRUE

FALSE correct
Question 3. How is Walmart reported to have addressed its analytical needs?

Code sharing

Crowdsourcing correct

Outsourcing

None of the options is correct

Social media
Question 4. What is the average base salary of a data scientist reported by the
New York Times?

$100,000

$150,000

$112,000 correct

$16 per hour

$85,000 + Bonus
Question 5. According to professor Haider, the three important qualities to
possess in order to succeed as a data scientist are curious, judgemental, and
proficient in programming.

TRUE
FALSE correct
Submit
Some problems have options such as save, reset, hints, or show answer. These
options follow the Submit button.
Course Module 1 - What is Data Science? Lesson 2 - Data Science Topics Learning
Objectives
Learning Objectives
Bookmark this page
You will learn about topics of Data Science, algorithms used in Data
Science and understand hard skills are required for anyone interested in
pursuing a career in this field. Beside, you also will learn about data
mining, and the steps the comprise the process of mining a given dataset,
regression and what questions can be put to regression analysis.

Lesson Objectives

1. Explain topics of Data Science, algorithms used in Data Science

2. Understanding about hard skills are required for anyone interested in

3. Explain about data mining, the steps the comprise the process of
mining a given dataset, regression, regression analysis..

Learning Outcomes:

o DSP301x_O2: List data science topics and algorithm.

o DSP301x_O3: Comprehends about Hard skills are required in data science.

Data Science Topics


Bookmark this page
Please study online the “Data Science Topics” lesson
at www.coursera.org as the following order:

o Video: Data Science Skills & Big Data

o Video1, Video 2: Basic Data Science Skills

o Video: What is Hadoop?

o Video: Neural Networks and Deep Learning

o Video: How Can Someone Become a Data Scientist?

o Video: Data Science Careers

o Video: Applications of Machine Learning

o Reading: Data Mining (Translated version)


o Reading: Regression (Translated version)

After finishing learning on Coursera.com, please:

 Review lesson key content with the Lesson Summary

 Finish Quiz with 100% passed

 Then move on to the next lesson on LMS: Lesson 3: Data Science in Business

Lesson Summaries
Bookmark this page

At the end of this lesson, you have learned:

 Skills are required for anyone interested in data science.

 The popular data science tools and algorithms: Python notebooks, Unix and Linux,
Python, some regular expressions, relational databases, Python Pandas,
mathematical and statistical calculations in Python, big data, Jupyter notebooks.

 Review knowledge about Big Data and Data Mining, Deep Learning and Machine
Learning.

 Learn about the process of mining a given dataset and about regression analysis.

Next lesson: Lesson 3 - Data Science in Business


Quiz 2
Bookmark this page

Quiz 2
10/10 points (graded)

Question 1. According to the reading, the output of a data mining exercise


largely depends on the skills of the data scientist carrying out the exercise.

TRUE

FALSE correct
Question 2. What should you do when data are missing in a systematic way?

Determine the impact of missing data on the results and whether missing
data can be excluded from the analysis. correct

Determine the average of the values around the missing data.

Determine who was managing the database.

Extrapolate the data.


Question 3. Prior Variable Analysis and Principal Component Analysis are both
examples of a data reduction algorithm.

FALSE correct

TRUE
Question 4. After the data are appropriately processed, transformed, and stored,
machine learning and non-parametric methods are a good starting point for data
mining.

FALSE correct

TRUE
Question 5. In-sample forecast is the process of formally evaluating the
predictive capabilities of the models developed using observed data to see how
effective the algorithms are in reproducing data.

TRUE correct

FALSE
Question 6. The real added value of the author's research on residential real
estate properties is quantifying people's preferences of different transport
services.

FALSE correct
TRUE
Question 7. Regression is a statistical technique developed by Blaise Pascal.

TRUE

FALSE correct
Question 8. What did the author's research discover about the impact of an
additional washroom on the price of a housing unit?

The author found that an additional bedroom adds the same to the housing
prices than an additional washroom. In other words, any additional room results
in an equal increase to the housing prices.

The author found that an additional washroom did not have any impact on
the pricing of a housing unit.

The author found that an additional washroom adds more to the housing
prices than an additional bedroom. correct

The author found that an additional bedroom adds more to the housing prices
than an additional washroom.
Question 9. The author discovered that houses located more than 2.5 kms to
shopping centres sold for less than the rest.

TRUE

FALSE correct
Question 10. "How much does a finished basement contribute to the price of a
housing unit?" is a question that can be put to regression analysis.

TRUE correct

FALSE
Course Module 1 - What is Data Science? Lesson 3 - Data Science in
Business Learning Objectives
Learning Objectives
Bookmark this page
In this lesson, you will learn about what companies need to do in order to
start with data science. You will also learn about some of the qualities that
differentiate data scientists from other professionals. In addition, you will
learn about analytics and what important role data scientists play in this
process, and about story-telling and the importance of an effective final
deliverable. Finally, you will be required to apply what you learned about
data science by answering open-ended questions.

Lesson Objectives

1. Learn about what companies need to do in order to start with data


science.

2. Explain some of the qualities that differentiate data scientists from


other professionals.

3. Explain about analytics and what important role data scientists play in
this process.

Learning Outcomes:

o DSP301x_O4: The student will list some applications of machine learning.

o DSP301x_O5: The student will list some applications of data science.

Data Science in Business


Bookmark this page
Please study online the “Data Science in Business” lesson
at www.coursera.org as the following order:

o Video: How Should Companies Get Started in Data Science?

o Video: Recruiting for Data Science

o Video: Applications of Data Science

o Reading: The Final Deliverable (Translated version)

o Reading: The Report Structure (Translated version)


After finishing learning on Coursera.com, please:

 Review lesson key content with the Lesson Summary

 Finish Quiz with 100% passed

 Then move on to the next lesson on LMS: Lesson 4: Tool for Data Science -
Jupyter Notebooks

Lesson Summaries
Bookmark this page

At the end of this lesson, you have learned:

 What companies need to do in order to start with data science: recording


information, capturing data, applying algorithms and analytics to data.

 The qualities that differentiate data scientists from other professionals:


curiosity, sense of humor, technical skills.

 Applications of Data Science such as in the medical field (drug delivery, cancer
treatment), Pokémons Go, Google Search.

 Structure of the report for applying what you learned about data science as a data
scientist.

Finally, you will be required to apply what you learned about data science by
answering open-ended questions.

Next lesson: Lesson 4 - Tool for Data Science - Jupyter Notebooks

Quiz 3
Bookmark this page

Quiz 3
10/10 points (graded)

Question 1. The discussion section is where you:


Introduce the research methods and data sources used for the analysis.

Refer the reader to the research question and the knowledge gaps you
identified earlier.

Highlight how your findings provide the ultimate missing piece to the puzzle.

Rely on the power of narrative to enable numbers to communicate your


important findings to the readers. correct
Question 2. According to the reading, what is the ultimate purpose of analytics?

To efficiently store big data with minimum storage requirements.

To communicate findings to stakeholders to formulate policy or


strategy. correct

To evangelize data science.

To facilitate meetings between sales and marketing.


Question 3. The reading mentions a common role of a data scientist is to use
analytics insights to build a narrative to communicate findings to stakeholders.

TRUE correct

FALSE
Question 4. The United States Economic Forecast is a publication by:

McKinsey Publication Inc.

Cambridge University Press.

McGraw-Hill Education.

Deloitte University Press. correct


Question 5. The report discussed in the reading successfully did the job of using
data and analytics to generate the likely economic scenarios.

TRUE correct

FALSE
Question 6. According to the reading, in order to produce a compelling narrative,
initial planning and conceptualizing of the final deliverable is of extreme
importance.

TRUE correct

FALSE
Question 7. Regardless of the length of the final deliverable, the author
recommends that it includes a cover page, table of contents, executive
summary, detailed contents, acknowledgments, and references.

TRUE correct

FALSE
Question 8. An introductory section is always helpful in introducing the research
methods and presenting the statistical calculations.

TRUE

FALSE correct
Question 9. The results section is where you present:

The conclusion.

The methods used.

The empirical findings. correct

R Squared.
Question 10. Adding a list of references and an acknowledgment section are
examples of housekeeping, according to the author.

TRUE correct

FALSE

Course Module 1 - What is Data Science? Lesson 4 - Tool for Data Science - Jupyter
Notebooks Learning Objectives
Learning Objectives
Bookmark this page

In this lesson, you will overview of the various data science tools available to you.
You also understand why they are so popular among data scientists today. You learn
how to host on Skills Network Labs, create an account and start exploring some of
the features.

Lesson Objectives:

1. Understand the popular tools used by data scientists.

2. Practice using Jupyter Notebook.

Learning Outcomes:

 DSP301x_O6: Understand a tool for data science: Jupyter Notebooks.

 DSP301x_O7: Practices create and share a Jupyter Notebook.

Introducing Jupyter Notebooks


Bookmark this page
Please study online the “Introducing Jupyter Notebooks” lesson
at www.coursera.org as the following order:

o Video: What are Jupyter Notebooks?

o Video: Getting started with Jupyter Notebooks

o Reading: Interesting Jupyter Notebooks on the Internet (Translated version)

 Practice with exercises on Coursera: Lab 1, Lab 2, Lab 3 to practice and ask mentor
if you need support

 Review lesson key content with the Lesson Summary

 Finish Quiz with 100% passed

 Then move on to the next lesson on LMS: Lesson 5: From Problem to


Approach
Exercise
Bookmark this page
In this lab, you will learn about a popular data science tool, Jupyter Notebooks, its
features, and why they are so popular among data scientists today. Please access to
below links to complete exercise:

o Lab 1: Jupyter Notebooks - The Basics (Other LINK to download)

o Lab 2: Jupyter Notebooks - More Features (Other LINK to download)

o Lab 3: Jupyter Notebooks - Advanced Features (Other LINK to download)

Lesson Summaries
Bookmark this page
At the end of this lesson, you have learned:

- Jupyter Notebooks: a popular data science tool.

- Practice using Jupyter Notebook with the basics, more features, advanced features in
Jupyter Notebook.

Next lesson: Lesson 5 - From Problem to Approach

Quiz 4
Bookmark this page

Quiz 4
5/5 points (graded)

Question 1. What can you write in Jupyter Notebooks? Select all that apply:

Code to be executed in one of the kernels (e.g. Python, R or Scala)

Stylized text in a format called "Markdown".

HTML code -- HTML can be written and rendered via Markdown cells.
correct
Question 2. Which of the following options are TRUE? Select all that apply.

You can download and save Jupyter Notebooks as .ipynb files.

You can change kernels within a Jupyter Notebook (e.g., to R or Python or


Scala).

You can document your code with stylized text using a formatting style called
"Markleft"

You can connect to databases from within Jupyter Notebooks.


correct
Question 3. What were Jupyter notebooks called before the name was changed
to “Jupyter”?

IPython notebooks correct

IScala notebooks

IR notebooks

Saturn notebooks

None of the above


Question 4. True or False? If you wrote "# Hi there" in Markdown, this would be
the equivalent of writing < h1> Hi there < /h1> in HTML. (If you're not sure, now
is a good time to search for a Markdown guide!)

TRUE correct

FALSE
Question 5. True or False? Although you can change the kernel of the Jupyter
Notebook between different programming languages (e.g., Python, R, Scala),
you cannot use multiple kernels within the same Jupyter notebook (e.g., running
Python, R and Scala within the same notebook).

TRUE correct

FALSE

You might also like