Module 1 - What Is Data Science
Module 1 - What Is Data Science
Module Introduction
Bookmark this page
Welcome to the first module of the Introduction to data science course. The
purpose of this module is to define data science and what data scientist do.
We will explore the data science topics and data science in business through
examples.
Learning Objectives:
Explain about data science, data scientists, and how each is defined.
Understanding some statistics about the field of data science, the
demand for data scientists, and some of the qualities of excelling
data scientists.
Explain topics of Data Science, algorithms used in Data Science.
Understanding about hard skills are required for anyone interested
in.
Learn about what companies need to do in order to start with data
science.
Explain some of the qualities that differentiate data scientists from
other professionals.
Explain about analytics and what important role data scientists play
in this process.
Understand the popular tools used by data scientists and practice
with Jupyter Notebook.
Learning Objectives
Bookmark this page
In this lesson, you will learn what data science is, what data scientists do,
and what tools and algorithms data scientists use on a daily basis. You will
be required to complete a reading assignment to learn why data science
is considered the sexiest job in the 21st century. Also, you understand
data science, data scientists, and how each is defined.
o Lesson Objective
o What is Scientists?
Lesson Objective
Learning Outcomes:
o Reading: Data Science: The Sexiest Job in the 21st Century (Translated
version)
Then move on to the next lesson on LMS: Lesson 2: Data Science Topics
Lesson Summaries
Bookmark this page
At the end of this lesson, you learned:
- The main tools in data science: Jupyter notebooks, Python, some regular
expressions, relational databases, Python Pandas, R for Python, mathematical
and statistical calculations in Python.
- Data Science topics and algorithms are regression, neural networks, Nearest
neighbor, Data visualization with R, …
Quiz 1
5/5 points (graded)
Question 1. Harvard Business Review called data science the sexiest job in the
21st century.
TRUE correct
FALSE
Question 2. According to the report by the McKinsey Global Institute, by 2018, it
is projected that there will be a shortage of 140,000 – 190,000 people with deep
analytical skills in the world.
TRUE
FALSE correct
Question 3. How is Walmart reported to have addressed its analytical needs?
Code sharing
Crowdsourcing correct
Outsourcing
Social media
Question 4. What is the average base salary of a data scientist reported by the
New York Times?
$100,000
$150,000
$112,000 correct
$85,000 + Bonus
Question 5. According to professor Haider, the three important qualities to
possess in order to succeed as a data scientist are curious, judgemental, and
proficient in programming.
TRUE
FALSE correct
Submit
Some problems have options such as save, reset, hints, or show answer. These
options follow the Submit button.
Course Module 1 - What is Data Science? Lesson 2 - Data Science Topics Learning
Objectives
Learning Objectives
Bookmark this page
You will learn about topics of Data Science, algorithms used in Data
Science and understand hard skills are required for anyone interested in
pursuing a career in this field. Beside, you also will learn about data
mining, and the steps the comprise the process of mining a given dataset,
regression and what questions can be put to regression analysis.
Lesson Objectives
3. Explain about data mining, the steps the comprise the process of
mining a given dataset, regression, regression analysis..
Learning Outcomes:
Then move on to the next lesson on LMS: Lesson 3: Data Science in Business
Lesson Summaries
Bookmark this page
The popular data science tools and algorithms: Python notebooks, Unix and Linux,
Python, some regular expressions, relational databases, Python Pandas,
mathematical and statistical calculations in Python, big data, Jupyter notebooks.
Review knowledge about Big Data and Data Mining, Deep Learning and Machine
Learning.
Learn about the process of mining a given dataset and about regression analysis.
Quiz 2
10/10 points (graded)
TRUE
FALSE correct
Question 2. What should you do when data are missing in a systematic way?
Determine the impact of missing data on the results and whether missing
data can be excluded from the analysis. correct
FALSE correct
TRUE
Question 4. After the data are appropriately processed, transformed, and stored,
machine learning and non-parametric methods are a good starting point for data
mining.
FALSE correct
TRUE
Question 5. In-sample forecast is the process of formally evaluating the
predictive capabilities of the models developed using observed data to see how
effective the algorithms are in reproducing data.
TRUE correct
FALSE
Question 6. The real added value of the author's research on residential real
estate properties is quantifying people's preferences of different transport
services.
FALSE correct
TRUE
Question 7. Regression is a statistical technique developed by Blaise Pascal.
TRUE
FALSE correct
Question 8. What did the author's research discover about the impact of an
additional washroom on the price of a housing unit?
The author found that an additional bedroom adds the same to the housing
prices than an additional washroom. In other words, any additional room results
in an equal increase to the housing prices.
The author found that an additional washroom did not have any impact on
the pricing of a housing unit.
The author found that an additional washroom adds more to the housing
prices than an additional bedroom. correct
The author found that an additional bedroom adds more to the housing prices
than an additional washroom.
Question 9. The author discovered that houses located more than 2.5 kms to
shopping centres sold for less than the rest.
TRUE
FALSE correct
Question 10. "How much does a finished basement contribute to the price of a
housing unit?" is a question that can be put to regression analysis.
TRUE correct
FALSE
Course Module 1 - What is Data Science? Lesson 3 - Data Science in
Business Learning Objectives
Learning Objectives
Bookmark this page
In this lesson, you will learn about what companies need to do in order to
start with data science. You will also learn about some of the qualities that
differentiate data scientists from other professionals. In addition, you will
learn about analytics and what important role data scientists play in this
process, and about story-telling and the importance of an effective final
deliverable. Finally, you will be required to apply what you learned about
data science by answering open-ended questions.
Lesson Objectives
3. Explain about analytics and what important role data scientists play in
this process.
Learning Outcomes:
Then move on to the next lesson on LMS: Lesson 4: Tool for Data Science -
Jupyter Notebooks
Lesson Summaries
Bookmark this page
Applications of Data Science such as in the medical field (drug delivery, cancer
treatment), Pokémons Go, Google Search.
Structure of the report for applying what you learned about data science as a data
scientist.
Finally, you will be required to apply what you learned about data science by
answering open-ended questions.
Quiz 3
Bookmark this page
Quiz 3
10/10 points (graded)
Refer the reader to the research question and the knowledge gaps you
identified earlier.
Highlight how your findings provide the ultimate missing piece to the puzzle.
TRUE correct
FALSE
Question 4. The United States Economic Forecast is a publication by:
McGraw-Hill Education.
TRUE correct
FALSE
Question 6. According to the reading, in order to produce a compelling narrative,
initial planning and conceptualizing of the final deliverable is of extreme
importance.
TRUE correct
FALSE
Question 7. Regardless of the length of the final deliverable, the author
recommends that it includes a cover page, table of contents, executive
summary, detailed contents, acknowledgments, and references.
TRUE correct
FALSE
Question 8. An introductory section is always helpful in introducing the research
methods and presenting the statistical calculations.
TRUE
FALSE correct
Question 9. The results section is where you present:
The conclusion.
R Squared.
Question 10. Adding a list of references and an acknowledgment section are
examples of housekeeping, according to the author.
TRUE correct
FALSE
Course Module 1 - What is Data Science? Lesson 4 - Tool for Data Science - Jupyter
Notebooks Learning Objectives
Learning Objectives
Bookmark this page
In this lesson, you will overview of the various data science tools available to you.
You also understand why they are so popular among data scientists today. You learn
how to host on Skills Network Labs, create an account and start exploring some of
the features.
Lesson Objectives:
Learning Outcomes:
Practice with exercises on Coursera: Lab 1, Lab 2, Lab 3 to practice and ask mentor
if you need support
Lesson Summaries
Bookmark this page
At the end of this lesson, you have learned:
- Practice using Jupyter Notebook with the basics, more features, advanced features in
Jupyter Notebook.
Quiz 4
Bookmark this page
Quiz 4
5/5 points (graded)
Question 1. What can you write in Jupyter Notebooks? Select all that apply:
HTML code -- HTML can be written and rendered via Markdown cells.
correct
Question 2. Which of the following options are TRUE? Select all that apply.
You can document your code with stylized text using a formatting style called
"Markleft"
IScala notebooks
IR notebooks
Saturn notebooks
TRUE correct
FALSE
Question 5. True or False? Although you can change the kernel of the Jupyter
Notebook between different programming languages (e.g., Python, R, Scala),
you cannot use multiple kernels within the same Jupyter notebook (e.g., running
Python, R and Scala within the same notebook).
TRUE correct
FALSE