Advanced Data Analytics Assignment
Advanced Data Analytics Assignment
2. Data preparation. This stage includes a set of steps to get the data ready to be
mined. It starts with data exploration, profiling and pre-processing, followed
by data cleansing work to fix errors and other data quality issues. Data
transformation is also done to make data sets consistent, unless a data
scientist is looking to analyze unfiltered raw data for a particular application.
3. Mining the data. Once the data is prepared, a data scientist chooses the
appropriate data mining technique and then implements one or more
algorithms to do the mining. In machine learning applications, the algorithms
typically must be trained on sample data sets to look for the information being
sought before they're run against the full set of data.
4. Data analysis and interpretation. The data mining results are used to create
analytical models that can help drive decision-making and other business
actions. The data scientist or another member of a data science team also
must communicate the findings to business executives and users, often
through data visualization and the use of data storytelling techniques.
These steps are part of the data mining process.
Types of data mining techniques
Various techniques can be used to mine data for different data science applications.
Pattern recognition is a common data mining use case that's enabled by multiple
techniques, as is anomaly detection, which aims to identify outlier values in data
sets. Popular data mining techniques include the following types:
Clustering. In this case, data elements that share particular characteristics are
grouped together into clusters as part of data mining applications. Examples
include k-means clustering, hierarchical clustering and Gaussian mixture
models.
Sequence and path analysis. Data can also be mined to look for patterns in
which a particular set of events or values leads to later ones.
Draw conclusions.
https://medium.com/@jw207427/how-to-apply-hypothesis-
test-in-marketing-data-fbe1e1ac2388
d. Predictive Modeling: Build a predictive model to forecast the sales for the next
quarter based on customer demographics, product purchases, and campaign responses.
[15 marks]
https://www.wallstreetmojo.com/predictive-modeling/
https://www.javatpoint.com/logistic-regression-in-machine-learning
e. Visualization: Create visualizations such as charts and graphs to present your findings
to the marketing team.
You can use the predictive models made in jupyternotebook. Or connect powerbi to
the notebook
Sites for extra help https://www.geeksforgeeks.org/predictive-analysis-in-data-mining/