Role of Business Analytics in Decision Making
Role of Business Analytics in Decision Making
Introduction:
The word analytics has come into the foreground in last decade or
so. The proliferation of the internet and information technology has made analytics very
relevant in the current age. Analytics is a field which combines data, information
technology, statistical analysis, quantitative methods and computer-based models into one.
This all are combined to provide decision makers all the possible scenarios to make a well
thought and researched decision. The computer-based model ensures that decision makers
are able to see performance of decision under various scenarios.
Application:
Business analytics has a wide range of application from customer
relationship management, financial management, and marketing, supply-chain
management, human-resource management, pricing and even in sports through team game
strategies.
Challenges:
Business analytics can be possible only on large
volume of data. It is sometime difficult obtain large volume of data and not question its
integrity.
The amalgamation of an increasingly complicated world,
the vast proliferation of data and the pressing desire to stay at the forefront of competition
has prompted organizations to focus on using analytics for driving strategic business
decisions. Business analytics is allowing managers to understand the dynamics of their
business, anticipate market shifts and manage risks. Rather than “going with gut” when
maintaining inventory, pricing solutions, or hiring talent, companies are embracing analytics
and systematic statistical reasoning to make decisions that improve efficiency, risk
management and profits.
Data and analytics are disrupting existing business
models and ecosystems. Proliferation of new data sets and introduction of massive data
migration capabilities are undermining existing information and technological silos. From
using granular data to personalize products and services to scaling digital platforms to
match buyers and sellers, companies are using business analytics to enable more faster and
facts-based decision making. In fact, studies show that data driven organizations not only
make better strategic decisions, but also enjoy high operational efficiency, improved
customer satisfaction, and robust profit and revenue levels. Recent research also shows that
data-centered organizations are twenty-three times more likely to acquire customers, six
times as likely to retain those customers, and nineteen times as likely to be profitable as a
result.
Analytical practitioners today have a vast array of
analytical capabilities and techniques at their disposal. These range from the most
fundamental techniques, “descriptive analytics”, which involve preparing the data for
subsequent analysis, to “predictive analytics” that provide advanced models to forecast and
predict future, to the top-notch of analytics called “prescriptive analytics” that utilize
machine-based learning algorithms and dynamic rule engines to provide interpretations and
recommendations. With their diverse use cases and applications, it is no longer a surprise
that these techniques are now finding way into customer, workforce, supply-chain, finance
and risk strategies at an organizational level.
Data is the new oil- and the best way for companies
to access and understand it is to digitize their processes. Digitizing customer interactions can
provide troves of information, which companies can feed into strategy, sales, marketing,
and product development. Detailed and granular data can enable companies to micro-target
their customers and to personalize their products and services. Further internal digitization
generates data that managers can use to improve their operations, including routing and
transportation, resource allocation and scheduling, capacity planning and manufacturing.
These trends are also causing many companies to converge their “Business Intelligence” and
“Operation Research” units on the common ground of predictive and advanced analytics.
Both communities are now using statistical and mathematical techniques to attack strategic
business problems and systemize decision making.
Data analytics, with its far-reaching use cases and
diverse applications, is now emerging as the keystone of strategic business decision making.
From enabling businesses to make consumer-oriented marketing decisions to helping them
address key operational inefficiencies, analytics is radically changing the perception towards
the importance of data. Advanced statistical models are furthering this cause by providing
valuable insights out of unconventional data sets and by enabling companies to explore new
business territories. The next few sections explore the vast and diverse opportunities that
data and analytics bring to businesses today.
The conclusion:
Marketing:
Studying buying patterns of consumer behaviour, analysing trends,
help in identifying the target audience, employing advertising techniques that can appeal to
the consumers, forecast supply requirements, etc.
For example:
Use BA to gauge the effectiveness and impact of a marketing
strategy on the customers. Data can be used to build loyal customers by giving them exactly
what they want as per their specifications.
HR Professionals:
HR professionals can make use of data to find information about
educational background of high performing candidates, employee attrition rate, number of
years of service of employees, age, gender, etc. This information can play a pivotal role in
the selection procedure of a candidate.
For example:
HR manager can predict the employee retention rate on the
basis of data given by BA.
CRM:
BA helps one analyse the key performance indicators, which
further helps in decision making and make strategies to boost the relationship with the
consumers. The demographics, and data about other socio-economic factors, purchasing
patterns, lifestyle, etc., are of prime importance to the CRM department.
For example:
The company wants to improve its service in a particular
geographical segment. With data analytics, one can predict the customer’s preferences in
that particular segment, what appeals to them, and accordingly improve relations with
customers.
Manufacturing:
BA can help you in supply chain management, inventory
management, measure performance of targets, risk mitigation plans, improve efficiency in
the basis of product data, etc.
For example:
The Manager wants information on performance of a
machinery which has been used past 10 years. The historical data will help evaluate the
performance of the machinery and decide whether costs of maintaining the machine will
exceed the cost of buying a new machinery.
R:
R is now the most popular analytics tool in the industry. It
has surpassed SAS in usage and is now the tool of choice even for companies that
can easily afford SAS. Over the years, R has become a lot more robust. It handles
large data sets much better than it used to, say even a decade earlier. It has also
become a lot more versatile.
Python:
Python has been a favourite of programmers for long. This
is mainly because it’s an easy to learn language that is also quite fast. However, it
developed into a powerful analytics tool with the development of analytical and
statistical libraries like numpy, scipy etc. Today it offers a comprehensive coverage of
statistical and mathematical functions.
Apache Spark:
Spark is another open source processing engine that is
built with a focus on analytics, especially on unstructured data or huge volumes of
data. Spark has become tremendously popular in the last couple of years. This is
because of various reasons – easy integration with the Hadoop ecosystem being one
of them. Spark has its own machine learning library which makes it ideal for analytics
as well.
Apache Storm:
Storm is the Big Data tool of choice for moving data
or when the data comes in as a continuous stream. Spark works on static data. Storm
is ideal for real time analytics or stream processing.
SAS:
SAS continues to be widely used in the industry. Some
flexibility on pricing from the SAS Institute has helped its cause. SAS continues to be
a robust, versatile and easy to learn tool. SAS has added tons of new modules. Some
of the specialized modules that have been added in the recent past are – SAS
analytics for IOT, SAS Anti-money Laundering, and SAS Analytics Pro for Midsize
Business.
Tableau:
Tableau is an easy to learn tool that does an
effective job of slicing and dicing your data and creating great visualizations and
dashboards. Tableau can create better visualizations than Excel and can most
definitely handle much more data than Excel can. If you want interactivity in your
plots, then Tableau is surely the way to go.
Excel:
Excel is of course the most widely used
analytics tool in the world. I have seldom come across a data scientist who does not
use Excel. Whether you are an expert in R or Tableau, you will still use Excel for the
grunt work. Non-analytics professionals will usually not have access to tools like SAS
or R on their machines. But everyone has Excel. Excel becomes vital when the
analytics team interfaces with the business steam.
QlikView:
QlikView and Tableau are essentially vying for the top
spot amongst the data visualization giants. QlikView is supposed to be slightly faster
than Tableau and gives experienced users a bit more flexibility. Tableau has a more
intuitive GUI and is easier to learn.
Splunk:
Splunk is more popular than some of the more known
names like Cloudera and Hortonworks. It started as a ‘Google for log files’ which
means its primary use was to process machine log files data. It has now become
much more than that. Splunk has great visualization options and a web interface
makes it easy to use.
Data Visualization in R:
In this article, we will create the following visualizations:
Basic Visualization
1. Histogram
2. Bar / Line Chart
3. Box plot
4. Scatter plot
Advanced Visualization
1. Heat Map
2. Mosaic Map
3. Map Visualization
4. 3D Graphs
5. Correlogram
In R the HistData package provides a collection of small data sets that are interesting and
important in the history of statistics and data visualization.
Selecting the Right Chart Type
Comparison
Composition
Distribution
Relationship
To determine which amongst these is best suited for your data, I suggest you should answer
a few questions like,
How many data points will you display for each variable?
Will you display values over a period of time, or among items or groups?
In your day-to-day activities, you’ll come across the below listed 7 charts most of the time.
1. Scatter Plot
2. Histogram
3. Bar & Stack Bar Chart
4. Box Plot
5. Area Chart
6. Heat Map
7. Correlogram
Now let’s see how to use these visualizations in R
1. Scatter Plot
When to use: Scatter Plot is used to see the relationship between two continuous variables.
In our above mart dataset, if we want to visualize the items as per their cost data, then we
can use scatter plot chart using two continuous variables, namely Item Visibility& ItemMRP
as shown below.
2. Histogram
When to use: Histogram is used to plot continuous variable. It breaks the data into bins and
shows frequency distribution of these bins. We can always change the bin size and see the
effect it has on visualization.
From our mart dataset, if we want to know the count of items on basis of their cost, then we
can plot histogram using continuous variable Item MRP as shown below.
3. Bar & Stack Bar Chart
When to use: Bar charts are recommended when you want to plot a categorical variable or
a combination of continuous and categorical variable.
From our dataset, if we want to know number of marts established in particular year, then
bar chart would be most suitable option, use variable Establishment Year as shown below.
. Area Chart
When to use: Area chart is used to show continuity across a variable or data set. It is very
much same as line chart and is commonly used for time series plots. Alternatively, it is also
used to plot continuous variables and analyze the underlying trends.
From our dataset, when we want to analyze the trend of item outlet sales, area chart can be
plotted as shown below. It shows count of outlets on basis of sales.
6. Heat Map
When to use: Heat Map uses intensity (density) of colors to display relationship between
two or three or many variables in a two dimensional image. It allows you to explore two
dimensions as the axis and the third dimension by intensity of color.
From our dataset, if we want to know cost of each item on every outlet, we can plot
heatmap as shown below using three variables Item MRP, Outlet Identifier & Item Type
from our mart dataset.
7. Correlogram
When to use: Correlogram is used to test the level of co-relation among the variable
available in the data set. The cells of the matrix can be shaded or colored to show the co-
relation value.
Darker the colour, higher the co-relation between variables. Positive co-relations are
displayed in blue and negative correlations in red colour. Colour intensity is proportional to
the co-relation value.
From our dataset, let’s check co-relation between Item cost, weight, visibility along with
Outlet establishment year and Outlet sales from below plot.
In our example, we can see that Item cost & Outlet sales are positively correlated while Item
weight & its visibility are negatively correlated.