Module 3
Module 3
Module 3
BUSINESS DATA
ANALYSIS
INTRODUCTION
• Business analytics begins with a data set or commonly with a database.
• As databases grow, they need to be stored somewhere like computer and
data warehousing.
• Storage areas become so large and new term is used t o describe them.
• Big data describes the collection of data set that are so large and
complex that software systems are hardly able to process them.
• Analytics is the use of data, information technology,statistical analysis,
quantitative methods, and mathematical or computer based models to
help managers to gain insight into their business operations and make
better decisions.
FEATURES OF NEW GENERATION
COMPUTERS
Speed of
operation Accuracy Storage
Automatic Diligence
Versatality operation
Complexity Reliability
FEATURES continuation
• Speed –faster, perform complex calculations very fast, powerful computer
capable of performing about 3-4 million simple instructions per second, speed is
expressed in terms of microseconds.
• Accuracy- very accurate in processing data, if correct data is entered with a
correct program, the result will also be correct, GIGO- Garbage in Garbage Out.
• Storage- have the capacity to store large amount of data and instructions and
can retrieve it when required, large memory capacity, memory of CPU is limited,
secondary storage devices can be used. Storage measured in terms of bytes,
kilobytes, megabytes, gigabytes and terabytes.
• Versatility – versatile machines and can perform any task as long as it can be
broken down into a series of logical steps, used for a wide variety of purpose at
the same time, digital computers are used for scientific as well as business
purposes.
FEATURES CONTINUATION
• Automatic operation- computer can carry out a particular task on the
basis of the instructions given to it until a stop instruction is executed.
• Diligence – computers are highly consistent, free from tiredness,
boredom, and resulting lack of concentration, can work for hours without
creating errors.
• Complexity – even a complex mathematical model can be analysed easily.
• Reliability –reliable , reliability is measured in terms of the performance
with a predetermined standard of operations, if a program is executed
any number of times with the same set of data, everytime the result
would be the same.
CONCEPT OF DATA ANALYSIS
• World is becoming more and more data driven.
• Data analysis is the process of evaluating data using analytical or statistical
tools to discover useful information
• Used by all types of businesses.
• Data is collected and sorted, the results are interpreted to make decisions.
• End results are delivered as a summary. Or as a visual like a chart or graph.
• Data analysis is a process of collecting, transforming, cleansing and modeling
data with the goal of discovering required information
• Used in different business, science, and social science domains.
• In today's business world, data analysis plays a role in making decisions more
scientific and helping businesses operate more effectively
DATA
• Data is used to describe things by assigning a value to them.
• These values are then organized, processed and presented within a given
context so that it becomes useful.
• Webster's new Collegiate dictionary 1973 defines data as “factual
information( as measurements or statistics) used as a basis for reasoning,
discussion or calculation.”
• Data are numbers, characters, images or other method of recording, in a
form which can be assessed to make a determination or decision about a
specific action.
DIFFERENT FORMS OF DATA
Categorical Continuous
data data
DIFFERENT FORMS OF DATA-
QUALITATIVE AND QUANTITATIVE
• Qualitative data
Data that uses words and descriptions.
Qualitative data can be observed but is subject and therefore difficult to use for the
purpose of making comparison.
Descriptions of texture, taste, or an experience are examples of quality data.
These types of data are collected through focus groups, interview, open ended
questionnaire items and other structured situations.
• Quantitative data
Data that is expressed with numbers. data which can be put into categories, measured,
or ranked.
Length, weight, rating scales are examples of quantitative data.
They can be represented visually in graphs and tables and can be statistically analysed
TYPES OF QUANTITATIVE DATA
• Two types- categorical and continuous
• Categorical data
Data that has been placed into groups
An item cannot belong to more than one group at a time
Categorical variables represent types of data which may be divided into
groups.eg: race,sex, etc.
Analysis of categorical data involves the use of data tables
• Continuous data
Numerical data measured on a continuous range or scale
All values are possible with no gaps in between. eg:weight, height etc
BUSINESS DATA ANALYSIS
• Data driven businesses make decisions based on data, which means they can
be more confident that their actions will bring success since there is data to
support them.
• Business analytics is the process of collecting sorting, processing and
studying business data, and using statistical models and iterative
methodologies to transform data into business insights.
• The goal of business analytics is to determine which data sets are useful and
how they can be leveraged to solve problems and increase efficiency,
productivity and revenue.
• Business analytics is a subset of business intelligence-business intelligence is
typically descriptive, focusing on the Strategies and tools utilised to acquire,
identify and categorise raw data and report on past or current events.
• Sophisticated data, quantitative analysis, and mathematical models are all
employed by business analysts to find solutions for data driven issues.
• Data Analytics is the science of analysing raw data in order to make
conclusion about that information
• In simple words data analysis is the process of collecting and organising
data in order to draw helpful conclusions from it.
• The main purpose of data analysis is to find meaning in data so that derived
knowledge can be used to make informed decisions
• Data Analytics is used in business to help organisation to make better
decisions, market research, product research customer reviews etc .
• Analysing data will provide insights that organisation need in order to make
the right decisions.
IMPORTANCE OF BUSINESS
ANALYTICS
• A methodology for commercial decision making
• Operational efficiency
• Competitive advantage
• Valuable information
TYPES OF DATA ANALYSIS
Four types
Descriptive Diagnostic
analysis analysis
Predictive Prescriptive
analysis analysis
DESCRIPTIVE ANALYSIS
• Descriptive data analysis looks at past data and tells what happened.
• Summarizes a business’s existing data to get a picture of what has happened in
the past or is happening currently.
• Simplest form of analytics
• Applies descriptive statistics to existing data and make it more accessible to
members of an organization.
• Often used when tracking key Performance Indicators(KPIs) like revenue,sales
leads, and more.
• Can help identify strengths and weaknesses and provide insight into customer
behavior
• Most common physical product of descriptive analysis is a report with visual
statistical aids.
DIAGNOSTIC ANALYSIS
• Aims to determine why something happened.
• Can be done to find out the reason why something negative or positive
happened.
• It shifts from the “what” of past and current events to “how” and “why”,
focusing on past performance to determine which factors influence
trends.
• Employs techniques such as drill-down, data discovery, data mining, and
correlations to uncover the root causes of events.
PREDICTIVE ANALYSIS
• It predicts what is likely to happen in the future.
• Trends are derived from the past data which are then used to form predictions about the future.
• Can also be applied to more complicated issues such as risk assessment, sales forecasting etc.
• It forecasts the possibility of future events using statistical models and machine learning techniques.
• Machine learning experts and data scientists are typically employed to run predictive analysis using
learning algorithms and statistical models, enabling a higher level of predictive accuracy.
• A common application of predictive analysis is sentiment analysis.
• Existing text data can be collected from social media to provide a comprehensive picture of opinions
held by a user.
• This data can be analysed to predict their sentiment towards a new subject.
• A common physical product of predictive analysis is a detailed report used to support complex
forecasts in sales and marketing.
PRESCRIPTIVE ANALYSIS
• Combine the information found from the above mentioned types and forms a
plan of action for the organization to face the issue or decision.
• This is where data driven choices are made.
• It goes a step beyond predictive analysis, providing recommendations for next
best actions and allowing potential manipulation of events to drive better
outcomes.
• It recommends specific actions to deliver the most desired result.
• It relies on strong feedback system and constant iterative analysis and testing to
continually learn more about the relationships between different actions and
outcomes.
• A common physical product of prescriptive analysis is a focused recommendation
for next best actions, which can be applied to clearly identified business goals.
PHASES OF DATA ANALYSIS
DATA REQUIREMENTS
SPECIFICATION
First step to determine the data requirements or how the data is grouped.
The data required for analysis is based on a question or an experiment.
Based on the requirements of those directing the analysis, the data
necessary as inputs to the analysis is identified (e.g., Population of
people).
Specific variables regarding a population (e.g., Age and Income) may be
specified and obtained.
Data may be numerical or categorical.
DATA COLLECTION
• Data that is processed, organized and cleaned would be ready for the
analysis.
• Various data analysis techniques are available to understand, interpret, and
derive conclusions based on the requirements.
• Data Visualization may also be used to examine the data in graphical format,
to obtain additional insight regarding the messages within the data.
• Statistical Data Models such as Correlation, Regression Analysis can be used
to identify the relations among the data variables.
• These models that are descriptive of the data are helpful in simplifying
analysis and communicate results.
• The process might require additional Data Cleaning or additional Data
Collection, and hence these activities are iterative in nature.
COMMUNICATION
MISRO
AI
SEGMENTATION
TRENDS
DEEP
IoT
LEARNING
NEURAL
NETWORK
TRENDS IN BUSINESS ANALYTICS
• Lack of alignment, availability and trust- In most organizations, the analysts are
organized according to the business domains. Unfortunately, the analysis is shared with
the top executives and thus the results are not easily communicated to the business
users for whom they provide the greatest value.
• Lack of Commitment- implementation of business analytics can be very costly and the
ROI is not immediate. By nature, these analytics models are prepared to improve
accuracy over time but it is a complex model that requires dedication to implement the
solution. Because the business users do not see the promised results immediately, they
lose interest which results in loss of trust as a result of which the models fail.
• Low quality of underlying transactional data-
• Implementation of the solutions provided by the business analysts fail because either
the data is not available, the data sources are too complex or they are poorly
constructed.
BUSINESS DATA ANALYST
• A data analyst collects, processes and performs statistical analysis on large dataset.
• They discover how data can be used to answer questions and to solve problems.
• With the development of computers and an ever increasing move toward
technological development, data analysis has evolved.
• In smaller organisations Business Data Analytics may be used interchangeably to
describe roles that involve data analysis. Larger organisations, however, often
employee both data analyst and business analyst, making the difference between
the two careers important to understand.
• While data analysts and business analysts both work with data, the main difference
lies in what they do with it. Business analyst use data to help organisation make
more effective business decisions, while Data Analytics are more interested in
gathering and analysing data for the business development and used to make
decisions on their own.
• Most jobs in data analytics involved gathering and cleaning data to
uncover Trends and business insights and it varies depending on the
industry or company.
• Most data analyst work with IT teams, management and data scientists
to determine organisational goals. in most cases, they pinpoint trends,
correlation and patterns in complex data sets and identify new
opportunities for process improvement.
RESPONSIBILITIES OF DATA
ANALYST
• Producing Reports
- data analysts spend a significant amount of time producing and
maintaining both internal and client-facing reports.
- These reports give management insights about new trends on the horizon
as well as areas the company may need to improve upon.
- Writing up a report is not simple. Successful data analysts understand
how to create narratives with data.
- To remain valuable, the reports, answers and insights that data analysis
provides have to be understood by the next decision-maker, who is not an
analyst but a manager.
• Spotting Patterns
- The most effective data analysts can use data to tell a story.
- In order to produce a meaningful report, a data analyst first has to be
able to see important patterns in the data.
- At the base level, data is used to find trends and insights that we can use
to make recommendations to clients.
- Reporting in regular increments, such as weekly, monthly or quarterly, is
important since it helps an analyst notice significant patterns.
- They all contribute to see trends over time.
• Collaborating with others
- The wide variety of data analyst roles and responsibilities means that one
has to collaborate across many other departments in the organization
including marketers, executives and salespeople.
- They are also likely collaborate closely with those who work in data
science like data architects and database developers.
- Hence it is vital to be able to communicate well as a data analyst.
- His success is dependent on his ability to work with people.
• Collecting data and setting infrastructure
- The most technical aspect of an analyst’s job is collecting the data itself.
- This often means working together with web developers to optimize data
collection.
- Streamlining this data collection is key for data analysts.
- They work to develop routines that can be automated and easily modified
to reuse in other areas.
- Analysts keep a handful of specialized software and tools to help them
accomplish this.
SKILLS REQUIRED FOR DATA
ANALYSTS
• Programming Languages : data analysts should be proficient in one language
and have working knowledge of a few more. Data analysts use programming
languages such as R and SAS for data gathering, data cleaning, statistical
analysis, and data visualization.
• Creative and Analytical Thinking: Curiosity and creativity are key attributes of
a good data analyst. It’s important to have a strong grounding in statistical
methods, but even more critical to think through problems with a creative and
analytical lens. This will help the analyst to generate interesting research
questions that will enhance a company’s understanding of the matter at hand.
• Strong and Effective Communication: Data analysts must clearly convey their
findings — whether it’s to an audience of readers or a small team of
executives making business decisions. Strong communication is the key to
success.
SKILLS
• Data Visualization: Effective data visualization takes trial and error. A
successful data analyst understands what types of graphs to use, how to scale
visualizations, and know which charts to use depending on their audience.
• Data Warehousing: Some data analysts work on the back-end. They connect
databases from multiple sources to create a data warehouse and use querying
languages to find and manage data.
• SQL Databases: SQL databases are relational databases with structured data.
Data is stored in tables and a data analyst pulls information from different
tables to perform analysis.
• Database Querying Languages: The most common querying language data
analysts use is SQL and many variations of this language exist, including
PostreSQL, T-SQL, PL/SQL (Procedural Language/SQL).
SKILLS
• Data Mining and Cleaning : When data isn’t neatly stored in a database,
data analysts must use other tools to gather unstructured data. Once
they have enough data, they clean and process through programming.
• Advanced Microsoft Excel: Data analysts should have a good handle on
excel and understand advanced modeling and analytics techniques.
• Machine Learning: Data analysts with machine learning skills are
incredibly valuable, although machine learning is not expected skill of
typical data analyst jobs.
ORGANISATION AND SOURCE OF
DATA
• All decision makers needs three terms
Informatio
Data Knowledge
n
• These three terms are sometimes used interchangeably and may have
several definitions.
• Data- items about things, events, activities and transactions are recorded,
classified and stored but are not organized to convey any specific
meaning. It can be numeric, alphanumeric, figures, sounds and images.
• Information – data have been organized in a manner that gives them
meaning for the recipient. They confirm something the recipient knows.
Data items are processed so that the results are meaningful for an
intended action or decision.
• Knowledge- consists of data items and information organized and
processed to convey understanding, experience, accumulated learning,
and expertise that are applicable to a current problem or activity.
Knowledge can be the application of data and information in making a
better decision.
SOURCES OF DATA
Internal data
• Stored in one or more places
• Data about people ,products, services and processes
• Private data that organistion owns, controls or collects.
• Exs –sales data, financial data
INTERNAL SOURCES OF DATA
• Accounting resources-- gives much information which can be used by
marketing researcher-internal factors
• Sales force report- information about the sale of a product- info provided
is outside the organisation
• Internal experts- people who are heading the organization- can give idea
about how a particular thing is working
• Miscellaneous reports-reports from operational reports.
EXTERNAL DATA
• There are many sources of external data- sources outside the
organisation
• Range from commercial databases to data collected by sensors and
satellites.
• Govt reports and files are a major source
• Also available by using GIS (geographical information systems)and other
demographic sources.
EXTERNAL DATA
Classified as- Govt Publications and Non Govt Publications
Govt pubications- Government sources provide an extremely rich pool of data
for the researchers. In addition, many of these data are available free of cost on
internet websites. There are number of government agencies generating data.
These are:
• Registrar General of India- It is an office which generate demographic data. It
includes details of gender, age, occupation etc.
• Central Statistical Organisation- This organization publishes the national
accounts statistics. It contains estimates of national income for several years,
growth rate, and rate of major economic activities. Annual survey of Industries
is also published by the CSO. It gives information about the total number of
workers employed, production units, material used and value added by the
manufacturer.
• Director General of Commercial Intelligence- This office operates from
Kolkata. It gives information about foreign trade i.e. import and export.
These figures are provided region-wise and country-wise.
• Ministry of Commerce and Industries - This ministry through the office of
economic advisor provides information on wholesale price index. These
indices may be related to a number of sectors like food, fuel, power, food
grains etc. It also generates All India Consumer Price Index numbers for
industrial workers, urban, non manual employees and cultural labourers.
• Reserve Bank of India- This provides information on Banking Savings and
investment. RBI also prepares currency and finance reports.
• Labour Bureau- It provides information on skilled, unskilled, white collared
jobs etc.
• National Sample Survey- This is done by the Ministry of Planning and it
provides social, economic, demographic, industrial and agricultural
statistics.
• Department of Economic Affairs- It conducts economic survey and it also
generates information on income, consumption, expenditure, investment,
savings and foreign trade.
• Non Government Publications- These includes publications of various industrial and
trade associations, such as
• The Indian Cotton Mill Association
• Various chambers of commerce
• The Bombay Stock Exchange (it publishes a directory containing financial accounts, key
profitability and other relevant matter)
• Various Associations of Press Media.
• Export Promotion Council.
• Confederation of Indian Industries ( CII )
• Small Industries Development Board of India
• Different Mills like - Woolen mills, Textile mills etc
The only disadvantage of the above sources is that the data may be biased. They are likely to
colour their negative points.
DATA COLLECTION
• Need to extract data from internal and external sources is a complicate
task
• Data collection is one of the first step of the data life cycle.
• To collect the right data, need to know the effort in collecting it,
• Data scientists help companies interpret and manage data and solve complex
problems using expertise in a variety of data niches.
• They generally have a foundation in computer science, modeling, statistics,
analytics, and math - coupled with a strong business sense. The following
explains the role:
Empowering Management to Make Better Decisions
- A trusted advisor and strategic partner to the organization’s upper
management by ensuring that the staff maximizes their analytics capabilities.
- A data scientist communicates and demonstrates the value of the institution’s
data to facilitate improved decision-making processes across the entire
organization, through measuring, tracking, and recording performance
metrics and other information.
Directing Actions Based on Trends - Examines and explores the organization’s data,
after which they recommend and prescribe certain actions that will help improve the
institution’s performance, better engage customers, and ultimately increase
profitability.
Challenging the Staff to Adopt Best Practices- One of the responsibilities of a data
scientist is to ensure that the staff is familiar and well-versed with the organization’s
analytics product. They prepare the staff for success with the demonstration of the
effective use of the system to extract insights and drive action. Once the staff
understands the product capabilities, their focus can shift to addressing key business
challenges.
Identifying Opportunities- During their interaction with the organization’s current
analytics system, data scientists question the existing processes and assumptions for
the purpose of developing additional methods and analytical algorithms. Their job
requires them to continuously and constantly improve the value that is derived from
the organization’s data.
Decision Making with Quantifiable data- There is no need to take high
risk in data gathering and analyzing from various channels with the
arrival of data scientists. Data scientists create models using existing
data that simulate a variety of potential actions. Thus, an organization
can learn which path will bring the best business outcomes.
Testing the Decisions- The primary role of data scientists involves making
certain decisions and implementing those changes. It is crucial to know
how those decisions have affected the organization. This is where a data
scientist comes in. They measure the key metrics that are related to
important changes and quantify their success.
Identification and Refining of Target Audiences- The importance of data
scientist is based on the ability to take existing data that is not
necessarily useful on its own and combine it with other data points to
generate insights an organization can use to learn more about its
customers and audience. With this in-depth knowledge, organizations
can tailor services and products to customer groups, and help
profit increases.
Recruiting the Right Talent for the Organization- The process of recruiting
is changed due to big data. In addition to resume, vast amount of
information available on talent—through social media, corporate
databases, and job search websites. Data science specialists can work
their way through all these data points to find the candidates who best fit
the organization’s needs.
ROLE OF AI AND INTELLIGENT
AGENTS IN E-BUSINESS
• Artificial Intelligence is now a common concept.
• In fact, AI is embedding itself in each and every aspect of our life. Now
when you visit a mall you see several self-checkout cash counters. Even
the airports are now being equipped with state of art advanced security
check systems.
• Some examples of AI in E-Commerce
• Chatbots- The e-commerce sites now offer 24/7 assistance and this is
because of chatbots. Earlier these chatbots just offered customary
replies, now they have turned into intelligent beings which understand all
issues which the visitors have to deal with.
• CRM- Previously human resource was the base of Customer Relationship
Management (CRM). Huge efforts were made to gather and evaluate data
so as to offer superb services to the clients. CRM pinpoints buying trends
so that your actions are directed in a right way. With advanced CRM
solutions, predictions are made with enhanced accuracy, thus the sales
team can concentrate on building long-lasting associations with
customers.
• Internet of Things- IoT intends to do-offering connectivity in all
elements of our life. From syncing different devices to programming your
washing machine, lights, household appliances and even your car, IoT is
simply everywhere.
BENEFITS OF AI IN E-BUSINESS
• Sales Forecasting- analyzing big volumes of user data and on that basis it offers
useful insights about consumer buying patterns.
• Superior services at affordable costs- It automates routine processes thus offering
personalized marketing options.
• Enhance customer satisfaction and promote sales- Besides offering superb
customer service, AI also helps in conversational commerce. This is basically real-time, a
human-like interaction between client and messenger, chatbot, or voice-chat.
• Personalized content- The Artificial technology offers any business a huge competitive
edge regardless to its size of the operation. It allows the users to pick up any item in any
picture online, and then ask to come up with similar things by making use of the image.
• AI in marketing- small business firms can use even modest budgets due to AI solutions
meant for marketing.
• Customer service- AI enables to follow an excellent customer service with perfection.
INTELLIGENT AGENTS
• An intelligent agent is a software program that supports a user with the
accomplishment of some task or activity by collecting information
automatically over the internet and communicating data with other
agents depending on the algorithm of the program.
• Business establishments want to improve sales by pulling the right
shoppers to their websites. Consumers are generally interested in
comparing various products and services to find the best product at best
price. These have led to the emergence of variety of intelligent agents
working for buyers and sellers of products and services over the web.
• Intelligent agents play an important role in e-marketing and enhancing
the eCommerce aspect of an organization. It enables both buyers and
sellers in eCommerce transactions by providing efficient, precise, and
enhanced comprehensive searches on the web and information
repository. Intelligent agents are very interactive and can perform
multiple tasks at a variety of locations. They are very powerful and
versatile than traditional search engines, spiders, and web crawlers.
• Features of Intelligent Agents
Mobility: Intelligent agents engaged in eCommerce travel from computer
to computer, across different system architecture and platforms and
gather information until search parameters are exhausted.
Goal oriented: Intelligent agent has the ability to accept the user
statement of goals and carry out the task delegated to it. It can move
around from one machine to another and can act proactively in response
to their environment, and can exhibit goal-directed behavior by taking the
initiative.
Independent: Intelligent agents function on its own without human
intervention and must have the ability to make decisions and to initiate
action without direct human supervision. They communicate
independently with repositories of information and other agents and
accomplish the objectives and tasks on the behalf of the user.
Intelligent: Intelligent Agents are able to crawl for data more intelligently.
They have intelligence to reason things out based on the existing
knowledge of its user and environment and on past experiences.
Intelligent agents follow preset rules to evaluate conditions in the
external environment.
Reduces net traffic: Agents can communicate and cooperate with other
agents quickly. This enables them to perform tasks, such as information
searches quickly and efficiently and reduces network traffic.
Multiple tasks: An intelligent agent can perform multiple tasks
simultaneously. It relieves the human user from monotonous clerical
work.
INTELLIGENT AGENTS IN E
COMMERCE
• Artificial intelligence (AI) continues to play a significant role in many
leading information systems.
• AI contribution is now crucial in systems such as workflow, data mining,
production scheduling, supply chain logistics, and most recently,
ecommerce.
• It is useful to explore the roles of agents as mediators in electronic
commerce in the context of a common framework.
Identification: This stage characterizes the buyer becoming aware of
some unmet need by inspiring through product information. Agents can
play an important role for those purchases that are repetitive (supplies) or
predictable (habits).
Brokering: two types- a) Product Brokering: once a buyer has identified a
need to make a purchase, the buyer has to determine what to buy
through a critical evaluation of retrieved product information. There are
several agents systems that lower consumers’ search cost when deciding
which products best meet their needs: B)Merchant Brokering: this stage
combines the consideration set from the previous stage with merchant-
specific alternatives to help determine who to buy from.
Negotiation: in this stage, price and other terms of the transaction are
settled on. Real-world negotiation increases transaction costs that may be
too high for either consumers or merchants. The majority of business-to-
business transactions involve negotiation.
Payment and Delivery: this stage can either signal the termination of the
negotiation stage or occur sometimes afterwards. In some cases, the
available payment or delivery options can influence product and
merchant brokering.
Product Service and Evaluation: this post-purchase stage involves product
service, customer service, and an evaluation of the satisfaction of the
overall buying experience and decision.
ETHICAL AND LEGAL COSIDERATION
IN BUSINESS ANALYTICS
• Comply with legal requirements
• Cultural and social norms
• Interest of stake holders
• Accountability
• Data protection
• Due care
• Confidentiality