Customer Analytics
Customer Analytics
Customer Analytics
CUSTOMER ANALYTICS
TABLE OF CONTENTS
0. FOREWORD 4
1. CUSTOMER ANALYTICS 6
1.2 CUSTOMERS 44
2. CUSTOMER SEGMENTATION 59
3.2 CLUSTERS 93
Page 2
CUSTOMER ANALYTICS
4. APPENDIX 104
5. BIBLIOGRAPHY 107
Page 3
CUSTOMER ANALYTICS
O. FOREWORD
As seen in other manuals, Big Data refers to all the data available to companies
regarding their customers and their processes, as well as their use in order to address
customers in a personalised manner and get better results.
This manual will focus on what is known as Customer Analytics. In other words, it will
focus on the study of customer data in order to help us to design customised actions,
plans and strategies to satisfy and retain our customers.
As seen in other manuals about business strategy, customers are an essential part of
companies since companies could hardly exist without them. That is why our
customers should not be ignored and they should be offered products and promotions
adapted to their needs and expectations because, otherwise, they will buy from our
competitors.
This manual will discuss the different customer data available to the companies and
the origin of these data, as well as the importance of using them in a way that helps us
to obtain better benefits and to make our customers happy.
We will deal with how to analyse these data, as well as the importance of
segmentation according to certain variables that allow us to personalise actions.
- Customer analytics
- Customer segmentation
Page 4
CUSTOMER ANALYTICS
These three topics will introduce the origin of our customer data, what tools can be
used to analyse this data and segment our customers based on variables. And,
customer value management will also be discussed.
Page 5
CUSTOMER ANALYTICS
1. CUSTOMER ANALYTICS
That is why customer analytics should never be ignored since thanks to this analysis we
know what our customers are like and what they need and expect from us, and this
will help us to adapt to them and offer personalised and exclusive actions and
promotions.
Before delving into customer analytics, we should first consider briefly the amount of
data available to companies today.
Some years ago, just before the rise of the internet and social media, companies only
had the customer they had reached themselves. Thus, a company could create a
database with information about customers such as their employment or address, as
well as information related to their purchases, frequency, amount spent or
satisfaction, among other aspects. These data were collected when customers
requested information, bought or purchased a product or service, or by means of
surveys.
In addition to these data, companies should now consider all information coming from
the internet or social media since users are increasingly using the networks to inform
or comment on companies and their products. The information that can be extracted
from these sources is really important. In addition to what users publish and share
with other people in the network about us or our products or services, other variables
Page 6
CUSTOMER ANALYTICS
should be taken into account such as user activity, their friends or followers and their
power of influence.
Companies have a large amount of information about their customers that may or may
not be very useful. This is because, depending on how this information is analysed and
used, it may or may not help companies to present a strategy or strategies according
to the profiles of their customers, and will also help acquiring new customers and
retaining current customers.
This new reality born through the democratisation of the internet is forcing companies
to completely change their marketing strategies. As just mentioned, the way this new
information is handled will influence our work positively or negatively.
Companies need to be aware of the importance of all this information; they should not
ignore it and should analyse it and use it to their benefit.
Thus, customer analytics can be defined as the process in which customer behaviour
data are used by companies to help them make key decisions through market
segmentation and predictive analysis. This is a very important process, since the
collected indicators should come from customer actions or attitudes. This means that
the analysed data should come from a source that has collected the customer
behaviour or the data related to it.
These data or this information about our customers can come from different sources:
- Web analytics.
Page 7
CUSTOMER ANALYTICS
- Etc.
As seen, companies have many sources to gather information and data about their
customers. However, these data and information belong to different departments,
which is a difficulty since these data do not stay in the same department. So, its
collection, analysis and use is very important.
In short, all these data and information allow the company to describe, explain and
predict the behaviour of its customers.
In addition, the main benefit that customer analytics can have is being able to make
decisions based on the data since this leads to having specific results such as:
Page 8
CUSTOMER ANALYTICS
It is important to address the two types of data that should be considered in our
analysis. Structured and unstructured data should be differentiated.
▪ Structured data
Structured data is information that is usually found in most databases of any company.
Structured data are text files usually displayed in rows and columns with titles.
These data can be ordered and easily processed by all data mining tools.
Structured data have a structure or predefined form, in which fields are fixed. In
addition, as has already been said, they are stored in tables.
This data type may come from different sources such as:
- Created data: data businesses purposely create data, generally for market
research. This may consist of customer surveys or focus groups. It also
includes more modern methods of research, such as creating a loyalty
programme that collects consumer information or asking users to create an
account and login while they are shopping online.
Page 9
CUSTOMER ANALYTICS
▪ Unstructured data
Unstructured data do not have a specific format so they cannot be stored in a table
because its information cannot be broken down into basic data types. In other words,
these data can be obtained in multimedia or PDF files or social media content.
Page 10
CUSTOMER ANALYTICS
These data cannot be structured or organised. However, they are the most relevant
data and information for companies since this information gives us data on behaviour,
preferences and user and customer satisfaction. This information is very useful when
designing marketing strategies.
It is increasingly common to find social media users who share their purchases, their
meals in a restaurant or their treatments in a beauty salon with their followers.
Although these interactions may seem very simple, they can provide very valuable
information about customers and users to a company. As is known, it is essential to
collect these data, analyse them and use them adequately to obtain benefits. It is
important to emphasise that it is unstructured data, so it is very important to make
sense of them.
Page 11
CUSTOMER ANALYTICS
In short, companies have a huge volume of data that increases very rapidly every day
and data can be structured or unstructured.
These three Vs of Big Data force the company, to a certain extent, to carry out a data
quality process since this will allow us to reduce the risk that the data are not accurate
and to make sure that the information used in our analysis is not wrong.
It is also very important to have tools to meet the objectives set that can be used by
different people involved in data analysis. As just mentioned, companies have a large
volume of data that is spread out among all their departments so these tools must be
accessible to both business users, the managers of the marketing department or the
analyst.
This helps us to monitor what is being done in order to understand what happened,
what is happening at this very moment and what is going to happen.
Page 12
CUSTOMER ANALYTICS
Given these considerations regarding data, it is important to briefly discuss the origin
of customer analytics, as this will help us to better understand the importance of this
process.
Customer analytics focuses on the interest that companies have in their customers in
order to understand them and adapt their strategies to them.
This is not new since the first actions and applications of companies aimed at
automating the activities associated with sales were in the early 90's. This meant that,
little by little, these applications of customer information management were
multiplied, giving specific solutions to the specific needs of customers. This gave rise to
independent information systems for call centre, customer service, help desk and
customer and service support departments.
However, at the end of the 1990s, the first integral solutions were created so that a
comprehensive customer management strategy, now known as CRM, was developed.
This strategy refers to two different concepts. On the one hand, it refers to a
management model of the entire organisation based on customer orientation, while,
on the other hand, it refers to the information system that supports customer
relationship management, sales force and marketing. By implementing a CRM,
companies have a broader view of the interactions with their customers, and they can
respond more efficiently.
Page 13
CUSTOMER ANALYTICS
Customer relationships
Basically a positive relationship with customers should be built to benefit both parties.
On the one hand, customers who get what they want from the company and, on the
other hand, the company that acquires or retains a customer.
Companies need a project that allows them to predict how and when they will satisfy
the needs of their customers. For this, the company must know the preferences,
behaviours and attitudes of their customers as this is the only way they will achieve
positive results. It is crucial that all departments have this information. This means that
departments or areas cannot simply know a part of the information related to
customers. Having all the information will help them to carry out complete actions
oriented at these customers and avoid generalised actions that do not take into
account the preferences, behaviours and attitudes of customers. It is very important
not only to focus on the data collected in a department, but also to consider the data
of all the departments of the company.
Page 14
CUSTOMER ANALYTICS
For this, it is essential to have a 360-degree view of customers and four data types
should be considered for that:
Processing of data
Before moving on to discuss the main features of customer analytics, as well as its
process and the technologies that are used to carry out the analysis, it is important to
introduce the processing of the data.
As seen, companies have a very large volume of data regarding their customers so
their processing and analysis is key to finding the patterns and associations among
them in order to predict the potential behaviour of customers as best as possible.
When dealing with these data to achieve the best results, it should be noted that the
analysts who perform these tasks do not use a single methodology, but rather
different methodologies to group together customers with similar attributes, optimise
marketing efforts, handle service issues and assess brand awareness. They use the
following methods or techniques:
Page 15
CUSTOMER ANALYTICS
Along with the processing of data, visualisation is very important. Data should not only
be collected and analysed, but this analysis should be understandable for all the
members of the organisation. Otherwise, actions and strategies will hardly be
successful. The visualisations of analysis should help us to understand, inform and
share information quickly and efficiently.
Page 16
CUSTOMER ANALYTICS
▪ Dashboards
Dashboards help us to quickly understand the complexity of the available data because
they allow us to create a picture of what is available and go into detail to analyse more
specific aspects.
- The number of indicators included should not be too high; only the
indicators that are really necessary should be included. This is very
important since, as already said, it is a visualisation tool that will help us to
quickly and easily understand the data available so we should avoid
including indicators that are not significant for what we want to analyse.
- These KPI should also be actionable for the business, that is, they must
allow us to carry out strategic actions to improve.
Companies can use a wide range of tools to create a dashboard. Data should be
introduced and they are converted into graphic information to make it easier to
interpret them.
Page 17
CUSTOMER ANALYTICS
In the examples below, a representation made with Excel and another representation
obtained through the Google Analytics tool are seen.
Other tools to create these panels are Cyfe, Chart.io, Klipfolio and Clicdata, among
others.
Page 18
CUSTOMER ANALYTICS
▪ Custom reports
As its name suggests, they are customised reports, that is, they are reports created
according to the current needs, in which information and data on the chosen topic will
be obtained. Thus, they only give that specific information that our company wants.
Once we have all the information regarding our company, custom reports will limit this
information to the data that really concerns us.
For example, we may have information related to the sales department of a specific
quarter or year. We have data such as the invoicing, the amount of the average
receipt, the number of daily, weekly and monthly purchases, the most and least sold
Page 19
CUSTOMER ANALYTICS
products, products that were sold together, the origin of the buyers, the payment
method, etc.
Tools like those offered by Google Analytics can be used to create these reports.
In short, dashboards give us general information, while custom reports give us only the
information related to a topic.
Given this introduction, it can be said that customer analytics refer to the processes to
capture, manage, analyse and generate strategic value of the customer data of a
company or organisation. In other words, it is about collecting all the information and
all the data related to our customers, visualising them and using them in a way that
provides value.
Page 20
CUSTOMER ANALYTICS
The features that differentiate customer analytics from other analytical systems are:
- Predictive: not only describes what the data show, but tries to predict the
reason for this behaviour.
Page 21
CUSTOMER ANALYTICS
In addition to these five general types, there are many other analyses that can be
carried out in customer analytics depending on our objectives or strategies.
Below is a list of strategies along with some of the analysis that should be considered
for their implementation.
Customer segmentation:
Page 22
CUSTOMER ANALYTICS
Customer acquisition:
- Cross sales and additional sales: involves knowing the products that are
sold together, as well as complementary or additional products that
customers consume or intend to consume.
Page 23
CUSTOMER ANALYTICS
Page 24
CUSTOMER ANALYTICS
Customer retention:
Efficiency:
Page 25
CUSTOMER ANALYTICS
In addition, the type of analysis used can be linked to the customer knowledge that is
intended to be created. These following categories or types of knowledge should be
considered:
Before moving on to deal with the customer journey and the implementation of a
customer analytics project in our company, it is worth commenting briefly on customer
analytics and Big Data.
In the era of Big Data, given the amount of data and customer information available to
companies, it is very important that they consider the following:
Page 26
CUSTOMER ANALYTICS
Big Data technologies can play a fundamental role in the design of customer
experiences, not only in data analysis, but also in their creation since they enable
further personalisation and analysis of the customer journey.
Given these considerations, the customer journey will be addressed below. After that,
we will learn how to implement a customer analytics process.
It is a Design Thinking tool that captures each of the stages, interactions, channels and
elements that customers go through when interacting with our company and brand.
Instead of looking at just a part of a transaction or experience, the customer journey
documents the full experience of being a customer.
It is important to highlight that the customer journey does not have to start when the
customer starts being our customer and end when our relationship ends, but rather
this map can be focused on a specific stage, such as the purchase process or the
Page 27
CUSTOMER ANALYTICS
When mapping this journey, the following aspects that make up the process should be
considered:
Page 28
CUSTOMER ANALYTICS
Page 29
CUSTOMER ANALYTICS
Source: https://medium.com/@azviss4/customer-journey-mapping-in-7-bookmarks-
d116da67a0c2
Page 30
CUSTOMER ANALYTICS
Examples: https://josecantero.com/2015/03/07/customer-journey-map-o-mapa-del-
viaje-del-cliente-para-gestionar-la-experiencia-de-cliente/
As seen in the examples, the objective is basically to track the journey of consumers
during a process and to assess how they feel in each of the phases of this process. This
allows us to determine the adequate processes and the processes that should be
improved to achieve a positive experience.
Now that we know how to map a customer journey, we will discuss the different
phases of a customer analytics project.
Projects of this type are usually carried out by an external company. However, the
essential steps and considerations will be discussed in order to analyse customer data
and information correctly and beneficially for the company.
The first thing to do when starting any kind of project is to have a strategic objective.
It is very important to clearly and accurately define what we want to achieve when
starting our analysis project.
To do this, two types of procedures can be used: setting SMART goals and a SWOT
analysis.
Page 31
CUSTOMER ANALYTICS
▪ SMART criteria
When we say that the objectives should be set using the SMART criteria, we mean that
these objectives should be:
- S à Specific
- M à Measurable
- A à Attainable
- R à Relevant
- T à Time-bound
Specific
Goals should be clear and specific, otherwise we will not be able to focus our efforts or
feel truly motivated to achieve them. When drafting goals, try to answer the five "W"
questions:
Page 32
CUSTOMER ANALYTICS
Attainable
Goals can be challenging but they also need to be realistic and attainable to be
successful. In other words, it should stretch our abilities, but still remain possible.
When setting an achievable goal, we may be able to identify previously overlooked
opportunities or resources that can bring us closer to it.
Relevant
Relevance is about ensuring that our goal matters to us, and that it also aligns with
other relevant goals. For example, if the goal is to launch a new product, it should be
something that is in alignment with the overall business objectives. Our team may be
able to launch a new consumer product, but if our company is a B2B that is not
expanding into the consumer market, then the goal would not be relevant.
Time-bound
Every goal needs a target date, so that we have a deadline to focus on and something
to work toward. There are three types of objectives: long-term, mid-term and short-
term.
- Customised promotions for each customer within certain segments for the
Christmas campaign.
To set goals to help us improve before setting objectives, research on the company
and the environment should be carried out to know the situation. A SWOT analysis will
be conducted for this.
Page 33
CUSTOMER ANALYTICS
▪ SWOT analysis
The SWOT analysis is usually represented as a matrix using the following methodology:
POSITIVE NEGATIVE
INTERNAL ORIGIN STRENGTHS WEAKNESSES
Attributes of the company Internal factors of the Internal factors of the
company that may be company that may be
helpful to achieve the harmful to achieve the
objectives. objectives.
EXTERNAL ORIGIN OPPORTUNITIES THREATS
Attributes of the External factors of the External factors of the
environment company that may be company that may be
helpful to achieve the harmful to achieve the
objectives. objectives.
This analysis has two dimensions, internal and external, to identify the elements of the
company that can be strengths and the elements that are weaknesses as well as the
elements and factors of the environment that can help the company to succeed.
In the external analysis, all elements of the environment that may have some type of
relationship with the company should be considered.
Page 34
CUSTOMER ANALYTICS
- Political: the stability of the country, the government system and restrictions on
imports and exports should be taken into account.
- Legal: fiscal trends and legislation should be taken into account.
- Economic: refer to public debt, wage levels, prices and foreign investment.
- Social: demographic distribution, employment and unemployment, and the
sanitation and hygiene system should be taken into account.
- Technological: the speed of technological progress and changes in the systems
should be taken into account.
By analysing these factors, we should be able to identify the elements that can help
the company to achieve its objectives, and the elements that can be harmful.
The internal analysis focuses on identifying the strengths and weaknesses that the
company has internally regarding capital resources, personnel, assets, product quality,
internal and market structure, and consumer perception, among others.
It is very important to establish what we want to achieve before analysing the data we
have since, as stated, there is a huge amount of data available so we are more than
likely to get lost among all these data if we are not clear about what we want to
achieve with them.
Once our strategic objective is established, we should work on asking the right
questions. In this way, we should ask ourselves why things happen so that with each
answer we will outline the issues that need to be analysed, thus building the path to
reach the final result.
It is very important to make sure that our objectives are well defined, so that they are
aligned with our strategy. For this, it is very important to ask the right questions,
otherwise we will not be able to identify the aspects in which we must work in our
data analysis. This means that strategies may be designed for other objectives and not
for the established objectives.
Page 35
CUSTOMER ANALYTICS
In this way, if our goal is to ensure that our customer service is excellent and is well
valued by our customers, we should ask questions about the factors that involve
customer service so that they give us the data or information that should be studied
and analysed in order to design a strategy and improvement actions.
In this phase of questions, it is also important to prioritise the problems that arise by
importance so that the most urgent problems are dealt with first. Organisation is key
since we should not try to solve all problems at the same time, but rather gradually.
Once the objectives are set and the right questions are asked, the data and
information analysis will be conducted.
As mentioned throughout the manual, companies have a large amount of data, both
structured and unstructured, so we should be very clear about the data types that will
be analysed in order to achieve our goal. For example, if we want to increase the
loyalty of our customers, it is very important to analyse the data referring to their
transactions. Otherwise, we will not know their buying tendency and it will be
impossible to predict what their behaviour will be in future purchases.
Although the company has a large amount of data, sometimes, depending on what we
want to analyse, we may lack relevant information. For this, we can always send
surveys to our customers with questions whose answers can help us in the analysis.
As previously seen, there are several types of data. They will be chosen depending on
our objective.
Once the analysis is finished and conclusions are drawn, as well as certain actions and
campaigns to achieve the established objectives, the results of our customer analytics
project will be presented. As seen, there are two tools for the visualisation and
presentation of results, dashboard and custom reports. Both will be used to present
results. It is particularly important to accompany the presentation with actions. These
actions will also involve a subsequent analysis to verify if they caused the desired
effect. As in any strategy, it is fundamental to monitor the development of the action,
since deviations can be identified and corrected.
Page 36
CUSTOMER ANALYTICS
In addition to guiding the company or organisation when carrying out strategies and
actions to achieve specific objectives, this process aims to generate a series of tangible
and intangible benefits for the organisation. These are:
Tangible benefits:
Page 37
CUSTOMER ANALYTICS
Intangible benefits:
- Identify market changes and acquire the ability to react to them quickly
thanks to our knowledge.
Although the analysis process may seem easy, it is important to note that we can often
come across problems or barriers that make our work more difficult. Customer
analytics strategies are not a simple process. Some of the most frequent barriers that
can be faced during this process are:
- Difficulty in identifying and quantifying the return on investment and the value
generated for the organisation and the customer.
Page 38
CUSTOMER ANALYTICS
- Difficulty in relating and integrating the result of the analysis with the different
business processes.
Knowing and taking into account the barriers or problems that can be faced when
launching a customer analytics project, it is important to highlight some good practices
that can help us to implement this project. These practices are:
Page 39
CUSTOMER ANALYTICS
1.2 CUSTOMERS
Having discussed what customer analytics is as well as the different data that should
be considered and how to carry out a customer analytics project; this section will deal
with another of the main elements of data analysis, customers.
In our relationship with our customers, data are essential since they allow us to
segment customers in different groups, as well to know what they want or need, or
the reasons why they came to us and not to the competition. This is why they are
incredibly helpful not only to retain customers with promotions and personalised
campaigns, but also to achieve what is known as the ideal client.
This section will focus on two aspects: the ideal customer and segmentation, which will
be discussed more in depth in the second chapter.
The objective of every business is to make money. This means that the company
should take care of certain aspects or elements such as its customers. Companies
should always pay attention to their customers in order to understand their interests
and needs so that they can adapt to them and make them become their ideal
customer.
For this, it is fundamental that, before launching any campaign or strategic action, the
customers of the company should be analysed and studied to know their interests,
motivations, needs, the reason why they came to us, their behaviour when shopping,
their origin, social status, etc.
Page 40
CUSTOMER ANALYTICS
In addition, the results and impact of each campaign should be assessed so that it
allows us to improve and achieve full satisfaction of our customers.
To achieve the ideal customer, the following actions should be carried out:
▪ Understand the attributes of our most valuable customers: the most valuable
customers refer to the customers who spend a lot of money in the company so
they bring the most benefits. It is important to know where they come from,
the most used shopping channels, whether they have an influence on social
media or not, their opinion about the company, their needs, etc. It is basically
about knowing our most valuable customers, in order to establish common
characteristics that define our ideal customer. This allows us to create a
customer profile that will be used to attract other customers that fit that
profile.
Page 41
CUSTOMER ANALYTICS
As seen, it is essential not to bother our customers with campaigns or offers they are
not interested in, or with campaigns or offers about products or services that they
have already purchased. We cannot forget that, most of the time, an excessive amount
of offers can diminish brand loyalty so we should not bombard customers, but only
send them what is necessary and may interest them.
Before analysing the data available, it is worth understanding what type of data we
have and how we have achieved it. In this way, we must ask ourselves questions like:
- The users that are going to be analysed were obtained when they subscribed or
when purchasing?
- Was the data obtained through surveys with closed questions or through the
user's own comments in surveys or in the networks?
This type of question will help us focus our analysis and our data sets.
Descriptive and behavioural data are good to start segmenting our customer base in
order to find behaviours of certain groups. However, to truly have a 360-degree view
of our customers, we should also focus on the study and analysis of interaction and
attitude data.
Page 42
CUSTOMER ANALYTICS
- Identify immediately small problems that could become bigger. This will be
achieved by analysing the differences between customers who have left and
our current customers since deviations identify data points that are
significantly different from the rest.
- Identify customers that are at risk of leaving and classify them with a note
depending on the stage of abandonment they are in.
In relation to important customers and their loyalty, Customer Life Value (CLV) should
be mentioned.
- Margin generated by the customer (annual benefit per customer less the
annual retention cost per customer).
- Retention rate.
Since marketing is now more focused and oriented to the use of data, a much more
dynamic approach for the analysis of CLV results has emerged. CLV is no longer
Page 43
CUSTOMER ANALYTICS
- Identify customers with whom our internal resources are being wasted.
The CLV calculation will make us better sellers since we will know how much we can
actually spend to acquire or retain a customer.
There are different ways to calculate the CLV. The easiest way is the following formula:
AP * PF * CL
Example:
- AP: 70
- PF: 5
- CL: 3
Page 44
CUSTOMER ANALYTICS
CLV = 70 * 5 * 3 = 1050
An approximation to the CLV of our customers is obtained with this formula, but a
much more accurate result can be obtained by adding the margin in each sale. In that
case, the formula would be as follows: AP * PF * CL * margin.
However, we should bear in mind that the value of money changes over time and not
all customers are regular so that the previous formula becomes: (AP * PF * CL *
margin) * (r / I + go)
Where:
Once the CLV is calculated, the way it is used is crucial to obtain good or bad results. In
order to achieve good results, the following five key points should be considered:
In most companies, almost all marketing budgets are aimed at acquiring new
customers. However, it is very important that the company determines how much it is
willing to spend for the acquisition of new customers, as this will help us to make
better decisions about where to allocate our marketing resources.
Acquisition tolerance can be determined using the net present value of future cash
flows, including the allocation of a percentage of the income of future promotions.
The CLV and the acquisition budget should be recalculated on a regular basis such as
the organisation and considering that customer behaviour changes.
Page 45
CUSTOMER ANALYTICS
We should make sure to focus on high value customers. We should avoid making
efforts and spending resources on customers who waste our resources and reduce our
benefits. For this, a cost analysis should be conducted.
We can send a standard offer to all our customers, or send a specific personalised offer
to a specific segment of our company. The same or a better result will be obtained, but
our marketing efforts will be focused, costs will decrease and the ROI (return on
investment) of our campaigns will increase.
- Find the best way to grow the relationship with our customers
Thanks to the information and analysis about each customer, the right message can be
sent to the right customer through the right channel.
Basically, this is about knowing our customers very well to be able to anticipate their
needs and offer them what they want. This will make them feel valued and they will
continue their relationship with us.
Thanks to the algorithms and calculation processes, deviations can be identified as well
as behaviours that put a customer at risk of loss and we can prevent them from leaving
with proactive actions to stay. They also allow us to identify customers who waste our
resources and who do not provide us with benefits, thus being able to allocate those
resources to other customers with higher value.
Page 46
CUSTOMER ANALYTICS
Each touchpoint of each department of our company can be analysed. With this, data
and key behavioural information at these points will be gathered, and we will also be
able to analyse in detail the message our customers are receiving to ensure that it is
aligned with our marketing strategy and the message of our campaigns and actions.
One of the best ways to increase CLV is to offer customers actions and promotions
based on their behaviour and preferences. That is, offering them personalised service
based on the results and conclusions drawn from the study and analysis of their data.
A good way to offer a promotion or discount to customers is right at the point of sale.
Some of the strategies that can be carried out at the point of sale are:
Adding unstructured data in our predictive models can help us to personalise even
more the cross-selling and up-selling offers, making them more attractive. It is basically
about knowing customers very well to offer them products and services that they
usually consume.
Page 47
CUSTOMER ANALYTICS
https://www.astutesolutions.com/blog/articles/what-are-the-stages-of-the-customer-
lifecycle
This second section will introduce segmentation, which will be discussed in depth in
the next chapter.
Customer segmentation involves dividing our customers into parts; each part is
characterised by features that make it different from the rest.
The first step to segment our customers is using the available data, such as descriptive
data or behavioural data.
Page 48
CUSTOMER ANALYTICS
principle, which states that 20% of a company's customers generate 80% of the
revenue.
Pareto noticed that 80% of Italy's land was owned by 20% of the population. He then
carried out surveys on a variety of other countries and found to his surprise that a
similar distribution applied.
With this analysis, it can be seen to what extent this principle is real in our case and
each customer can be placed in its step of the value pyramid.
The RFM analysis consists of classifying customers by their value according to three
variables:
Scales should be created based on these three variables in order to classify customers.
Each customer is given a value according to their group (a number of groups of the
same size, or with the same number of customers, is made). We usually work with 5
values, even though 9-10 values are also commonly used, especially in direct and
online sales. Here is an example:
Page 49
CUSTOMER ANALYTICS
It is worth mentioning that the RFM analysis has clear benefits both in strategic
segmentation and tactical segmentation of retail customers. These benefits are:
- Easily integrated into the usual promotional dynamics in a marketing area and
ideal for direct marketing and relationship marketing.
- Excellent starting point when segmenting our audience, as it tells us who the
most valuable customers are.
Page 50
CUSTOMER ANALYTICS
As said, this is the first step in segmentation. To offer promotions and offers as
personalised as possible, we should not only focus on group segmentation, but also on
segment individuals.
Micro-segmentation will be used for this, which will allow us to identify the
preferences, needs and behaviours of each of our customers. If offers are personalised
at this level, the revenue of our marketing campaign can be maximised and the
relationship with our customers can be improved.
There may be customers who have not bought for a while within this group, but
because they did it frequently and used to spend a lot, we want them back. To make
them buy again, we send a specific promotion for them. However, not every customer
will react to our communication in the same way. Some of them may prefer to receive
it by post or by email, some others may prefer an online discount code, and some
others may prefer a discount or direct promotion in store, etc.
We should know these customers very well, so that they can be classified according to
specific criteria, which allow us to carry out strategies and actions as personalised as
possible. We should basically create profiles within the groups. There are two types of
analysis to create these profiles:
Page 51
CUSTOMER ANALYTICS
Having introduced segmentation, the second subject of this manual will deal with
customer segmentation in a more specific and detailed manner.
Page 52
CUSTOMER ANALYTICS
2. CUSTOMER SEGMENTATION
As seen, customer segmentation involves dividing the general group of customers into
different groups based on a wide range of characteristics and attributes.
This chapter will continue dealing with segmentation, but it is essential to address data
exploration before. We should know perfectly what type of data we have before
segmenting our customers by groups, since this will help us to segment accurately. For
example, if we want to segment our customers according to their level of education, it
is important to verify that we have this information from most of our customers.
Before analysing our data, whether it is to segment or to carry out specific marketing
strategies, it is very important to perform data exploration tasks. These will allow us to
perfectly know the data that we have as well as to identify incorrect data.
Although there are several tools for this, here we will focus on the study of the R tool
and the RStudio work platform, which will facilitate the use of R.
RStudio is an interface that allows us to easily access the full power of R. R needs to be
previously installed to be able to use it. R is an object-oriented language for statistical
calculation and graphic generation that has a wide range of statistical and graphic
techniques that will help us in our analysis and exploration.
Page 53
CUSTOMER ANALYTICS
S and R are currently the two most widely used languages in statistical research. Their
main attractions are:
- High quality graphics: data visualisation and graphic production for articles.
- A very dynamic community, with high growth in the number of packages and
made up of highly renowned statisticians.
- Object-oriented language.
Regardless of what our goal is, it is very important to explore our data before taking
action.
Page 54
CUSTOMER ANALYTICS
By exploring the data, we will also realise what type of customers we have, which will
help us in the construction of the model.
This section will address the different ways to obtain information about our data. Data
exploration uses a combination of general statistics (means, medians, variance and
counters) and visualisations or data graphs.
▪ Problem identification
As already mentioned, our databases may be not perfect. That is why data exploration
is so important because it will help us to identify any problems that may arise in
relation to our data. The most frequent problems that can be found and how to
identify them graphically will be discussed here.
- Null values
A few null values or missing data in a variable are not a problem, but many of them in
the same variable are since this will prevent us from performing a real analysis of what
interests us.
Regardless of the reason for having null values, we should decide what to do with
them. We can either try to obtain the data related to that variable or include a variable
with null values in our model. In this second case, we should decide whether to delete
all the records where the value is null or to convert these null values into an additional
category or 0 value.
Page 55
CUSTOMER ANALYTICS
- Outliers
Not only should we check to see if there are null values or missing data, but also
determine that non-null values are correct.
- If there are data in text value when they should be expressed in numerical
value.
- If there are data out of range, as having data with value 0 or value 120 in the
field of age.
In most cases, outliers are simply because they were entered incorrectly. As with null
data, we should decide what to do with this incorrect data.
It is very important to pay attention to the variation of our data, which is to say, of the
data needed to segment our customers in relation to the objective or objectives set.
We should ensure that there is enough variation.
We should also consider the opposite case, in other words, that a range of a set of
data is too broad.
The important thing here is to determine if the range of the set of data available is very
broad or very narrow depending on the objectives.
Page 56
CUSTOMER ANALYTICS
A good way to identify these problems is visualisation. Images can offer us another
type of information different from the information given by numbers. For example, a
graph is a good way to see the customer age distribution.
These are the types of graphs that can be used to identify problems in our data:
Page 57
CUSTOMER ANALYTICS
Histograms
If we want to determine the number of customers in each age range, the histogram
will show how each column referring to an age range corresponds to a quantity.
Density plot
A density plot visualises the distribution of data over a continuous interval or time
period. It is a variation of a histogram that uses kernel smoothing to plot values,
allowing for smoother distributions by smoothing out the noise. The peaks of a density
plot help display where values are concentrated over the interval.
Page 58
CUSTOMER ANALYTICS
An advantage density plots have over histograms is that they are better at determining
the distribution shape because they are not affected by the number of bins used (each
bar used in a typical histogram).
quick-start-guide-r-software-and-data-visualisation
Bar graphs
A bar chart presents categorical data with rectangular bars with heights or lengths
proportional to the values that they represent. The bars can be plotted vertically or
horizontally.
Page 59
CUSTOMER ANALYTICS
Bars are vertical in this example. This is due to the fact that few variables are
presented. However, when the number of variables is high, it is recommended to plot
bars horizontally.
tutorial/
Page 60
CUSTOMER ANALYTICS
In graphs with horizontal bars, it is best to order variables from the highest to the
lowest since this will help reading and interpreting the graph.
The previous graphs showed one variable only. However, there are cases in which a
relationship between two variables needs to be established as well as the type of
relationship. For this, line graphs, scatter charts, hexbin plots and bar graphs for two
categorical variables can be used.
Line graphs
A line graph is used to display quantitative values changing over a continuous interval
or time span.
They are particularly useful to establish if there is a relationship between two values or
variables, since we can see how they are evolving over time.
*Ex
amp
le of
a
line
gra
ph.
Sour
ce:
https://docs.tibco.com/pub/spotfire_web_player/6.0.0-november-2013/es-
ES/WebHelp/GUID-60AD831A-D7C3-4407-AA54-44EBD23A29D0.html
Page 61
CUSTOMER ANALYTICS
Scatter plot
A scatter plot is a two-dimensional data visualisation that uses dots to represent the
values obtained for two different variables, one plotted along the x-axis and the other
plotted along the y-axis. It shows how much one variable is affected by another.
If the indicators are almost aligned in the scatter plot, it means that the two variables
are highly correlated. If the indicators are distributed evenly throughout the graph, the
correlation will be low or zero.
Page 62
CUSTOMER ANALYTICS
Hexbin plots
This type of graph is like a two-dimensional histogram. Data are divided into hexagons
and the number of data in each hex is represented by a colour.
This type of graph is exactly the same as the previously discussed bar graph. The only
difference is that two different variables are presented so there should be two bars for
each value.
The placement of the information relative to each variable can be done in several
ways, either one bar on top of the other or one bar next to another, either horizontally
or vertically. The important thing is that the person that visualises the information
knows what variable each bar refers to, in order to be able to establish the existing
relationship between the two variables.
Page 63
CUSTOMER ANALYTICS
Although we have talked about establishing the relationship of two variables, it can
also be established with more variables.
Page 64
CUSTOMER ANALYTICS
This is basically about gathering information about the data available in order to know
the types of data and their variations. Otherwise, customer segmentation and analysis
may be carried out with unreal or unrepresentative information, and the actions to be
undertaken once the analysis is completed will not be carried out successfully.
Page 65
CUSTOMER ANALYTICS
Once data are explored and verified, customer segmentation can be carried out. As
already seen, customer segmentation involves dividing our customers based on a wide
range of characteristics or attributes.
- How our customers and key potential customers are: type, consumption,
means, profitability, habits, etc.
- Types: is the set of consumers that share common features in terms of activity.
- Niches: a smaller part of the market, a very specific segment and delimited by a
specific variable.
Page 66
CUSTOMER ANALYTICS
Once the general scope of action is defined, we should identify to whom the
segmentation is directed. In customer analytics, we will focus on our customers, but
depending on the objective, such as expanding market share, segmentation can be
directed to other audiences:
- Population: refers to the entire group of people who are part of the
established set.
- Potential customers: consumers who are not our customers but who match
the profile of our customers.
As for the features or attributes to segment, there are infinite possibilities. We should
always take into account the data we have, if they have enough variation and the
established objectives. Some of the most common and most used variables or criteria
for segmentation are:
Page 67
CUSTOMER ANALYTICS
- Geographic
- Seniority
- Psychographic
- Sales volume
- Demographic
- Business profitability
- Based on benefits
- Based on use
- Strategic
- Based on behaviour
These are just some of the variables or attributes. As already said, their use will
depend on the set objectives and the available data.
When we have already decided what we are going to segment, to whom and according
to what variables, we must be clear about the strategy we want to carry out, that is,
we must be clear about the objective of the segmentation since it may be aimed at:
Page 68
CUSTOMER ANALYTICS
Given these previous considerations, we will focus on the most common classifications
of segmentation types, such as status, sales volume and frequency of purchase. We
will also discuss strategic segmentation, operational segmentation and will review RFM
analysis, previously addressed.
- Active customers: customers who buy frequently, and who bought recently or
during a specific period of time established by the company such as a period of
three months. This period of time depends on the type of company or the
product or service it offers.
- Potential customers: people who have not purchased, but who have shown
interest in the company and its products by requesting information or quotes.
They can become customers and income generators for the company at any
time.
- Probable customers: have never bought and have not shown interest in us.
However, due to their features, the company considers that they could become
generators of future income.
Page 69
CUSTOMER ANALYTICS
To segment our customers based on the sales volume, the 80/20 theory should be
taken into account, that is, 80% of our sales are made by 20% of our customers.
- Top customers: this first group includes customers that generate a volume that
is far above the average. It is important to know our top customers so that our
efforts and resources can be established based on those criteria.
- Big customers: customers that generate medium-high sales volume. They are
important customers, but they do not have the volume of top customers.
Depending on our type of business, the average purchase frequency will vary. For
example, in the case of a supermarket, the average frequency can be one or two
weeks, while in a clothing store this frequency can be higher.
Page 70
CUSTOMER ANALYTICS
- Regular customers: buy regularly, but do not purchase the average. It is very
important to maintain a high level of satisfaction by generating actions and
strategies adapted to them in order to achieve an increase in frequency.
- Casual customers: buy from time to time in our company. They also deserve
good service from the company, although most efforts and resources will be
allocated to frequent and regular customers.
Once customers are segmented in these categories, these segments should also be
segmented according to other variables, such as age, income, level of education, place
of residence, the number of people living in the household, tastes and preferences,
etc. It is very important to set clear and specific objectives so that our customers are
segmented according to the ongoing needs.
Strategic segmentation
Page 71
CUSTOMER ANALYTICS
For example, if we have a luxury fashion store and we want to grow, we can
open another store like the store we already have but in an area where we
have no presence, but where there is a large volume of potential customers or
even current customers that go to where we currently are. We can also open
an outlet offering clothes from past seasons at a lower price in order to win
customers who, due to the prices, cannot buy our products in our current
store. This will also allow us to get rid of clothing from past seasons that our
current customers do not want.
Basically, the type of product that customers want should be known, and they
should be offered that product. Zara is an example of this strategy. They
replace the pieces of clothes that are the biggest sellers in each store, which is
why the same clothes are not found in all stores. It depends on what each store
sells the most.
Page 72
CUSTOMER ANALYTICS
customers are reluctant to use the web to buy, it will make no sense to create
an application to buy since it will not be used.
Operational segmentation
Page 73
CUSTOMER ANALYTICS
RFM analysis
As already seen, RFM analysis is a simple and effective segmentation method based on
the Pareto principle, which states that 20% of a company's customers generate 80% of
the revenue.
With this analysis, we intend to verify to what extent this principle is real in our case so
that each customer can be placed in their step in the value pyramid. In other words, it
allows us to order our customers from higher to lower value so that we can identify
customers that generate higher profitability and customers who do not.
The RFM analysis consists of classifying customers by their value according to three
variables:
Page 74
CUSTOMER ANALYTICS
Knowing the types and segments of customers of our company or organisation as well
as the data referring to them, we should continue to what is known as customer value
management. This will help us to adapt to each type and each segment in the best way
possible in order to achieve the total satisfaction of our customers, their loyalty and,
therefore, increase our profits.
As has already been said, adapting to different types of customers as well as having a
good relationship with them is the key to lasting benefits. We are only not interested
in selling a lot with a specific campaign that attracts many new customers. In addition
to attracting new customers, we should also retain current customers so that they
become permanent or lasting customers who are happy with the company and with
the treatment they receive.
- Customer identification
- Contact management
- Campaign management
- Customer score
Page 75
CUSTOMER ANALYTICS
- Customer identification
- Campaign management
- Appropriate retention
▪ Customer identification
As seen in the previous two chapters, for adequate customer management that gives
good results, it is essential to make an appropriate identification in order to know
everything about customers such as residence, education, age, economic status,
consumption habits, needs and satisfaction, among others.
Data and information about our customers should be collected in order to segment
them according to certain variables and know perfectly the different profiles of
customers we have.
Page 76
CUSTOMER ANALYTICS
will not be targeted with the same campaigns as customers with lower value who
purchase monthly.
In short, this is about knowing how to identify the customer types or segments of our
company to then offer them what best suits their characteristics and needs.
- Basic structure: together with operating systems and flat files, it provides raw
data that are stored together with metadata. End users can access them for
analysis, report generation and mining.
- Test area: this area can be placed between the data sources and the
warehouse, and provides a place where data can be cleaned before entering
the warehouse.
- Data marts: systems designed for a particular business line. Data marts are
data subsets that aim to help a specific area of the business to make better
decisions. There may be separate data marts in different departments or
categories such as sales, purchases, inventory, etc., to only access the category
that is needed for the analysis.
Page 77
CUSTOMER ANALYTICS
We mean that we should focus on the analysis of the customer in relation to the data
available, that is to say, in relation to the data we have entered in our Data
Warehouse.
In short, the budget for the actions and marketing campaign for each group or
segment of customers should be established. Most resources are used for customers
that bring higher profits.
▪ Campaign management
Page 78
CUSTOMER ANALYTICS
The success of customer value management lies mainly in the ability to analyse
customer data, predict their behaviour as well as their needs and focus customer
relationship management mainly on how to treat individual customers so that actions
and promotions for each type of customer can be personalised. It is an essential cycle
or process for successful marketing campaigns and actions.
In this process, advanced data analysis will provide us with a collection of prediction
models. The execution of these models translates into each customer being assigned
with a predictive score that expresses the probability that the customer behaves or
responds to a specific offer, action or promotion in a particular way. These models,
which are functions of a reduced set of variables and are derived from a sample of
customer data, should be executed in the entire customer base.
As already seen, our customers can be segmented based on many variables such as
demographic data, behavioural data or data referring to the frequency of purchase. A
prediction model for each type of segment should be established, since each group or
segment is different. This means that what works with some customers may not work
with some others so it is very important to adapt to each group.
Each of the segments or groups of customers that were identified in the first phase will
be assigned a basic customer marketing strategy. This will allow us to know what kind
of campaigns or actions are appropriate and will be profitable for each segment. As
said, personalisation of campaigns based on the type of customer is fundamental to
obtaining value and achieving our customers’ loyalty. This means that general
campaigns and strategies will not be carried out since they will not work for all
customers.
Customer relationship management is very important here. It is the main vehicle for
delivering results of the prediction models and treatment recommendations of the
decision engine of the workers who are in contact with customers. It also helps us to
Page 79
CUSTOMER ANALYTICS
track any interaction with customers as well as the reactions and responses of
customers to the different inputs they receive.
▪ Adequate retention
The design and implementation of actions and specific marketing campaigns for each
group or customer segment aims to retain customers.
Customer retention is effective when we manage to retain the right customers, that is,
the most valuable customers for the company.
As already mentioned, it is particularly important to identify and get to know all of our
customers in order to make an adequate segmentation based on certain criteria. This
will help us to know what customers have higher and lower value in order to retain
high-value customers and turn low-value customers into high-value customers.
The company should know the value that customers perceive of our company since
this will help us when planning and defining strategies. To achieve better results, we
Page 80
CUSTOMER ANALYTICS
will not only focus on the most valuable customers, but also on the elements that
make the customer perceive value of our company.
Customer Perceived Value is the evaluated value that a customer perceives to obtain
by buying a product. It is the difference between the total obtained benefits according
to the customer perception and the cost that he had to pay for that. Customer
perceived value is seen in terms of satisfaction of needs a product or service can offer
to a potential customer. The customer will buy the same product again only if he
perceives to be getting some value out of the product. Hence, delivering this value
becomes the motto of marketers.
As seen, managing these elements will help us to contribute to our customers having a
good perception of our company so they should not be ignored in our customer value
management process.
Customers are the main element of any company or organisation, and that is why their
satisfaction and loyalty should be our main objectives. If they are happy, they will
consider our company has higher value and this will make them become a valuable
customer, which will have a positive impact on our profits.
customers.
Page 81
CUSTOMER ANALYTICS
3.2. CLUSTERS
The previous subject showed a series of techniques and graphic representations that
allow us to establish relationships between certain variables in order to segment our
customers and identify higher-value customers.
These techniques are visual so they show relationships between variables that are easy
to see. This section will focus on establishing the relationships between variables that
cannot be easily seen. The so-called unsupervised methods will be used for this.
The main objective of the cluster analysis is to group the observations of our data in
different clusters with similar data.
Customers are clustered based on variables that seem to be unrelated. For example, in
a company that sales package holidays, customers will not be only be clustered
according to variables such as age or purchasing power, but also according to their
preferred means of transport, the destinations they have visited and would like to visit,
their preferred type of accommodation, or the leisure activities in which they
participate during trips, among others.
Thanks to dividing our customers into more complex groups, we will be able to offer
them the products that best suit their needs, tastes and preferences.
- Hierarchical clustering
- K-means clustering
Before dealing with these two approaches, cluster analysis will be further discussed. As
already mentioned, it is a set of techniques used to classify a group of individuals into
Page 82
CUSTOMER ANALYTICS
homogeneous groups. The main feature of this type of analysis is that groups are
unknown beforehand so these groups should be established.
The main objective of this analysis is to obtain classifications through an analysis with a
significant exploratory nature. Our customers are basically clustered according to
variables that are not easily seen, but require further analysis and exploration.
Cluster analysis seeks to find groups to assign individuals according to some uniformity
criterion so it is essential to establish a measure of similarity or divergence in order to
classify individuals in groups.
- Before starting the analysis, it is important to decide if the analysis will start
either from an initial cluster already made, or if each individual element will be
considered as an initial cluster that will be later clustered until obtaining the
final clusters. That is, we should decide if the analysis will start from groups or
segments already made such as age segments, social class or place of
residence, or if each customer will be considered individually.
The complete process of analysis will be structured based on the following points:
- The analysis will be based on a certain number of individuals that, as said, may
already be clustered, or each member will be considered individually.
Page 83
CUSTOMER ANALYTICS
Similarity criteria
As stated previously, similarity criteria are very important to analyse clusters. This is
the starting point for establishing clusters of our customers.
Once the selection variables to be considered in our analysis are established, similarity
criteria should be established. In other words, we should establish how similar or
different the individuals are to each other since groups will be created based on this.
To measure how similar the individuals we want to cluster are, there is a wide range of
similarity and dissimilarity indices. Here we will focus on the concept of distance. Four
types will be differentiated:
- Euclidean distance
- Hamming distance
- Manhattan distance
- Cosine similarity
Page 84
CUSTOMER ANALYTICS
- Euclidean distance
It is the most popular type of distance and it measures the distance between two
vectors.
This distance will be used when variables are homogeneous and are measured in
similar units and/or when the variance matrix is unknown.
Page 85
CUSTOMER ANALYTICS
- Hamming distance
- Manhattan distance
The distance between two points measured along axes at right angles. Manhattan
distance is often used in integrated circuits where wires only run parallel to the X or Y
axis.
- Cosine similarity
Measures the smallest angle between two vectors (the angle between two vectors is
supposed to be between 0 and 90 degrees).
- Two parallel vectors are the most similar. They are even identical if both are
assumed to be based on the origin.
Page 86
CUSTOMER ANALYTICS
Data preparation
Having discussed the distances in cluster analysis, it is worth dealing with data
preparation. As mentioned throughout the manual, it is not enough to just take the
data and analyse and segment them, it is also very important to prepare such data and
verify that there is enough correct and valid information for the analysis.
If a cosmetics company wants to establish age groups with gender differentiation for
their best-selling products, they should have the whole information, since on the
contrary, they can hardly segment correctly. If they do not know the age of part of
their customers, that part will not be segmented and the analysis and the following
campaign will not have the expected benefits.
Another essential aspect to take into account in data preparation refers to the units or
metrics in which some of these data are expressed as well as their names. We must
ensure that data are expressed in the same unit of measure or with the same name.
Otherwise, they will be considered as different variables.
For example, it is not the same to measure the height of someone as 1.60 m and 160
cm, or to write the name of a city as Barcelona, Barna or Bcn; three names that refer
to the same city but that would be considered different variables.
Given these considerations, the two approaches mentioned at the beginning will be
discussed. After these, the PCA (principal component analysis) will be addressed.
▪ Hierarchical clustering
This approach presents a wide range of methods that aim to group clusters in order to
make a new cluster or to divide an already existing cluster into two different clusters.
Thus, if this agglomeration or division process is carried out successively, some
distance is minimised or some similarity measure is maximised.
Page 87
CUSTOMER ANALYTICS
- Divisive: This variant is also called top down clustering. All observations start in
one cluster, and splits are performed recursively as one moves down the
hierarchy.
Page 88
CUSTOMER ANALYTICS
The different types that can be found in the bottom up and top down methods will not
be discussed. Their calculation will not be addressed either since it is currently
automated through computer software.
▪ K-means clustering
K-means clustering is a type of unsupervised learning which is used when you have
unlabelled data (i.e., data without defined categories or groups). The goal of this
algorithm is to find groups in the data, with the number of groups represented by the
variable K. The algorithm works iteratively to assign each data point to one of K groups
based on the features that are provided. Data points are clustered based on feature
similarity.
The Κ-means clustering algorithm uses iterative refinement to produce a final result.
The algorithm inputs are the number of Κ clusters and the data set. The data set is a
collection of features for each data point. The algorithms start with initial estimates for
the Κ centroids, which can either be randomly generated or randomly selected from
the data set. The algorithm then iterates between two steps:
- Data assignment step: Each centroid defines one of the clusters. In this step,
each data point is assigned to its nearest centroid based on the squared
Euclidean distance.
- Centroid update step: In this step, the centroids are recomputed. This is done
by taking the mean of all data points assigned to that centroid's cluster.
This is the most popular clustering algorithm when data are numerical and the
distance between them is Euclidean. Although the number of clusters should be
chosen in advance, it is an easy application and implementation method in addition to
being faster than hierarchical clustering in large data sets.
Page 89
CUSTOMER ANALYTICS
Before dealing with PCA in the development of clusters, it is important to consider the
following aspects regarding clustering:
- The main objective of clustering is to find the similarities between our data that
are not easily seen.
- In a good clustering, the points of the same cluster have to be more similar
than the relationship of each of the points to the points of other clusters.
- When clustering, the units of each variable matter. Different units cause
different distances and, potentially, different clusters.
The new principal components after the reduction will be a linear combination of the
original variables and they will also be independent of each other.
Interpreting factors is fundamental in PCA since this is not given and should be
deduced after observing the relation of the factors with the initial variables. This
means that both the sign and the magnitude of the correlations should be studied. This
Page 90
CUSTOMER ANALYTICS
task is not always easy, so the knowledge of the expert on the subject of research will
be essential.
- Selection of the factors: this selection is made in such a way that the first factor
gathers as much as possible of the original variability, the second factor should
collect the maximum possible variability not collected by the first factor and so
on. Of the total of these factors, those that collect the percentage of variability
that is considered sufficient will be chosen. These selected factors will be called
principal components.
- Factor analysis: when the principal components are already selected, they are
represented as a matrix. Each element of this matrix represents the factorial
coefficients of the variables (the correlations between the variables and the
principal components). This matrix will have as many columns as principal
components and as many rows as variables.
- Calculation of factor scores: scores of the principal components for each case
that allow their graphic representation.
Page 91
CUSTOMER ANALYTICS
4. APPENDIX
After dealing with the theory of how customer analytics work and all the elements and
considerations that should be taken into account for its implementation to be
beneficial for our company or organization, this section will offer all some practical
examples and information.
As seen, the customer journey is a tool that allows us to visually trace and see the
journey of customers in our company.
The following links offer practical examples on how to effectively create a customer
journey map.
Link: https://www.ngdata.com/how-to-create-a-customer-journey-map/
Link: https://business.tutsplus.com/tutorials/customer-journey-map--cms-27014
Link: https://www.youtube.com/watch?v=A2LFJF1SUBg
Link: http://ncss-
tech.github.io/stats_for_soil_survey/chapters/1_introduction/1_introduction.html
Link: https://www.youtube.com/watch?v=mcYcjH-1giM
Page 92
CUSTOMER ANALYTICS
Link: http://www.statisticssolutions.com/cluster-analysis-2/
Link: https://www.focus-
balkans.org/res/files/upload/file/9%20Cluster_Analysis%20Schaer.pdf
These links extend the information regarding hierarchical clustering. The different
types of methods and the formulas for their calculation are presented.
Link: https://www.saedsayad.com/clustering_hierarchical.htm
Link: https://uc-r.github.io/hc_clustering
4.5 K-MEANS
The following links are practical examples of this algorithm that will help us to review
and expand the previous information.
Link: https://www.edureka.co/blog/k-means-clustering/
Link: https://people.revoledu.com/kardi/tutorial/kMean/NumericalExample.htm
Link: https://www.utdallas.edu/~herve/abdi-awPCA2010.pdf
Link: https://www.youtube.com/watch?v=eJ08Gdl5LH0
Page 93
CUSTOMER ANALYTICS
5. BIBLIOGRAPHY
5.1 BOOKS
Bill Schmarzo. Big Data. El poder de los dtos. Anaya Multimedia, 2014.
5.2 WEBSITES
Gabriela Campos Torres, 2016. Customer joruney: qué es y cómo crear uno.
https://blog.fromdoppler.com/customer-journey-map-como-crear-uno/
Page 94
CUSTOMER ANALYTICS
Iván Ruíz Sevilla. Sobre el cálculo del valor de vida del cliente.
http://www.ivanruiz.es/como-calcular-el-valor-del-ciclo-de-vida-de-un-cliente-o-
customer-lifetime-value-2/
Page 95