Unit 5

Unit -5
Recommendation Evaluation and Validation
In machine learning, recommendation evaluation and validation are essential processes to

assess the performance, accuracy, and effectiveness of recommendation systems. These
processes ensure that the recommendation system provides relevant and useful
recommendations to users. Here's a brief description of recommendation evaluation and
validation:
Recommendation Evaluation:
 Definition: Recommendation evaluation involves assessing the quality and relevance

of the recommendations generated by the recommendation system.
 Purpose: The goal of recommendation evaluation is to measure how well the
recommendation system performs in terms of accuracy, precision, recall, coverage,
diversity, and other relevant metrics.
 Methods: Various evaluation metrics are used to measure the performance of
recommendation systems, such as precision, recall, F1-score, mean average precision
(MAP), normalized discounted cumulative gain (NDCG), and accuracy.
 Process: The recommendation system's predictions are compared with ground truth
data (actual user preferences or interactions) to calculate evaluation metrics. These
metrics provide insights into the system's strengths and weaknesses and guide
improvements.
Recommendation Validation:
 Definition: Recommendation validation involves validating the recommendation

system's performance using separate datasets to ensure its generalization ability and
reliability.
 Purpose: The purpose of recommendation validation is to assess how well the
recommendation system generalizes to unseen data and performs under various
conditions.
 Process: Data is split into training, validation, and test sets. The training set is used to
train the recommendation model, the validation set is used to tune hyperparameters
and assess performance during training, and the test set is used to evaluate the final
performance of the model. The recommendation system's predictions are compared
with ground truth data in the validation and test sets to measure its performance using
evaluation metrics.
 Benefits: Validation ensures that the recommendation system performs well in real-
world scenarios, provides unbiased estimates of its performance, and guides
improvements and optimizations.
In summary, recommendation evaluation and validation are critical processes in machine

learning that assess the performance and reliability of recommendation systems, ensuring that
they deliver accurate and relevant recommendations to users. These processes involve
measuring the system's performance using evaluation metrics and validating its
generalization ability using separate datasets.
Describing Recommendation Engines
Recommendation engines are advanced data filtering systems that predict which
content, products, or services a customer is likely to consume or engage with. One doesn’t
need to look far to see one in action. Every time someone chooses a TV show using Netflix’s
“You May Also Like…” feature or buys a product Amazon recommends, they’re using
powerful recommendation engines.
Recommendation engines (sometimes called recommenders) are win-win features for

both customers and the businesses that deploy them. Customers enjoy the level of
personalization and assistance a well-tuned recommendation engine provides. Businesses
build them because they fuel engagement and encourage sales.
Accurate recommendations don’t appear out of thin air. Businesses must invest in
data solutions capable of analyzing a high volume of products and identifying patterns in
customer behavior. Only then can they unlock the true value of their customer data and make
recommendations that positively impact revenue.
Key takeaways
Recommendation engines are advanced data filtering systems that use behavioral data,
computer learning, and statistical modeling to predict the content, product, or services
customers will like.
Customers are drawn to businesses that offer personalized experiences.

The three main types of recommendation engines include collaborative filtering, content-
based filtering, and hybrid filtering.
Recommenders improve revenue by encouraging cross-selling, suggesting product

alternatives, and drawing attention to items abandoned in a digital shopping cart.
What is a recommendation engine?
Recommendation engines are tools that leverage predictive analytics to help companies
anticipate their customers’ wants and needs. The engines use machine learning and statistical
modeling to create advanced algorithms based on a business’s unique historical and
behavioral data. The resulting recommendations are based on some combination of:
 A customer’s past behaviors and history

 A product’s ranking by consumers
 The behaviors and history of a similar cohort
Recommendations are most accurate when there’s a great volume of data at a company’s
disposal. The more active users a product has, the more data there is to compare behaviors
and preferences across demographics.
However, not every bit of data collected will be relevant or even reliable. Building
recommendations on bad data results in recommendations that are inaccurate and unhelpful.
The first step in creating a workable recommendation engine is adopting a proper data
management strategy and analytics stack that collects and verifies data before it is put to use.
Types of Recommendation Engines
Not every recommendation engine uses the same methodology to form predictions.
Recommenders typically achieve results using one of three types of data filtering: content-
based, collaborative filtering, or a combination of the two.
Content-based filtering
This type of filtering is used in “Similar items include…” recommenders. Content-based

filtering creates predictions on the actual qualities of the products and services being offered.
Products in this system are assigned attributes that can be compared to other products
directly. Companies choose the types of attributes used by the engine based on the type of
products being consumed.
For instance, an ecommerce website that specializes in selling groceries might tag their
products with the following attributes:
 Type of food (e.g., “fruit” or “cereal”)

 Established taste (e.g., “bitter” or “sweet”)
 Container (e.g., “box” or “can”)
 Brand
The recommender would then compare items historically purchased by the user or those
currently in their shopping cart to other similar or linked items. Attributes are weighted by the
number of items in the database that share the tag with more common tags receiving higher
rankings than uncommon ones. This weighting determines which items appear first in a list of
recommendations.
Content-based filtering doesn’t require the input of other customers to make predictions. It
bases its predictions on similarities within a customer’s own behavioral and historical profile.
A well-designed content-based filtering engine will identify specific quirks and interests that
may not have broad appeal to other customers.
A major drawback with this type of recommendation engine is it requires a great deal of
maintenance. Attributes must be added and updated constantly to keep recommendations
accurate—a daunting task for businesses with a high volume of product. Additionally, the
attributes themselves must be accurate. Labeling a Honeycrisp apple “red” is easy, but more
complex content may require a dedicated team of subject matter experts to correctly label
each individual product.
Collaborative filtering
This method of filtering is what’s used in “People who watched this show also watched…”
types of recommenders. Collaborative filtering uses behavioral data to determine what a
person will like based on how their preferences compare to other users. Whereas content-
based filtering focuses on linking products to other products, collaborative filtering builds
predictions by linking similar customer profiles.
For example, imagine using a video streaming platform that uses collaborative filtering.
When you go to find a movie, you create data based on a number of behaviors, including:
 Movies you watch
 Titles you select but ultimately do not watch
 Selections you hover over
 Searches you make
 Rankings you give films
The recommender then effectively builds a user profile for you based on this data set. It then
compares your profile against a cohort of users who behave similarly. The resulting
predictions are based on the movies this cohort has consumed and enjoyed versus the actual
content of each film.
Collaborative filtering doesn’t require product feature information. This makes maintenance
less time-consuming than that of a content-based engine. However, a reliance on other
customers’ behaviors can create data gaps. Say no one interacts with your favorite movie on a
streaming service. A movie that’s perfectly suited to your interests won’t be recommended
because the recommendation engine won’t have any behavioral data with which to form a
prediction.
Hybrid filtering
Hybrid filtering attempts to address the shortcomings of both content-based filtering and
collaborative filtering by combining the two methods. As such, it’s the most effective of the
three types of recommendation systems.
How Recommendation Engines Are Used
Recommendation engines are used across a variety of industries, and have become a popular
means of improving both customer experience and a company’s bottom line.
E-COMMERCE
In e-commerce, recommendation engines play a crucial role in driving sales. About 35

percent of purchases on Amazon come from product recommendations, according to a
McKinsey & Company report. These days, messages like “you may also like this” and “buy
this product again” are a familiar site on just about every online retail site.
Recommendation engines are also used to identify products that are frequently bought
together by customers and present them as bundled or related items. For example, if a
shopper is searching for dumbbells, the recommendation engine may suggest compatible
accessories like yoga mats and resistance bands.
Recommendations based on things like location, season, price point and similar users are also
common tactics in e-commerce, and are used as a way to incentivize customers to keep
shopping.
SOCIAL MEDIA
Social media platforms like Facebook and Instagram use recommendation engines to suggest
friends or groups based on a user’s existing network, interests and location. They also use
them to show relevant posts and advertisements, depending on a user’s preferences.
For example, YouTube considers a viewer’s watch history and ratings to suggest new videos.
And TikTok considers videos the user has interacted with in the past, accounts and hashtags
they’ve followed, the type of content they create, and their location and language preferences
to determine what videos to show on their For You page.
MEDIA STREAMING
When a user browses movies and TV shows on a streaming platform like Netflix, Hulu or
Max, the recommendation engine analyzes their viewing history, searches and previous
ratings to suggest content they’re likely to watch and enjoy. Once a user finishes watching
that content, the recommendation engine suggests the next title to watch. All of this is a
useful way of keeping users engaged and reducing the time they spend searching for content.
Gaming platforms, like Steam and Playstation Store, and music streaming services, like
Spotify and SoundCloud, also use recommendation engines to suggest relevant content based
on a user’s preferences and historical data.
Benefits of Recommendation Engines
Recommendation engines can be beneficial both to the companies that deploy them and the
users that encounter them.
IMPROVES CUSTOMER EXPERIENCE
A more personalized experience can lead to more satisfied, engaged and loyal customers,
mainly because they are being fed the content or products they want without having to put in
the effort of finding it themselves.
After all, a lack of a recommendation engine creates a “pretty subpar experience” for
customers, as Amplitude’s Thompson put it. Without it, our social media feeds would be full
of content we don’t care about. And we’d have to search for every product, movie, show and
song ourselves, which would be a pretty time-consuming undertaking
INCREASES TIME SPENT ON PLATFORM
Social media platforms, media streaming services and even news outlets all want people to
spend as much time as possible on their sites. Consistently providing relevant
recommendations of more videos to watch, songs to listen to and articles to read keeps users
hooked. This translates to more click-through rates, conversions and — as is often the case
with websites — more dollars.
BOOSTS REVENUE
Perhaps the biggest benefit of recommendation engines — on the business side, at least — is
that they can help platforms make more money. Not only do recommendation engines
incentivize people to make more purchases (a technique known as cross-selling), but they can
also suggest product alternatives and draw attention to items that have been abandoned in a
customer’s online shopping cart.
Even if a company isn’t in the business of selling physical products, recommendation engines
can still do wonders for their bottom line. For example, if Netflix’s recommendation engine
consistently feeds viewers content they enjoy watching, they’re less likely to cancel their
subscription or choose another streaming service, saving Netflix about $1 billion a year,
according to the company.
“If you’re an organization that’s looking to increase revenue, being able to provide tailored
experiences for your customers based off their likelihood to purchase or likelihood to
complete a particular action, drives growth for your business,” Thompson said.
Challenges of Recommendation Engines
Recommendation engines do come with some challenges, though.
LIMITED TO WHAT THEY ALREADY KNOW
A recommendation engine is only as good as the data it’s fed. If it doesn’t have accurate or
abundant information about users or items, it likely won’t work correctly.
“They’re limited in their knowledge,” Alexander Marmuzevich, founder and CTO of InData
Labs, told Built In. “They can’t propose something which doesn’t exist, they can’t generate
completely new ideas.”
A common example of this is what Alexei Tishurov, a lead data scientist at InData Labs, calls
a “cold start problem.” This is when a recommendation engine struggles to deal with new
users who have not yet provided enough data for the engine to make accurate
recommendations. New items with little or no historical data tied to them can be challenging
for the engine as well.
“You need to have users interacting with items to do collaborative filtering,”

Tishurov explained. “But if you have a completely new service you do not have such
history.”
CAN BE BIASED
Like any machine learning system, recommendation engines can produce biased results if
they are based on biased data. This can result in inaccurate or even discriminatory
recommendations, posing both functional and ethical problems.
By extension, recommendation engines may fall victim to popularity bias, where popular
items tend to be suggested more frequently than lesser-known items. This can lead to a lack
of diversity in the recommendations, and prevent users from discovering niche or less popular
items.
GATHERING CUSTOMER DATA CAN BE TRICKY
Data is the backbone of recommendation engines. But as regulations and policies regarding
the collection and storage of data continue to evolve, acquiring enough accurate customer
data to generate decent recommendations will be an ongoing challenge.
Companies have to be sure they’re compliant with whatever security and privacy regulations
exist within the jurisdictions they’re operating out of. And even then, customers can often opt
out of providing the data recommendation engines need.
“If a customer is not giving you permission to track them or track their behavior while they’re
browsing your website, it’s a lot harder for you to provide those tailored experiences,”
Thompson said. Sites like Netflix and Amazon “can’t operate without being able to use the
models to provide tailored recommendations,” he continued. “It’s a core, business critical
system when it comes to providing their service.”
Comparing the Types of Recommendation Engines
Recommendation
Engine Type Description Pros Cons
- Cold start problem for
- Effective for new users/items-
Recommends items based on recommending items Difficulty in handling
Collaborative user-item interactions and based on user preferences- large datasets- May suffer
Filtering similarities with other users. Can handle sparse data from the "popularity bias"
- Limited to
Recommends items similar to - No cold start problem as recommending items with
those a user has liked in the it relies on item features- known features- May
Content-based past, based on item Can provide explanations suffer from the "over-
Filtering attributes/features. for recommendations specialization" problem
Combine collaborative and - Can mitigate limitations - Increased complexity in
Hybrid content-based approaches to of individual approaches- implementation- Requires
Recommender leverage the strengths of Provides more accurate more computational
Systems both. recommendations resources
- Requires significant
Recommends items based on - Can handle cold start initial knowledge about
explicit knowledge about problem effectively- Can users and items-
Knowledge-based user preferences and item provide personalized Maintenance of
Recommender characteristics, often using recommendations based knowledge base can be
Systems rules or knowledge graphs. on user preferences labor-intensive
Techniques that decompose
user-item interaction - Effective in handling - Requires careful tuning
matrices to discover latent sparsity and scalability-of parameters- Cold start
factors and make Can capture complex user-problem for new
Matrix Factorization recommendations. item relationships users/items
- Requires large amounts
Utilizes deep learning models - Can handle complex data of data for training-
Deep Learning to capture complex patterns and relationships- Can Computationally
Recommender in user-item interactions and automatically learn expensive and resource-
Systems make recommendations. features from raw data intensive
This table outlines the key characteristics, advantages, and limitations of different
recommendation engine types.
Collecting and Manipulating Data
1. Problem Identification & Goal Formulation
The first step is to clearly define the problem that the recommendation system will solve. For
instance, we want to build an Amazon-like recommendation system that suggests products to
customers based on their past purchases and browsing history.
A well-defined goal helps in determining the data required, selecting the appropriate
machine-learning models, and evaluating the performance of the recommender system.
2. Data Collection & Preprocessing
The next step is to collect data on customer behavior, such as their past purchases, browsing
history, reviews, and ratings. To process large amounts of business data, we can use Apache
Hadoop and Apache Spark. After data collection, the data engineers preprocess and analyze
this data. This step involves cleaning the data, removing duplicates, and handling missing
values. Also, the data engineers transform this data into a format suitable for machine
learning algorithms.
Here are some popular Python-based data preprocessing libraries:
 Pandas: Provides methods for data manipulation, transformation, and analysis

 NumPy: Provides powerful numerical computations for arrays and matrices.
3. Exploratory Data Analysis
Exploratory Data Analysis (EDA) helps understand the data distribution and relationships
between variables which can be used to generate better recommendations.
For instance, you can visualize which items are sold the most in the last quarter. Or which
items are sold more when the customers purchase a specific item, like eggs are sold more
with bread and butter.
Here are some popular Python libraries for carrying out exploratory data analysis:
 Matplotlib: Provides data visualization methods to create different plots like

histograms, scatterplots, pie charts, etc.
 Seaborn: Provides methods to create more advanced visualizations such as heatmaps
and pair plots.
 Pandas Profiling: Generates a report with descriptive statistics and visualizations for
each variable in a dataset.
4. Feature Engineering
Feature engineering involves selecting the best-suited features to train your machine learning
model. This step involves creating new features or transforming existing ones to make them
more suitable for the recommendation system.
For example, within customer data, features such as product ratings, purchase frequency, and
customer demographics are more relevant for building an accurate recommendation system.
Here are some popular Python libraries for performing feature engineering:
 Scikit-learn: Includes tools for feature selection and feature extraction, such as
Principal Component Analysis (PCA) and Feature Agglomeration.
 Category Encoders: Provides methods for encoding categorical variables i.e.,
converting categorical variables into numerical features.
5. Model Selection
The goal of model selection is to choose the best machine learning algorithm that can
accurately predict the products that a customer is likely to purchase or a movie they are likely
to watch based on their past behavior.
Some of these algorithms are:
i. Collaborative Filtering
Collaborative filtering is a popular recommendation technique, which assumes that users who
share similar preferences will most likely buy similar products, or products that share similar
features will most likely be bought by the customers.
ii. Content-Based Filtering
This approach involves analyzing the attributes of products, such as the brand, category, or
price, and recommending products that match a user's preferences.
iii. Hybrid Filtering
Hybrid filtering combines collaborative filtering and content-based filtering techniques to

overcome their limitations by leveraging their strengths to provide more accurate
recommendations.
6. Model Training
This step involves dividing the data into training and testing sets and using the most
appropriate algorithm to train the recommender model. Some of the popular recommendation
system training algorithms include:
i. Matrix Factorization
This technique predicts missing values in a sparse matrix. In the context of recommendation
systems, Matrix Factorization predicts the ratings of products that a user has not yet
purchased or rated.
ii. Deep Learning
This technique involves training neural networks to learn complex patterns and relationships
in the data. In recommendation systems, deep learning can learn the factors that influence a
user's preference or behavior.
iii. Association Rule Mining
It is a data mining technique that can discover patterns and relationships between items in a
dataset. In recommendation systems, Association Rule Mining can identify groups of
products that are frequently purchased together and recommend these products to users.
These algorithms can be effectively implemented using libraries such as Surprise, Scikit-
learn, TensorFlow, and PyTorch.
7. Hyperparameter Tuning
To optimize the performance of the recommender system, hyperparameters, such as the

learning rate, regularization strength, and number of hidden layers in a neural network are
tuned. This technique involves testing different combinations of hyperparameters and
selecting the combination that gives the best performance.
8. Model Evaluation
Model evaluation is critical to ensure that the recommendation system is accurate and
effective in generating recommendations. Evaluation metrics such as precision, recall, and F1
score can measure the accuracy and effectiveness of the system.
9. Model Deployment
Once the recommendation system has been developed and evaluated, the final step is to
deploy it in a production environment and make it available to customers.
Deployment can be done using in-house servers or cloud-based platforms such as Amazon
Web Services (AWS), Microsoft Azure, and Google Cloud.
For instance, AWS provides various services such as Amazon S3, Amazon EC2, and Amazon
Machine Learning, which can be used to deploy and scale the recommendation system.
Regular maintenance and updates should also be performed based on the latest customer data
to ensure the system continues to perform effectively over time.
Describing Similarity and Neighborhoods
In recommendation systems, similarity and neighborhood-based approaches are fundamental

concepts, especially in collaborative filtering methods. Here's an explanation of each:
Similarity:
Definition: Similarity measures how alike two items or users are based on some criteria, such
as their attributes or behavior.
Purpose: It helps identify items or users that are closely related to each other.
Calculation: There are various methods to calculate similarity, including cosine similarity,
Pearson correlation coefficient, and Jaccard similarity, depending on the nature of the data
and the context of the recommendation problem.
Example: In user-based collaborative filtering, similarity between users is computed based

on their ratings or interactions with items. Users with similar preferences are considered more
similar to each other.
Neighborhood:
Definition: Neighborhood refers to a subset of items or users that are most similar to a given
item or user.
Purpose: It helps identify a group of items or users that are likely to be of interest to a
particular user or item.
Types: There are two types of neighborhoods:
User-based neighborhood: Consists of users who are most similar to the target user.
Item-based neighborhood: Consists of items that are most similar to the target item.
Size: The size of the neighborhood can vary based on parameters like the number of nearest
neighbors to consider.
Example: In item-based collaborative filtering, the neighborhood of a target item consists of

other items that have high similarity scores with the target item. These similar items are then
used to make recommendations to users who have interacted with the target item.
In summary, similarity measures quantify how alike items or users are, while neighborhoods
represent subsets of items or users that are most similar to a given item or user. These
concepts are crucial for building effective recommendation systems, especially in
collaborative filtering approaches where recommendations are based on the preferences of
similar users or items.
Creating a Recommendation Engine
Recommendation engines bring together lots of data and then use machine learning to
recommend the “next best action,” Thompson said, and that could be anything from buying a
product to clicking on a video.
There are two main categories at play in a recommendation engine — users and items,
according to Eugene Medved, an AI developer at recommendation engine provider InData
Labs. “The task itself,” he explained, “is all about ranking the items for a specific user by
probability of the interaction.”
This is accomplished by a standard order of operations, starting with data collection. From
the initial data collection to the final presentation of recommendations, you will understand
how these systems expertly analyze data and transform it into personalized item suggestions.
Working Process of Recommendation Engine
Phase 1: Data Collection
During this initial phase, the engine collects a wide range of data, including user interactions
(such as clicks, views, or purchases), user demographics (such as age and location), and
detailed item information (such as descriptions and categories). A challenge in this step,
known as the “cold start problem,” occurs when there is insufficient data on new users or
items, making it difficult to provide accurate recommendations initially.
In the data collection phase of a recommendation engine, various methods are used to gather
comprehensive information.
One of the primary tools used is Web crawlers, which are automated programs that navigate
the Internet to collect data from various Web sites. They are particularly useful for gathering
detailed information about items such as product descriptions, customer reviews, and ratings.
In addition, user information is collected through techniques such as the use of cookies.
Cookies are small files stored on users’ devices that track their visits to and interactions with
websites. This allows the recommendation system to understand user behavior on the site by
tracking actions such as clicks, views, and purchases. Together, these methods provide a rich
data set that forms the basis for generating accurate and personalized recommendations.
Here are the types of data collected by recommendation systems:
 User Behavior Data: This includes data on the actions users take, such as the items they
view, purchase, or add to their wishlist. It also tracks the frequency of these actions and the
time spent on each item.
 User Demographic Data: This refers to personal information about the user, like age,
gender, location, and possibly income level or educational background.
 Item Data: This encompasses details about the products or content available for
recommendation, such as descriptions, categories, price, brand, specifications for products, or
genre and author for books.
 Contextual Data: It includes information about the context in which user interactions take
place, such as the time of day, season, or whether the interaction was on a mobile device or a
desktop.
 Feedback Data: User ratings, reviews, and preferences explicitly provided by the users are
also vital. This data helps in understanding the user’s satisfaction and preferences more
directly.
Phase 2: Data Processing
The second step in the functioning of a recommendation engine is data processing, a critical
phase in which the collected data is refined and prepared for analysis.
This step is all about ensuring the quality and usability of the data.
First, data cleansing is performed to remove irrelevant, incomplete, or erroneous information.
This may involve filtering out noise or correcting data inconsistencies to ensure that the
remaining data is accurate and reliable.
Next, data transformation is performed to convert the raw data into a structured format
suitable for analysis. This can include normalizing data (scaling it to a certain range),
categorizing unstructured data (such as text or images), and creating user or object profiles.
Another key aspect is data integration, where data from different sources is combined to
create a comprehensive view. For example, users’ demographic data can be merged with their
behavioral data. Finally, feature extraction is critical, where specific attributes or “features”
are identified and extracted from the data.
These features, such as the frequency of item views or the types of products viewed, are what
the recommendation algorithms will later use to make predictions.
Overall, data processing transforms raw, unorganized data into a clean, structured format that
is essential for the recommendation engine to function effectively.
Phase 3: Filtering
At this stage, methods such as matrix factorization are used.
Matrix factorization is a mathematical technique for predicting user preferences. It works by

breaking down a large user-item interaction matrix into smaller, more manageable matrices
representing users and items. These matrices are then used to identify latent factors that
influence user preferences.
By applying specific mathematical recommendation algorithms, the system can predict how
likely a user is to prefer an item, even if they haven’t interacted with it before.
Phase 4: Generating Recommendations
The fourth step in the operation of a recommendation engine is the generation of

recommendations, a crucial phase in which the processed data and the insights gained from
the previous steps are used to suggest relevant items or content to the user.
In this stage, the engine applies algorithms to predict and match user preferences with
available items to provide personalized and relevant suggestions. The engine considers
factors such as past user behavior, similarities between items, and user profiles to generate
these recommendations.
In making these recommendations, the engine strives to balance relevance, user engagement,
and business goals, such as promoting new products or increasing sales in certain categories.
The ultimate goal is to enhance the user experience by providing timely and relevant
suggestions that are tailored to the user’s needs and interests.
What are these types? Let’s look at what many e-commerce sites are doing with their
recommendations:
 Personalized Recommendations: Tailored specifically to an individual’s preferences and

past behavior, these suggestions are based on items the user has previously interacted with,
showing similar or complementary products.
 Best Sellers: These are popular items across the platform, often recommended to new users
or those with limited interaction history. They represent what is trending or most purchased
in a certain category.
 Related Items: Often seen as “Customers who viewed this also viewed” suggestions, these
are based on the correlation between products, recommending items that other users have
looked at or purchased in relation to the current item.
 New Arrivals: Recommendations focusing on the newest items in a category, useful for
returning users to discover the latest products or content.
Recommending Another Item
Recommending another item typically involves suggesting items that are similar to the ones a
user has interacted with or shown interest in. This is commonly known as item-based
collaborative filtering. In item-based collaborative filtering, recommendations are made
based on the similarity between items. The idea is that if a user likes one item, they are likely
to enjoy similar items. Here's how it works:
1. Calculate Item Similarity:
 Compute the similarity between items using a similarity measure such as

cosine similarity, Pearson correlation, or Jaccard similarity.
 The similarity between items can be determined by comparing their attributes,

features, or user interactions.
2. Select Target Item:
 Given a target item for which we want to make recommendations, find items
that are most similar to it.
3. Rank Similar Items:
 Rank the similar items based on their similarity to the target item.
4. Filter Top Items:
 Select the top-N similar items to recommend to the user.
Example:
Let's say we have a dataset of user interactions with items, such as movies watched or
products purchased. We want to recommend another movie to a user based on the movie they
recently watched. Here's how we can do it:
 User's Recently Watched Movie: "The Matrix"
 Items in the Dataset: A list of movies with their attributes (e.g., genre, director,
actors).
Item Similarity Table:
Movie Similarity Score
The Matrix 1.00
Inception 0.85
Interstellar 0.80
Movie Similarity Score
Blade Runner 0.75
Terminator 2 0.70
In this table, we calculate the similarity scores between "The Matrix" and other movies in the
dataset. For example, "Inception" has a high similarity score of 0.85, indicating it's closely
related to "The Matrix" and thus a good candidate for recommendation.
Recommendation:
Based on the item similarity table, we recommend "Inception" to the user who watched "The
Matrix" as it has the highest similarity score among the other movies.
This approach can be scaled to handle large datasets and can provide personalized
recommendations to users based on their past interactions with items.
Finding Items to Recommend
In recommendation engines, "Finding Items to recommend" involves identifying items that

are likely to be of interest to a user based on various factors such as user preferences, item
characteristics, and historical interactions. Recommendation engines employ various
algorithms and techniques to identify items that are relevant and potentially interesting to
users. Here's an overview of the process:
1. User Profile and Preferences:
 Build a user profile based on their past interactions, ratings, and preferences.
 Consider explicit feedback (ratings, likes, dislikes) and implicit feedback

(viewing history, purchase history).
2. Item Attributes and Characteristics:
 Analyze the attributes and characteristics of items in the dataset.
 This could include genre, category, tags, features, and metadata associated
with the items.
3. Algorithm Selection:
 Choose a recommendation algorithm suitable for the scenario, such as

collaborative filtering, content-based filtering, or hybrid approaches.
 Collaborative filtering methods recommend items based on user behavior and

preferences, while content-based methods focus on the attributes of items.
4. Generate Recommendations:
 Utilize the selected algorithm to generate a list of recommended items for each
user.
 This can involve calculating similarity scores between items, predicting user
ratings, or using matrix factorization techniques.
5. Evaluation and Refinement:
 Evaluate the effectiveness of the recommendations using metrics like

precision, recall, and user satisfaction.
 Refine the recommendation algorithm based on user feedback and

performance metrics.
Example:
Let's consider a scenario where we have a dataset of movies and user ratings. We want to
recommend movies to a user based on their preferences and past interactions. Here's how we
can do it:
 User Profile: User A has previously rated several movies, and their preferences
indicate a preference for action and sci-fi genres.
 Items in the Dataset: A list of movies with attributes such as genre, director, actors,
and user ratings.
Recommendation Table:
Movie Genre Director User Rating
The Matrix Action, Sci-Fi Wachowski 4.5
Inception Sci-Fi Nolan 4.8
Terminator 2 Action, Sci-Fi Cameron 4.7
Avatar Action, Adventure Cameron 4.6
Interstellar Sci-Fi Nolan 4.9
In this table, we have a list of movies with their respective genres, directors, and user ratings.
Recommendations:
Based on User A's preferences for action and sci-fi genres, we can recommend movies such
as "The Matrix," "Terminator 2," and "Inception" as they match the user's preferences and
have high user ratings.
This approach allows for personalized recommendations tailored to the individual user's
preferences and can enhance user engagement and satisfaction with the platform.
Recommending Items Based on Other Items (Item-Based Collaborative Filtering):
Recommending items based on other items, also known as item-based collaborative filtering
is a technique used in recommendation engines to suggest items that are similar to those a
user has interacted with or shown interest in. Item-based collaborative filtering recommends
items to users based on the similarity between items they have interacted with. The
underlying idea is that if a user likes one item, they are likely to enjoy similar items. Here's
how it works:
1. Calculate Item Similarity:
 Compute the similarity between items using a similarity measure such as

cosine similarity, Pearson correlation, or Jaccard similarity.
 The similarity between items can be determined by comparing their attributes,

features, or user interactions.
2. Select Target Item:
 Choose a target item for which we want to make recommendations.
3. Identify Similar Items:
 Find items that are most similar to the target item based on the calculated
similarity scores.
4. Rank Similar Items:
 Rank the similar items based on their similarity to the target item.
 Select the top-N similar items to recommend to the user.
Example:
Let's consider a scenario where we have a dataset of products and user interactions (e.g.,
purchases, views). We want to recommend other products to users based on the products they
have previously interacted with. Here's how we can do it:
 User's Interacted Items: User A has previously purchased the item "iPhone 11."
 Items in the Dataset: A list of products with attributes such as category, brand, price,
and user interactions.
Item Similarity Table:
Product Similarity Score
iPhone 11 1.00
Product Similarity Score
Samsung Galaxy S10 0.85
Google Pixel 4 0.80
iPhone XR 0.75
OnePlus 7T 0.70
In this table, we calculate the similarity scores between "iPhone 11" and other products in the
dataset based on their attributes or user interactions.
Recommendations:
Based on the item similarity table, we recommend products such as "Samsung Galaxy S10"
and "Google Pixel 4" to User A, who has previously purchased the "iPhone 11," as they are
the most similar items to the target item.
This approach leverages the concept of item similarity to provide personalized

recommendations to users, enhancing their experience and engagement with the platform.
Evaluating a Recommendation System
Evaluating a recommendation system is crucial to ensure that it provides accurate and

relevant recommendations to users. This involves assessing its performance using various
metrics to determine how effectively it predicts user preferences and generates relevant
recommendations. Here's an overview of the evaluation process:
1. Define Evaluation Metrics:
 Choose appropriate metrics to evaluate the performance of the

recommendation system.
 Common evaluation metrics include precision, recall, F1-score, mean average

precision (MAP), normalized discounted cumulative gain (NDCG), and
accuracy.
2. Split Data into Training and Test Sets:
 Divide the dataset into training and test sets to simulate real-world scenarios.
 The training set is used to train the recommendation model, while the test set
is used to evaluate its performance.
 Use the recommendation model to generate recommendations for users in the

test set.
4. Compare Predictions with Ground Truth:
 Compare the recommendations generated by the recommendation system with

the actual preferences or interactions of users in the test set.
5. Calculate Evaluation Metrics:
 Compute the evaluation metrics to measure the performance of the

 Evaluate metrics both globally (across all users) and on a per-user basis to
understand performance variations.
6. Interpret Results and Iterate:
 Analyze the evaluation metrics to understand the strengths and weaknesses of

the recommendation system.
 Iterate on the recommendation algorithm or system parameters to improve

performance based on the evaluation results.
Example:
Let's consider a scenario where we have a recommendation system for movies, and we want
to evaluate its performance using precision and recall metrics.
 Ground Truth: Actual movies that users have interacted with in the test set.
 Predicted Recommendations: Movies recommended by the recommendation system
for each user in the test set.
Evaluation Metrics Table:
Predicted
User Ground Truth Recommendations Precision Recall
[Inception, Interstellar,
User 1 [Matrix, Inception, Interstellar] Avengers] 0.67 0.67
User 2 [Inception, Avengers, Titanic] [Inception, Titanic, Avatar] 0.67 0.67
[Matrix, Inception,
User 3 [Matrix, Avengers, Titanic] Interstellar] 0.33 0.33
[Inception, Interstellar,
User 4 [Inception, Interstellar] Matrix] 0.67 1.00
... ... ... ... ...
Average ... ... 0.58 0.67
In this table, we have evaluated the recommendation system's performance for several users.
For each user, we compare the predicted recommendations with the ground truth and
calculate precision and recall. Finally, we compute the average precision and recall across all
users to assess the overall performance of the recommendation system.
Interpretation:
 Precision: The proportion of recommended items that are relevant to the user out of
the total recommended items.
 Recall: The proportion of relevant items that are successfully recommended out of all
relevant items.
 In this example, the recommendation system achieves an average precision of 0.58
and an average recall of 0.67, indicating its effectiveness in generating relevant
recommendations for users.
By evaluating the recommendation system using appropriate metrics, we can gain insights
into its performance and make informed decisions to improve its accuracy and relevance.
Validating Recommendation System
Validating a recommendation system is essential to ensure its performance and reliability

before deploying it to production. Validation involves assessing how well the
recommendation system generalizes to unseen data and how it performs under various
conditions. This includes testing its performance on a separate validation dataset to ensure
that it provides accurate and relevant recommendations to users.
1. Split Data into Training, Validation, and Test Sets:
 Divide the dataset into three separate sets: training, validation, and test sets.
 The training set is used to train the recommendation model, the validation set
is used to tune hyperparameters and evaluate performance during training, and
the test set is used to assess the final performance of the model.
2. Choose Evaluation Metrics:
 Select appropriate evaluation metrics to measure the performance of the

 Common evaluation metrics include precision, recall, F1-score, mean average

precision (MAP), normalized discounted cumulative gain (NDCG), and
accuracy.
3. Train Recommendation Model:
 Train the recommendation model using the training dataset.
 Tune hyperparameters and adjust the model architecture as needed based on

performance on the validation set.
4. Evaluate on Validation Set:
 Use the trained recommendation model to generate recommendations for users

in the validation set.
 Calculate evaluation metrics to assess the performance of the recommendation

system on the validation set.
5. Optimize and Iterate:
 Analyze the validation results and identify areas for improvement.
 Adjust the recommendation algorithm, feature engineering, or model

parameters based on the validation feedback.
 Repeat the training and validation process iteratively until satisfactory

performance is achieved.
6. Assess Generalization to Test Set:
 Once the recommendation system has been optimized based on the validation
set, evaluate its performance on the test set.
 This provides an unbiased estimate of the recommendation system's

performance on unseen data.
Example:
Let's consider a scenario where we have a recommendation system for books, and we want to
validate its performance using precision and recall metrics.
 Validation Set: A separate dataset containing user-book interactions for validation.
 Test Set: A separate dataset containing user-book interactions for final evaluation.
Validation Results Table:
User Ground Truth Predicted Recommendations Precision Recall
[Introduction to [Computer Networking: A Top-

User 1 0.33 0.33
Algorithms, Computer Down Approach, Artificial
User Ground Truth Predicted Recommendations Precision Recall
Networking: A Top- Intelligence: A Modern

Down Approach, Approach, Database System
Operating System Concepts]
Concepts]
[Computer Networking: A
Top-Down Approach,
Artificial Intelligence: A
Modern Approach, [Introduction to Algorithms,
Database System Database System Concepts,
User 2 Concepts] Operating System Concepts] 0.33 0.67
[Operating System
Concepts, Database [Introduction to Algorithms,
System Concepts, Computer Networking: A Top-
Computer Architecture: A Down Approach, Operating
User 3 Quantitative Approach] System Concepts] 0.67 0.33
[Introduction to [Introduction to Algorithms,

Algorithms, Computer Computer Networking: A Top-
Networking: A Top- Down Approach, Operating
User 4 Down Approach] System Concepts] 1.00 1.00
... ... ... ... ...
Average ... ... 0.58 0.58
In this table, we have evaluated the recommendation system's performance on the validation
set for several users, using real computer science textbook titles. For each user, we compare
the predicted recommendations with the ground truth and calculate precision and recall.
Interpretation:
 Precision: The proportion of recommended items that is relevant to the user out of the
total recommended items.
 Recall: The proportion of relevant items that are successfully recommended out of all
relevant items.
 In this example, the recommendation system achieves an average precision of 0.58

and an average recall of 0.58 on the validation set.
By validating the recommendation system on a separate dataset containing real-world

interactions, we can assess its performance and ensure that it provides accurate and relevant
recommendations to users interested in computer science textbooks.

Unit 5

Uploaded by

Copyright:

Available Formats

Unit 5

Uploaded by

Document Information

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Unit 5

Uploaded by

Copyright:

Available Formats

Unit -5

Recommendation Evaluation and Validation

In machine learning, recommendation evaluation and validation are essential processes to

 Definition: Recommendation evaluation involves assessing the quality and relevance

 Definition: Recommendation validation involves validating the recommendation

In summary, recommendation evaluation and validation are critical processes in machine

Describing Recommendation Engines

Recommendation engines (sometimes called recommenders) are win-win features for

Customers are drawn to businesses that offer personalized experiences.

Recommenders improve revenue by encouraging cross-selling, suggesting product

What is a recommendation engine?

 A customer’s past behaviors and history

Types of Recommendation Engines

This type of filtering is used in “Similar items include…” recommenders. Content-based

 Type of food (e.g., “fruit” or “cereal”)

How Recommendation Engines Are Used

In e-commerce, recommendation engines play a crucial role in driving sales. About 35

IMPROVES CUSTOMER EXPERIENCE

INCREASES TIME SPENT ON PLATFORM

Challenges of Recommendation Engines

Recommendation engines do come with some challenges, though.

LIMITED TO WHAT THEY ALREADY KNOW

“You need to have users interacting with items to do collaborative filtering,”

GATHERING CUSTOMER DATA CAN BE TRICKY

1. Problem Identification & Goal Formulation

2. Data Collection & Preprocessing

 Pandas: Provides methods for data manipulation, transformation, and analysis

3. Exploratory Data Analysis

 Matplotlib: Provides data visualization methods to create different plots like

Some of these algorithms are:

iii. Hybrid Filtering

Hybrid filtering combines collaborative filtering and content-based filtering techniques to

ii. Deep Learning

iii. Association Rule Mining

To optimize the performance of the recommender system, hyperparameters, such as the

Describing Similarity and Neighborhoods

In recommendation systems, similarity and neighborhood-based approaches are fundamental

Example: In user-based collaborative filtering, similarity between users is computed based

Types: There are two types of neighborhoods:

Example: In item-based collaborative filtering, the neighborhood of a target item consists of

Working Process of Recommendation Engine

Phase 1: Data Collection

Here are the types of data collected by recommendation systems:

Phase 2: Data Processing

At this stage, methods such as matrix factorization are used.

Matrix factorization is a mathematical technique for predicting user preferences. It works by

Phase 4: Generating Recommendations

The fourth step in the operation of a recommendation engine is the generation of

 Personalized Recommendations: Tailored specifically to an individual’s preferences and

Recommending Another Item

 Compute the similarity between items using a similarity measure such as

 The similarity between items can be determined by comparing their attributes,

2. Select Target Item:

3. Rank Similar Items:

4. Filter Top Items:

 Select the top-N similar items to recommend to the user.

 User's Recently Watched Movie: "The Matrix"

Item Similarity Table:

Movie Similarity Score

The Matrix 1.00