Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
0% found this document useful (0 votes)
102 views

Recommender Systems Notes

Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
102 views

Recommender Systems Notes

Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 21

Unit - I

Recommender Systems
Definition?
A recommender system (or recommendation system) is a class of machine learning
that uses data to help predict, narrow down, and find what people are looking for among an
exponentially growing number of options.
A recommender system, or a recommendation system, is a subclass of information
filtering systems that provide suggestions for items that are most pertinent to a particular
user.
Role:
Based on the user's profile, these systems can predict whether a product will be
preferable by a user or not. More broadly, recommender systems represent user preferences
for the purpose of suggesting items to purchase or examine and are now an integral part of a
lot of e-commerce sites.
Types/Techniques:
 Collaborative Filtering. The collaborative filtering method is based on gathering and
analyzing data on user's behavior.
 Content-Based Filtering.
 Knowledge-based system.
 Hybrid Recommendation Systems.
Algorithm:
The most common is the Linear Regression Algorithm. The linear regression
algorithm is used to find the best linear approximation to a data set. In a recommender
system, this algorithm is used to predict how a user will rate an item based on their past
ratings.
Benefits:
 Revenue and sales increase.
 User satisfaction growth.
 Turnover increase.
 Deficiency of information.
 Information variability.
 But here are some moments, that we should take into consideration:
o Unpredictable performance.
Recommender System use case:
News and media: Recommendation systems are used in news and media platforms to
recommend articles, videos, and other content that is relevant to a user's interests.
Social media: Recommendation systems are used in social media to recommend
friends, groups, or posts that are likely to be of interest to users.
Learning:
It is ―a process that leads to change, which occurs as a result of experience and
increases the potential for improved performance and future learning‖.
Types of Learning:
• Auditory learning (―by listening and speaking―).
• Visual learning (―through the eyes, by watching‖).
• Kinesthetic (Learning through the intellect - experience and practice).
• Haptic learning (―by touching and feeling‖).
Artificial Intelligence:
Artificial intelligence (AI) is intelligence demonstrated by machines, as opposed to
intelligence displayed by humans or by other animals. Also refers to the simulation of human

Page 1 of 21
Unit - I

intelligence in machines that are programmed to think and act like humans using robust
datasets.
Machine Learning:
Machine learning is a branch of artificial intelligence (AI) and computer science
which focuses on the use of data and algorithms to imitate the way that humans learn,
gradually improving its accuracy.
Data Science:
Data science is the study of data to extract meaningful insights for business. It is a
multidisciplinary approach that combines principles and practices from the fields of
mathematics, statistics, artificial intelligence, and computer engineering to analyze large
amounts of data.
Deep Learning:
Deep learning is a subset of machine learning, which is essentially a neural network
with three or more layers. These neural networks attempt to simulate the behavior of the
human brain—albeit far from matching its ability—allowing it to ―learn‖ from large amounts
of data.
Big Data:
Big data is a collection of data from many different sources and is often describe by
five characteristics: volume, value, variety, velocity, and veracity. Put simply, big data is
larger, more complex data sets, especially from new data sources.
Analysis vs Analytics
Analysis is the division of a whole into small components, and analytics is the science
of logical analysis. While analysis looks backward over time and works on the facts and
figures of what has happened, analytics work towards modeling the future or predicting a
result.
Labeled and unlabeled data:
It is the data that has one or more predefined tags such as name, type, or number. For
example, an image has an apple or banana. At the same time, unlabelled data contains no tags
or no specified name. Labeled data is used in Supervised Learning techniques, whereas
unlabelled data is used in Unsupervised Learning.
Supervised Learning:
Supervised learning, also known as supervised machine learning, is a subcategory of
machine learning and artificial intelligence. It is defined by its use of labeled datasets to train
algorithms that to classify data or predict outcomes accurately.
Steps: 1. Prepare Data. 2. Choose an Algorithm. 3. Fit a Model
Eg.:
1. Linear regression for regression problems.
2. Random forest for classification and regression problems.
3. Support vector machines for classification problems.
Unsupervised learning:
It is also known as unsupervised machine learning, uses machine learning algorithms
to analyze and cluster unlabeled datasets. These algorithms discover hidden patterns or data
groupings without the need for human intervention.
Supervised Learning vs Unsupervised Learning:
Supervised machine learning is generally used to classify data or make predictions,
whereas unsupervised learning is generally used to understand relationships within datasets.
Supervised machine learning is much more resource-intensive because of the need for
labelled data.

Page 2 of 21
Unit - I

Underfitting:
Underfitting is a situation when your model is too simple for your data. More
formally, your hypothesis about data distribution is wrong and too simple — for example,
your data is quadratic and your model is linear. This situation is also called high bias. This
means that your algorithm can do accurate predictions, but the initial assumption about the
data is incorrect.

Page 3 of 21
Unit - I

Overfitting:
Overfitting is a situation when your model is too complex for your data. More
formally, your hypothesis about data distribution is wrong and too complex — for example,
your data is linear and your model is high-degree polynomial. This situation is also called
high variance. This means that your algorithm can’t do accurate predictions — changing the
input data only a little, the model output changes very much.

low bias, low variance is a good result, just right.


low bias, high variance (overfitting) the algorithm outputs very different predictions for
similar data.
high bias, low variance (underfitting) the algorithm outputs similar predictions for similar
data but predictions are wrong (algorithm ―miss‖).
high bias, high variance very bad algorithm. You will most likely never see
this.
Underfitting means that your model makes accurate, but initially incorrect
predictions. In this case, train error is large and val/test error is large too.

Overfitting means that your model makes not accurate predictions. In this case, train
error is very small and val/test error is large.

When you find a good model, train error is small (but larger than in the case of
overfitting), and val/test error is small too.

In the above case, the test error and validation error are approximately the same. This
happens when everything is fine, and your train, validation, and test data have the same
distributions. If validation and test error are very different, then you need to get more data
similar to test data and make sure that you split the data correctly.

Tools and Techniques


Now let’s look at techniques to prevent underfitting and overfitting, considering
exactly why we should use them.
General Intuition You Should Remember
As we remember:
 underfitting occurs when your model is too simple for your data.
 overfitting occurs when your model is too complex for your data.
Based on this, simple intuition you should keep in mind is:
 to fix underfitting, you should complicate the model.

Page 4 of 21
Unit - I

 to fix overfitting, you should simplify the model.


In fact, everything that will be listed below is only the consequence of this simple rule. I
will try to show why certain actions will complicate or simplify the model.

More Simple / Complex Model


The easiest way that comes to mind based on the intuition above is to try a more
simple or more complex algorithm (model).

To complicate the model, you need to add more parameters (degrees of freedom).
Sometimes this means to directly try a more powerful model — one that is a priori capable to
restore more complex dependencies (SVM with different kernels instead of logistic
regression). If the algorithm is already quite complex (neural network or some ensemble
model), you need to add more parameters to it, for example, increase the number of models
in boosting. In the context of neural networks, this means adding more layers / more neurons
in each layer / more connections between layers / more filters for CNN, and so on.

To simplify the model, you need contrariwise to reduce the number of parameters.
Either completely change the algorithm (try random forest instead of deep neural network),
or reduce the number of degrees of freedom. Fewer layers, fewer neurons, and so on.

Regularization:
Regularization refers to techniques that are used to calibrate machine learning models
in order to minimize the adjusted loss function and prevent overfitting or underfitting.
Regularization is a set of techniques that can prevent overfitting in neural networks
and thus improve the accuracy of a Deep Learning model when facing completely new data
from the problem domain.

L1 regularization:
It is also known as Lasso regularization, is a machine-learning strategy that inhibits
overfitting by introducing a penalty term into the model's loss function based on the absolute
values of the model's parameters.
L2 regularization:
It acts like a force that removes a small percentage of weights at each iteration.
Therefore, weights will never be equal to zero. L2 regularization penalizes (weight). There is
an additional parameter to tune the L2 regularization term which is called regularization rate
(lambda).

Page 5 of 21
Unit - I

Primary Models of Recommender Systems:


1. Prediction version of problem.
2. Ranking version of problem.

1. Prediction version of problem:


• Predict the rating value for a user-item combination. It is assumed that training data is
available, indicating user preferences for items.
• For m users and n items, this corresponds to an incomplete m × n matrix, where the
specified (or observed) values are used for training.
• The missing (or unobserved) values are predicted using this training model.
• This problem is also referred to as the matrix completion problem because we have an
incompletely specified matrix of values, and the remaining values are predicted by the
learning algorithm.
2. Ranking version of problem:
• In practice, it is not necessary to predict the ratings of users for specific items in order to
make recommendations to users.
• Rather, a merchant may wish to recommend the top-k items for a particular user, or
determine the top-k users to target for a particular item.
• The determination of the top-k items is more common than the determination of top-k
users, although the methods in the two cases are exactly analogous.
• The top-k items, is the more common setting and is also referred to as the top-k
recommendation problem.

Goal:
• Increasing the product sales is the primary goal of a recommender system, utilized by
merchants to increase their profit.
• How? By recommending carefully selected items to users, recommender systems bring
relevant items to the attention of users.
• To achieve the broader business-centric goal of increasing revenue, the common
operational and technical goals of recommender systems are as follows:
• Relevance (logical connection):
a. To recommend items that is relevant to the user at hand.
• Novelty(new & unusual):
a. To recommend the item that is something that the user has not seen in the past.
Eg. Popular movies of a preferred genre would rarely be novel to the user.
Repeated recommendation of popular items can also lead to reduction in the sales
diversity.
• Serendipity(unexpected, good luck, by chance in finding good things- coincidence):
a. Serendipity is different from novelty in that the recommendations are truly
surprising to the user, rather than simply something they did not know about
before. The serendipitous methods focus on discovering such recommendations.
b. Eg. if a new Indian restaurant opens in a neighborhood, then the recommendation
of that restaurant to a user who normally eats Indian food is novel but not
necessarily serendipitous.
c. On the other hand, when the same user is recommended Ethiopian food, and it
was unknown to the user that such food might appeal to him, then the
recommendation is serendipitous.
d. Serendipity has the beneficial side effect of increasing sales diversity or beginning
a new trend of interest in the user results in long-term and strategic benefits to the
merchant.

Page 6 of 21
Unit - I

e. The algorithms that provide serendipitous recommendations often tend to


recommend irrelevant items. In many cases, the longer term and strategic benefits
of serendipitous methods outweigh these short-term disadvantages.
• Increasing recommendation diversity:
a. Recommender systems typically suggest a list of top-k items.
b. When all these recommended items are very similar, it increases the risk that the
user might not like any of these items.
c. On the other hand, when the recommended list contains items of different types,
there is a greater chance that the user might like at least one of these items.
d. Diversity has the benefit of ensuring that the user does not get bored by repeated
recommendation of similar items.
• Conclusion:
• From the perspective of the user, recommendations can help improve overall user
satisfaction with the Web site.
a. For example, a user who repeatedly receives relevant recommendations from
Amazon.com will be more satisfied with the experience and is more likely to use
the site again. This can improve user loyalty and further increase the sales at the
site.
• At the merchant end, the recommendation process can provide insights into the needs of
the user and help customize the user experience further. Finally, providing the user an
explanation for why a particular item is recommended is often useful.
a. For example, in the case of Netflix, recommendations are provided along with
previously watched movies providing explanations.
• Facebook, do not directly recommend products. Rather they may recommend social
connections, which have an indirect benefit to the site by increasing its usability and
advertising profits.
GroupLens Recommender System:
• a research prototype for recommendation of Usenet news.
• The system collected ratings from Usenet readers and used them to predict whether or not
other readers would like an article before they read it.
• The general ideas developed by this group were also extended to other product settings
such as books and movies, referred to as BookLens and MovieLens
Amazon.com Recommender System:
• Originally founded as a book e-retailer, the business expanded to virtually all forms of
products.
• The recommendations in Amazon.com are provided on the basis of explicitly provided
• ratings, buying behavior, and browsing behavior.
• The ratings in Amazon.com are specified on a 5-point scale, with lowest rating being 1-
star, and the highest rating being 5-star. The customer-specific buying and browsing data
can be easily collected when users are logged in with an account authentication
mechanism supported by Amazon.
• Recommendations are also provided to users on the main Web page of the site, whenever
they log into their accounts. In many cases, explanations for recommendations are
provided.
• For example, the relationship of a recommended item to previously purchased items may
be included in the recommender system interface.
• The purchase or browsing behavior of a user can be viewed as a type of implicit rating,
as opposed to an explicit rating, which is specified by the user.
Netflix Movie Recommender System:

Page 7 of 21
Unit - I

• Netflix was founded as a mail-order digital video disc (DVD) rental company of movies
and television shows, which was eventually expanded to streaming on a subscription
basis.
• Netflix provides users the ability to rate the movies and television shows on a 5-point
scale.
• Furthermore, the user actions in terms of watching various items are also stored by
Netflix. These ratings and actions are then used by Netflix to make recommendations.
• Netflix does an excellent job of providing explanations for the recommended items.
• It explicitly provides examples of recommendations based on specific items that were
watched by the user.
• Such information provides the user with additional information to decide whether or not
to watch a specific movie
• The Netflix Prize contest is notable for its numerous contributions to recommendation
research.
Google News Personalization System:
• Able to recommend news to users based on their history of clicks. The clicks are
associated with specific users based on identification mechanisms enabled by Gmail
accounts.
• but no mechanism exists for them to show their dislike. (implicit and unary ratings).
Facebook Friend Recommendations:
• Social networking sites often recommend potential friends to users in order to increase
the number of social connections at the site.
• This kind of recommendation has slightly different goals than a product recommendation.
While a product recommendation directly increases the profit of the merchant by
facilitating product sales, an increase in the number of social connections improves the
experience of a user at a social network. This, in turn, encourages the growth of the social
network. Social networks are heavily dependent on the growth of the network to
increase their advertising revenues.
• Therefore, the recommendation of potential friends (or links) enables better growth
and connectivity of the network. This problem is also referred to as link prediction in the
field of social network analysis. Such forms of recommendations are based on structural
relationships rather than ratings data.
Computational advertising?
• Computational Advertising (CA) is a scientific sub-discipline at the intersection of
information retrieval, statistical modeling, machine learning, optimization, large
scale search and text analysis. The core problem addressed in Computational
Advertising is of match-making between the ads and the context.
Goal of computational advertising:
• The goals of computational advertising are to achieve a more efficient allocation of
advertising resources through better targeting and improve effectiveness through
enhanced ad relevance and personalization.

Basic Models of Recommender Systems:


• The basic models for recommender systems work with two kinds of data, which are
1. The user-item interactions, such as ratings or buying behavior. (Collaborative Filtering)
2. The attribute information about the users and items such as textual profiles or relevant
keywords. (Content-based recommendations) - content-based systems also use the ratings
matrices in most cases, although the model is usually focused on the ratings of a single
user rather than those of all users.

Page 8 of 21
Unit - I

• Knowledge-based recommender systems, the recommendations are based on explicitly


specified user requirements.
• Instead of using historical rating or buying data, external knowledge bases and constraints
are used to create the recommendation.
• Some recommender systems combine these different aspects to create hybrid systems.
• Hybrid systems can combine the strengths of various types of recommender systems to
create techniques that can perform more robustly in a wide variety of settings.
Collaborative Filtering Models:
• Uses the collaborative power of the ratings provided by multiple users to make
recommendations.
• The main challenge in designing collaborative filtering methods is that the underlying
ratings matrices are sparse.
• Consider an example of a movie application in which users specify ratings indicating
their like or dislike of specific movies.
• Most users would have viewed only a small fraction of the large universe of available
movies. As a result, most of the ratings are unspecified (or ―unobserved‖ or ―missing.‖).
The specified ratings are also referred to as observed ratings.
• These unspecified ratings can be imputed because the observed ratings are often highly
correlated across various users and items.
• For example, consider two users named Alice and Bob, who have very similar tastes. If
• the ratings, which both have specified, are very similar, then their similarity can be
identified by the underlying algorithm.
• In such cases, it is very likely that the ratings in which only one of them has specified a
value, are also likely to be similar. This similarity can be used to make inferences about
incompletely specified values.
• Most of the models for collaborative filtering focus on leveraging either inter-item
correlations or inter-user correlations for the prediction process.
• There are two types of methods that are commonly used in collaborative filtering, which
are referred to as memory-based methods and model-based methods:
1. Memory-based methods:
• These are referred to as neighborhood based collaborative filtering algorithms in which
the ratings of user-item combinations are predicted on the basis of their neighborhoods.
These neighborhoods can be defined in one of two ways:
• User-based collaborative filtering:
In this case, the ratings provided by like-minded users of a target user A are used in order
to make the recommendations for A. Thus, the basic idea is to determine users, who are
similar to the target user A, and recommend ratings for the unobserved ratings of A by
computing weighted averages of the ratings of this peer group.
• Therefore, if Alice and Bob have rated movies in a similar way in the past, then one can
use Alice’s observed ratings on the movie Terminator to predict Bob’s unobserved ratings
on this movie. In general, the k most similar users to Bob can be used to make rating
predictions for Bob.
• Similarity functions are computed between the rows of the ratings matrix to discover
similar users.

• Item-based collaborative filtering:


• In order to make the rating predictions for target item B by user A, the first step is to
determine a set S of items that are most similar to target item B. The ratings in item set S,
which are specified by A, are used to predict whether the user A will like item B.

Page 9 of 21
Unit - I

• Therefore, Bob’s ratings on similar science fiction movies like Alien and Predator can be
used to predict his rating on Terminator.
• Similarity functions are computed between the columns of the ratings matrix to discover
similar items.
Advantage:
• The advantages of memory-based techniques are that they are simple to implement and
the resulting recommendations are often easy to explain.
Disadvantage:
• On the other hand, memory-based algorithms do not work very well with sparse ratings
matrices.
• For example, it might be difficult to find sufficiently similar users to Bob, who have rated
Gladiator. In such cases, it is difficult to robustly predict Bob’s rating of Gladiator. In
other words, such methods might lack full coverage of rating predictions.
• The lack of coverage is often not an issue, when only the top-k items are required.
2. Model-based methods:
• In model-based methods, machine learning and data mining methods are used in the
context of predictive models.
• In cases where the model is parameterized, the parameters of this model are learned
within the context of an optimization framework.
• Some examples of such model-based methods include decision trees, rule-based
models, Bayesian methods and latent factor models.
• Many of these methods, such as latent factor models, have a high level of coverage even
for sparse ratings matrices.

Types of Ratings:
• The ratings are often specified on a scale that indicates the specific level of like or dislike
of the item at hand.
• The ratings to be continuous values, can take on any value between -10 and 10.
• Usually, the ratings are interval-based, where a discrete set of ordered numbers are used
to quantify like or dislike. Such ratings are referred to as interval-based ratings. For
example, a 5-point rating scale might be drawn from the set {−2,−1, 0, 1, 2}, in which a
rating of −2 indicates an extreme dislike, and a rating of 2 indicates a strong affinity to
the item. Other systems might draw the ratings from the set {1, 2, 3, 4, 5}.
• The use of 5-point, 7-point, and 10-point ratings is particularly common.
binary ratings 5-point interval ratings Unbalanced rating scale Ordinal ratings
Like, Dislike ***** Loved it Excellent Strongly Disagree
0,1 **** I liked it Very Good Disagree
*** It was ok Good Neutral
** I didn’t like it Fair Agree
* I hated it Poor Strongly Agree
• There may be an even number of possible ratings, and the neutral rating might be
missing. This approach is referred to as a forced choice rating system.
• One can also use ordered categorical values such as {Strongly Disagree, Disagree,
Neutral, Agree, Strongly Agree} in order to achieve the same goals referred to as ordinal
ratings.
• Binary ratings, the user may represent only a like or dislike for the item and nothing
else. For example, the ratings may be 0, 1, or unspecified values. The unspecified values
need to be predicted to 0-1 values.
• A special case of ratings is that of unary ratings, in which there is a mechanism for a user
to specify a liking for an item but no mechanism to specify a dislike.

Page 10 of 21
Unit - I

• Unary ratings are particularly common, especially in the case of implicit feedback data
sets.
• In these cases, customer preferences are derived from their activities rather than their
explicitly specified ratings.
• For example, the buying behavior of a customer can be converted to unary ratings. When
a customer buys an item, it can be viewed as a preference for the item. However, the act
of not buying an item from a large universe of possibilities does not always indicate a
dislike.
• Similarly, many social networks, such as Facebook, use “like” buttons, which provide
the ability to express liking for an item. However, there is no mechanism to specify
dislike for an item.
• The implicit feedback setting can be viewed as the matrix completion analog of the
positive-unlabeled (PU) learning problem in data classification.
• A ratings matrix is sometimes referred to as a utility matrix, although the two may not
always be the same.
• In the unary rating, the matrix is referred to as a positive preference utility matrix
because it allows only the specification of positive preferences.

Content-Based Recommender Systems:


• In content-based recommender systems, the descriptive attributes of items are used to
make recommendations. The term ―content‖ refers to these descriptions.
• In content-based methods, the ratings and buying behavior of users are combined with the
content information available in the items.
• For example, consider a situation where John has rated the movie Terminator highly, but
we do not have access to the ratings of other users. Therefore, collaborative filtering
methods are ruled out.

Page 11 of 21
Unit - I

• However, the item description of Terminator contains similar genre keywords as other
science fiction movies, such as Alien and Predator. In such cases, these movies can be
recommended to John.
Advantage:
• It will make recommendations for new items, when sufficient rating data are not available
for that item. This is because other items with similar attributes might have been rated by
the active user.
• Therefore, the supervised model will be able to leverage these ratings in conjunction with
the item attributes to make recommendations even when there is no history of ratings for
that item.
Disadvantage:
• In many cases, content-based methods provide obvious recommendations because of the
use of keywords or content. For example, if a user has never consumed an item with a
particular set of keywords, such an item has no chance of being recommended. This is
because the constructed model is specific to the user at hand, and the community
knowledge from similar users is not leveraged. This phenomenon tends to reduce the
diversity of the recommended items, which is undesirable.
• Even though content-based methods are effective at providing recommendations for new
items, they are not effective at providing recommendations for new users. This is because
the training model for the target user needs to use the history of her ratings. In fact, it is
usually important to have a large number of ratings available for the target user in order to
make robust predictions without overfitting.

Cold Start Scenarios:


• The cold-start problem, which describes the difficulty of making recommendations when
the users or the items are new, remains a great challenge for Collaborative Filtering.
Traditionally, this problem is tackled by resorting to an additional interview process to
establish the user (item) profile before making any recommendations.
• The predictions from recommender systems, in most cases, are fetched from previous
ratings from the user or from ratings from similar users. The cold start problem occurs
when either new items or users are added to the system, and then it is not possible to
create the predictions.

Knowledge-Based Recommender Systems:

• Knowledge-based recommender systems are particularly useful in the context of items


that are not purchased very often.
• Examples include items such as real estate, automobiles, tourism requests, financial
services, or expensive luxury goods.
• Sufficient ratings may not be available for the recommendation process.
• The items are bought rarely, and with different types of detailed options, it is difficult to
obtain a sufficient number of ratings for a specific instantiation (i.e., combination of
options) of the item at hand.
• This problem is also encountered in the context of the cold-start problem, when sufficient
ratings are not available for the recommendation process.
• The nature of consumer preferences may evolve over time when dealing with such items.
For example, the model of a car may evolve significantly over a few years, as a result of
which the preferences may show a corresponding evolution.

Page 12 of 21
Unit - I

• A particular item may have attributes associated with it that correspond to its various
properties, and a user may be interested only in items with specific properties.
• For example, cars may have several makes, models, colors, engine options, and interior
options, and user interests may be regulated by a very specific combination of these
options. Thus, in these cases, the item domain tends to be complex in terms of its varied
properties, and it is hard to associate sufficient ratings with the large number of
combinations at hand.
• The recommendation process is performed on the basis of similarities between customer
requirements and item descriptions, or the use of constraints specifying user
requirements.
• The process is facilitated with the use of knowledge bases, which contain data about rules
and similarity functions to use during the retrieval process.
• The explicit specification of requirements results in greater control of users over the
recommendation process.
• In both collaborative and content-based systems, recommendations are decided entirely
by either the users past actions/ratings, the action/ratings of her peers, or a combination of
the two.
• Knowledge-based systems are unique in that they allow the users to explicitly specify
what they want.

Knowledge-based recommender systems can be classified on the basis of the type of the
interface (and corresponding knowledge) used to achieve the goals.

Constraint-based recommender systems:

Page 13 of 21
Unit - I

• In constraint-based systems, users typically specify requirements or constraints (e.g.,


lower or upper limits) on the item attributes.
• Domain-specific rules are used to match the user requirements to item attributes. These
rules represent the domain-specific knowledge used by the system.
• Such rules could take the form of domain-specific constraints on the item attributes (e.g.,
―Cars before year 1970 do not have cruise control.‖).
• Furthermore, constraint-based systems often create rules relating user attributes to item
attributes (e.g., ―Older investors do not invest in ultra high-risk products.‖).
• In such cases, user attributes may also be specified in the search process. Depending on
the number and type of returned results, the user might have an opportunity to modify
their original requirements.
• For example, they might relax some of their constraints when too few results are returned,
or they might add more constraints. This search process is interactively repeated until the
user arrives at her desired results.

Case-based recommender systems:

• In case-based recommender systems, specific cases are specified by the user as targets or
anchor points.
• Similarity metrics are defined on the item attributes to retrieve similar items to these
cases.
• The similarity metrics are often carefully defined in a domain-specific way. Therefore,
the similarity metrics form the domain knowledge that is used in such systems.
• The returned results are often used as new target cases with some interactive
modifications by the user.
• For example, when a user sees a returned result, which is almost similar to what they
want, they might re-issue a query with that target, but with some of the attributes changed
to the user’s liking.
• This interactive process is used to guide the user towards items of interest.

Page 14 of 21
Unit - I

constraint-based systems case-based systems


• In constraint-based systems, rules (or • In case-based systems, examples (or cases) are
constraints) are used to guide the used as anchor points to guide the search in
search. combination with similarity metrics.

• The form of the guidance may often • Critiquing interfaces are particularly popular
take the form of search-based systems, for expressing feedback in such systems,
where users specify their constraints where users iteratively modify one or more
with a search-based interface attributes of a preferred item in each iteration

The interactivity in knowledge-based recommender systems achieved through one or


more of the following methods

Conversational systems:
• The user preferences are determined iteratively in the context of a feedback loop. The
main reason for this is that the item domain is complex and the user preferences can be
determined only in the context of an iterative conversational system.
Search-based systems:
• The user preferences are elicited by using a preset sequence of questions such as the
following: ―Do you prefer a house in a suburban area or within the city?‖ In some cases,
specific search interfaces may be set up in order to provide the ability to specify user
constraints.
Navigation-based recommendation:
• The user specifies a number of change requests to the item being currently recommended.
• Through an iterative set of change requests, it is possible to arrive at a desirable item.
• An example of a change request specified by the user, when a specific house is being
recommended is as follows: ―I would like a similar house about 5 miles west of the
currently recommended house.‖ Such recommender systems are also referred to as
critiquing recommender systems
Note: The main difference is that content-based systems learn from past user behavior,
whereas knowledge-based recommendation systems recommend based on active user
specification of their needs and interests.

Utility-Based Recommender Systems:


• In utility-based recommender systems, a utility function is defined on the product features
in order to compute the probability of a user liking the item.
• The central challenge in utility-based methods is in defining an appropriate utility
function for the user at hand.
• It is noteworthy that all recommender schemes, whether collaborative, content-based, or
knowledge-based methods, implicitly rank the recommended items on the basis of their
perceived value (or utility) for the target user.
• In utility-based systems, this utility value is based on a function that is known a priori.
• In this sense, such functions can be viewed as a kind of external knowledge. Therefore,
utility-based systems can be viewed as a specific case of knowledge-based recommender
systems.
• Utility functions are used frequently in various ways for ranking items in knowledge-
based recommender systems.
Demographic Recommender Systems:

Page 15 of 21
Unit - I

• Recommender systems that are based on demographic characteristics of consumers and


recommend a list of items that have good feedback from the consumers that are
demographically similar to the target consumer.
Hybrid and Ensemble-Based Recommender Systems:
Recap:
• Collaborative filtering systems rely on community ratings
• Content-based methods rely on textual descriptions and the target user’s own ratings
• Knowledge-based systems rely on interactions with the user in the context of knowledge
bases.
• Demographic systems use the demographic profiles of the users to make
recommendations.

• Different systems use different types of input, and have different strengths and
weaknesses.

• Knowledge-based systems, are more effective in cold-start settings where a significant


amount of data is not available
• Collaborative methods are more effective when a lot of data is available.
• The hybrid recommendation system is a special type of system that uses data of both
collaborative data and content-based data simultaneously which helps to suggest a similar
or close item to the users. Combining the two above approaches helps to resolve the big
problems in more effective cases sometimes.
Advantages:
• The advantages include more detailed and personalized recommendations, no cold start
problem, etc.
Disadvantages:
• They uses high computational complexity and have large database of ratings.

Ensemble based recommender systems:


• Ensemble methods are techniques that aim at improving the accuracy of results in models
by combining multiple models instead of using a single model.
• The combined models increase the accuracy of the results significantly. This has boosted
the popularity of ensemble methods in machine learning.

Evaluation of Recommender Systems:
• Recommender systems share several conceptual similarities with the classification and
regression modeling problem.
• In classification and regression modeling, the missing class variable needs to be predicted
from the feature variables.
• In recommender systems, any of the matrix entries may be missing and need to be
predicted in a data-driven way from the observed entries in the remaining matrix.
• In this sense, the recommendation problem can be viewed as a generalization of the
classification problem. Therefore, many of the models used for evaluation of classifiers
can be used for evaluating recommender systems, with some modifications.
• There are significant variations in the evaluation techniques used for different aspects of
recommender systems, such as rating prediction or ranking.
• The former is closely related to classification and regression modeling,
• The latter is closely related to the evaluation of retrieval effectiveness in search and
information retrieval applications.

Page 16 of 21
Unit - I

• Predictive accuracy metrics, classification accuracy metrics, rank accuracy metrics, and
non-accuracy measurements are the four major types of evaluation metrics for
recommender systems
• Predictive measures address the subject of how close ratings of recommender systems are
to the user ratings. They are a good choice for non-binary tasks.
• Mean Absolute Error (MAE) and Root Mean Squared Error (RMSE) are the most popular
and easy to interpret predictive metrics.
Domain-Specific Challenges in Recommender Systems:
• In different domains, such as temporal data, location-based data, and social data, the
context of the recommendation plays a critical role.
• Therefore, the notion of contextual recommender systems was developed to address the
additional side information that arises in these domains.
• This notion is used with different modifications for various types of data, such as
temporal data, location data, or social data.
Context-Based Recommender Systems:
• Context-based or context-aware recommender systems take various types of contextual
information into account, while making recommendations.(time, location, social media).
• Eg. Cloths – season & location of customer. – festival, holidays affects these.

Time-Sensitive Recommender Systems:


• The recommendations for an item might evolve with time. Eg. the recommendations for a
movie may be very different at the time of release from the recommendations received
several years later.
• The temporal aspect in recommender systems can be reflected in several ways.
1. The rating of an item might evolve with time, as community attitudes evolve
and the interests of users change over time. User interests, likes, dislikes, and
fashions inevitably evolve with time. (time as an explicit parameter in
collaborative filtering systems.)
2. The rating of an item might be dependent on the specific time of day, day of week,
month, or season. For example, it makes little sense to recommend winter clothing
during the summer, or raincoats during the dry season.(Special case of context-
based recommender systems)
• Temporal recommender systems are challenging because of the fact that the matrix of
ratings is sparse, and the use of specific temporal context aggravates the sparsity problem.
Location-Based Recommender Systems:
• With the increasing popularity of GPS-enabled mobile phones, consumers are often
interested in location-based recommendations.
• For example, a traveling user may wish to determine the closest restaurant based on his
previous history of ratings for other restaurants.
• The recommendation of places always has a location aspect built into it.
• Two types of spatial locality that are common:
1. User-specific locality: The geographical location of a user has an important role
in her preferences. For example, a user from Chennai, CBE might not have the
same movie preferences as a user from MDU, TCN. This type of locality is
referred to as preference locality. (context-sensitive systems)
2. Item-specific locality: The geographical location of an item (e.g., restaurant)
might have an impact on the relevance of the item, depending on the current
location of the user. Users are generally not willing to travel very far from their
current location. This type of locality is referred to as travel locality. (ad hoc
heuristics)

Page 17 of 21
Unit - I

Social Recommender Systems:


• Social recommender systems are based on network structures, social cues(signal) and
tags, or a combination of these various network aspects.
• The recommender systems that are based on social cues and tags are slightly different
from those that are based purely on structural aspects
Structural Recommendation of Nodes and Links:
• Various types of networks, including social networks, are composed of nodes and links
which are recommended largely.
• The problem of node recommendation is closely related to the problem of Web search.
• Both problems require the use of various forms of ranking algorithms.
• A key component of these methods is the use of the PageRank algorithm, although the
personalization of such algorithms is more closely related to recommendation algorithms
also referred to as personalized PageRank algorithms.
• In cases where examples of nodes of interest are available, such nodes can be used as
training data in order to determine other nodes of interest.
• This problem is referred to as collective classification.
• A closely related problem is that of the link recommendation or link prediction problem,
where it is desirable to suggest friends (or potential links) for a user in a social network.
• The link prediction problem also has numerous applications beyond social networks.
• Interestingly, the problems of ranking, collective classification, and link recommendation
• are closely related.
• In fact, solutions to one problem are often used as subroutines for other problems. For
example, ranking and link prediction methods are often used for traditional product
recommendations in user-item graphs.
• These methods can be used to perform recommendations in many problem settings,
which can be transformed into graphs.
Product and Content Recommendations with Social Influence:
• Many forms of product and content recommendation are performed with the help of
network connections and other social cues.
• This problem is also referred to as viral marketing.
• In viral marketing, products are recommended with the use of word-of-mouth systems.
• In order to achieve this goal, it is important to be able to determine influential and
topically relevant entities in the network. This problem is referred to as influence
analysis in social networks.
• For example, determining the influential users in a Twitter stream for specific topics
may be very useful for viral marketing. In other cases, social cues are harvested from
social networks in order to make recommendations.
Trustworthy Recommender Systems:
• Many social media sites, such as Epinions or Slashdot, allow users to express their trust
and distrust in one another, either in a direct way, or through various feedback
mechanisms.
• For example, users can express their trust or distrust in reviews of other users, or they
may directly specify their trust or distrust relationships with other users.
• This trust information is very useful for making more robust recommendations.
• In reality, it is evident that a user-based neighborhood method should be computed with
the use of trustworthy peers to obtain robust recommendations.
Leveraging Social Tagging Feedback for Recommendations:
• Users have numerous methods for incorporating their feedback in recommender systems.

Page 18 of 21
Unit - I

• The most common form of feedback is social tagging. Such forms of feedback are
particularly common on content sharing sites on the Web, such as Flickr (photo sharing),
last.fm (music sharing), and Bibsonomy (scientific literature sharing).
• Tags are meta-data that users utilize to add short informative keywords to the content.
• For example, a user on a music site might tag Michael Jackson’s Thriller album as
―rock.‖
• Such tags provide useful information about the interests of both the user and the content
of the item because the tag is associated with both.
• The tags serve as useful context for performing the recommendations. Methods for
context-sensitive recommendations can be directly used to incorporate this feedback into
the recommendation process.
Advanced Topics and Applications:
The Cold-Start Problem in Recommender Systems:
• One of the major problems in recommender systems is that the number of initially
available ratings is relatively small.
• In such cases, it becomes more difficult to apply traditional collaborative filtering models.
• While content-based and knowledge-based methods are more robust than collaborative
models in the presence of cold starts, such content or knowledge might not always be
available.
• Therefore, a number of specific methods have been designed to improve the problem of
cold start in the context of recommender systems.
Attack-Resistant Recommender Systems:
• The use of recommender systems has a significant impact on the sale of various products
and services.
• As a result, the sellers of products and services have significant economic incentives to
manipulate the output of recommender systems.
• One example of such a manipulation would be to submit inflated ratings of their own
products to the recommender systems.
• A malicious rival might submit biased and negative reviews about the products of a
competitor.
• Over the years, numerous sophisticated strategies have been developed for attacking
recommender systems.
• Such attacks are highly undesirable because they reduce the overall effectiveness of the
recommender system and reduce the quality of experience for legitimate users.
• Therefore, methods are needed that enable robust recommendations in the presence of
such attacks.
Group Recommender Systems:
• The recommendation system is tailored to recommend a particular activity to a group of
users rather than a single user.
• Eg. might include the watching of movie or television by a group, the selection of music
in a fitness center, or the travel recommendations to a group of tourists.
• Simple averaging strategies do not work well when groups are heterogeneous and contain
users with diverse tastes.
• This is because users often have an impact on each other’s tastes based on phenomena
from social psychology, such as emotional contagion (Contamination) and conformity
(orthodoxy).
Multi-Criteria Recommender Systems:
• In multi-criteria systems, ratings might be specified on the basis of different criteria by
a single user.
• For example, a user might rate movies based on the plot, music, special effects, and so on.

Page 19 of 21
Unit - I

• Such techniques often provide recommendations by modeling the user’s utility for an
item as a vector of ratings corresponding to various criteria.
• In multi-criteria recommender systems, one can often obtain misleading results by using
only the overall rating in conjunction with a traditional recommender system.
• For example, if two users have the same overall rating for a movie, but their component
ratings for the plot and music are very different, then the two users should not be
considered similar from the perspective of a similarity-based collaborative filtering
algorithm.
• In some of the multi-criteria systems, users may not specify an overall rating at all. In
such cases, the problem is even more challenging because it is needed to present ranked
lists of items to various users on the basis of multiple criteria.
• Some of the methods for group recommender systems can also be adapted to multi-
criteria recommender systems.
• However, the two topics are generally considered different because they emphasize
different aspects of the recommendation process.
Active Learning in Recommender Systems:
• A major challenge in recommender systems is the acquisition of sufficient ratings in order
• to make robust predictions.
• The sparsity of the ratings matrix continues to be a significant impediment in effective
functioning of recommender systems.
• The acquisition of sufficient ratings can reduce the sparsity problem.
• A variety of real-world recommender systems have mechanisms to encourage users to
enter ratings in order to populate the system.
• For example, users might be provided incentives to rate certain items.
• In general, it is often difficult to obtain too many ratings from the single user because of
the high cost of the acquisition process.
• Therefore, one must judiciously select the items to be rated by specific users.
• For example, if a user has already rated a lot of action movies, then asking the user to rate
another action movie does not help much in predicting ratings of other action movies, and
it helps even less in predicting ratings of movies belonging to unrelated genres.
• On the other hand, asking the user to rate movies belonging to less populated genres will
help significantly in predicting ratings of movies belonging to that genre.
• Of course, if a user is asked to rate an unrelated movie, it is not necessary that she will be
able to provide feedback because she might not have watched that movie at all.
• There are many interesting trade-offs in the problem of active learning of recommender
systems, that are not encountered in other problem domains like classification
Privacy in Recommender Systems:
• Recommender systems are based heavily on feedback from the users, which might be
implicit or explicit.
• This feedback contains significant information about the interests of the user, and it might
reveal information about their political opinions, sexual orientations, and personal
preferences.
• In many cases, such information can be highly sensitive, which leads to privacy concerns.
• Such privacy concerns are significant in that they hinder the release of data necessary for
the advancement of recommendation algorithms.
• The availability of real data is crucial for algorithmic advances.
• For example, the contribution of the Netflix Prize data set to the recommender systems
community is invaluable, in that it can be credited with motivating the development of
many state-of-the-art algorithms.

Page 20 of 21
Unit - I

• The recommendation domain is no exception, and numerous privacy-preserving


algorithms have been developed.
Application Domain:
• Recommender systems are used in numerous application domains, such as retail, music,
content, Web search, querying, and computational advertisements.
• Some of these domains require specialized methods for adapting recommender systems.
• There are three specific domains corresponding to news recommendations, computational
advertising, and reciprocal recommender systems.
• All these application domains are Web-centric in nature.
• They assume the existence of strong user-identification mechanisms in order to track and
identify long-term user interests

Page 21 of 21

You might also like