Understanding Customer Analytics Strategies
Understanding Customer Analytics Strategies
Ticket Sales Third-party platforms Price, seat location Dealing with partner
(e.g., Ticketmaster), (e.g., courtside), ticket agreements and data
Resale markets. type (single game vs. sharing. High
season ticket), speed of percentage (~80%) of
sale sales are season
tickets, often resold
Merchandise Team store (stadium), Item bought (jersey, Ownership of the store
[Link] (online) hat, etc.) (Team, Stadium, or
NBA) dictates data
flow. Difficult to
connect if cash is used
Week 3:
➔ Purpose fo marketing: match the genuine needs and desires of consumers with the
offers of suppliers particularly suited to satisfy those needs and desires
➔ Marketing planning: logical sequence and series of activities leading to setting of
marketing objectives + fromulation of plans to achieving theme
◆ Consists of 2 components
● Strategic plan: Macro/Long-term
○ Identifies consumer needs and desires
○ Assess company strengths and weaknesses + external
opportunities and threats the org may face
○ Determine which consumers to focus on
◆ 3 C’s
○ 2 key decisions: which consumers to focus on and which image of
org to create in the market
◆ Segmentation+Targeting: Which consumers to focus on
● Segmentation: Splitting the broad, heterogeneous
market into smaller, homogenous segments
● Targeting: Deciding which of those segments to
focus on
◆ Positioning: Micro, Tactical implementation
● Implementing the strategy through the 4 P's
(Product, Price, Place, Promotion) to appeal
specifically to the chosen target segment.
● Tactical marketing plan translates the long-term strategic plan into
detailed instructions for short-term marketing action
○ Development and modification of product in view of needs and
desires of target segment
○ Determination of price in view of cost, competition, and willingness
to pay of target segment
○ Selection of most suitable distribution channels to reach target
segment
○ Communication and promotion of offer in a way that’s most
appealing to target sgment
◆ 4 P’s
➔ 3 C’s: situation Analysis
◆ Customer
◆ Company
◆ Competition
◆ Can include: Collaborators & Context
➔ From the 3C’s it goes down to STP: Strategy
◆ Segmentation
◆ Targeting
◆ Positioning
➔ This goes into the 5P’s: Tactics
◆ Product
◆ Price
◆ Promotion
◆ Place
◆ Peaple/Process/Packaging
➔ Segmentation: identify groups of prospective customers who have
◆ Similar attributes within the group
◆ Different attributes between the groups
➔ Segmentation fundamental premise: to not make avg product
◆ Response to Heterogeneity: segmentation is the irm;s response to the main
market feature than every customer is different
● A unique snowflake
◆ Contrast to early economics which uses single avg consumer to model deicisons
➔ Sources of customer heterogeneity
◆ Demographic
● easiest data to collect
● often insufficient on its own
● EX) The Prince Charles/Ozzy Osbourne Problem:
○ Identical demographics (male, born 1948, raised UK, married
twice, wealthy, famous)
○ can describe two radically different individuals
○ demonstrating why demographics alone are a poor predictor of
behavior
○ Ozzy Osbourne: Heavy metal artist
◆ Info&Experiences
● Does/Doesn’t understand benefits of product
● Has purchased product before
● Prior experience and brand loyalty
○ EX) driving a Ford because your father drove a Ford
● amount of information a consumer has
○ EX) knowing how a certain brand of shirt fits you
● heavily influences their decision-making, often more so than
demographics
◆ Attitudes: (+)/(-) beliefs about product/brand
◆ Psychographics: lifestyles, values, art, status, religions
◆ Needs: requirements that are non-negotiable based on life stage or constraints
● EX) Have kids: bigger car
● EX) Watch videos: bigger screen
● Leads to better segmentation solutions but
○ Costly to collect
○ Lead to segments difficult to identify
◆ Geographics
● Cultural norms in different countries or states.
● Micro-Location: In countries where permitted (e.g., China), real-time
location data can be used to serve a consumer an ad when their phone is
physically near a competitor's store
➔ Linking data types: challenge in segmentation is bridging the gap between data that is
predictive (psychographics) and data that is accessible (demographics).
◆ Demographic-First (Easy to Identify): Start by creating segments based on
easy-to-collect demographics (e.g., Young Women, Old Men).
● Requirement: This solution is only good if those demographic groups truly
behave and respond to marketing actions differently regarding the specific
product/industry.
◆ Psychographic-First (Best Predictors): Start by surveying consumers to create
segments based on mindsets, needs, and behavior (the most predictive data).
● Requirement: You must then tie these segments back to identifiable
differences in demographics to make the segments accessible and
measurable in the real world
○ EX) finding that the "High-Value Seeking" segment
disproportionately uses certain social media platforms
● Substantiality Redefined: A segment must be large enough to be
profitable, not just physically large.
○ Low-Cost Niche Example (Online Dating): Websites like
"Gluten-Free Singles" or "Clown Dating" can be profitable despite
a tiny market share because the cost of replication is extremely
low
◆ re-doing an existing website platform
➔ Tactical Implementation: segmentation + 4P’s
◆ Segmentation is used to tailor the marketing mix (Product, Price, Place,
Promotion) to maximize appeal and profitability for each segment
◆ Pricing (Segmentation by Willingness to Pay)
● Disneyland Example: Price is segmented both geographically and
temporally.
○ Goal: Practice price discrimination to capture consumer surplus
(charge each customer close to their maximum willingness to
pay).
○ Segments:
◆ Tourists/Out-of-State Visitors: Charged the highest price,
as they are often die-hards who require the experience
regardless of cost (low price sensitivity).
◆ SoCal Residents: Offered substantial discounts (often tied
to specific days/restrictions) because they are more
sensitive to price and are flexible on when they visit, filling
park capacity on less popular days (Monday-Thursday).
◆ Promotion/Communication (Same Product, Different Message)
● McDonald's Southwest Chicken Salad Example: The exact same product
was promoted using different themes based on the racial/ethnic segment
they were targeting.
○ White Segment: Emphasized the salad as "fresh and gourmet."
○ Black Segment: Emphasized the salad as "hearty and interesting,"
addressing a common insight that this group might not view a
salad as a satisfying meal replacement.
○ Hispanic Segment: Emphasized "cultural flavors," highlighting
ingredients like cilantro and lime.
○ Asian Segment: Emphasized "variety and newness."
◆ Product (Designing Offerings for Specific Needs)
● Procter & Gamble (P&G) Laundry Detergents: P&G owns multiple
detergent brands to appeal to different psychological needs and price
points, filling the entire aisle.
○ Tide: Appeals to high quality, long-term brand loyalty (often
passed down), and convenience (Tide Pods).
○ Dreft: Appeals to parents of newborns ("Hopefuls") seeking
products that are perceived as organic, gentle, and un-chemical
(willingness to pay a premium for child-related safety).
● Coca-Cola (Diet Coke vs. Coke Zero): Two identical calorie-free products
targeting two different gender-based segments.
○ Diet Coke (1980s): Developed when "Diet" was the mainstream
term associated with women's health and weight-consciousness.
Branding and taste were calibrated for this segment.
○ Coke Zero (2000s): Developed to appeal to men who were now
also seeking calorie reduction but avoided Diet Coke due to its
historical association with women, its name, and its distinct taste.
Coke Zero used black/gray branding (masculine) and was
reformulated to taste more like regular Coke.
◆ Place & Timing
● Tesla (Timing/Staggered Release): Releasing products sequentially from
high-end to mainstream.
○ Strategy: Start with the high-priced Roadster and Model S
(2005-2012).
○ Reasons: Captures high-margin profit from early adopter/die-hard
segment, which funds operational build-up, and establishes a
premium brand image before releasing cheaper, mass-market
vehicles (e.g., Model 3).
● QuickVue Home Pregnancy Tests (Place/Product Differentiation):
○ "The Hopefuls" Segment (Ready for Family):
◆ Packaging: Features a baby, sold in smaller counts (25).
◆ Place: Located near diapers, formula, and other
baby-related items.
◆ Price: Higher per-test price (people will pay more for
products related to children).
○ "The Fearfuls" Segment (Not Ready):
◆ Packaging: Features only a woman, sold in bulk packs
(50).
◆ Place: Located near birth control and medicinal/pharmacy
items.
◆ Price: Lower per-test price.
● Hardee's vs. Carl's Jr. (Geographic): Same parent company, two brands
segmented regionally.
○ Carl's Jr. (West Coast): Marketed as "Charbroiled Burgers,"
offering "restaurant quality in a fast-food setting."
○ Hardee's (Midwest/South): Marketed as the "Charbroiled
Thickburger," appealing to a consumer preference for a "big, full,
proper meal."
➔ The Consulting Segmentation Process (GBK Example)
◆ common, structured approach used by consulting firms to deliver a segmentation
solution:
◆ Get Up to Speed: Thoroughly understand the client's industry, competition, and
existing consumer knowledge.
◆ Build Datasets: Survey consumers to collect data (especially psychographics)
and build the predictive segmentation model.
◆ Cluster Analysis: Run statistical models (like k-means clustering) to divide the
market into distinct, homogenous groups.
◆ Profile Segments: Analyze each resulting segment:
● Measure size and approximate market share.
● Use latent data (demographics, other behaviors) to create rich
descriptions of each segment for identification.
◆ Targeting: Recommend to the client which specific segments they are best
positioned to serve and focus on.
◆ Activation and Internal Embedding: Integrate the segmentation solution deeply
across the organization. This requires buy-in from departments beyond marketing
(e.g., R&D, Finance, PR/Advertising).
➔ Activation: Naming, Imagery, and Internal Adoption (Mozilla/Firefox)
◆ For a segment solution to be successful, it must be easily communicated and
adopted internally by all relevant teams.
● Creating Distinct Identity: Every segment is given a name, a color, and
imagery to make it sticky and easily understood across the organization.
● EX) The Mozilla/Firefox Example: Mozilla created a six-segment solution
to guide product development and marketing for its browser.
○ Importance for Open Source: This is particularly critical for
open-source companies, where development teams are
decentralized. The segment profiles (e.g., "The Pragmatist," "The
Seeker") help coders and contributors understand exactly who
they are building features for.
○ Result: The segments become internal dashboards, serving as a
constant reminder of the customer profiles, behaviors, and needs
they are trying to address.
➔ Good segmentation
◆ Relevant to firm’s objectives
◆ Substantial: enough to profit
● Substantial: large enough to be worth the effort
● Low-cost cusinesses can compete in markets with may segments
◆ Identifiable/Measurable: consumers can be identifies to segments
◆ Accessible: must be able to reach them
◆ Actionable: distinct segments have differential responses to firm’s marketing
actions
◆ EX) good segment solution - Finance
● Data Used: Risk Score (X-axis) vs. Yield/Profit (Y-axis).
● Resulting Segments:
○ The "Safe" Segment (Low Risk): They pay off debt perfectly,
resulting in negative yield for the firm (because the firm makes no
money on interest or fees).
○ The "Profitable" Segment (Moderate Risk): They sometimes carry
a balance or incur fees, providing the highest positive yield for the
firm.
○ The "Risky" Segment (High Risk): They often default or file
bankruptcy, resulting in negative yield for the firm.
● Actionability: The firm can now market different products to each group
○ EX) high-interest products to the profitable segment, or basic
accounts to the safe segment
➔ Segmentation criteria: Identifiable and accessible/actionable -> Look for a pattern:
Predictive of behavior
➔ Beenfit segmentation: Predictive of behavior -> Look for pattern here: Identifiable and
accessible/actionable
➔ Can be multiple segmentations within a firm
➔ Segmentation witht he 4P’s:
◆ Pricing: price discrimination, discounts
◆ Promotions: ads, placement
◆ Product: design, time fo release
◆ Placement: geographic/retailer specific
◆ Brand extension: product portfolio
➔ Firm’s approach of consumer insights
◆ Shore up foundational knowledge
● Ourside-In thinking
◆ Build robust dataset for segmentation
● What are the needs/desires/behaviors
● Is it moral to market different products to different genders?
● Survey-based
◆ Segmentation analysis
● How to cluster
● Segmentation bases
◆ Sizing and profiling
● How big is each category + characteristics
● Segmentation and CLV drivers
◆ Segment prioritization
● Which segment to focus on
○ Key to segmentation is sacrifice
◆ Activation
● How to bring segments to life + learn more about them
● More than nice to know
➔ When tailoring products or marketing efforts to specific segments, marketers must tread
carefully to avoid crossing ethical lines.
◆ EX) The Gray Area - Nerf: Differentiating products by color (e.g., blue vs. pink
Nerf blasters) might seem sexist,
● it could also be seen as appealing more directly to consumer preferences
and providing tailored joy
● the intent and consumer reception are key here.
◆ EX) Crossing the Line - "Bic for Her": When segmentation becomes explicitly
patronizing or reinforces outdated stereotypes (like marketing pastel pens as
"BIC for Her"), it can lead to public backlash
● example of poorly executed segmentation that focuses on gender
stereotypes rather than actual differentiated needs for the product
function.
➔ Cluster Analysis is the statistical tool used to group consumers into segments.
◆ Unlike predictive modeling: where you have a known outcome, or 'Y' variable, like
"Did the customer buy?"
◆ cluster analysis is unsupervised
● We do not know the "right" segment for a consumer
● algorithm must discover the optimal groupings using only the available
input variables (X variables).
● Goal: To create groups (clusters) where people within the cluster are very
similar to each other, and people between different clusters are very
dissimilar.
◆ Types of Clustering
● Tree-Based Methods (Hierarchical):
○ Agglomerative (AGNES): Starts with every person as their own
cluster and groups them upward (builds tree bottom-up)
○ Divisive (DIANA): Starts with one large group and splits them
downward (builds tree top-down).
○ The process visualizes the groupings as a tree (dendrogram)
○ Same branch and lower the branch is the higher similarity
○ Pick horizontal slice for cluster
◆
◆ K-means is the most common and visual distance-based clustering method,
requiring a predetermined number of clusters, K.
◆ What it does: randomly assign ppl to cluster middle(centroid) -> assign ppl to
closest cluster middle -> move them to be in middle of everyone assigned
● Use nstarts: 10-20 for algorithm to start is diff places
● Some convert to best and some converge to local min -> give single best
one
◆ Trying to find how far someone is from avg -> measures distance -> units matter:
scale all variables in dataset to have mean=0 and sd=1
◆ The primary objective of the K-means algorithm is to minimize the Total
Within-Cluster Variation (W)
◆ Total within-cluster variation: measure of the homogeneity (closeness) of the
points within all the identified clusters
● Hard Partition: K-means creates a hard partition, meaning every
consumer is fully and exclusively assigned to one cluster
(non-overlapping).
● Within-Cluster Variation (W): This is calculated as the sum of squared
distances between every point in a cluster and that cluster's central point
(the average). Minimizing W means making the points in each cluster as
close to their respective centers as possible (high homogeneity).
● Distance Metric: For continuous numeric variables, K-means typically
uses Euclidean Distance (the square root of the sum of squared
differences, like the hypotenuse of a right triangle).
◆ The algorithm follows a simple, repetitive process:
● Initialization: Randomly assign K initial cluster center points (centroids)
within the data space.
● Assignment Step: Assign every single data point (consumer) to the cluster
whose centroid is closest to it.
● Update Step: Recalculate the position of each centroid by taking the
mean (average) of all the points newly assigned to that cluster.
● Repeat: Repeat the Assignment and Update steps until the cluster
assignments stop changing—the solution is said to have converged.
➔ Considerations for K-means
◆ The Local Minimum Problem: Because the algorithm starts randomly, it might
converge on a good solution that is not the absolute best possible solution (a
local minimum).
● Solution: nstarts: To increase the chance of finding the globally optimal
solution, we use the nstarts argument in R, telling the algorithm to run
the process multiple times (e.g., 10, 25, or 50) from different random
starting points. R then returns only the single best result (the one with the
lowest final W).
◆ The Unit Problem (Variable Scaling): Variables measured in different units (e.g.,
age in years vs. income in thousands of dollars) would unfairly weight the
clustering process toward the variable with the largest numeric range.
● Solution: Scaling: All variables must be scaled before clustering to have
● a mean of zero and a standard deviation of one. This ensures all
variables contribute equally to the distance calculation.
○ Scaled: all the same units
◆ Cluster Labeling: The computer arbitrarily labels the final clusters (e.g., "Cluster
1," "Cluster 2").
● Solution: When communicating results, ignore the arbitrary label. Instead,
focus on describing the segment by its characteristics (e.g., "the largest
cluster," or "the segment with the highest income").
➔ Determining the Optimal Number of Clusters (K) - how to find K
◆ Scree plot->run kmeans 10 times w/ k=1 ->k=10 and plot clusters on horizontal
axis and plot within cluster sum(w)
● Adding clusters bg improvement and and some point small infelction
point(elbow)
◆ Since the true number of segments is unknown, we need a method to estimate
the best value for K:
◆ The Elbow Method (Scree Plot):
● steps to find the optimal K are:
○ Generate the Scree Plot: You plot the number of clusters (K) on
the horizontal axis against the resulting Total Within-Cluster
Variation (W) on the vertical axis. The line should always
decrease.
◆ Keep trying different values for K
○ Identify the Inflection Point (The "Elbow"): You look for the point on
the graph where the line's rate of descent sharply changes—it
goes from falling steeply to flattening out significantly. This bend in
the graph is the "elbow."
○ Select K: The value of K corresponding to the elbow is considered
the best choice.
◆ Example below the K should be 3(right number of clusters)
● Bigger effects early on and smaller effects after
elbow point
◆ Silhoutte scores: Mathematical formula way to evaluate cluster solution
● Right number of clusters? -> evaluate cluster solution
● HDB scan: new method
◆ Greatest and Latest(2016) method: HDB scan
➔ segmentiation: find structure of rows in dataset with cluster qanaysis and market
mapping
◆ People who buy some product do so for diverse reasions
◆ Firms response to main market feature, heterogeneity
➔ Marketing pursue: match tru need and desires of consumers
◆ Marketing plan: logical series of events to match
● Strategic plan: long terms/big pre plan -> find consumer needs, sowt
analysis
○ Which consumers to focus on and create image of org to creat
● Tactical plan: what to do to put plan into action - micro
○ Develop product in way to match segment
○ Find right price based on comp
○ Best way to distribute
○ Communicate and promote is most appealing way
➔ Segmentation: Tl, DR version - put people into groups
◆ Within groups=similar
◆ Across groups or b/w diff groups = diff
◆ Important b/c if you try to cater to all = avg product
◆ Either tailor product, the way we talk about it, promotuibs, pricing to appeal to diff
groups
➔ How are groups heterogeneous
◆ Demographcus: age, race, income
● Easiest to collenct, wasn to segment just on this but
○ Prince charles vs. ozzy
○ EX) chris evans: 50’s, actor, famous, rich VS. ryan: more
entreprenaurial
◆ Info and expeirences: abotu a product: underatnad pros of porcuct, if they
pruchased it before
◆ Attitued: (+)/(-) feel from revuews/social media
● Better segmentation, but once segment based only on mind
○ How do we find them iRL
○ Cant survey every consumer
◆ Psychogrpahucs: choices - lifestyle?relgious?art?
◆ Needs: bigger car=family, streamer=bigger screen
◆ Geographic: diff countries = diff norm = diff beaviorus
● Across states
● How to choose to store
● Diff area = diff wifi carriers
➔ Good segementatuib
◆ relevant: to firms goals
● Hwo to think about your customers
● EX) how to ad to diff consumers
● Can be narrow/vague
◆ Substantial: segments can’t all be at individual level
● Need big enough groups to pofit and focus and care about
● Tiktok can b/ digital and have a lot of info on consumers at individual level
● Not practical at individual elvel
◆ identify/measruable: can be assigned into segments
● How to find which segment someones in to carry diff marketing
◆ Accessible: if main target elderly can’t do only online sales
◆ Actionable: ppl respond different to diff marketing
● Need actioabilt thats didd across diff acting segments
➔ Need to connect dots b/w easy to collect(demographuc) and what’s in ppl’s minds
◆ Descriptive: start w/ demographic based segment
● Confirm these segments have psychographic/behavioral differences
● EX) young women, young men, old women, old men: good segmentation
if groups behave and respond differently based on prodcuts/industry
◆ Benefit segmentation: get info on most predicit ve behaviro and tie back to
demographic differences o find these segments IRL
➔ EX) walmart segmentation: can have multiple semgnets/firms
◆ Income segmentation: cater to lower income
◆ Price-sensitivity: can afford higher price for cars but comporsmise with cheaper
goods
◆ Expand into area w/ less stores and become dominant - suburbs
➔ How do firms use segments
◆ Pricing: disneyland: $239 uf socal residement but $400 if nonsocal
● Geogrpahucal pricing change and everyday diff pricing
● Some willing to pay higher pruce to higher prices
● Some not willing to pay so have some days with lower prices(M-Thurs)
◆ Promotions: mcdonalds salad promotions
● White: fresh and grommet
● Black: hearty and internesting
● Hispanic: highlight culturally relatable flavors
● asiansL variety and new
● Product: P&G laundry detergents - own many brands to fill up 1 aisle
○ Tide: dorm friendly
○ Dreft: newborns
○ Maybe all rpdoduct have exact same content
● Prodcut release timing: tesla first cart roadstar(2005
->$100-$120k)->model S(2012->$80k)->model 3(2017->$60k)
○ Build operations slowly to develop diff models
○ Start w/ facienct sow when think tesla think fancy
○ Start w/ highest pruce so most margin/car to build operations for
other cars
◆ Release overtime: excited about tesla but only get
cheapest
● Release overtimeL excited about tesla then only 1
choice
● Placement: restaurants
○ Wet coast: carl’s JR - charbroiled burgers
○ East coast: same restaurant called Hardees: charibroiled
thickburgers
● Placement: Quidel preganancy
○ Segment 1: hopeful for family
◆ Have packaging with baby
◆ Placed near other baby things
◆ Higher pricing
○ Segment 2: fearful for (+)
◆ Package: no baby
◆ Placed near meds/birth control
➔ Helpind company segment
◆ Get up to speed fast -> build datasets on which to segment on: survey
consumers’ needs/desires/behaviors -> segment analysis: seleclting variables to
do cluster analysis of to find segments -> sizing and profiling: how big and market
share/segment -> segment priortiization: client focu on few segments that they
well prioritize and well positions to address(targetting) -> activiation
➔ Segmentation consideration:
◆ Blue and pink: maybe meant like women=pink, but men still buy pink
◆ Bic pastel for her: too far and bad
➔ Clustering -> cluster analysisL consumers grouped to segments
◆ Dont know right segment for consumer
◆ Algorithm needs to find optimal groups w. Available variables(x)
◆ Create clusters where within very similar and b/w cluster very different
◆ Diff wats to evaluate clusters -> diff things being called best
Week 4:
➔ Market mapping: use customer data to draw comp situation ad inform strategic
markeitng
➔ Why market mapping
◆ Undertnad brand/rpdocyt position in market
● EX) epsom salt
◆ targetting/position
● Blue aporn: healthy eaters?convenvience?
◆ Realt truth for customers is what they think truth is -> thei perception matters
◆ Where to enter/waht products/feautres to develop
● Room in current market for differentiation/innovation
● MVMT watches
➔ Mapping occurs overtime: b/c news and political events
➔ When mapping do it using product attributes: what do consumer sperceive about
attributes
➔ K-means clustering - Segmentation:
◆ Goal: finding structure in the rows of dataset
◆ Purpose: an unsupervised machine learning algorithm used for segmentation:
which people(rows) belong together
◆ Output: groups of consumers - segments
➔ Principal Component Analysis(PCA):
◆ Goal: finding structure on the columns of the dataset
◆ Purpose: primarily a dimension reduction algorithm: taking a lo of
variables(columns) and representing the info content in theme with less
variables(components)
◆ Application: used on product attributes to understand how variables relate to
each other
➔ Market analysis has to be based on consumer’s viewpoint
◆ Decisions consumers make are based on the info in their head and what they do
with it
◆ Regardless of what company/engineers believe to be true - customer’s
belief(perception) is the real truth that matters for marketing
◆ EX) optical illusion of monsters
● 1 perceived as smaller than other
● Emphasized that what’s perceived(even if wrong) is most critical factor in
consumer decision-making
➔ PCA commonly used to perform market mapping: process of visually representing
market based on attributed consumers care about
◆ Goal: Understand product position
● Primary reason to map a market is to understand exactly where product
sits in competitive landscape
○ EX) Dr. Teal’s Bath salths
◆ Product: Epsom bath salts has an unclear position in the
store
◆ Medicinal? Bath product? Spa/Relaxation product?
◆ Conclusion: b/c product can be used differently by different
consumers - mapping helps determine the dominant
consumer view and use case
◆ Goal: targeting and positioning decisions
● Mapping clarifies who the product is truly for and how it should be
positioned against comepittiorys
○ EX) Blue apron
◆ Blue apron delivers specific ingredients and recipe for a
meal
● Unlike instacart which deliver groceries/restaurant
delivery
◆ Targeting ambiguity: who’s target consumer? Is it someone
who needs convenience?help/creativity in kitchen?healht
conscious people?
◆ Positions: mapping helps decide which of these
benefits(convenvice/health/guidance) to emphasize to
attract the desire consumer base
◆ Deciding where to enter/innovate - Gap finding
● Mapping identifies gaps in the market where a new or differentiated
product can succeed
○ EX) Movement(MVMT) watches
◆ Watch market had 2 extremes: cheap(plastic) and very
expensive(rolex)
◆ Gap found: movement found a gap for watch that looked
nice(appeared to cost $1,000+) but was priced
affordable(around $200-$400)
◆ Success: the company filled the space - growing into major
business by offering variety, fashion, and low cost over
mechanical perfection
○ EX) BMW 3 series
◆ Car market polarixed b/w non-luxury and luxury expensive
cars
◆ Gap found: BMW created 3 series ad “introductory luxury”
● pulling new customers up into the luxury category
who could not afford the larger sedans
◆ Impact: The product was wildly successful and has since
been copied by competitors (e.g., Audi A4, Lexus IS)
◆ When mapping market, attributes that consumers use to evaluate products fall
into 2 categories
● Vertical Differentiation (Universal Agreement)
○ Attributes where virtually all consumers agree that one direction is
better than the other
○ Better to worse scale
Taste/Flavor Strong loyalty exists for Diet Coke vs. Diet Pepsi;
opposing options. people are often fiercely loyal
to one over the other.
Platform/Operating System Driven by familiarity and Mac vs. Windows (or iPhone
switching costs. vs. Android). Users often
prefer the system they grew
up with, and switching incurs
a learning curve and "pain."
Game Console Loyalty based on ecosystem Sony vs. Nintendo (or the
or exclusive games. platform-specific debate for
gamers).
➔ Market mapping is not a one-time activity because markets and consumer tastes evolve
◆ Essential to map the market over time to track these changes.
◆ EX) Ford Bronco
● The car was a competitor to the Jeep Wrangler and FJ Cruiser.
● After O.J. Simpson's famous police chase in 1994, the white Ford Bronco
received extensive, negative national press.
● Result: Sales fell, and Ford decided to discontinue it for 20-25 years due
to the negative association. Ford recently brought the Bronco back,
suggesting the market perception has finally shifted.
◆ EX) Peloton and Sex and the City
● In the Sex and the City reboot (And Just Like That...), the character Mr.
Big dies of a heart attack shortly after riding a Peloton.
● Result: This was a high-profile example of bad press for the product,
negatively impacting perception among its core demographic.
➔ Hotelling's Model of Differentiation (The Single-Dimension Street) -> horizontal
differentiation model
◆ Developed by economist Harold Hotelling nearly 100 years ago to understand
product differentiation in a single dimension
◆ dimension is often represented geographically (a "main street")
◆ but it applies abstractly to any product dimension
● EX) "More Android" to "More Mac"
◆ The Ice Cream Vendor Analogy (The Beach):
● Initial Logic: Two identical ice cream vendors (Gold and Blue) might
initially locate themselves far apart on a beach (a one-dimensional
space), believing they should avoid competition.
● Consumer Behavior: Consumers will choose the vendor closest to them,
creating a break point in the middle.
● Strategic Move: A vendor (Gold) realizes they can move toward the
competition (Blue) to capture a larger share of the consumers between
them while keeping all consumers on their far side.
● Equilibrium: Both vendors continue to move inward until they are located
very close to each other in the center of the dimension.
◆ Implication: When products are identical (no horizontal differentiation other than
position/location),
● competitors will tend to cluster near the center of the market dimension to
maximize their market share
◆ EX)
● Gas Stations: Shell and Chevron are often located patty-cornered or
directly next to each other, despite there not being enough demand to
support two stations independently, confirming the Hotelling model.
● Retail/Big Box Stores: In the Orlando market, Lowe's and Home Depot
stores are located within walking distance
○ sharing parking lots or within one mile
● Airlines (Post-Deregulation): Before deregulation in the 1980s, when
pricing was controlled, competing flights (Delta, American) would depart
at the exact same time (e.g., 8 AM)
○ After deregulation allowed price variation, they separated their
departure times (one earlier, one later) to cater to different
segments (business vs. vacation travelers)
○ differentiating beyond just price.
● Political Spectrum: Candidates in a two-party system often do not
moderate to the center, as the most moderate candidate (C) would only
capture a small group of central voters,
○ while the slightly more extreme candidates (B and D) capture
large segments of voters to their respective flanks.
● Vending Machines: A study found that if you place a Coke machine alone,
the consumer decision is "Do I want a Coke? Yes/No."
○ If you place Coke and Pepsi machines next to each other, the
decision changes to "Do I want a Coke vs. a Pepsi?"
○ works better for Coke, as it leads to more overall soft drink
purchases
➔ PCA for market mapping
◆ PCA as a Dimension Reduction Algorithm
● Purpose: Dimension Reduction. It takes a high-dimensional space (many
variables) and collapses the information into fewer, more meaningful
dimensions (Principal Components).
○ New axes: a new set of coordinate axes that are mathematically
calculated to replace the original variables in your dataset
● New Axes are Ordered by Importance:
○ PCA starts with N original variables (e.g., 9 product attributes) and
returns N new axes (9 Principal Components).
○ The axes are ordered by the amount of variance (spread) they
capture in the data.
○ PCA1: The most important axis, running in the direction where the
data is most spread out (captures the most variation).
○ PCA2: The second most important axis, which is orthogonal
(90-degree angle) to PCA1
○ J products(rows) and K attributes(rows) = J x K matrix
◆ Math to find PC: calculate K x K matrix
○ Diagonal: left to bottom right: vaiecnes of solo variables
◆ How spread variable 1 is from its avg
○ Off-diagonal:
◆ (+): both vairables move together
◆ (-): both go in oppositie
◆ (0): o linear relation
◆ Covariance of how both variables mvoe together
◆ Understadning PCA mechanics
● focusing on Variation: If you start with 9 dimensions, you might only need
the first 2 or 3 axes (e.g., PCA1 and PCA2 -> can go in any direction(/|\-)
to summarize 80-90% of the interesting information
○ The last few axes can be dropped because they capture very little
variation
● Technical Concept (Matrix Algebra):
○ The algorithm calculates the Covariance Matrix of the product
attributes (K x K matrix).
○ The Eigenvectors of that covariance matrix are the Principal
Components (the new axes).
● Interpretation is Key: PCA outputs these new axes without labels
○ The analyst (you) must interpret the dimensions based on how the
original variables (attributes) map onto the new axes
◆ EX) is PCA1 "Luxury vs. Economy," or "Speed vs.
Reliability"?
➔ PCA in Practice: Academic and Marketing Examples
◆ Teacher Quality Index:
● Input: 12 teachers rated on 9 measures of teaching quality (organization,
friendliness, fostering improvement, etc.).
● PCA Output: The 9 dimensions were collapsed into a single index of
teacher quality (PCA1).
● Result: This single dimension allowed the paper to rank teachers (e.g.,
"Teacher 9 is ranked best") without having to weigh the complex
trade-offs of the original 9 attributes.
◆ Country Food Consumption:
● Input: Consumption data for different food types (fish, fruit, potatoes,
cereals) across four countries (England, Scotland, Wales, North Ireland).
High-dimensional data makes patterns hard to find.
● PCA Output: When reduced to a single dimension, the analysis clearly
shows that North Ireland is far apart from England, Scotland, and Wales
in its consumption patterns.
● Interpretation: Plotting the data on two dimensions (PCA1 and PCA2)
might reveal that PCA1 is "Nut Consumption vs. Meat Consumption"
(source of protein) and PCA2 is "Grains vs. Fish and Fruits"
(cultural/geographical factors).
● Result: Two components could explain 89% of the variation in food
consumption patterns across countries, suggesting that factors like
geography (landlocked vs. coastal) and culture are key drivers.
◆ Car/Truck Market Mapping:
● Input: 9 different dimensions describing cars and trucks.
● PCA Output: The first two principal components (PCA1 and PCA2) are
used to make a market map (perceptual map).
● Goal: To visualize areas of high concentration (saturated competition) and
gaps (white space) in the market, which represent potential opportunities
for new product development or innovation.
➔ Cons
◆ How de we know its the 2 most important attributes
● What are the mos important, the make map on thesed dimensions
◆ What about categorical attributes?
● So far only for continious
➔ Selecting Important Attributes (MaxDiff Scaling)
◆ main challenges in market mapping is determining which product attributes are
most important to consumers to use as the axes for mapping
● The Problem: Asking consumers to rank a long list of attributes (e.g., 10
or more) results in unreliable data because people struggle to hold that
many distinct preferences in their mind simultaneously
○ The middle items get "jumbled up."
● Solution: MaxDiff Scaling (Best-Worst Scaling): This technique, often
used in consumer surveys, manages complexity while generating a
complete rank order.
○ Task Design: Consumers are shown a small subset of attributes
(e.g., 3 or 4) from the complete list (e.g., 20 or 30).
○ Choice: For that small subset, they must select the Most Preferred
(Best) and the Least Preferred (Worst) attribute.
○ Design: The survey designer carefully balances which attributes
appear together and how often, ensuring every item is seen
multiple times.
○ Scoring: The results are aggregated to create a score for each
attribute: (% Times Marked Best) - (% Times Marked Worst).
○ Result: This MaxDiff score provides a highly reliable, rank-ordered
list of attribute importance (e.g., "handling and steering response"
vs. "blind spot detection"), which can then inform which attributes
should be emphasized in the product design or marketing
communication.
● 1. Collect attribtues to be tested
● 2. Make this choice task
○ On survey show few choices and pick best and least fav -> fro few
choices get your preferences a little
◆ Choices look random to consumers, but use design in
survey background: figures what attributes to show on
which page
◆ Next: paired comparisons -> make sure options show up =
times
● 3. Collect data: fit into models and calculate
○ (% times option was chosen as best/totoal times it was
shown)-((%times it was worse/totoal times show)
◆ Handling Unknown or Hard-to-Measure Attributes (MDS)
● defining attributes of a product are vague (e.g., "quality" or "brand
feeling") or unknown, Multi-Dimensional Scaling (MDS) provides an
alternative to PCA
○ Concept: MDS is a cousin of PCA that uses perceived
similarity/dissimilarity to reconstruct a map.
○ Task: Instead of comparing attributes, consumers are simply
asked to rate how similar or different two products are (e.g., "On a
scale of 0 to 100, how similar are BMW and Acura?"). Netzer
◆ Both gave 3 groups w/ some differences b/w cars put in
messages together and trade in behavior
○ Process: The resulting matrix of similarity scores (the "distances"
between all products) is fed into the algorithm. MDS then works
backward, creating a spatial map where the distance between
points accurately reflects the stated similarity/dissimilarity.
○ Result: This allows the creation of a perceptual map even when
the underlying attributes that drive consumer preference are
initially unknown.
➔ Principles of dimension reduction and market mapping extend beyond survey data into
massive datasets derived from consumer behavior and language
◆ Validating Market Structure with Multiple Data Sources
● Oded Netzer and co-authors performed a crucial study that validated the
structural integrity of market maps using two completely independent data
sources:
○ Perceptual Data ([Link]): Researchers scraped over half
a million message threads from car forums. If two car models
(e.g., a Nissan Altima and a Honda Accord) were mentioned in the
same message, they were considered more similar in the
consumer's mind.
○ Behavioral Data (J.D. Power/DMV): Researchers used trade-in
data to see which new cars were purchased when old cars were
traded in. Cars that are traded in for one another are also
considered similar.
● Finding: Despite the data coming from two totally different domains (what
people say vs. how people spend their money), the resulting market maps
showed the same basic structure—three clusters of vehicles:
○ American-made cars (lower right)
○ Premium European/Luxury cars (left)
○ Predominantly Asian/Budget cars (top right)
● Conclusion: This cross-validation confirmed that the observed structure
(the grouping of brands) was not an artifact of one dataset but a robust
representation of the actual automotive market's competitive landscape.
◆ The Rise of Embeddings: The idea of mapping relationships in high-dimensional
space is foundational to modern language models.
● Word Embeddings: This process is analogous to PCA for language. A
massive amount of text is processed, converting sentences into a sparse
matrix where words that appear together often are linked.
● Semantic Space: The resulting model creates a spatial map (the
"embedding space") where the distance and direction between words
hold meaning. For example, the vector that separates "man" from
"woman" is parallel and similar in magnitude to the vector that separates
"king" from "queen."
● If people liike at 2 prcuts in 2 session, then these 2 products thought to be
more similar than 2 products they didnt look at together
○ Map all products in retailer - map figures out organic vs. inorganic
◆ Jerky seen as more organic b/c higher protein thant
organic cookie
● Significance: This ability to capture semantic and relational meaning
through word position is a fundamental step toward building Large
Language Models (LLMs) that can understand and generate human-like
text.
◆ E-Commerce and Substitution (Eccles)
● Using data from millions of shopping sessions, researchers mapped
product relationships based on the frequency of products being viewed
together.
○ Insight: The resulting map reveals consumer substitution behavior
that transcends simple product categories.
● Example 1: Cookies: Organic cookies were often mapped closer to
non-organic cookies than to other organic health foods. This suggests
that for many consumers, the product category ("cookie") is a stronger
factor than the attribute ("organic").
● Example 2: Beef Jerky: Conversely, inorganic beef jerky was sometimes
mapped near organic health foods because, in the consumer's mind, it
shares the characteristic of being a natural, high-protein snack, making it
a viable substitute in that specific context.
● Application: These insights are valuable for store layout decisions (e.g.,
should orange juice be placed with other juices, or near breakfast foods
like milk and eggs?) and competitive strategy.
Week 5
Week 6
Week 7
◆ Log-Linearity: Taking the natural log (ln) of both sides transforms the equation
into a simple linear regression:
➔
◆ Elasticity as the Coefficient: In this specific functional form, the slope coefficient
(beta) of the log-linear regression is the price elasticity of demand.
◆ Optimal Price Formula (Convenience): Given this constant elasticity (beta), the
profit-maximizing price can be easily calculated using the formula:
➔
➔Multinomial Logit (MNL) Model Optimization (Realistic Approach)
◆ Deriving Quantity: In the MNL model, the market share for product J is calculated
first, which is a function of price. Quantity Sold (Qj) is then found by multiplying
the market share by the market size.
◆ The Profit Function (pi): Profit is calculated as revenue minus cost (Quantity x
Price - Cost).
Week 8:
➔ Brand purpose
◆ Firms spend vast sums on brand building, and intangible brand capital often
makes up the majority of a company's sale value.
◆ Branding helps build reputation (e.g., expecting high quality from a new Apple
product based on past experience).
◆ It reduces search or transaction costs for the consumer.
◆ It gives the firm monopoly power by lowering competition and capturing
consumer loyalty.
➔ Celebrity alignment
◆ Aligning with a celebrity can leverage their existing brand equity (e.g., Ryan
Reynolds buying a soccer team) to lift the company's brand.
◆ The major risk is unfavorable association if the celebrity has a public scandal
(e.g., Kanye West or P. Diddy).
➔ McGurk Effect: Demonstrates top-down processing, where visual cues (seeing the
mouth shape for "FA") override audio input (the actual sound is "BA"). We hear what we
expect to hear.
◆ Top-down processing
◆ Use existing knowledge to interpret sensory info -> brain sends down info to
sense: top-down
➔ Color Perception (The Dress/Shadows): Our expectations about lighting and shade
manipulate our perception of color (seeing black/blue vs. white/gold).
➔ Face Recognition: Humans are extremely good at recognizing faces, emphasizing the
importance of specific features like eyes and the mouth (demonstrated by the
upside-down Obama photo error).
➔ Defining a brand
◆ A brand is intangible—an idea or a collection of associations that lives in the
consumer's mind. It can be different for each person.
◆ It is not just the logo, symbol, or physical product.
◆ It is about relationships, not transactions.
➔ Branding (The Process): The act of developing the brand and establishing a
psychological structure in the consumer's mind. The three big pillars of good branding
are Identity, Relevance, and Differentiation.
◆ identiity: something it stands for
◆ relevance: need to matter, connect, and build relation with people
◆ Differentiation: dont call company a common word thatll be confused when
search up
➔ Origin of Branding: Traced back to cattle ranchers who used hot irons (now liquid
nitrogen) to mark animals to prevent theft and allow buyers to easily identify animals
from the same, reliable ranch.
➔ The psych of brand association:
➔ Memory and Association (How Branding Works)
◆ Memory as an Auto-Associative Network: The core theory is that memory works
like a network where activating one idea (or node) cues other related ideas (e.g.,
trying to recall the happy birthday song requires internally singing the melody).
◆ Retrieve things fro cues
◆ Brand building: wen we activate 1 thing w/ brad so we want that one thing to
automatically activate other (+) things
◆ Primacy effectL remember first few items
◆ Recency effect: remember latest items
◆ The Goal of Branding: To establish positive cues so that when consumers think of
related good ideas (e.g., speed, quality, celebration), their minds automatically
fire the link back to the brand itself.
◆ The "Sleep" Test (Memory Experiment):
● The words shown were related to sleep (e.g., nighttime, slumber, drowsy,
dream, blanket).
● Many students incorrectly remembered seeing the word "sleep," even
though it was not on the list.
● Conclusion: The sheer volume of related concepts primed the idea of
"sleep" in the students' minds. Branding aims for this effect—to have
positive, cued associations lead directly to the brand.
◆ The "Prefix" Test (Priming Experiment):
● Students were given the same prefix (OCT) twice but with different initial
prompts (SQUID and NOVEMBER).
● The typical result is the first prompt yields "Octopus" and the second
yields "October."
● Conclusion: A brand can use an associative cue (the "prompt") to
influence how a consumer interprets an ambiguous mark (the "prefix"),
demonstrating how context is everything in perception.
➔ What can be branded
◆ Answer: Everything can be branded because a brand is merely a collection of
associations in the mind.
◆ Examples of Brands:
● Platform Brands: Airbnb (matchmaker, not product or consumer).
● Direct-to-Consumer Brands: Warby Parker, Kirkland (cutting out the
middleman to offer lower prices).
● Smart Brands: Google Nest, Amazon Alexa.
● Idea Brands: The Me Too Movement.
● Person Brands: Elon Musk.
◆ The Patagonian Toothfish Example (Rebranding Success):
● Original Name: Patagonian Toothfish (unappetizing, "horrid-looking").
● Rebrand: Chilean Sea Bass (a non-fishy fish, making it versatile for chefs
to pair with any flavor profile).
● The new name is misleading ("sea bass" is a nonsensical term for a
non-freshwater fish), but it successfully changed perception and made it a
popular menu item.
➔ Successful Branding Techniques
◆ Building Positive Associations:
● Kodak (1995 Ad): Associated the brand with special, non-everyday life
moments (birthdays, weddings, graduation) to justify the cost and effort of
film photography.
● The Economist: Used a clever ad (a lightbulb turning on when walked
under) to convey that the magazine gives readers "bright ideas."
◆ Avoiding Bad Associations:
● Twitter/X: After Elon Musk purchased the company and fired the content
moderation teams, the change in rules ("anything goes") caused major
advertisers to pull back.
● Advertiser Fear: Companies do not want their ads appearing next to "hate
speech" or "bad content" (e.g., a Disney ad next to a KKK discussion on
Reddit).
◆ Logo Design (Hidden Meanings/White Space): Logos are small, recognizable
cues that embed meaning.
● Goodwill: The 'G' is designed to look like a happy face.
● Toblerone: The triangular chocolate shape represents the Swiss Alps; the
white space contains a bear (from Bern).
● VAIO: Combines an analog wave (VA) and the digital 1-0 (IO) to represent
the shift from analog to digital electronics.
● Baskin-Robbins: The number 31 (representing a different flavor for every
day of the month) is embedded in the 'B' and 'R' of the logo.
● Amazon: The arrow goes from A to Z (representing the wide variety of
products) and forms a smile (for customer happiness).
● FedEx: The arrow in the white space between the 'E' and 'x' suggests
speed and delivery.
➔ Brand Recognition Test
◆ Given different colors and figure out what matches
◆ EX) brown = padres
➔Creative Advertising (Enabled by Branding)
◆ Once strong associations are established, a brand can use highly abstract and
creative advertising that doesn't need to explicitly name the product:
● Subway: An ad showing the Big Mac (bun in the middle) to suggest
Subway is a healthier alternative or competitor.
● KitKat: An image of a calendar full of online meetings, playing on their
slogan "Have a break, have a KitKat."
● Snickers: An ad using the familiar brown, white, and red packaging design
and colors, with the size resembling a Snickers bar on a taxi, without
showing the logo or name.
● Guinness: Stacked phones resembling a dark beer with white foam,
encouraging people to connect over a beer.
● Mercedes: Ran a campaign showing their three-pronged logo appearing
naturally in National Geographic-style photos, implying quality and
ubiquity.
➔ Illusion of product differentiation: When products are fundamentally identical, their
success relies entirely on the associations built by the brand.
◆ Vodka: Branding a "Neutral Spirit"
● Vodka is legally defined as a "neutral spirit" that is "without distinctive
character, aroma, taste, or color." The product is, by definition,
homogenous, forcing firms to create value solely through branding:
○ Grey Goose (Luxury Brand): Focuses on exclusivity. Associations
include: French flag colors, frosted glass bottles, and the swan
logo, all conveying luxury. Advertisements push the slogan, "Drink
Luxury."
○ Absolut (Fun/Social Brand): Focuses on local relevance and
socializing. Ads are bright, local (e.g., the LA swimming pool
shaped like the bottle), and emphasize a fun, accessible, and
social experience.
○ Price: The perceived luxury (e.g., Grey Goose) is reflected in a
higher price premium compared to the more social or generic
brands
◆ Bottled Water: The Ultimate Homogeneity
● The most extreme example is water, where consumers are asked to pay
up to 4,400 times more for chemical H₂O that they already have access
to.
● Success here is achieved by creating a perception of added benefits
(electrolytes, purity, natural spring sourcing) rather than delivering a
functionally different product.
➔ How brands work: 4 core mechanisms
◆ Brands are successful because they function in four distinct ways in the
consumer's mind:
◆ Establishing Credibility: Consumers rely on brands as a signal of quality,
particularly when they lack expertise.
● The Price Premium for Credibility (Aspirin Study - headache meds):
Consumers will pay four times more for Bayer Aspirin (4 cents/pill) over
the generic Kirkland brand (1 cent/pill) for the same chemical compound
(acetylsalicylic acid). This is a price premium for the credibility and trust
associated with the original brand.
● Expert vs. General Consumer: Studies show that consumers who work in
the medical field (i.e., experts) are far more likely to buy the cheaper,
store-brand pain relief, demonstrating that their knowledge overrides the
brand's credibility. General consumers, lacking that knowledge, rely on
the brand.
● The Yogurt Taste Test Paradox: In a blind test, participants learned they
preferred the cheaper Roundy's yogurt over the expensive Chobani.
While they initially intended to switch to the cheaper brand, the data
showed that over the medium-to-long term, this behavioral change
dissipated, and they drifted back to buying the national Chobani brand.
The national brand's pull eventually outweighs the consumer's direct,
recent taste preference.
◆ Rapid communication
● Brands use mascots, names, and visual language to instantly
communicate the product's function, target demographic, or benefit.
● Mr. Clean: Instantly communicates cleanliness, strength, and efficacy with
his appearance.
● Go-Gurt: The name itself is a blend of "Go" and "Yogurt," communicating
a product intended for "people on the go," packaged in a portable tube for
consumption by active children.
● Mountain Dew: Everything about the brand (extreme colors, font, and
advertising) conveys high energy and "extreme" caffeine intake, targeting
a specific, high-intensity consumer.
◆ Personality and self-expression
● Brands develop distinct personalities that consumers use as a form of
self-expression.
● Mr. Peanuts: Was created to give the low-class, common bar snack a
"tuxedo and monocle" personality to distance the product from its humble
origins.
● Dove's "Real Beauty" Campaign: Successfully changed its brand
personality from traditional sex appeal to one of inclusivity by featuring
"normal-looking people," resonating deeply with its target audience.
● Insurance Industry (Homogenous Product, Distinct Personalities):
Insurance is inherently bland, making it a perfect canvas for personality
differentiation:
○ GEICO: The Gecko emphasizes convenience and savings by not
having physical brick-and-mortar stores.
○ Progressive (Flo): Focuses on being friendly, trustworthy, and
personable.
○ Farmers (Professor Burke): Projects an image of wisdom,
experience, and reliability ("seen a thing or two").
○ Allstate (Mayhem): Emphasizes the fear factor and the necessity
of protection against chaotic events.
● Brand Tattoos: The ultimate form of self-expression where consumers
permanently commit to brands like Nike or Harley-Davidson.
◆ Gestalt (The "Extra" Factor): Gestalt means "the whole is greater than the sum of
its individual parts." In branding, this refers to the unexplainable, quantifiable,
psychological boost a preferred brand provides.
● The Coke-Pepsi fMRI Experiment:
○ When participants drank soda without knowing the brand (blind
taste test), their brain activity was similar for both Coke and Pepsi.
○ However, when participants drank the soda knowing which brand
it was, the one they preferred triggered a "bonus" of extra brain
activity (measured in areas related to reward and preference).
○ This suggests that the brand knowledge (the collection of
associations, or the "gestalt") is separate from and adds
psychological value to the chemical product, leading to a
measurable difference in consumption experience.
➔ Brand evaluation
◆ Colors are Inalienable: Switching the colors of two competing brands (e.g.,
Coke's red/white and Pepsi's blue/white) makes both look immediately "wrong" or
"out of place."
◆ Keller's Brand Report Card: A successful brand should score high on 10
fundamental measures, including proper positioning, consistency, and a sensible
portfolio.
◆ Steve Jobs' Philosophy: The idea that a successful brand requires demanding
standards and a clear focus to set itself apart.
➔ Gestalt: the principle that the "whole is greater than the sum of its parts," by presenting
further evidence that brand perception can chemically alter a consumer's experience,
even when the product is identical or swapped.
➔Brand-Induced Preference
◆ Peanut Butter Study: When researchers swapped the contents of peanut butter
jars, people consistently reported preferring the peanut butter that came from the
jar with the well-known brand name on it, even if that jar contained the inferior
product. The mere sight of the trusted brand name altered their perceived taste
experience.
◆ Search Engine Study: Researchers created two search pages—one styled like
Google and one like Bing. When users searched, the back-end results were
secretly swapped (Google-styled page running Bing results, and vice versa).
Users consistently reported that the page that looked like Google offered better
results, showing that the brand's reputation for quality superseded the actual
functionality of the search engine.
➔Brand Portfolio Strategies
◆ When managing multiple products, a business must choose how to structure its
brands, typically falling into one of two categories:
● Branded House: The company uses a single, powerful master brand
across all its offerings. The brand equity is reinforced with every product.
○ EX) Google (Gmail, Google Drive, Google Calendar) or Apple
(iPhone, Apple Watch, Apple Music).
● House of Brands: The parent company remains in the background, and
distinct brands are created for separate product categories. This is
common in consumer packaged goods (CPG) to target diverse
demographics without cross-contamination.
○ EX) Procter & Gamble (P&G) owns Tide, Duracell, and Pringles,
with the P&G name rarely appearing on the product packaging
itself.
◆ It's noted that the entire grocery store is dominated by a few major companies
(like Kraft Heinz) that collectively own vast numbers of individual brands.
➔Brand Evolution and the "Debranding" Trend
➔ Brands are not static; they must evolve to remain relevant. A recent trend observed
across major corporations is Debranding—the move toward simpler, flatter, and less
detailed logos.
➔The Forces Driving Debranding
◆ Mobile-First Design: The most immediate pressure is the need for logos to scale
effectively. A complex logo with gradients and shadows becomes illegible when
shrunk to a favicon or a tiny app icon. This drove companies like Burger King,
VW, Kia, and Pfizer to adopt simpler, flat (2D) designs.
◆ Maturity: As companies grow from "playful innocence" to large, professional
corporations, their logos must mature to reflect the increased seriousness and
high stakes of their business (e.g., GoDaddy).
◆ Versatility: A minimalist, neutral logo can act as a better "empty vessel" that is
flexible enough to span multiple product genres, allowing one symbol to cover 31
different flavors or diverse services.
➔Case Studies in Rebranding Failures
◆ Changing an established brand identity is extremely difficult, as strong brand
equity can lead to consumer rejection if the change is perceived as damaging or
confusing.
◆ confusing.
HBO Multiple changes: HBO Executives eventually The name Max was
Go, HBO Now, HBO reversed course, often too generic and
Max, and ultimately returning to "HBO immediately
dropping "HBO" to just Max." sacrificed the HBO
"Max." brand's legacy of
premium, high-quality
content.
Gap Switched from its classic Rolled back within a Violated decades of
serif font in a blue box to week due to massive strong visual identity
a modern, flat logo in public outcry and that consumers relied
2010. hatred for the design. on.
UC System Replaced the traditional Abandoned within five It was criticized for
crest with a minimalist days. having no connection
blue and gold wave logo to heritage and
in 2012. visually resembling a
"loading screen" or
"flushing toilet."
Cracker Barrel Attempted to remove the Brought the man back Consumers felt the
image of the old man after consumer guy was part of the
from the logo in backlash. brand's identity and
conjunction with heritage.
modernizing the in-store
atmosphere.
Week 9:
● Research question: how well does each factor explain merchant revenue
● The revenue that comes into the company(merchant revenue) comes from number of
stores that the company operats*the number of cusotemrs that come/store*avg number
of transactions/customer*for each translation the avg amount of money they spend
○ D = S * C/S * T/C * D/T
○ Run regression on 4 different acts and the beta coe3ffieicnt from the regression
will tell you importance/fctors in explaining revenue of merchant
○ Run multiple regressions -> got a matrix
○ Fixed effect: if put gender into model it would come out as 1 binary viarble for
each category
■ But need to leavr out one variable for multicollinearity
■ EX) what year are you: b/c 4 years, code in 3 dummy variables and put in
3 binary vairuables into regression
■ Each coeff you get on that variable is something that shifts the intercept
for that group of people
■ Fitting the model and putting some variables in fixed effects
○ Findings:
■ Number of customers explains most vairiaiton in merchant revenue
■ Then size of trqsaction more important that number of trqnsactions made
● If more expensive price items then size of transaction hold more
weight thant buying from 7/11
● Market size: how big is the market that we operate in
○ M: total size of the market - number of people who have some core needs that
you may serve which can be measured in dollars/units/volune
■ EX) oil and gas industry dont measure by number of people they serve
but by the thousands of barrels of oils they can produce
○ Trying to estimate the market size:
○ Top-down: look at total market and try to estimate each person that has this need
■ Would it be possible to serve all of these people?
○ Bottom-up: some people are currently being served. They have a need and they
are being served by some company
■ How much more room for groweht is there above the service that’s
already happening?
○ Ask expert: give us an estimate of the market size
○ None of these are accurate on their own so need to take average
● EX) Total addressable market is everyone who needs a amttress ~ same size as the
served market(everyone at this point has a market)
○ Supposed to replace mattress every 7 years - but we may do less often
○ Huge range for mattress costs: we take average of $300
○ Assume population of 340 million
○ CANT just do 340 milion/7 B/C NOT every single person needs a mattress
■ Creates overcounting b/c couples amy sleep together
■ Temporary need for mattresses: hotels/dorms
■ Extra beds, maybe each eprosn has multiple houses so more
mattresses/person
■ Assume the adjustments all offset each other so every 7 years mattress is
getting replaces and there’s 1 mattress/person of the 240 million people
● 340 million/7 years = 48.6 million mattresses
■ Costs $300/matrress so
● $300*48.6 = $15 billion industry is the size of the market
● Charge different pricing to try to get as much consumer surplus as possible b/c everyone
is willing to pay a diff price
● EX) Google trends
○ The graph goes up and down constnalty - the ups are peaking during college
szn, the low times are when no one is applying
● Market size can change overtime
○ mass adoption - lack of adoption - start adoption again
○ Can be caused by political events/demand side effects where consumer out of
work so purchase less, supply side with mandates or making prices increase
○ Rate of adoption for newer things tend to be faster, even faster with software
products
● Diffusion - spread of adoption: Frank Bass - borrows idea from the spred of
disease(epidemiology) - idea tht spread of technology into a group of consumers can be
modeled like the spread of disease among group of population
○ Starts idea of innovation and diffusion of products within market
● Bas model equation Inputs:
■ t: time period
■ M: market size - number of adopters
■ N(t): number of new adopters in period t
■ A(t): number of accumulated customers who adopted before period t
● Cumulative sum of all people who have adopted so far
● At the start of the period - everyone who bought up until period T
so doesnt include N(t)for same peiro
■ R(t): number of remaining customers who can potentially adopt in period t
● Total market size-A(t)
○ Speed at which people adopt
■ p: coeff of innovation - curios about product so they are going to buy it
■ q: coeff of imitation - as some people adopt, how that affects other
adopters
● Try to control network effect: idea that if some people have this,
then its going to make others want it to
■
○ 1st equation: relationship b/w size of market M and breaking it down to total
nu,ber of people that have accumulated and those who need to accumulate
○ 2nd formula: rate of adoption of people and the P and Q seen as rates
■ The total number of people adopting in period T over those tha are left to
adopt that haven’t adopted yet
■ Percent(p): the number of people who will buy the tech the next time
period out of those that are remaining to purchase
○ Need to specify values like M, P, Q
■ EX) total market size - 100M users, P=2% of adoption, Q=0.4, simulate
BAS over 25 time periods(J)
● 1st period no one has adopted yet
○ Cumulative number of adopters before the first time
period(A)=0
○ Use the equations: A(1)=0, R(1)=M-0=M, N(1)=Mp
■ Use second equation for N
■ Cumulative number of people who adopted=0=p ->
N(t)/R(t) = 0+q(A(t)/M) = N(1)=Mp
■ Next create vectors for A,R,N and each length of 25
to store the values
○ Code follows same math(Bass mode simulation slide)
○ During the next time period: total people who accumulated
in period T is the number who accumulated until the last
time period+in the last time period
■ T=5 -> want everyone who is accumulated up
through the 4th time period+people who
accumulated during 4th time period = everyone
who hs accumulated until start of fifth time period
● Take the equations and write them into code->run the for loop and
get values of A,R,N for each of the first 25 time periods
● Chart on the slide: plot 1-25 on horizontal axis and number of
people who adopt in each period on vertical xis
○ Interp: some scorpion and rhe number of adopters/time
period will grow
○ Added to natural rate of adoption is something tha depend
on total number of people who have adopted so-called
○ As tech starts to diffuse into market, more people jump on
board, as we start to saturate the market there’s fewer
people left who cna adopt the product
○ Rate of adoption has to come down because there’s fewer
people left to adopt so upside down curve
■ If plotted total cumulative number of adopters(instead of per period) =
cumulative sum(bottom chart on the slide) -> slower start and then picks
up faster, but as we start to fill up the market its going to tail off
● Can get different shapes by playing with diff numbers for P and Q
○ Using BASS IRL: collect data on N(number of people who have adopted) and we
can use the N’s to make A’s = cumulative number of people who have dopted
■ How many people have bought the product in eahc time period: we can
estimate P,Q,M - once we estimate, we can project forward
■ Each of the adopter/time eperiod hasa an A, B*A, (C*A)^2
■ From running regression we will know A,B, and C
■ N: outcome variable
■ Adopters: first period 3 adopted, 2nd period 6 adopted, so start o 3rd
period: 6+3=9 adopters
○ If smeone collected the data, over some periods of time - how many people
bought the product?
■ Take this vector od data and make the cumulative nu,ber of people who
bought the product
● Bass model algebra: take the 2 bass model equation and put them together to rewrite
the relationship
● If dont want to leave it upt ot model to make guesses about market size [ find P and Q
from BASS model, if you know M
● BASS model rewritten as
●
● A little but of error b/w left and right side fro any values of P and Q
● Use nonlinea least squares: take left side of equation, N - the right side
○ For eahc time period look at the difference, square it , and sum differences = sum
of squared errors from models
○ Pick P and Q to minimuze sum of squared errors
an Yavorsky: If you had then done that, so you collect a little bit of data, like we put on the board,
use that data to fit the BAS model, bind P, Q, and M,
Dan Yavorsky: Once you've collected that data, you could then project forward to see what the
future is going to look like. So we estimate, we collect some data, we estimate the BAS model
parameters.
Dan Yavorsky: And then we're going to project forward, the dots are red on the projection, what
the… what the growth is going to look like in the industry, according to P and Q, and where growth
is going to taper off, and the market's going to be saturated, like, how big is the market?
Dan Yavorsky: And we're going to learn all this, P, Q, and M, 3 very interesting things to know about
the market, just from the little bits of adoption that we see at the beginning, and the structure that's
built into the VAST model.
Dan Yavorsky: So, when Frank Bass went to go do this in his paper, this was, like, he wrote the
paper in the 1960s, and the data was from, like, the 1940s, 50s? Yeah, 1947 to 1961.
Dan Yavorsky: After he does this, he then projects forward, so he has data before what's plotted,
and that's the… what he's fit his model to. And then he predicts forward what his model thinks is
going to happen in the industry, and then he charts out what actually happened in the solid line.
Dan Yavorsky: And you'll see that his predictions aren't perfect, right? His model a little bit
over-predicted at first, and then a little bit under-predicted in the middle.
Dan Yavorsky: But by and large, his model is fitting these adoption curves really pretty well.
Dan Yavorsky: This is the same reason the multinomial logic became so popular. I mean, first of all,
it's a good model, but second, its grand debut on the BART system in San Francisco was really
accurate. It did a really good job. It proved itself as being useful. And Frank's model does the same
thing.
Dan Yavorsky: Right? He's showing you that this does actually a very good job at predicting,
Dan Yavorsky: the future of the number of sales that are going to happen in each time period, and
also giving you, like, the… when the market becomes saturated and sales will start to decline. Kind
of shows you this rate of adoption in the industry.
Dan Yavorsky: The statistics are actually not very hard. You run one linear regression model, like
you learned in ECOM 120D, and you solve the three equations for the three unknowns, and you
get P, Q, and F.
Dan Yavorsky: If, however, you don't want to leave it up to the model and the little bit of data you've
collected to make an important guess about the size of the market app.
Dan Yavorsky: you might have, like we did with the Mattress Global Industries Trend Report, some
other very good guess at M that you get from some other data.
Dan Yavorsky: If you have that, if you know M, so to speak, then all you need to do is find the P
and the Q from the BAS model.
Dan Yavorsky: This feels like it should be easier. We have fewer things to learn. Before, we were
learning PQM, now I just want to learn P and Q. It seems like it should be an easier question, but in
fact, it needs slightly harder, statistics.
Dan Yavorsky: And it's a chance for me in the code to show you something that's,
Dan Yavorsky: I think it's pretty cool, so I'm going to show you on the slide, and then we'll get to
doing the code together.
Dan Yavorsky: The Bass model equation can be written as this N is equal to P plus QA over M
times R. So we've seen this relationship before.
Dan Yavorsky: We've collected data on N, and we think it should fit these things on the right-hand
side exactly.
Dan Yavorsky: For any values of P and Q, we have something on the right-hand side we can
calculate that may or may not exactly equal the left-hand side.
Dan Yavorsky: There's probably going to be a little bit of an error between the left-hand side and
the right-hand side.
Dan Yavorsky: So what we're going to use is something called, nonlinear least squares.
Dan Yavorsky: The same idea as least squares. We're going to take the left-hand side of the
equation, N, minus the right-hand side, whatever we calculate involving M,
Dan Yavorsky: It may not exactly equal, so there's going to be a little bit of error between the
left-hand side and the right-hand side.
Dan Yavorsky: And we're gonna… for each time period, we're gonna look at this difference, we're
gonna square it up, and we're gonna take the sum of those differences. This is the sum of squared
errors from the model. We want to pick P and Q in order to minimize the sum of squared errors.
Dan Yavorsky: So I'll show you in code how to do some nonlinear least squares as the last thing
today.
Dan Yavorsky: Okay, Bass's model doesn't have a lot to say for why adoption happens the way it
does, right? He borrowed this simple mathematical model from the spread of disease, and he
applied it to the rate of adoption of a technology in marketing.
●
● Market Sizing and Industry Analysis
● Target Audience for Research Reports (B2B):
○ Expensive industry reports (e.g., mattress trend reports priced $3,000 - $7,000)
are sold to businesses, not curious consumers.
○ Strategic Buyers: Companies (like Casper, Purple) need detailed industry data
to understand their market and competition.
○ Investment Buyers: Investors managing large funds (e.g., $500 million) require
reports to allocate capital sensibly across industries.
● Pricing Strategy and Consumer Surplus:
○ Companies often use "Good, Better, Best" (3-tiered) pricing (e.g., single-user,
multi-user, enterprise license) to practice price discrimination.
○ The goal is to extract maximum consumer surplus by offering different
products (access/licensing) tailored to different customers (e.g., an analyst vs. a
large corporation's research team).
○ The middle option often serves as a focal point, sometimes alongside a decoy
option (like an overly expensive bottle of wine) to push consumers toward the
next best-priced item.
Key Components
● $M$: Total market size (maximum number of potential adopters).
● $N_t$: New adopters (first-time purchases) in time period $t$.
● $A_t$: Cumulative adopters at the start of period $t$.
● $R_t$: Remaining non-adopters ($M - A_t$).
● $P$ (Coefficient of Innovation): Represents adoption driven by external factors (e.g.,
media, curiosity, core need). A natural rate of trial.
● $Q$ (Coefficient of Imitation): Represents adoption driven by network effects or
word-of-mouth (seeing others use the product).
Non-Linear Least Squares (NLS) to Find $P$ and $Q$ (When $M$ is Known)
● If the market size $M$ is estimated externally, a standard linear regression cannot be
used directly.
● The goal is to find $P$ and $Q$ that minimize the Sum of Squared Errors (SSE):
$$SSE = \sum (N_t - \hat{N}_t)^2$$
● This requires specialized optimization routines (e.g., the optim function in R) to search
for the $P$ and $Q$ values that yield the smallest error.
A - Relative Advantage
● Definition: The degree to which a new product is perceived as being better than the
idea, product, or process it supersedes.
● Adoption Effect: Better products are adopted much quicker.
● Examples & Caveats:
○ AI Competition: A superior AI model (e.g., Claude) is expected to be adopted
faster than competitors.
○ First-Mover Advantage (Caveat): An older product (e.g., ChatGPT) may have
an initial adoption edge due to being first or being forcibly integrated into existing
ecosystems (e.g., Microsoft Office), giving it an unfair diffusion advantage
regardless of current superiority.
C - Compatibility
● Definition: The degree to which an innovation is consistent with the existing values, past
experiences, and current workflow of potential adopters.
● Adoption Effect: A technology compatible with existing things is adopted faster.
● Examples:
○ ChatGPT/Search: The landing page is intentionally minimal, looking similar to
the Google search bar, making it compatible with the user's existing online
search behavior.
○ Google Gemini Integration: Google integrates its AI response directly into the
standard search results page, avoiding making users change their workflow or
go to a separate site.
○ Robotics Design: Designing a home robot to look like a human makes it more
compatible with the social expectation of having a helper.
C - Perceived Complexity
O - Observability
● Definition: The degree to which the results or use of an innovation are visible to
others.
● Adoption Effect: Products that are highly observable are more likely to be imitated and
adopted.
● Examples:
○ High Observability (Fast Adoption): Products that stand out aesthetically
when used in public, like the Tesla Cybertruck.
○ Low Observability (Slow Adoption): Products used privately, like a new brand
of shampoo, where consumers do not typically discuss or see its use among
friends.
R - Risk
● Definition: The perceived chance of loss or negative consequences from adopting the
innovation. Risk is not limited to financial loss.
● Adoption Effect: Products perceived as risky are less likely to be adopted.
● Types of Risk:
○ Financial Risk: Losing money if the product is ineffective (e.g., an in-home robot
fails to perform).
○ Social/Perceptual Risk: Fear of being judged by others for adopting a new trend
(e.g., being one of the first to wear baggy jeans as they come back into fashion).
D - Divisibility (Trialability)
● Definition: The degree to which an innovation can be tested or tried on a limited basis
before a full financial commitment is made.
● Adoption Effect: Trialability increases the likelihood of adoption by lowering the
commitment hurdle.
● Examples:
○ High-Cost Goods: Test driving a car.
○ Food/Retail: Offering free samples (e.g., bagels) to allow customers to
experience the product.
● The framework is criticized by economists and marketers for focusing primarily on the
product side and neglecting the consumer side.
● Customer Heterogeneity: Adoption timing is often due to differences between people
(heterogeneity), not just product features.
○ Example: A commuter living far from campus may be an early adopter of an
Electric Vehicle (EV) to save on fuel costs, while a student living close to
campus may have no compelling economic need to adopt early.
● Market Evolution: The framework also misses the reality that markets evolve over time:
○ Price: Prices fall as production becomes more efficient.
○ Features: Products gain new features via updates (e.g., an AI robot receiving a
software update to enhance functionality).
● The ACCORD framework, while useful, is seen as incomplete by many economists and
marketers because it focuses too heavily on the product's attributes and neglects
crucial factors related to the customer and the evolving market.
● Price: As production becomes more efficient, prices for the product typically fall,
opening the door to new market segments.
● Features: Products and technologies are continually updated and improved, often
through software.
○ Example (AI Robot): A customer may buy a "barely functioning" AI robot initially,
knowing that software updates and new features will eventually turn it into a
high-performing tool.
● The BAS model assumes a smooth S-curve from early to mainstream adoption.
● The concept of Crossing the Chasm suggests there is a discontinuity or hurdle
between the early adopters and the mainstream majority.
○ Early Adopters: Often non-price-sensitive, interested in new technology for its
own sake.
○ Discontinuity: The market must overcome a barrier to achieve mainstream
adoption. The book's focus was on the strategies needed to successfully
navigate this transition.
●
●
● Revenue: Revenue generated from the customer in period t
● Costs: Costs incurred for the customer in period t (e.g., cost of goods, service).
● R: discount rate
● T: the time horizon - lifetime
● CAC: Customer Acquisition Cost (often an upfront cost, subtracted once)
When deciding whether to acquire a customer, the financial threshold is an NPV > 0. However,
managers often use a more conservative rule of thumb:
● The Rule: Total expected future profits (CLTV) should be 3 times the Customer
Acquisition Cost (CAC).
○ CLTV >= 3 * CAC
○ Rationale: The 3X rule acts as a risk buffer. Since CLTV relies on
approximations for things like future revenue, churn rates, and the discount rate,
this gap ensures that the acquisition remains profitable even if many of the
underlying assumptions are wrong.
● Using CLTV to "Fire" Customers
CLTV is primarily used to identify profitable customers worth acquiring, but it can also be used to
identify unprofitable customers who cost the company money.
QUIZZES
Quiz 1:
- True or False: A basic data object in R is the [Link] and the Tidyverse set of
packages are designed to work with an enhanced version of the [Link]
called a "tibble": True
- Suppose you read in the smartphone customer dataset csv file like so:
- cust_dat <-
read_csv("path/to/smartphone_customer_data.csv")
- True: calling nrow(cust_dat) or running cust_dat |> summarize(n())
returns the exact same values
- geom_point(): Ralph has data for two thousand new social media accounts. For
each account, the data indicate the number of number of posts during the first
365 days the account was active, and the number of acquired followers during
the same time period. Ralph wants to create a scatterplot of these data to
visualize the relationship between posts and followers. Which ggplot geometry is
most appropriate?
- A ggplot template might look something like:
- ggplot(data = <DATA>) +
- <GEOM_FUNCTION>(mapping = aes(<MAPPINGS>))
- where "aes" is shorthand for "aesthetics."In the terminology of ggplot, what is an
aesthetic: The visual properties that can be perceived on the graphic.
Quiz 2:
- Which of the following can you use to add a linear regression trend line in a
scatterplot with ggplot2: geom_smooth(method = "lm")
- Which of the following options correctly describes what this code returns: A
barplot of the number of customers for each brand, split into separate panels by
gender
- ggplot(cust_dat) +
facet_grid(rows = vars(gender)) +
geom_bar(aes(x = brand), fill = "dodgerblue4")
labs(x = "Brand", y = "Customer Count",
title = "Customer Count",
subtitle = "by Gender and Brand") +
theme_bw()
- facet_grid(rows = vars(gender)) split into separate panels by gender
- True: If you facet on a continuous variable, each unique value of the
continuous variable is treated as a category, so you get one facet per unique
value
- True: David is a product manager at a company that rents out high-end
camping gear. The company wants to grow its customer base by targeting
specific outdoor enthusiast segments. David created a segmentation based
on psychological factors such as adventure-seeking levels and
environmental consciousness from a survey of potential customers. He feels
confident about the segmentation because each segment is large and
distinct. David shared his segmentation with his colleague, Sarah.
- Sarah raises concerns that the segments aren't actionable because they can't be
easily identified based on straightforward customer characteristics, like location
or income, which makes it difficult to tailor marketing efforts effectively. True or
False: Sarah's concerns are valid.
- Jenna is a marketing analyst at a subscription-based online learning platform.
She created customer segments based on consumer's behaviors, such as how
frequently they engage with courses, the types of subjects they study, and their
subscription plan. Jenna showed her segmentation to her friend Alex.
- Alex pointed out that two segments have similar demographic characteristics:
both are mostly made up of young adults between the ages of 25 and 35. He
argues that these segments aren't truly different from each other because their
demographics overlap. Are Alex's concerns valid?
- No, Alex’s concerns are not valid because the segmentation is based on
customer behavior.
Quiz 3:
Quiz 5:
- Fit a multinomial logit model that includes product-specific intercepts and price
as explanatory variables on mdat2 (we called this out4 in the week 5 class
script). After estimating the model, you find that the predicted and actual
product-level market shares are identical. Which feature of this MNL model
specification explains this result?
- The use of product-specific intercepts (phone_id).
- The Angler Case
- Train Chapter 6 presents a case study of angler's preferences for fishing sites.
The empirical work for the case study utilizes a "mixed" logit model.
- Table 6.1 in Train presents the means and standard deviations of the
individual-level parameters (ie, the "random" coefficients).
- The image below corresponds to the Campground coefficients, with the mean
marked by a dashed vertical line, zero marked by a solid vertical line, and the
area to the right of zero shaded blue.
-
- What is the mean of the above distribution? (Round to 3 decimal places): 0.116
- What is the standard deviation of the above distribution? (Round to 3 decimal
places): 1.655
- In the distribution in above, the horizontal axis is for different values of…:
Angler-specific campground preferences
Quiz 6:
- In the basic Multinomial Logit (MNL) demand model, why do all customers facing the
same set of options have identical predicted choice probabilities Pnj=Pj:
- Because the model assumes identical preference parameters (no
subscript on α and β)
- Why can’t we simply add customer attributes (e.g., income, total minutes) directly
to the MNL model to account for heterogeneity?
- They are constant across alternatives and thus cancel out in the choice
probability formula
- What is the main purpose of performing k-fold cross-validation?
- To assess how well the model’s predictions generalize to unseen
customers
- If you use 10-fold cross validation, like we did in Class 6, on models out9 and
out10, you will find that the MSPE (sometimes called just the MSE) is higher for
out10 than out9. What does this mean?
- out9 is the better model because its MSPE is lower
- Which of the following patterns best indicates that a model is overfitting?
- In-sample prediction error decreases, but cross-validated prediction error
increases
- Given that out10 has slightly better in-sample fit (higher hit rates and
log-likelihood ratio) but a higher cross-validated MSPE than out9, which
statement best describes their performance?
- out9 achieves the best balance between in-sample fit and out-of-sample
predictive accuracy, while out10 adds unnecessary complexity that slightly
harms generalization.
-
- Scenario:
- Alex is a pricing strategist at EconoGoods, a company that sells household
essentials. The CMO at EconoGoods is considering revising the pricing strategy
for its flagship product, UltraClean Dish Soap. Alex provides the CMO with
historical monthly sales data and corresponding average prices:
393 0.853
351 0.921
345 0.946
306 1.002
340 0.949
348 0.943
337 0.978
395 0.884
363 0.912
343 0.971
- The CMO assumes that EconoGoods faces a constant price elasticity demand
curve and that the cost to manufacture each ounce of UltraClean Dish Soap is
40 cents.
- What is the price elasticity of demand for UltraClean Dish Soap?
(This should be a negative number, as a percentage increase in price
should be associated with a percentage decrease in sales.)
- Hint: You can calculate the elasticity by regressing the natural
log of quantity on the natural log of price and taking the slope
coefficient estimate. You can do this in Excel, Google Sheet, R, or
wherever you're most comfortable running regressions.
(Round to 2 decimals)
- 1.27
- True: True or False: If the CMO assumes a constant elasticity demand curve,
this implies that the percentage change in quantity demanded divided by the
percentage change in price is always equal to the price elasticity of demand, at
any point on the demand curve.
Quiz 7:
- What is the economic value to the consumer (ie, their max willingness to pay) to
purchase an otherwise identical Ford truck compared to a Nissan truck, based on
the following conjoint results? In other words, what is the maximal Ford price
premium relative to Nissan, based on the conjoint?
- Round to the nearest dollar if necessary, include a negative sign if necessary,
and omit commas and the dollar sign. For example,if the value is negative
$12,345.67 you would enter "-12345.67".
-
Attribute Level
Part-Worth
(estimated
coefficient)
Ford 40
Nissan -20
Vehicle Truck 80
Type
SUV 20
Cross-over -100
Price $50,000 50
$60,000 20
$70,000 -10
$80,000 -40
- 20,000
We began by optimizing the price for the S1 phone in order to optimize the profit of the
S1 phone only. We found that the optimal price for the S1 phone was...?
- $709
We again tested multiple prices for the S1 phone (again holding constant the price of
the S2 phone), but this time we analyzed the total profit to Samsung (from both the S1
and S2 phones). The total-profit maximizing price of the S1 phone was found to be...?
- $749
- The S1 price that optimized the S1 profit is different from the S1 price that
optimized total Samsung profit. This is because changing the S1 price causes
the estimated probability of buying the S1 and S2 phones to change.
- Specifically, compared to the observed prices, a consumer facing a lower S1
price will have an increased probability of buying the S1 phone and a decreased
probability of buying the S2 phone. The estimated probabilities translate into
estimated market shares, revenues, and profits. So changing the price of any
one phone is likely to affect the profit from all other phones.
- Yes
- Which of the following quotes are in the Bronnenberg, Dube, Syverson article
from the week 8 reading? Select any or all that apply.
-
- Quote 1:
- Brands would likely exist even in the absence of systematic advertising or other
corporate investments in brand-building.
-
- Quote 2:
- The total brand value represented by these 100 most valuable brands is
estimated to be $4.14 trillion in 2021 (more than the entire tangible capital stock
of Belgium) and has been growing at an average annual growth rate of 8.1
percent.
-
- Quote 3:
- Branding can form associations in the consumer’s memory that assist with recall
and consideration of the branded product.
-
- Quote 4
- The economics literature in recent decades has largely overlooked the role of
branding and marketing human capital for our understanding of markets and their
organization as well as firm productivity and macroeconomic growth... Given the
large economic magnitude of intangible brand capital and its recent growth, these
issues seem likely to be of first-order importance.
- All of these quotes
Quiz 8:
- A dollar today is worth more than a dollar tomorrow, since I can invest that dollar
today and make it worth more tomorrow.
- To discount future dollars back into today's dollars (also called the "present
value"), you divide those future dollars by (1+r)^t where r is your discount rate
and t is the number of time periods you need to discount back. You might
familiar with this idea from "discounted cash flows" often discussed in a finance
class.
- For example, if you have an 8% annual discount rate and you expect to receive
$100 in 2 years, you compute 100 / (1+0.08)^2 = 85.73.
- Often, you have a stream of future payments, say you expect $100 in each of the
next 3 years, then you compute (100 / 1.08^1) + (100 / 1.08^2) + (100 / 1.08^3).
This is often written with summation notation as:
- While these equations are compact and look nice in a textbook, the most
common way to do present value calculations is with a spreadsheet:
-
0 0 1 0
NPV 257.71
(ie, sum or
"net"
present
value)
-307.23
- Now, you might incur an upfront cost and face attrition.
- Suppose you operate a gym. You could pay $1,000 for an advertisement that
reached 100 people. You expect 4% of those people to "convert" into customers
by signing up for the gym. The gym membership is $100 per year, but it costs
you on average $10 per customer per year to service the equipment. In addition,
on average 20% of your customers cancel their membership every year and you
have a 7% discount rate. What is the NPV over the next 3 years for the
advertisement?
NPV = -19.14
-
- This negative net present value indicates that the cost exceeds the benefits, and
so it would be a bad advertising investment.
- 238,083.83