Professional Documents
Culture Documents
An Introduction to Predictive
Customer Lifetime Value
Modeling
Jean-Rene Gauthier 02.27.17
How can you predict the value of a customer over the course of his or her
interactions with your business? That's a question many companies are trying
to answer, and it was the subject of my Feb. 28 webcast on O’Reilly Media.
Accounting for these network effects can be challenging at first, which is why,
for the sake of simplicity, I will be focusing on direct purchases only in this post.
Historical Cu
stomer
Lifetime
Value
There are many methodologies that deal with the portion of CLV associated
with direct purchases, but the two most broad classes are generally defined as
historical and predictive CLV. Historical methods look at past data and make a
judgment on the value of customers solely based on past transactions, without
any attempt to predict what those customers will do next.
https://www.datascience.com/blog/intro-to-predictive-modeling-for-customer-lifetime-value 2/17
29/06/2018 An Introduction to Predictive Customer Lifetime Value Modeling
In principle, this is a valid approach if the customers behave similarly and have
been interacting with the company for roughly the same amount of time.
However, there’s generally a fair amount of heterogeneity among customers.
The chart below shows a few purchasing trajectories to illustrate my point.
Time goes from left to right. The vertical dashed line represents the present
time, and each small, vertical line represents an order/purchase made by a
customer:
Typical historical approaches will apply a recency of last purchase criterion to
distinguish between active and inactive users. Average past purchase behavior
is employed to measure the relative (or in some cases, absolute) value of
customers.
However, there are several problems with such methodologies. For example,
the first customer in the chart above has made more purchases than the
second customer, but in fact, the first customer is more likely to be
inactive than the second one. Value based on past averages would claim that
the first customer is more valuable — yet the second customer is still active and
could make many more purchases in the future. Methods that account for
variation in the behavior of customers will allow us to arrive at more accurate
conclusions about customer lifetime and purchase behavior.
https://www.datascience.com/blog/intro-to-predictive-modeling-for-customer-lifetime-value 3/17
29/06/2018 An Introduction to Predictive Customer Lifetime Value Modeling
Predictive
Customer
Lifetime
Value
The goal of predictive CLV is to model the purchasing behavior of customers in
order to infer what their future actions will be. Whether a predictive CLV model
and methodology makes sense for your use case will largely be determined by
the business context. For the purpose of this post, business context is defined
along two dimensions: non-contractual vs. contractual business settings, and
continuous vs. discrete purchase opportunities. This context definition should
cover the vast majority of business cases. Below, I have included a table
highlighting the differences between these contexts:
https://www.datascience.com/blog/intro-to-predictive-modeling-for-customer-lifetime-value 4/17
29/06/2018 An Introduction to Predictive Customer Lifetime Value Modeling
Below are some examples of business cases belonging to each one of the four
quadrants. CLV models for fitness clubs or insurance policies will differ from
the ones targeting grocery purchases, for example:
https://www.datascience.com/blog/intro-to-predictive-modeling-for-customer-lifetime-value 5/17
29/06/2018 An Introduction to Predictive Customer Lifetime Value Modeling
Probabilistic
Models For
The Non-
Contractual
And
Continuous
https://www.datascience.com/blog/intro-to-predictive-modeling-for-customer-lifetime-value 6/17
29/06/2018 An Introduction to Predictive Customer Lifetime Value Modeling
Purchase
Setting
Perhaps the most common business context is the non-contractual one, in
which the purchase opportunity is continuous. A large number of
probabilistic models have been built to address the challenges of modeling
lifetime value in such a context. These types of models have been used now for
several decades. They are applicable to a wide variety of business situations
and, in many cases, are your “go-to” models. Probabilistic models are definitely
a good first step (and sometimes the only one!) toward CLV modeling.
Machine learning and Markov models are also worthy approaches to CLV
modeling, but they need to be tweaked and sometimes customized to fit the
particulars of a business situation. In the few case studies comparing the
outcome of these different models, probabilistic approaches and machine
learning models tend to produce results that are of a similar quality.
Different
Probabilistic
https://www.datascience.com/blog/intro-to-predictive-modeling-for-customer-lifetime-value 7/17
29/06/2018 An Introduction to Predictive Customer Lifetime Value Modeling
Models, But
Similar
Modeling
Frameworks
Let’s take a closer look at probabilistic models. There are several different
flavors of probabilistic models out there; however, they all tend to share a
similar modeling framework. In this framework, CLV models are often
constraining the same three latent (unobserved) parameters characterizing
customers behavior:
https://www.datascience.com/blog/intro-to-predictive-modeling-for-customer-lifetime-value 8/17
29/06/2018 An Introduction to Predictive Customer Lifetime Value Modeling
The
Pareto/NBD
Model: A
Good First
Step Toward
https://www.datascience.com/blog/intro-to-predictive-modeling-for-customer-lifetime-value 9/17
29/06/2018 An Introduction to Predictive Customer Lifetime Value Modeling
CLV
Modeling
The Pareto/NBD model is perhaps the most well-known and frequently applied
probabilistic model in the non-contractual context. I created the chart below to
illustrate how the model works:
The Pareto/NBD portion is on the left side of the chart in the dashed rectangle.
Pareto/NBD only focuses on the purchase count and lifetime. It does not
address the monetary value component. There are a few models out there that
address monetary value; I've chosen the Gamma Gamma extension to the
Pareto/NBD model (as seen in the chart above).
https://www.datascience.com/blog/intro-to-predictive-modeling-for-customer-lifetime-value 10/17
29/06/2018 An Introduction to Predictive Customer Lifetime Value Modeling
First, you must train the model over a training period with a minimum length
that corresponds to three times the typical inter-purchase time of your
customers. With customers data and simulations, we found that three times is a
minimum. Five to ten is definitely better.
https://www.datascience.com/blog/intro-to-predictive-modeling-for-customer-lifetime-value 11/17
29/06/2018 An Introduction to Predictive Customer Lifetime Value Modeling
The training period will give you an estimate for the model parameters. You
should then be able to compare what the model predicts vs. what you
observed in the training period at the customer level. If the purchase count is
in agreement, the next step is to compare predictions with observations made
in a validation/holdout period. This period has not been observed by the
model. If the model performs well in the validation/holdout period, then you
can forecast for a period of time from several months to several years,
depending on your business needs.
The Gamma-
Gamma
Extension To
The
https://www.datascience.com/blog/intro-to-predictive-modeling-for-customer-lifetime-value 12/17
29/06/2018 An Introduction to Predictive Customer Lifetime Value Modeling
Pareto/NBD
Model
As mentioned above, the Pareto/NBD model focuses on modeling lifetime and
purchase count. The monetary value extension to the Pareto/NBD model noted
on the right side of the chart, Gamma-Gamma, makes a few assumptions:
https://www.datascience.com/blog/intro-to-predictive-modeling-for-customer-lifetime-value 13/17
29/06/2018 An Introduction to Predictive Customer Lifetime Value Modeling
Tying These
Two Models
Together:
CLV
Estimates At
The
https://www.datascience.com/blog/intro-to-predictive-modeling-for-customer-lifetime-value 14/17
29/06/2018 An Introduction to Predictive Customer Lifetime Value Modeling
Customer
Level
The Pareto/NBD model allows you to compute the expected number of
purchases in a forecast period at the customer level. Furthermore, the Gamma-
Gamma model allows you to assign a value to each of those future purchases.
It becomes a trivial exercise to forecast CLV for each customer; you simply have
to multiply the expectation values of each model. That should allow anyone to
make CLV comparisons during the holdout period before making any
forecasts.
Additional
Information
To help make these concepts very concrete, I have created
a public github repo that contains a notebook and a test dataset of an online
retailer in order to supplement my O'Reilly webcast. In the notebook, you will
find the steps to train both the Pareto/NBD and Gamma-Gamma models and
compute CLV at the customer level.
https://www.datascience.com/blog/intro-to-predictive-modeling-for-customer-lifetime-value 15/17
29/06/2018 An Introduction to Predictive Customer Lifetime Value Modeling
Learn More
Want to keep learning? Download our new study from Forrester about the tools
and practices keeping companies on the forefront of data science.
JEAN-RENE GAUTHIER
https://www.datascience.com/blog/intro-to-predictive-modeling-for-customer-lifetime-value 16/17
29/06/2018 An Introduction to Predictive Customer Lifetime Value Modeling
Platform Solutions Resources Tools Company
Copyright © 2018, Oracle and/or its affiliates. All rights reserved. Oracle and Java are registered
trademarks of Oracle and/or its affiliates. Other names may be trademarks of their respective
owners.
https://www.datascience.com/blog/intro-to-predictive-modeling-for-customer-lifetime-value 17/17