SAS Forum Belux 2011: Talk on Ensemble Learning for Database Marketing / Customer Intelligence

When Predictive Models Join Forces: on the How and Why of Ensemble Learning for Customer Intelligence/ Database Marketing (By Koen W. De Bock, PhD)

In customer intelligence, predictive modeling is a key instrument. Applications such as customer churn prediction, response modeling, cross-sell (Next-Product-To-Buy) analysis or customer lifetime value analysis all depend upon inferences about expected future customer behavior or characteristics in order to make marketing campaigns more targeted and effective. While the success of these activities depends on decisions made in several phases of the modeling process, it is widely acknowledged that the choice of the modeling technique is a prominent one. In this presentation, light is shed upon the advantages of letting predictive models in CI join forces, whereby several models are combined into new and more powerful models. These so-called ensembles have consistently emerged as winning entries in data mining contests, such as the Teradata/Duke CRM competition, KDD Cup or the Netflix Prize since many years. However, despite their strength and intuitive nature, their applications in real-life business are still scarce. This talk will untangle the topic of ensemble learning, include an overview of the most important techniques, how they can be tailored to get the most out of your CI applications. The advantages of the techniques are demonstrated throughout several academic experiments on real-life datasets.



GAMens R Package Tutorial

GAMens R Package for binary ensemble classificationFor a first tutorial on this blog, I want to introduce you to the GAMens R package for binary ensemble classification, which resulted from a research collaboration of myself (Koen De Bock), Kristof Coussement and Dirk Van den Poel.

GAMens is a strongly performing algorithm for binary classification. It is an ensemble classifier (or multiple classifier systems), meaning that it is actually an aggregated model of many constituent classification models. The practice of taking several models and combining their predictions has Continue reading “GAMens R Package Tutorial” »

R for large datasets: Revolution R Enterprise

Revolution_R_Enterprise_LogoIf you are an academic or a practitioners in database marketing you’ve probably at least heard of R, the open-source freeware package for statistical computing. It has a number of attractive features that make it suitable for work on database marketing or customer intelligence. First, its matrix-based programming syntax, resembling the programming languages of Matlab and SAS/IML very closely, is fairly easy to master. Second, it is very popular in academics. As a consequence, many novel, state-of-the-art functions and algorithms are released as by-products of academic literature and become available to any user long before the commercial software package producers consider implementing them.
Continue reading “R for large datasets: Revolution R Enterprise” »

MCS 2011: The 10th International Workshop on Multiple Classifier Systems

MCS 2011Ensemble classification, or Multiple Classifier Systems (MCS), have received an increasing amount of attention in academic research on database marketing recently. Continue reading “MCS 2011: The 10th International Workshop on Multiple Classifier Systems” »

BAQMaR 2010 Conference: RE!SET

BAQMAR RESET bannerBAQMaR, the Belgian Association for Quantitative and Qualitative Marketing Research has recently announced this year’s edition of their annual conference, to be held on the 16th of December 2010 in Ghent Belgium. This years edition is entitled RE!SET. Like the previous editions, several opinion leaders from the market research and customer analytics industry have been announced. Moreover, for the first time the evening conference track will be preceded by an afternoon workshop program, with in-depth tutorials on specific topics within two different tracks; Marketing Research and Data Mining. Registration is open and has been reported to move lightning-fast, so don’t hesitate too long! More information on www.BAQMaR.be.



Customer churn prediction: how to solve the class imbalance problem?

A well-known problem in customer churn prediction is class imbalance. Unless a company is doing extremely bad, the number of customer who are actually going to churn is only a fraction of the total customer database. The problem resides in the fact that there is one majority class and one minority class and that most classification techniques assume balanced classes. Hence, their predictions aboute future customer behavior will be biased towards the majority class. In other words, the model will have severe difficulties to identify the heartbreaking customers. If you don’t do anything about this problem, the accuracy of your model will seem extremely good, but actually your model will not be able to make a good distinction between churners and non-churning customers. Continue reading “Customer churn prediction: how to solve the class imbalance problem?” »

Database Marketing defined and introduced

Let’s start off this blog by explaining the concept of Database Marketing. Database marketing is defined by Blattberg et al. [1] asthe use of customer database to enhance marketing productivity through more effective acquisition, retention, and development of customers. Firstly, this definition stipulates that database marketing is aimed at an improvement of customer relationships and emphasizes the three pillars of Customer Relationship Management: customer acquisition, customer development and customer retention. These three elements are linked to the concept of the customer life cycle, as represented in Figure 1. Continue reading “Database Marketing defined and introduced” »