Data Mining

Data mining blogs

Creating DDL For An Entire Database In SQL Server 2008

February 10, 2010
By

Recently, I started a new project which has a database component. I looked around for some visual data modeling tools, and I settled on just using the diagrams capability of SQL Server. Since the client is using SQL Server, it was simple to download ...

Read more »

Simpson’s Paradox and Marketing

February 2, 2010
By

A reader asked the following question:Hi Michael/Gordon, In campaign measurements, it's possible to get a larger lift at the overall level compared to all the individual decile level lifts or vice versa, because of the differences in sample size acro...

Read more »

Oracle load scripts now avalable for Data Analysis Using SQL and Excel

January 19, 2010
By

Classes started this week for the spring semester at Boston College where I am teaching a class on marketing analytics to MBA students at the Carroll School of Management.  The class makes heavy use of Gordon's book, Data Analysis Using SQL and Ex...

Read more »

Hadoop and Parallel Dataflow Programming

January 9, 2010
By

Over the past three months, I have been teaching myself enough Hadoop to get comfortable with using the environment for analytic purposes.There has been a lot of commentary about Hadoop/MapReduce versus relational databases (such as the articles refere...

Read more »

MapReduce versus Relational Databases?

January 5, 2010
By

The current issue of Communications of the ACM has articles on MapReduce and relational databases. One, MapReduce a Flexible Data Processing Tool, explains the utility of MapReduce by two Google fellows -- appropriate authors, since Google invented th...

Read more »

Hadoop and MapReduce: Normalizing Data Structures

January 2, 2010
By
Hadoop and MapReduce:  Normalizing Data Structures

To set out to learn Hadoop and Map/Reduce, I tackled several different problems. The last of these problems is the challenge of normalizing data, a concept from the world of relational databases. The earlier problems were adding sequential row number...

Read more »

Differential Response or Uplift Modeling

December 28, 2009
By
Differential Response or Uplift Modeling

Some time before the holidays, we received the following inquiry from a reader:Dear Data Miners,I’ve read interesting arguments for uplift modeling (also called incremental response modeling) [1], but I’m not sure how to implement it. I have respo...

Read more »


Subscribe

Email:

  Subscribe