Data Mining

Data mining blogs

Why I don’t recommend MS Access

April 8, 2014
By

Recently, I was asked:Why do you not recommend Access to use? Just curious. Read on page xi of your intro in Data Analysis Using SQL and Excel. Just beginning a class in SQL and bought your text. Thanks, MortThis is a very fair question and o...

Read more »

Doing the Right Thing: Are your measures correct?

March 30, 2014
By

"A lot of good analysis is wasted doing the wrong thing."Anyone who has worked with data on business problems is probably aware of this adage.  And this past week, I was reminded once again of this fact while analyzing a marketing program.  T...

Read more »

Three SQL Constructs You Can Forget About

March 25, 2014
By

SQL is a very powerful language, which could, of course, be made even more powerful and useful.  This post discusses three features of the language -- ANSI standard features -- that seem not only unnecessary but downright detrimental.  That i...

Read more »

Big Data and SQL

March 20, 2014
By

I happen to think that SQL is a very viable option for analyzing big data.  I was thinking about this when I a book review recently:For instance, Siegel reports, people who buy small felt pads that adhere to the bottom of chair legs (to protect th...

Read more »

Heuristics in Analytics

March 12, 2014
By

Last week, a book -- a real, hard-cover paper-paged book -- arrived in the mail with the title:  Heuristics in Analytics:  A Practical Perspective of What Influences Our Analytic World.  The book wasn't a total surprise, because I had re...

Read more »

Lines and Circles and Logistic Regression

March 1, 2014
By
Lines and Circles and Logistic Regression

Euclidean geometry, formalized in Euclid's Elements about 2,300 years ago, is in many ways a study of lines and circles.  One might think that after more than two millennia, we have moved beyond such basic shapes particularly in a realm such as da...

Read more »

Taking a Random Sample on Amazon Redshift

February 26, 2014
By

Recently, I was approached by Vicky whom I'm working with at a client, to help with a particular problem.  She wanted to calculate page view summaries for a random sample of visitors from a table containing about a billion page views.  This i...

Read more »

Hopper – new in the travel space

January 19, 2014
By
Hopper – new in the travel space

Briefly - Hopper is something new in the travel / local space. In their own words: What if you could plan an amazing trip based on a vague idea — like “spring surfing in California” or “Mediterranean cruise”? What if...

Read more »

FitBit: A great product with an even better website

December 29, 2013
By
FitBit: A great product with an even better website

Briefly - Wakako gave me (actually us) a FitBit for Christmas. This is a great product if you are (like me) motivated by data to take action. While I appreciate the device design (small but functional), I really like the...

Read more »

Review: Information is Beautiful by David McCandless

December 29, 2013
By
Review: Information is Beautiful by David McCandless

Information is Beautiful is a thought provoking labour of love by one of the first true data journalists, David McCandless. It is a simply structured collection of graphical interpretations of a variety of interesting statistics, factoids and opinions. It is...

Read more »


Subscribe

Email:

  Subscribe