Posts Tagged ‘ data ’

Read sas7bdat files in R with GGASoftware Parso library

September 12, 2014
By

... using the new R package sas7bdat.parso. The software company GGASoftware has extended the work of myself and others on the sas7bdat R package by developing a Java library called Parso, which also reads sas7bdat files. They have worked out most of the remaining kinks. For example, the Parso library reads sas7bdat files with compressed […]

Read more »

Trifacta, an attempt to simplify the analyst’s life

September 8, 2014
By

A LinkedIn contact and 538 reader pointed me to this demo video by Joe Hellerstein, from a Bay Area startup called Trifacta. They have a neat product that tries to automate data cleaning/processing tasks for analysts. I love that people are working on this problem. It's an area that I'm interested in getting involved in. Also, they have a sleek user interface, well thought out, and innovative. There is a…

Read more »

Background on my new 538 article on restaurant health grades

September 3, 2014
By
Background on my new 538 article on restaurant health grades

Some behind-the-scenes comments on my recent article on New York's restaurant inspection grades; it appeared on FiveThirtyEight this Tuesday. *** The Nature of Ratings This article is about the ratings of things. I devoted a considerable amount of pages to this topic in Numbersense (link) - Chapter 1 is all about the US News ranking of schools. A few key points are: All rating schemes are completely subjective. There is…

Read more »

Getting Quandl Data Into EViews

September 2, 2014
By
Getting Quandl Data Into EViews

I've sung the praises of Quandl before - e.g., see here. What's not to like about millions of free time series data - especially when they're linked back to their original sources so that updating and accuracy is the least of your worries.If you can th...

Read more »

Mind your Qs

August 20, 2014
By
Mind your Qs

Rescheduling Notice: I have been informed by the organizers that the Meetup tonight has to be rescheduled due to an unexpected problem with the venue. When a new date is set, I will let you know. *** Since I am...

Read more »

David Mimno on "Data Carpentry"

August 19, 2014
By
David Mimno on "Data Carpentry"

There's a post on David Mimno's blog  today titled, "Data Carpentry".I like it a lot, because it emphasises just how much effort, time and creativity can be required in order to get one's data in order before we can get on with the fun stuff ...

Read more »

Davenport on targeted marketing

August 13, 2014
By

Tom Davenport is one of the leading voices on business analytics, and he has a new piece titled "Why are most 'targeted' marketing offers so bad?" in which he expanded on a question I raised in my HBR article. Tom's book Competing on Analytics is a classic. He has a great appreciation for the business of the data business. In the new feature, Davenport classifies marketing offers he gets into…

Read more »

Applied Statistics Lesson of the Day – The Coefficient of Variation

Applied Statistics Lesson of the Day – The Coefficient of Variation

In my statistics classes, I learned to use the variance or the standard deviation to measure the variability or dispersion of a data set.  However, consider the following 2 hypothetical cases: the standard deviation for the incomes of households in Canada is $2,000 the standard deviation for the incomes of the 5 major banks in Canada […]

Read more »

One guaranteed to make Stephen Few cry

August 4, 2014
By
One guaranteed to make Stephen Few cry

Vox published this chart: This sort of chart is, unfortunately, quite common in business circles. Just about the only thing one can read readily from this chart is the overall growth in the plug-in vehicle market (the heights of the...

Read more »

Another day, another misuse of averages

July 30, 2014
By

The other day, we are told that if we walk anywhere in New York, we will bump into a few millionaires (link). This week, we are told that wherever we go in the US, "Roughly, every third person you pass on the street is going to have debt in collections" (link). The woman who said this has a PhD in Economics from Cornell. Oh please. The claim comes from yet…

Read more »


Subscribe

Email:

  Subscribe