Read from one data set and write to another with SAS/IML

September 17, 2014
By
Read from one data set and write to another with SAS/IML

Many people know that the SAS/IML language enables you to read data from and write results to multiple SAS data sets. When you open a new data set, it is a good programming practice to close the previous data set. But did you know that you can have two data […]

Read more »

R package to convert statistical analysis objects to tidy data frames

September 16, 2014
By
R package to convert statistical analysis objects to tidy data frames

I talked a little bit about tidy data my recent post about dplyr, but you should really go check out Hadley’s paper on the subject.R expects inputs to data analysis procedures to be in a tidy format, but the model output objects that you get back are...

Read more »

They know my email but they don’t know me

September 16, 2014
By

This came (unsolicited) in the inbox today (actually, two months ago; we’re on a delay, as you’re probably aware), subject line “From PWC – animations of CEO opinions for 2014″: Good afternoon, I wanted to see if the data my colleague David sent to you was of any interest. I have attached here additional animated […] The post They know my email but they don’t know me appeared first on…

Read more »

Don’t data puke, says Avinash Kaushik

September 16, 2014
By

Here are five amazing recommendations by Avinash Kaushik from a post about how to make Web analytics dashboards better by simplifying. Dashboards are not reports. Don't data puke. Include insights. Include recommendations for actions. Include business impact. NEVER leave data interpretation to the executives (let them opine on your recommendations for actions with benefit of their wisdom and awareness of business strategy). When it comes to key performance indicators, segments…

Read more »

Introduction to Statistical Computing

September 16, 2014
By

At an intersection of Enigmas of Chance and Corrupting the Young. Class homepage Fall 2014 Class announcement Lectures: Introduction to the Course; Basic Data Types Bigger Data Structures Dataframes and Control Introduction to Strings Regular...

Read more »

Notes from the Kölner R meeting, 12 September 2014

September 16, 2014
By
Notes from the Kölner R meeting, 12 September 2014

Last Friday we had guests from Belgium and the Netherlands joining us in Cologne. Maarten-Jan Kallen from BeDataDriven came from The Hague to introduce us to Renjin, and the guys from DataCamp in Leuven, namely Jonathan, Martijn and Dieter, gave an ove...

Read more »

Beyond the Knee-Jerk Reaction

September 16, 2014
By
Beyond the Knee-Jerk Reaction

There is a tendency to just reflexively make fun of certain types of charts, in particular pie charts and 3D charts. While that is often justified, there are also exceptions. Not all pie charts are bad, and not all 3D charts are terrible. But to spot those outliers, we have to suppress the knee-jerk reflex and give them a moment of thought before ripping them apart.

Read more »

Data Mining versus (?) Data Science

September 16, 2014
By

Two of my favorite answers are:"A data scientist is a statistician who lives in San Francisco""A data scientist is someone who is better at statistics than any software engineer and better at software engineering than any statistician."(These have both...

Read more »

More bad news for the buggy-whip manufacturers

September 15, 2014
By
More bad news for the buggy-whip manufacturers

In a news article regarding difficulties in using panel surveys to measure the unemployment rate, David Leonhardt writes: The main factor is technology. It’s a major cause of today’s response-rate problems – but it’s also the solution. For decades, survey research has revolved around the telephone, and it’s worked very well. But Americans’ relationship with […] The post More bad news for the buggy-whip manufacturers appeared first on Statistical Modeling,…

Read more »

Applied Statisticians: people want to learn what we do. Let’s teach them.

September 15, 2014
By

In this recent opinion piece, Hadley Wickham explains how data science goes beyond Statistics and that data science is not promoted in academia. He defines data science as follows: I think there are three main steps in a data science … Continue reading →

Read more »

On deck this week

September 15, 2014
By

Mon: More bad news for the buggy-whip manufacturers Tues: They know my email but they don’t know me Wed: What do you do to visualize uncertainty? Thurs: Sokal: “science is not merely a bag of clever tricks . . . Rather, the natural sciences are nothing more or less than one particular application — albeit […] The post On deck this week appeared first on Statistical Modeling, Causal Inference, and…

Read more »

Relevance, to you or me: a response to Cairo

September 15, 2014
By
Relevance, to you or me: a response to Cairo

Alberto Cairo discussed a graphic by the New York Times on the slowing growth of Medicare spending (link). The chart on the top is published, depicting the quite dramatic flattening of the growth in average spending over the last years--average...

Read more »

My dataviz workshop starts in October

September 15, 2014
By

Course announcement is on my sister blog. This course adopts the creative writing workshop structure to teach data visualization. Think of chart making as a craft. We emphasize the importance of sketches, revisions, feedback from readers, and a critical eye. It's limited enrollment.

Read more »

1976 NBER-Census Time Series Conference

September 15, 2014
By

What a great blast from the past -- check out the program of the 1976 NBER-Census Time-Series Conference. (Thanks to Bill Wei for forwarding, via Hang Kim.)The 1976 conference was a pioneer in bridging time-series econometrics and statistics. Econ...

Read more »

Handling run-time errors in user-defined modules

September 15, 2014
By
Handling run-time errors in user-defined modules

I received the following email from a SAS/IML programmer: I am getting an error in a PROC IML module that I wrote. The SAS Log says NOTE: Paused in module NAME When I submit other commands, PROC IML doesn't seem to understand them. How can I continue the program? The […]

Read more »

How do you say π^π^π?

September 15, 2014
By
How do you say π^π^π?

Well, not that you really probably want to know how to say such an absurdly large number. However for those of you who are interested (allowing for rounding) it is:one quintillion, three hundred forty quadrillion, one hundred sixty-four trillion, one h...

Read more »

Regression with Python, pandas and StatsModels

September 14, 2014
By
Regression with Python, pandas and StatsModels

I was at Boston Data-Con 2014 this morning, which was a great event.  The organizer, John Verostek, seems to have created this three-day event single-handedly, so I am hugely impressed.Imran Malek started the day with a very nice iPython tutorial....

Read more »

One datavis for you, ten for me

September 14, 2014
By
One datavis for you, ten for me

Over the years of my graduate studies I made a lot of plots. I mean tonnes. To get an extremely conservative estimate I grep’ed for every instance of “plot\(” in all of the many R scripts I wrote over the past five years. The actual number is very likely orders of magnitude larger as 1) many […]

Read more »

If Scotland becomes a country

September 14, 2014
By
If Scotland becomes a country

On the 18th of September 2014, Scottish people will vote on secession from the United Kingdom, potentially ending a union that has existed since 1707. If Scots vote “Yes” to end the union, the United Kingdom will consist of England, Wales and Northern Ireland, while the newly created country of Scotland may look like this: […]

Read more »

Six quotes from Kaiser Fung

September 14, 2014
By

You may think you have all of the data. You don’t. One of the biggest myth of Big Data is that data alone produce complete answers. Their “data” have done no arguing; it is the humans who are making this claim. Before getting into the methodological issues, one needs to ask the most basic question. […] The post Six quotes from Kaiser Fung appeared first on Statistical Modeling, Causal Inference,…

Read more »

Trying a prefmap

September 14, 2014
By
Trying a prefmap

Preference mapping is a key technique in sensory and consumer research. It links the sensory perception on products to the liking of products and hence provides clues to the development of new, well tasting, products. Even though it is a key technique,...

Read more »

RDataMining Slides Series

September 14, 2014
By
RDataMining Slides Series

by Yanchang Zhao, RDataMining.com I have made a series of slides on R and data mining, based on my book titled R and Data Mining — Examples and Case Studies. The slides will be used at my presentations at seminars … Continue reading →

Read more »

The Ecology of Data Matrices: A Metaphor for Simultaneous Clustering

September 13, 2014
By
The Ecology of Data Matrices: A Metaphor for Simultaneous Clustering

"...a metaphor is an affair between a predicate with a past and an object that yields while protesting." Nelson Goodman (1976)It is, as if, data matrices were alive. The rows are species, and the columns are habitats. At least that seems to be the...

Read more »


Subscribe

Email:

  Subscribe