## Wouldn’t it be cool if Glenn Hubbard were consulting for Herbalife and I were on the other side?

February 6, 2013
By

I remember in 4th grade or so, the teacher would give us a list of vocabulary words each week and we’d have to show we learned them by using each in a sentence. We quickly got bored and decided to do the assignment by writing a single sentence using all ten words. (Which the teacher [...]

## Another cool aggregator site

February 5, 2013
By

There’s another site I’ve found that collects posts from a bunch of statistics/quantitative social science blogs: http://www.statsblogs.com/ Add it to your reader!

## Belly Button Biodiversity: Part One

February 5, 2013
By

This post is a excerpt from Think Bayes: Bayesian Statistics Made Simple, the book I am working on now.  You can read the entire current draft at http://thinkbayes.com.Belly button bacteriaBelly Button Biodiversity 2.0 (BBB2) is a nation-wide citi...

## Glenn Hubbard and I were on opposite sides of a court case and I didn’t even know it!

February 5, 2013
By

Matt Taibbi writes: Glenn Hubbard, Leading Academic and Mitt Romney Advisor, Took \$1200 an Hour to Be Countrywide’s Expert Witness . . . Hidden among the reams of material recently filed in connection with the lawsuit of monoline insurer MBIA against Bank of America and Countrywide is a deposition of none other than Columbia University’s [...]

## Junk Data

February 5, 2013
By

There are junk charts, and there are junk data. That was the thought that ran through my mind when I saw this post about a new FourSquare app (link). For those who are not familiar with it, FourSquare is this website that lets you broadcast your current location to your friends/followers. This new app, which won a competition hosted by FourSquare, allows users to fake their check-ins, in other words,…

## MCMSki IV, Jan. 6-8 (9?), 2014, Chamonix (news #3)

February 5, 2013
By

In case you have not been constantly tracking the changes on the MCMSki IV webpage, here are some news: the number of invited and accepted contributed sessions in the program had considerably increased, to the point of almost filling two parallel sessions for the whole duration of the meeting. This includes an exciting round-table on [...]

## Is Significance Significant?

February 5, 2013
By

Justin Esarey has a post up on his blog about the amount of information contained in a statistically significant result. I offer my quick thoughts and reactions.

## Next Kölner R User Meeting: 6 February 2013

February 5, 2013
By

Quick reminder: The next Cologne R user group meeting is scheduled for tomorrow, 6 February 2013. All details and the agenda are available on the KölnRUG Meetup site. Please sign up if you would like to come along. Notes from the last Cologne R user g...

## Tracking Number of Historical Clusters in DOW 30 and S&P 500

February 5, 2013
By

In the Tracking Number of Historical Clusters post, I looked at how 3 different methods were able to identify clusters across the 10 major asset universe. Today, I want to share the impact of clustering on the larger universe. Below I examined the historical time series of number of clusters in the DOW 30 and […]

## Recently in the sister blog

February 4, 2013
By

I remarked that If you’re havin’ electoral problems I feel bad for you son, I got 538 problems but partisan bias ain’t one, and it got such a strong reaction it caused me to rethink my entire approach to political blogging, so I followed up with, What we’ve got here is failure to communicate. In [...]

## U.S. Gun Murders in 2010: an Alternative View

February 4, 2013
By

How many gun murder victims in the U.S. are black? How many were killed with hand guns (and not with the now fiercely debated assault rifles)? U.S. Gun Murders in 2010 [periscopic.com] by Periscopic combines function and beauty to examine the data ret...

## … and we’re back!

February 4, 2013
By

To be finally back from Mali is kind of weird. It's good, of course. In fact it's great, because I think we were all ready to be back to our more "normal" life. But it's also strange, because it's being such an amazing experience.Not just because of XY...

## Too many MC’s not enough MIC’s, or What principles should govern attempts to summarize bivariate associations in large multivariate datasets?

February 4, 2013
By

Justin Kinney writes: Since your blog has discussed the “maximal information coefficient” (MIC) of Reshef et al., I figured you might want to see the critique that Gurinder Atwal and I have posted. In short, Reshef et al.’s central claim that MIC is “equitable” is incorrect. We [Kinney and Atwal] offer mathematical proof that the [...]

## Too many MC’s not enough MIC’s, or What principles should govern attempts to summarize bivariate associations in large multivariate datasets?

February 4, 2013
By

Justin Kinney writes: Since your blog has discussed the “maximal information coefficient” (MIC) of Reshef et al., I figured you might want to see the critique that Gurinder Atwal and I have posted. In short, Reshef et al.’s central claim that MIC is “equitable” is incorrect. We [Kinney and Atwal] offer mathematical proof that the [...]

## How Close Is The Normal Distribution?

February 4, 2013
By
$How Close Is The Normal Distribution?$

HOW CLOSE IS THE NORMAL DISTRIBUTION? One of the first things you learn in probability is that the average has a distribution that is approximately Normal. More precisely, if are iid with mean and variance then where and means “convergence in distribution.” 1. How Close? But how close is the distribution of to the Normal? [...]

## Simulate discrete variables by using the "Table" distribution

February 4, 2013
By

I wanted to write a blog post about the "Table distribution" in SAS. The Table distribution, which is supported by the RAND and the RANDGEN function, enables you to specify the probability of selecting each of k items. Therefore you can use the Table distribution to sample, with replacement, from [...]

## An infelicity with Value at Risk

February 4, 2013
By

More risk does not necessarily mean bigger Value at Risk. Previously “The incoherence of risk coherence” suggested that the failure of Value at Risk (VaR) to be coherent is of little practical importance. Here we look at an attribute that is not a part of the definition of coherence yet is a desirable quality. Thought … Continue reading →

## Proposed techniques for communicating the amount of information contained in a statistical result

February 4, 2013
By
$Proposed techniques for communicating the amount of information contained in a statistical result$

A couple of weeks ago, I posted about how much we can expect to learn about the state of the world on the basis of a statistical significance test. One way of framing this question is: if we’re trying to come to scientific conclusions on the basis of statistical results, how much can we update […]

## Paper: Storytelling, The Next Step for Visualization

February 4, 2013
By

Visualization is often considered to consist of three phases: exploration, analysis, and presentation. While the former two topics are covered well in the literature, there has been very little work specifically on presentation. In an upcoming paper, Jock Mackinlay and I argue that presentation, and in particular storytelling and communication of data, are the logical next step for the field, and provide some research directions. The paper is titled Storytelling:…

## Data Visualization for Education

February 4, 2013
By

Recently I was invited to give a talk to two cohorts of Strategic Data Project fellows. I was asked to speak about using data visualization to help inform decision-making of policy makers. At the same time, the group had a lot of variation in their int...

## Frequentist properties of Bayesian decision rules? A few more words…

February 4, 2013
By

In recent days I received e-mails from different readers of DBDA, inquiring about frequentist properties of Bayesian decision rules. One question asked about the relative false alarm rates of the traditional t test, on the one hand, and Bayesian estima...