Unemployment of Europe in 2014 by NUTS 2 region

April 25, 2015
By
Unemployment of Europe in 2014 by NUTS 2 region

During the Christmas break I worked on some code to show unemployment by NUTS 2 region. At that point no 2014 data was available. When I noticed the 214 was available I dug up the code and plotted again.Data and CodeAs written, the code was made beginn...

Read more »

Random Data Sets Quickly

April 25, 2015
By
Random Data Sets Quickly

This post will discuss a recent GitHub package I’m working on, wakefield to generate random data sets. The post is broken into the following sections: Demo 1.1 Random Variable Functions 1.2 Random Data Frames 1.3 Missing Values 1.4 Default Data … Continue reading →

Read more »

Bayesian comparison of groups using Python emcee

April 24, 2015
By
Bayesian comparison of groups using Python emcee

Prof. Brain Blais has implemented the BEST model of two groups in emcee, a Python system for MCMC sampling. See his post about it here.

Read more »

“Statistical Concepts in Their Relation to Reality” by E.S. Pearson

April 24, 2015
By
“Statistical Concepts in Their Relation to Reality” by E.S. Pearson

To complete the last post, here’s Pearson’s portion of the “triad”  “Statistical Concepts in Their Relation to Reality” by E.S. PEARSON (1955) SUMMARY: This paper contains a reply to some criticisms made by Sir Ronald Fisher in his recent article on “Scientific Methods and Scientific Induction”. Controversies in the field of mathematical statistics seem largely […]

Read more »

100 Years

April 24, 2015
By
100 Years

Comparing statistical visualizations over a period of 100 years is quite rare. The newly published Atlas of the Swiss Federal …Continue reading →

Read more »

Statistical significance, practical significance, and interactions

April 24, 2015
By
Statistical significance, practical significance, and interactions

I’ve said it before and I’ll say it again: interaction is one of the key underrated topics in statistics. I thought about this today (OK, a couple months ago, what with our delay) when reading a post by Dan Kopf on the exaggeration of small truths. Or, to put it another way, statistically significant but […] The post Statistical significance,…

Read more »

scale acceleration

April 23, 2015
By
scale acceleration

Kate Lee pointed me to a rather surprising inefficiency in matlab, exploited in Sylvia Früwirth-Schnatter’s bayesf package: running a gamma simulation by rgamma(n,a,b) takes longer and sometimes much longer than rgamma(n,a,1)/b, the latter taking advantage of the scale nature of b. I wanted to check on my own whether or not R faced the same […]

Read more »

Gelman speed read

April 23, 2015
By

For those who have found it tough to keep up with Andrew Gelman's prolificacy, here are some brief summaries of several recent posts: On people obsessed with proving the statistical significance of tiny effects: "they are trying to use a bathroom scale to weigh a feather—and the feather is resting loosely in the pouch of a kangaroo that is vigorously…

Read more »

Edmond Malinvaud: A Tribute to his Contributions in Econometrics

April 23, 2015
By
Edmond Malinvaud: A Tribute to his Contributions in Econometrics

I wrote this brief post just after Edmond Malinvaud passed away on 7 March of this year, at the age of 91. Peter Phillips' tribute to Malinvaud is a "must read" piece (see here).Like Peter, I also used Malinvaud's text when undertaking my Masters-...

Read more »

Political Attitudes in Social Environments

April 23, 2015
By

Jose Duarte, Jarret Crawford, Charlotta Stern, Jonathan Haidt, Lee Jussim, and Philip Tetlock wrote an article, “Political Diversity Will Improve Social Psychological Science,” in which the argued that the field of social psychology would benefit from the inclusion of more non-liberal voices (here I’m using “liberal” in the sense of current U.S. politics). Duarte et […] The post Political Attitudes…

Read more »

What if the Washington Post did not display all the data

April 23, 2015
By
What if the Washington Post did not display all the data

Thanks to reader Charles Chris P., I was able to get the police staffing data to play around with. Recall from the previous post that the Washington Post made the following scatter plot, comparing the proportion of whites among police...

Read more »

Thinking big at Yahoo

April 22, 2015
By
Thinking big at Yahoo

I’m speaking in the “Yahoo Labs Big Thinkers” series on Friday 26 June. I hope I can live up to the title! My talk is on “Exploring the boundaries of predictability: what can we forecast, and when should we give up?”  Essentially I will start with some of the ideas in this post, and then discuss the […]

Read more »

Conjoint Analysis and the Strange World of All Possible Feature Combinations

April 22, 2015
By
Conjoint Analysis and the Strange World of All Possible Feature Combinations

The choice modeler looks over the adjacent display of cheeses and sees the joint marginal effects of the dimensions spanning the feature space: milk source, type, origin, moisture content, added mold or bacteria, aging, salting, packaging, price, and m...

Read more »

A message from the vice chairman of surgery at Columbia University: “Garcinia Camboja. It may be the simple solution you’ve been looking for to bust your body fat for good.”

April 22, 2015
By
A message from the vice chairman of surgery at Columbia University:  “Garcinia Camboja. It may be the simple solution you’ve been looking for to bust your body fat for good.”

Should Columbia University fire this guy just cos he says things like this: “You may think magic is make believe but this little bean has scientists saying they’ve found the magic weight loss cure for every body type—it’s green coffee extract.” “I’ve got the No. 1 miracle in a bottle to burn your fat. It’s […] The post A message…

Read more »

Link: The Power of Wee Things

April 22, 2015
By

Lena Groeger (of ProPublica) has written a beautiful piece about the Power of Wee Things. She talks about using small things, multiples, and units to display data and get people interested. The article goes through many, many examples covering many different areas and ideas. She also gave a great talk on the topic at OpenVis 2014. On a somewhat related … Continue reading Link: The…

Read more »

Instead of worrying about multiple hypothesis correction, just fit a hierarchical model.

April 22, 2015
By
Instead of worrying about multiple hypothesis correction, just fit a hierarchical model.

Pejman Mohammadi writes: I’m concerned with a problem in multiple hypothesis correction and, despite having read your article [with Jennifer and Masanao] on not being concerned about it, I was hoping I could seek your advice. Specifically, I’m interested in multiple hypothesis testing problem in cases when the test is done with a discrete finite […] The post Instead of…

Read more »

Sum a series in SAS

April 22, 2015
By
Sum a series in SAS

A customer asked: How do we go about summing a finite series in SAS? For example, I want to compute for various integers n ≥ 3. I want to output two columns, one for the natural numbers and one for the summation of the series. Summations arise often in statistical […] The post Sum a series in SAS appeared first…

Read more »

Travelling Thilaksha

April 22, 2015
By
Travelling Thilaksha

One of my PhD students, Thilaksha Tharanganie, has been very successful in getting travel funding to attend conferences. She was the subject of a write-up in today’s Monash News. We encourage students to attend conferences, and provide funding for them to attend one international conference and one local conference during their PhD candidature. Thilaksha was […]

Read more »

NEYMAN: “Note on an Article by Sir Ronald Fisher” (3 uses for power, Fisher’s fiducial argument)

April 22, 2015
By
NEYMAN: “Note on an Article by Sir Ronald Fisher” (3 uses for power, Fisher’s fiducial argument)

Note on an Article by Sir Ronald Fisher By Jerzy Neyman (1956) Summary (1) FISHER’S allegation that, contrary to some passages in the introduction and on the cover of the book by Wald, this book does not really deal with experimental design is unfounded. In actual fact, the book is permeated with problems of experimentation.  […]

Read more »

R: single plot with two different y-axes

April 21, 2015
By
R: single plot with two different y-axes

I forgot where I originally found the code to do this, but I recently had to dig it out again to remind myself how to draw two different y axes on the same plot to show the values of two different features of the data. This is somewhat distinct from th...

Read more »

The feather, the bathroom scale, and the kangaroo

April 21, 2015
By
The feather, the bathroom scale, and the kangaroo

Here’s something I wrote in the context of one of those “power = .06″ studies: My criticism of the ovulation-and-voting study is ultimately quantitative. Their effect size is tiny and their measurement error is huge. My best analogy is that they are trying to use a bathroom scale to weigh a feather—and the feather is […] The post The feather,…

Read more »

Nice analysis of racial composition of police forces

April 21, 2015
By
Nice analysis of racial composition of police forces

The Washington Post has a good idea. Using Census data, they computed the proportion of police force who are white and the corresponding proportion of citizens who are white, in different cities. In the following scatter plot, they singled out...

Read more »

More on the AA paper

April 21, 2015
By
More on the AA paper

This is a supplement to the previous post about a new research paper on the effect of Alcoholics Anonymous, and an NY Times exposition that I commented on. A misreading of that article led me to complain about per-protocol analysis, which wasn't the methodology behind the Humphrey et. al. research. I will explain their methodology in this post (known as…

Read more »


Subscribe

Email:

  Subscribe