Excel-bashing

April 17, 2013
By
Excel-bashing

In response to the latest controversy, a statistics professor writes: It’s somewhat surprising to see Very Serious Researchers (apologies to Paul Krugman) using Excel. Some years ago, I was consulting on a trademark infringement case and was trying (unsuccessfully) to replicate another expert’s regression analysis. It wasn’t until I had the brainstorm to use Excel [...]The post Excel-bashing appeared first on Statistical Modeling, Causal Inference, and Social Science.

Read more »

Venn-Diagram Based Representation of Combinations of Chronic Diseases

April 17, 2013
By
Venn-Diagram Based Representation of Combinations of Chronic Diseases

The simple yet persuasive interactive infographic accompanying the NYTimes article "For the Elderly, Diseases That Overlap" [nytimes.com], reveals the relative frequency with which different chronic diseases overlap for residents in assisted-living fa...

Read more »

NUTS discussed on Xi’an’s Og

April 17, 2013
By

Xi’an’s Og (aka Christian Robert’s blog) is featuring a very nice presentation of NUTS by Marco Banterle, with discussion and some suggestions. I’m not even sure how they found Michael Betancourt’s paper on geometric NUTS ...

Read more »

Data problems, coding errors…what can be done?

April 17, 2013
By

This post is by Phil A recent post on this blog discusses a prominent case of an Excel error leading to substantially wrong results from a statistical analysis. Excel is notorious for this because it is easy to add a row or column of data (or intermediate results) but forget to update equations so that [...]The post Data problems, coding errors…what can be done? appeared first on Statistical Modeling, Causal…

Read more »

Two unhealthy submissions from readers

April 17, 2013
By
Two unhealthy submissions from readers

Josh hated this "dataless visualization" from ABC. (link; warning: ads). Here are his comments: The report has planes leaving China, landing across the globe and instantly infecting us all with bird flu. It doesn't do a good job explaining how...

Read more »

Occupational hazards in data science

April 17, 2013
By

An interesting episode is developing in econometrics over the very high profile Reinhart-Rogoff paper that was heavily cited as a source to "prove" that high levels of national debt impede growth. It appears that that result was based on a combination of spreadsheet errors, and bad assumptions. 1. Andrew Gelman has a great discussion here. His main concern is ethics of data analysts. This is a very important point -…

Read more »

Quantile regression: Better than connecting the sample quantiles of binned data

April 17, 2013
By
Quantile regression: Better than connecting the sample quantiles of binned data

I often see variations of the following question posted on statistical discussion forums: I want to bin the X variable into a small number of values. For each bin, I want to draw the quartiles of the Y variable for that bin. Then I want to connect the corresponding quartile [...]

Read more »

Interview with a forced convert from Matlab to R

April 17, 2013
By
Interview with a forced convert from Matlab to R

Here is an interview with Ron Hochreiter, Assistant Professor at WU Vienna University Economics and Business. In 25 words or less tell us what you do (using German words is cheating). I consider myself as a data scientist (teaching and research) with roots in Mathematical Programming, i.e. Optimization under Uncertainty (Stochastic Programming). You were an […]The post Interview with a forced convert from Matlab to R appeared first on Burns…

Read more »

Reinhart & Rogoff: Everyone makes coding mistakes, we need to make it easy to find them + Graphing uncertainty

April 17, 2013
By
Reinhart & Rogoff: Everyone makes coding mistakes, we need to make it easy to find them + Graphing uncertainty

You may have already seen a lot written on the replication of Reinhart & Rogoff’s (R &amp R) much cited 2010 paper done by Herndon, Ash, and Pollin. If you haven’t, here is a round up of some of some of what has been written: Konczal, Y...

Read more »

Memo to Reinhart and Rogoff: I think it’s best to admit your errors and go on from there

April 17, 2013
By
Memo to Reinhart and Rogoff:  I think it’s best to admit your errors and go on from there

Jeff Ratto points me to this news article by Dean Baker reporting the work of three economists, Thomas Herndon, Michael Ash, and Robert Pollin, who found errors in a much-cited article by Carmen Reinhart and Kenneth Rogoff analyzing historical statistics of economic growth and public debt. Mike Konczal provides a clear summary; that’s where I [...]The post Memo to Reinhart and Rogoff: I think it’s best to admit your errors…

Read more »

I wish economists made better plots

April 16, 2013
By
I wish economists made better plots

I'm seeing lots of traffic on a big-time economics article by that failed to reproduce and here are my quick thoughts. You can read a pretty good summary here by Mike Konczal. Quick background: Carmen Reinhart and Kenneth Rogoff wrote … Continue reading →

Read more »

My talk in Chicago this Thurs 6:30pm

April 16, 2013
By

Choices in Visualizing Data This time, it’s not at the university, it’s at a data science meetup. Here are the slides. I actually prefer the term “statistical graphics” or “visualizing quantitative information” rather than “visualizing data.” I spend a lot of time graphing inferences and fitted models, understanding my fits and doing exploratory model analysis. [...]The post My talk in Chicago this Thurs 6:30pm appeared first on Statistical Modeling, Causal…

Read more »

Flotsam 11: mostly on books

April 16, 2013
By
Flotsam 11: mostly on books

‘No estaba muerto, andaba the parranda’† as the song says. Although rather than partying it mostly has been reading, taking pictures and trying to learn how to record sounds. Here there are some things I’ve come across lately. I can’t remember if I’ve recommended Matloff’s The Art of R Programming before; if I haven’t, go […]

Read more »

Test Driven Analysis?

April 16, 2013
By
Test Driven Analysis?

At the last LondonR meeting Francine Bennett from Mastodon C shared some of her experience and findings from an analysis of a large prescriptions data set of the UK's national health service (NHS). However, it was her last slide, which I found the most...

Read more »

RStudio is reminding me of the older Macs

April 16, 2013
By
RStudio is reminding me of the older Macs

The only thing missing is the cryptic ID number.Well, the only bad thing is that I am trying to run a probabilistic graphical model on some real data, and having a crash like this will definitely slow things down.

Read more »

Four-day course in doing Bayesian data analysis, June 10-13

April 16, 2013
By
Four-day course in doing Bayesian data analysis, June 10-13

There will be a four-day introductory course in doing Bayesian data analysis, June 10-13 (2013), at the University of St. Gallen, Switzerland. The course is offered through the Summer School in Empirical Research Methods. Complete info is at this link:...

Read more »

MCMSki IV, Jan. 6-8, 2014, Chamonix (news #5)

April 15, 2013
By
MCMSki IV, Jan. 6-8, 2014, Chamonix (news #5)

More exciting news about MCMSki IV! First thing first, the 16 contributed sessions are now all-set, having gotten the stamp of approval from the scientific committee! Thanks to everyone who submitted a session proposal. (There were so many proposals that we alas had to reject some, as well as every single talk proposal… Sorry people: […]

Read more »

Isotonic Regression

April 15, 2013
By
Isotonic Regression

My latest contribution for scikit-learn is an implementation of the isotonic regression model that I coded with Nelle Varoquaux and Alexandre Gramfort. This model finds the best least squares fit to a set of points, given the constraint that the f...

Read more »

Data science only poses a threat to (bio)statistics if we don’t adapt

April 15, 2013
By

We have previously mentioned on this blog how statistics needs better marketing. Recently, Karl B. has suggested that “Data science is statistics” and Larry W. has wondered if “Data science is the end of statistics?” I think there are a … Continue reading →

Read more »

How effective are football coaches?

April 15, 2013
By

Dave Berri writes: A recent study published in the Social Science Quarterly suggests that these moves may not lead to the happiness the fans envision (HT: the Sports Economist). E. Scott Adler, Michael J. Berry, and David Doherty looked at coaching changes from 1997 to 2010. What they found should give pause to people who [...]The post How effective are football coaches? appeared first on Statistical Modeling, Causal Inference, and…

Read more »

Doing legwork, doing justice

April 15, 2013
By
Doing legwork, doing justice

The New York Times brought attention to the Bronx courtrooms this weekend. (link) The following small-multiples chart effectively illustrates how the Bronx system is uniquely unproductive, compared to the other boroughs: The above chart shows the outcomes. The next chart...

Read more »

The role of statistics in the top public health achievements of the 20th century

April 15, 2013
By
The role of statistics in the top public health achievements of the 20th century

In this International Year of Statistics, I'd like to describe the major role of statistics in public health advances. In our modern society, it is sometimes difficult to recall the huge advances in health and medicine in the 20th century. To name a few: penicillin was discovered in 1928, risk [...]

Read more »

Stock-picking opportunity and the ratio of variabilities

April 15, 2013
By
Stock-picking opportunity and the ratio of variabilities

How good is the current opportunity to pick stocks relative to the past? Idea The more stocks act differently from each other relative to how volatile they are, the more opportunity there is to benefit by selecting stocks.  This post looks at a particular way of investigating that idea. Data Daily (log) returns of 442 … Continue reading →

Read more »


Subscribe

Email:

  Subscribe