## Hierarchical forecasting with hts v4.0

February 12, 2014
By

A new version of my hts package for R is now on CRAN. It was completely re-written from scratch. Not a single line of code survived. There are some minor syntax changes, but the biggest change is speed and scope. This version is many times faster than the previous version and can handle hundreds of thousands of time series without complaining. The speed-up is due to some new research I…

February 12, 2014
By

These are questions I answered recently for an undergrad who had questions about how best to prepare herself for graduate school. I thought the answers might be more widely of interest, and with a lot of editing, am including the questions and answers ...

## Circular Binning Map Highlights Enjoyable U.S. Weather Conditions

February 11, 2014
By

Binning is a clever method to avoid overlapping data points by aggregating multiple points in a grid of polygons, and using color to denote the relative density (see some interesting explanations here and here). The map The Pleasant Places to Live [k...

## A million ways to connect R and Excel

February 11, 2014
By

In quantitative finance both R and Excel are the basis tools for any type of analysis. Whenever one has to use Excel in conjunction with R, there are many ways to approach the problem and many solutions. It depends on what you really want to do and the size of the dataset you’re dealing with. I […]

## There is no Such Thing as Biomedical "Big Data"

February 11, 2014
By

At the moment, the world is obsessed with “Big Data” yet it sometimes seems that people who use this phrase don’t have a good grasp of its meaning.  Like most good buzz-words, “Big Data” sparks the idea of something grand and complicated...

## An Interview With Bradley Efron

February 11, 2014
By

You've all heard about the bootstrap, and you all know that it was Bradley Efron (Statistics, Stanford) who came up with the idea. (If I'm wrong, you can check this earlier post.)On Twitter yesterday, Joe Blitzstein (Statistics, Harvard; @stat110)...

## My talks in Bristol this Wed and London this Thurs

February 11, 2014
By

1. Causality and statistical learning (Wed 12 Feb 2014, 16:00, at University of Bristol): Causal inference is central to the social and biomedical sciences. There are unresolved debates about the meaning of causality and the methods that should be used to measure it. As a statistician, I am trained to say that randomized experiments are […]The post My talks in Bristol this Wed and London this Thurs appeared first on…

## A deeper look at the Bloomberg report

February 11, 2014
By

In the prior post, I linked to Eric P.'s (link) vetting of the Bloomberg chart on the drop in median male income in the U.S. in the last few decades. Just as a reminder, here is the key chart: In...

## Control an LED with the Raspberry Pi and via the web

February 11, 2014
By

What a great little device the Raspberry Pi is! After my initial setup it is time to play around with the input and output pins. The first example has to be to switch on an LED. This can also be done remotely via a web interface and better, I cannot on...

## Le Vote par Procuration en France

February 11, 2014
By
$P_i\sim\mathcal{B}(N_i,p_i)$

La Vie des Idées a mis en ligne, ce matin, un court texte, écrit par Baptiste Coulmont (a.k.a. @coulmont) et Joël Gombin (a.k.a. @joelgombin), auquel j’ai très modestement contribué, intitulé ”Un homme, deux voix. Le vote par procuration“. Alors que sur son blog, Baptiste a rajouté pas mal d’information sur le vote par procuration en France (et le contexte général, en particulier pourquoi autant de partis courtisent certaines personnes en les incitant…

## How dplyr replaced my most common R idioms

February 10, 2014
By

Having written a lot of R code over the last few years, I've developed a set of constructs for my most common tasks. Like an idiom in a natural language (e.g. "break a leg"), I automatically grasp their meaning without having to think about it. Because they allow me to become more and more productive »more

## Unprincipled Component Analysis

February 10, 2014
By

As a data scientist I have seen variations of principal component analysis and factor analysis so often blindly misapplied and abused that I have come to think of the technique as unprincipled component analysis. PCA is a good technique often used to reduce sensitivity to overfitting. But this stated design intent leads many to (falsely) […] Related posts: Bad Bayes: an example of why you need hold-out testing Don’t use…

## Scraping Pro-Football Data and Interactive Charts using rCharts, ggplot2, and shiny

February 10, 2014
By

UPDATE: THE BLOG/SITE HAS MOVED TO GITHUB. THE NEW LINK FOR THE BLOG/SITE IS patilv.github.io and THE LINK TO THIS POST IS: http://bit.ly/1k0mKWI. PLEASE UPDATE ANY BOOKMARKS YOU MAY HAVE.This post uses pro-football (American) boxscore data from 1966 t...

## Reporter: Nicholas Felton Releases Self-Logging iPhone App

February 10, 2014
By

Nicholas Felton, the person behind the infographic-style setting Feltron annual reports (see 2012, 2011, 2010, 2009, 2008, 2007, 2006, 2005) has released a self-logging app for the iPhone, called Reporter [reporter-app.com]. The app presents the use...

## More on US health care overkill

February 10, 2014
By

Paul Alper writes: You recently posted my moving and widening the goalposts contention. In it, I mentioned “how diagnoses increase markedly while deaths are flatlined” indicating that we are being overdiagnosed and overtreated. Above are 5 frightening graphs which illustrate the phenomenon. Defenders of the system might (ludicrously) contend that it is precisely the aggressive […]The post More on US health care overkill appeared first on Statistical Modeling, Causal Inference,…

## Is it true that all epistemic principles can only be defended circularly? A Popperian puzzle

February 10, 2014
By

Current day Popperians, the “critical rationalists”, espouse the following epistemic principle CR:[i] (CR) it is reasonable to adopt or believe a claim or theory P which best survives serious criticism. What justifies CR?  To merely declare it is a reasonable epistemic principle without giving evidence that following it advances any epistemic goals is entirely unsatisfactory, […]

## An offer for New York Times Business reporters

February 10, 2014
By

Reading the New York Times's coverage of the unemployment report irritates me so much that I'm putting an offer on the table: if they contact me, I'll send them a copy of Numbersense (link) for free so that they can educate themselves about employment statistics (Chapter 6). *** In the meantime, let me run through all the irritating errors they keep reporting month after month (link to article). By the…

## Where Have All the Tenured Women Gone?

February 10, 2014
By

Ingram Olkin has an excellent editorial about gender equality in statistics departments in the US.  Everyone should read it.  Each University has its own business structure, and UCLA has its own structure. Biostatistics also differs from statistics i...

## Modelling Olympic Medal Wins

February 10, 2014
By

With the Sochi Winter Olympics now well underway, I was reminded of the empirical literature that has attempted to model the number of medals that different countries win.Back in 2006, one of our students, Glen Roberts, wrote an excellent paper, titled...

## On deck this week

February 10, 2014
By

This blog has roughly a month’s worth of items waiting to be posted. I post about once a day, sometimes rescheduling posts to make room for something topical. Anyway, it struck me that I know what’s coming up, but you don’t. So, here’s what we have for you during the next few days: Mon: More […]The post On deck this week appeared first on Statistical Modeling, Causal Inference, and Social…

## Applying numbersense to a Bloomberg report

February 10, 2014
By

Eric P. asked me to comment on his recent post via twitter. (Yes, that's another way to submit charts to me.) Overall, I love the spirit of Eric's article. He's using his numbersense, in this case by asking what the...

## NBER Econonometrics "Methods Lectures" Videos

February 10, 2014
By

For nearly a decade, the National Bureau of Economic Research has been holding a day of econometrics "Methods Lectures" during the Summer Institute, with the speakers and sub-topic changing each year.Evidently it's not widely known that the lecture vid...

## Monday data/statistics link roundup (2/10/14)

February 10, 2014
By

I'm going to try Monday's for the links. Let me know what you think. The Guardian is reading our blog. A week after Rafa posts that everyone should learn to code for career preparedness, the Guardian gets on the bandwagon. … Continue reading →