Blog Archives

Piketty in R markdown – we need some help from the crowd

June 30, 2014
By
Piketty in R markdown – we need some help from the crowd

Thomas Piketty's book Capital in the 21st Century was a surprise best seller and the subject of intense scrutiny. A few weeks ago the Financial Times claimed that the analysis was riddled with errors, leading to a firestorm of discussion. A … Continue reading →

Read more »

Privacy as a function of sample size

June 25, 2014
By

The U.S. Supreme Court just made a unanimous ruling in Riley v. California making it clear that police officers must get a warrant before searching through the contents of a cell phone obtained incident to an arrest. The message was put … Continue reading →

Read more »

The difference between data hype and data hope

June 23, 2014
By

I was reading one of my favorite stats blogs, StatsChat, where Thomas points to this article in the Atlantic and highlights this quote: Dassault Systèmes is focusing on that level of granularity now, trying to simulate propagation of cholesterol in human … Continue reading →

Read more »

Heads up if you are going to submit to the Journal of the National Cancer Institute

June 18, 2014
By

Update (6/19/14): The folks at JNCI and OUP have kindly confirmed that they will consider manuscripts that have been posted to preprint servers.  I just got this email about a paper we submitted to JNCI Dear Dr. Leek: I am … Continue reading →

Read more »

The future of academic publishing is here, it just isn’t evenly distributed

June 16, 2014
By

Academic publishing has always been a slow process. Typically you would submit a paper for publication and then wait a few months to more than a year (statistics journals can be slow!) for a review. Then you'd revise the paper … Continue reading →

Read more »

What I do when I get a new data set as told through tweets

June 13, 2014
By
What I do when I get a new data set as told through tweets

Hilary Mason asked a really interesting question yesterday: Data people: What is the very first thing you do when you get your hands on a new data set? — Hilary Mason (@hmason) June 12, 2014 You should really consider reading … Continue reading →

Read more »

10 things statistics taught us about big data analysis

May 22, 2014
By
10 things statistics taught us about big data analysis

In my previous post I pointed out a major problem with big data is that applied statistics have been left out. But many cool ideas in applied statistics are really relevant for big data analysis. So I thought I'd try to answer the … Continue reading →

Read more »

Why big data is in trouble: they forgot about applied statistics

May 7, 2014
By
Why big data is in trouble: they forgot about applied statistics

This year the idea that statistics is important for big data has exploded into the popular media. Here are a few examples, starting with the Lazer et. al paper in Science that got the ball rolling on this idea. The … Continue reading →

Read more »

Picking a (bio)statistics thesis topic for real world impact and transferable skills

April 22, 2014
By

One of the things that was hardest for me in graduate school was starting to think about my own research projects and not just the ideas my advisor fed me. I remember that it was stressful because I didn't quite … Continue reading →

Read more »

The #rOpenSci hackathon #ropenhack

April 10, 2014
By

Editor's note: This is a guest post by Alyssa Frazee, a graduate student in the Biostatistics department at Johns Hopkins and a participant in the recent rOpenSci hackathon.  Last week, I took a break from my normal PhD student schedule … Continue reading →

Read more »


Subscribe

Email:

  Subscribe