Blog Archives

Applied Statisticians: people want to learn what we do. Let’s teach them.

September 15, 2014
By

In this recent opinion piece, Hadley Wickham explains how data science goes beyond Statistics and that data science is not promoted in academia. He defines data science as follows: I think there are three main steps in a data science … Continue reading →

Read more »

Academic statisticians: there is no shame in developing statistical solutions that solve just one problem

July 25, 2014
By

I think that the main distinction between academic statisticians and those calling themselves data scientists is that the latter are very much willing to invest most of their time and energy into solving specific problems by analyzing specific data sets. … Continue reading →

Read more »

The Big in Big Data relates to importance not size

May 28, 2014
By
The Big in Big Data relates to importance not size

In the past couple of years several non-statisticians have asked me "what is Big Data exactly?" or "How big is Big Data?". My answer has been "I think Big Data is much more about "data" than "big". I explain below. … Continue reading →

Read more »

Confession: I sometimes enjoy reading the fake journal/conference spam

April 30, 2014
By

I've spent a considerable amount of time setting up filters to avoid getting spam from fake journals and conferences. Unfortunately, they are exceptionally good at thwarting my defenses. This does not annoy me as much as I pretend because, secretly, … Continue reading →

Read more »

Correlation does not imply causation (parental involvement edition)

April 17, 2014
By

The New York Times recently published an article on education titled "Parental Involvement Is Overrated". Most research in this area supports the opposite view, but the authors claim that "evidence from our research suggests otherwise".  Before you stop helping your children … Continue reading →

Read more »

Writing good software can have more impact than publishing in high impact journals for genomic statisticians

April 7, 2014
By

Every once in a while we see computational papers published in science journals with high impact factors.  Genomics related methods appear quite often in these journals. Several of my junior colleagues express frustration that all their papers get rejected from these journals. … Continue reading →

Read more »

Data Analysis for Genomics edX Course

March 31, 2014
By

Mike Love (@mikelove) and I have been working hard the past couple of months preparing a free online edX course on data analysis for genomics. Our target audience are the postdocs, graduate students and research scientists that are tasked with … Continue reading →

Read more »

The fact that data analysts base their conclusions on data does not mean they ignore experts

March 24, 2014
By

Paul Krugman recently joined the new FiveThirtyEight hating bandwagon. I am not crazy about the new website either (although I'll wait more than one weeks before judging) but in a recent post Krugman creates a false dichotomy that is important to … Continue reading →

Read more »

How to use Bioconductor to find empirical evidence in support of π being a normal number

March 14, 2014
By
How to use Bioconductor to find empirical evidence in support of π being a normal number

Happy π day everybody! I wanted to write some simple code (included below) to the test parallelization capabilities of my  new cluster. So, in honor of  π day, I decided to check for evidence that π is a normal number. A … Continue reading →

Read more »

Per capita GDP versus years since women received right to vote

March 7, 2014
By
Per capita GDP versus years since women received right to vote

Below is a plot of per capita GPD (in log scale) against years since women received the right to vote for 42 countries. Is this cause, effect, both or neither? We all know correlation does not imply causation, but I … Continue reading →

Read more »


Subscribe

Email:

  Subscribe