# Statistics

Statistics Blogs

## Internet use and religion, part five

November 30, 2015
By

In the previous article, I show results from two regression models that predict religious affiliation and degree of religiosity.  I use the models to compare hypothetical respondents who are at their national means for all explanatory factors; the...

## The hidden benefits of open-source software

November 29, 2015
By

I’ve been having discussions with colleagues and university administration about the best way for universities to manage home-grown software. The traditional business model for software is that we build software and sell it to everyone willing to pay. Very often, that leads to a software company spin-off that has little or nothing to do with the university that nurtured the […]

## The proliferation of useless data

November 29, 2015
By

One of the secrets of great data analysis is thoughtful data collection. Great data collection is necessary but not sufficient for great data analysis. I recently had the unfortunate need to select a new doctor. Every time I had to do this, it has been an exercise in frustration and desperation. And after wasting hours and hours perusing the "data" on doctors, inevitably I give up and just throw a…

November 29, 2015
By

They all use statistics …  in the media, in politics, in sports. But they mostly forget that statistics, especially official statistics, are made by professionals in a quite demanding, time- and resource-consuming process. The WO/MAN-IN-THE-MIDDLE, the professionals, providing information and knowledge from facts remain hidden (despite Googles’ statement that statistician will be ‘the sexy job in … Continue reading Attention please!

November 29, 2015
By

A few month ago, I did mention a graph, of some so-called Lorenz curves to compare regression models, see e.g. Progressive’s slides (thanks Guillaume for the reference) The idea is simple. Consider some model for the pure premium (in insurance, it is the quantity that we like to model), i.e. the conditional expected valeur On some dataset, we have our predictions, as well as observed quantities, . The curve are obtained simply…

November 29, 2015
By

Some recent criticisms of statistical tests of significance have breathed brand new life into some very old howlers, many of which have been discussed on this blog. One variant that returns to the scene every decade I think (for 50+ years?), takes a “disagreement on numbers” to show a problem with significance tests even from a “frequentist” perspective.  Since it’s […]

## 3 YEARS AGO (NOVEMBER 2012): MEMORY LANE

November 25, 2015
By

MONTHLY MEMORY LANE: 3 years ago: November 2012. I mark in red three posts that seem most apt for general background on key issues in this blog.[1]. Please check out others that didn’t make the “bright red cut”. If you’re interested in the Likelihood Principle, check “Blogging Birnbaum” and “Likelihood Links”. If you think P-values are hard to explain, see how […]

## Even the tiniest error messages can indicate an invalid statistical analysis

November 25, 2015
By

The other day, I was reading in a data set in R, and the function indicated that there was a warning about a parsing error on one line. I went ahead with the analysis anyway, but that small parsing error kept bothering me. I thought it was just one lin...

## Internet use and religion, part four

November 24, 2015
By

In the previous article, I presented preliminary results from a study of relationships between Internet use and religion.  Using data from the European Social Survey, I ran regressions to estimate the effect of media consumption (television, radio...

## Statistical Models That Support Design Thinking: Driver Analysis vs. Partial Correlation Networks

November 24, 2015
By

We have been talking about design thinking in marketing since Tim Brown's Harvard Business Review article in 2008. It might be easy for the data scientist to dismiss the approach as merely a type of brainstorming for new products or services. Yet, desi...