Significant birthdays in the weekend

May 25, 2014
By

I am a listener to BBC's podcast More or Less. In the program Tim Harford looks at data with both humour and determination to find what the numbers mean. Last week he handled a listener question. Does everybody get a significant birthday (20, 30 years ...

Read more »

An interesting mosaic of a data programming course

May 24, 2014
By

Rajit Dasgupta writes: I have been working on a website, SlideRule that in its present state, is a catalog of online courses aggregated from over 35 providers. One of the products we are building on top of this is something called Learning Paths, which are essentially a sequence of Online Courses designed to help learners […] The post An interesting mosaic of a data programming course appeared first on Statistical…

Read more »

Buzzfeed, Porn, Kansas…That Can’t Be Good

May 24, 2014
By

This post is by David K. Park and courtesy of Alex Palen Ellis… Thought you might find this funny: Buzzfeed set out to study porn consumption versus the red/blue political spectrum. And they failed miserably. An article form opennews.org outlines six major fallacies Buzzfeed committed, the best of which resulted in the Kansas effect: “Pornhub’s writeup omitted […] The post Buzzfeed, Porn, Kansas…That Can’t Be Good appeared first on Statistical Modeling, Causal…

Read more »

De la difficulté de faire des prévisions (quand on a peu de données)

May 24, 2014
By
De la difficulté de faire des prévisions (quand on a peu de données)

Depuis plusieurs mois, on observe un engouement (probablement légitime) pour le big data. Si beaucoup peut être fait pour utiliser les volumes énormes de données à la disposition des assureurs, il convient de garder en mémoire que dans beaucoup de cas, les données sont rares et que la technologie ne devrait pas pouvoir changer grand chose. Le manque de données (fiables) crée une variabilité importante. Loi des grands nombres, approximations…

Read more »

The gremlins did it? Iffy statistics drive strong policy recommendations

May 23, 2014
By
The gremlins did it?  Iffy statistics drive strong policy recommendations

Recently in the sister blog. Yet another chapter in the continuing saga, Don’t Trust Polynomials. P.S. More here. The post The gremlins did it? Iffy statistics drive strong policy recommendations appeared first on Statistical Modeling, Causal ...

Read more »

Structural breaks

May 23, 2014
By
Structural breaks

I’m tired of reading about tests for structural breaks and here’s why. A structural break occurs when we see a sudden change in a time series or a relationship between two time series. Econometricians love papers on structural breaks, and apparently believe in them. Personally, I tend to take a different view of the world. I think a more realistic view is that most things change slowly over time, and…

Read more »

Did Neyman really say of Fisher’s work, “It’s easy to get the right answer if you never define what the question is,” and did Fisher really describe Neyman as “a theorem-proving poseur who wouldn’t recognize real data if it bit him in the ass”?

May 23, 2014
By
Did Neyman really say of Fisher’s work, “It’s easy to get the right answer if you never define what the question is,” and did Fisher really describe Neyman as “a theorem-proving poseur who wouldn’t recognize real data if it bit him in the ass”?

To answer the question in the title of this post: Of course not. Fisher is English. They say arse, not ass. But here’s a quote that is floating around. Joseph Wilson quotes science reporter Regina Nuzzo: Neyman called some of Fisher’s work mathematically “worse than useless”; Fisher called Neyman’s approach “childish” and “horrifying [for] intellectual […] The post Did Neyman really say of Fisher’s work, “It’s easy to get the…

Read more »

Rationality and Bayesian Objectivity

May 23, 2014
By

Frequentism and Subjective Bayes are both special cases of Objective Bayes (to the extent they’re true at all). I’ve detailed (here, here, here, and here) in exactly what sense Frequentism is a special case of Bayes. Here I will do the same...

Read more »

Introducing Probability

May 23, 2014
By
Introducing Probability

I have a guilty secret. I really love probability problems. I am so happy to be making videos about probability just now, and conditional probability and distributions and all that fun stuff. I am a little disappointed that we won’t be … Continue reading →

Read more »

Introducing Probability

May 23, 2014
By
Introducing Probability

I have a guilty secret. I really love probability problems. I am so happy to be making videos about probability just now, and conditional probability and distributions and all that fun stuff. I am a little disappointed that we won’t be … Continue reading →

Read more »

A. L. Nagar

May 23, 2014
By
A. L. Nagar

Earlier this year I had a post in memory of the eminent statistician and econometrician, Anirudh Nagar. His passing was a great loss to our profession. Today, I was pleased to learn about this site that honours A. L. Nagar's life and contributions...

Read more »

10 things statistics taught us about big data analysis

May 22, 2014
By
10 things statistics taught us about big data analysis

In my previous post I pointed out a major problem with big data is that applied statistics have been left out. But many cool ideas in applied statistics are really relevant for big data analysis. So I thought I'd try to answer the … Continue reading →

Read more »

The need for documenting functions

May 22, 2014
By
The need for documenting functions

My current work usually requires me to work on a project until we can submit a research paper, and then move on to a new project. However, 3-6 months down the road, when the reviews for the paper return, it is quite common to have to do some new analyses or re-analyses of the data. […]

Read more »

Big Data needs Big Model

May 22, 2014
By

Gary Marcus and Ernest Davis wrote this useful news article on the promise and limitations of “big data.” And let me add this related point: Big data are typically not random samples, hence the need for “big model” to map from sample to population. Here’s an example (with Wei Wang, David Rothschild, and Sharad Goel): […] The post Big Data needs Big Model appeared first on Statistical Modeling, Causal Inference,…

Read more »

R and Python Meetups, Philippines

May 22, 2014
By

There will be upcoming meet ups for R User Group Philippines and Python Philippines (PythonPH) Community. Below are the details:R Meetuptopic: R for SAS users, and planning of RUG activities venue: 9/F Sun Life Centre, 5th Avenue corner Rizal Driv...

Read more »

Newer dplyr!!

May 22, 2014
By
Newer dplyr!!

Last week Statistical Programming DC had a great meetup with my partner-in-crime Marck Vaisman talking about data.table and dplyr as powerful, fast R tools for data manipulation in R. Today Hadley Wickham announced the release of dplyr v.0.2, which is packed with new features and incorporates the “piping” syntax from Stefan Holst Bache‘s magrittr package. I […]

Read more »

The Birthday Simulation

May 22, 2014
By
The Birthday Simulation

Nothing novel or unique about this problem.  This just extends the problem to measure the probability to three or more people sharing the same birthday using simulation approaches. Though there are other ways to approach this problem with built-in functions the example below shows some of the individual steps. For two people it’s fairly straight forward […]

Read more »

estimation of deformations of densities

May 21, 2014
By
estimation of deformations of densities

Today, Jean-Michel Loubes from Toulouse gave a seminar in Dauphine on the estimation of deformations using Wassertsein distances. This is functional data analysis, where samples from random transforms of the original density are observed towards estimating the baseline (or true) measure As a neophyte, I found the problem of interest if difficult to evaluate, in […]

Read more »

A Coursera course on Machine Learning starts on 16 June

May 21, 2014
By
A Coursera course on Machine Learning starts on 16 June

A 10-week course on Machine Learning by Andrew Ng from Stanford University will start on Coursera on 16 June. Below are descriptions of the course picked up from Coursera. The course provides a broad introduction to machine learning, data mining, … Continue reading →

Read more »

Correlation – NOT Causation

May 21, 2014
By
Correlation – NOT Causation

"Correlation is NOT the same as causation".I don't know how many times I've said it (haven't we all?) in class, to the T.V. announcer, ...........Tyler Vigen is a grad. student at Harvard Law School. He has a fun site called Spurious Correlations. Here...

Read more »

History hour

May 21, 2014
By

This is kind of cool $-$ that's an article by Joseph Mazur, appeared in The Guardian with a very brief (but interesting) discussion of the origin of mathematical symbols. I had never really thought about it and basically gave the use of familiar s...

Read more »

Models with constraints

May 21, 2014
By

I had an interesting conversation with Aki about monotonicity constraints. We were discussing a particular set of Gaussian processes that we were fitting to the arsenic well-switching data (the example from the logistic regression chapter in my book with Jennifer) but some more general issues arose that I thought might interest you. The idea was […] The post Models with constraints appeared first on Statistical Modeling, Causal Inference, and Social…

Read more »

Title IX, causal claims, and plausibility in statistics

May 21, 2014
By

Andrew Gelman discusses a paper and blog post by Ian Ayres on the Freakonomics blog. Their main result is summarized as: We find that a ten percentage-point increase in state-level female sports participation generates a five to six percentage-point rise in the rate of female secularism, a five percentage-point increase in the proportion of women who are mothers, and a six percentage-point rise in the proportion of mothers who, at…

Read more »


Subscribe

Email:

  Subscribe