getstats In 2010 the Royal Statistical Society (RSS) launched the getstats campaign in order to bring statistics to various groups like …Continue reading →

From 2011: Chapter 1 On Sunday we were over on 125 St so I stopped by the Jamaican beef patties place but they were closed. Jesus Taco was next door so I went there instead. What a mistake! I don’t know what Masanao and Yu-Sung could’ve been thinking. Anyway, then I had Jamaican beef patties […]The post Picking pennies in front of a steamroller: A parable comes to life appeared…

Julie Josse sends along this job announcement: A post-doctoral position is available in the applied mathematics department of Agrocampus Rennes. The postdoc will be funded by the Henri Lebesgue Center (see http://www.lebesgue.fr/) if the application is selected. Applicants are expected to send their application before 31 March 2014. The research focus is on development of […]The post Postdoc at Rennes on multilevel missing data imputation appeared first on Statistical Modeling,…

This was originally posted at Point Mass Prior and features MathML. If you’re viewing from StatsBlogs the math probably won’t show up properly and it would be beneficial to view the post here. It’s been a while since my last post which was on using the delta method in R with a specific application to finding the ‘x’ value that corresponds to the maximum/minimum value in a quadratic regression. This…

From 2010: Mark Buchanan wrote a cover article for the New Scientist on random matrices, a heretofore obscure area of probability theory that his headline writer characterizes as “the deep law that shapes our reality.” It’s interesting stuff, and he gets into some statistical applications at the end, so I’ll give you my take on it. But […]The post Random matrices in the news appeared first on Statistical Modeling, Causal Inference, and…

With Matt Moores and Kerrie Mengersen, from QUT, we wrote this short paper just in time for the MCMSki IV Special Issue of Statistics & Computing. And arXived it, as well. The global idea is to cut down on the cost of running an ABC experiment by removing the simulation of a humongous state-space vector, […]

One of the most satisfying experiences for an academic is when someone asks a question that you’ve already answered. This happened in the comments today. Daniel Gotthardt wrote: So for applied stat courses like for sociologists, political scientists, psychologists and maybe also for economics, what do we actually want to accomplish with our intro courses? […]The post Teaching Bayesian applied statistics to graduate students in political science, sociology, public health,…

I happen to think that SQL is a very viable option for analyzing big data. I was thinking about this when I a book review recently:For instance, Siegel reports, people who buy small felt pads that adhere to the bottom of chair legs (to protect th...

Monday, in our MAT8181 class, we’ve discussed seasonal unit roots from a practical perspective (the theory will be briefly mentioned in a few weeks, once we’ve seen multivariate models). Consider some time series , for instance traffic on ...

From 2008: The candy weighing demonstration, or, the unwisdom of crowds My favorite statistics demonstration is the one with the bag of candies. I’ve elaborated upon it since including it in the Teaching Statistics book and I thought these tips might be useful to some of you. Preparation Buy 100 candies of different sizes and […]The post The candy weighing demonstration, or, the unwisdom of crowds appeared first on Statistical…

Overfitting occurs when a statistical model or machine learning algorithm captures the noise of the data. Intuitively, overfitting occurs when the model or the algorithm fits the data too well. Specifically, overfitting occurs if the model or algorithm shows low bias but high variance. Overfitting is often a result of an excessively complicated model, and […]

Having taught and tutored introductory statistics numerous times, I often hear students misinterpret the Central Limit Theorem by saying that, as the sample size gets bigger, the distribution of the data approaches a normal distribution. This is not true. If your data come from a non-normal distribution, their distribution stays the same regardless of the […]