## What Would Cohen Have Titled “The Earth is Round (p < .05)" in 2014?

June 25, 2014
The area of bibliometrics is not my area of expertise but is still of interest as a researcher. I sometimes think about how Google has impacted the way we title articles. Gone are the days of witty, snappy titles. Title … Continue reading →

## More on those randomistas

June 25, 2014
Following up on our recent post, I clicked on some of Ziliak’s links and found lots of good stuff, especially the post by Berk Ozler. I have no knowledge of his work but I like his writing; see here, for example. Ziliak replied: Ozler’s post is very good indeed, and well written. Ozler’s suggestion for […] The post More on those randomistas appeared first on Statistical Modeling, Causal Inference, and…

## R Scrabble: Part 2

June 25, 2014
Ivan Nazarov and Bartek Chroł gave very interesting comments to my last post on counting number of subwords in NGSL words. In particular they proposed large speedups of my code. So I thought to try checking a larger data set. So today I will work with...

## How to read Big Data studies

June 25, 2014
This is part 3 of my response to Gelman's post about the DST/heart attacks study. The previous parts are here and here. One of the keys of vetting any Big Data/OCCAM study is taking note of the decisions made by the researchers in conducting the analysis. Most of these decisions involve subjective adjustments or unverifiable assumptions. Not that either of those things are inherently bad - indeed, any analysis one…

## Statistical Challenges in Neuroscience

June 25, 2014
A workshop on statistics and neuroscience, to take place at the University of Warwick, UK, Sept. 3-5 2014. We’ll talk spikes, voxels, pixels, MCMC, and so on.Official call for posters below the fold. We are pleased to announce that a workshop on “Statistical Challenges in Neuroscience” will take place Sept. 3 to 5, 2014 in […]

## Simulating data for a logistic regression model

June 25, 2014
In my book Simulating Data with SAS, I show how to use the SAS DATA step to simulate data from a logistic regression model. Recently there have been discussions on the SAS/IML Support Community about simulating logistic data by using the SAS/IML language. This article describes how to efficiently simulate […]

## It is so random! Or is it? The meaning of randomness

June 25, 2014
The concept of “random” is a tough one. First there is the problem of lexical ambiguity. There are colloquial meanings for random that don’t totally tie in with the technical or domain-specific meanings for random. Then there is the fact … Continue reading →

## ABC model choice by random forests

June 24, 2014
After more than a year of collaboration, meetings, simulations, delays, switches,  visits, more delays, more simulations, discussions, and a final marathon wrapping day last Friday, Jean-Michel Marin, Pierre Pudlo,  and I at last completed our latest collaboration on ABC, with the central arguments that (a) using random forests is a good tool for choosing the […]

## Bedtools tutorial from 2013 CSHL course

June 24, 2014
A couple of months ago I posted about how to visualize exome coverage with bedtools and R. But if you're looking to get a basic handle on genome arithmetic, take a look at Aaron Quinlan's bedtools tutorials from the 2013 CSHL course. The tutorial uses ...

## New book on implementing reproducible research

June 24, 2014
I have mentioned this in a few places but my book edited with Victoria Stodden and Fritz Leisch, Implementing Reproducible Research, has just been published by CRC Press. Although it is technically in their "R Series", the chapters contain information on … Continue reading →

## Example 2014.7: Simulate logistic regression with an interaction

June 24, 2014
Reader Annisa Mike asked in a comment on an early post about power calculation for logistic regression with an interaction. This is a topic that has come up with increasing frequency in grant proposals and article submissions. We'll begin by showing ...

## Too Linear To Be True: The curious case of Jens Forster

June 24, 2014
Yup, another social psychology researcher from northwestern Europe who got results that people just don’t believe. I’m a fan of Retraction Watch but not a regular reader so I actually heard about this one indirectly, via this email from Baruch Eitam which contained the above link and the following note: Of the latest troubles in […] The post Too Linear To Be True: The curious case of Jens Forster appeared…

## Notes on "Collective Stability in Structured Prediction: Generalization from One Example" (or: Small Pieces, Loosely Joined)

June 24, 2014
## Generating and visualising multivariate random numbers in R

June 24, 2014
This post will present the wonderful pairs.panels function of the psych package [1] that I discovered recently to visualise multivariate random numbers.Here is a little example with a Gaussian copula and normal and log-normal marginal distributions. I ...

## Machine Learning and Applied Statistics Lesson of the Day – The Line of No Discrimination in ROC Curves

$Machine Learning and Applied Statistics Lesson of the Day – The Line of No Discrimination in ROC Curves$

After training a binary classifier, calculating its various values of sensitivity and specificity, and constructing its receiver operating characteristic (ROC) curve, we can use the ROC curve to assess the predictive accuracy of the classifier.  A minimum standard for a good ROC curve is being better than the line of no discrimination.  On a plot of […]

## The Oracle (5. Or: Calibration, calibration, calibration…)

June 23, 2014
First off, a necessary disclaimer: I haven't been able to write this post before a few of the games of the final round of the group stage have been played, but I have not watched the games so far and have run the model to predict round 3 as if none of ...

## revenge of the pigeons

June 23, 2014
While I had not had kamikaze pigeons hitting my windows for quite a while…, it may be that one of them decided to move to biological warfare: when I came back from Edinburgh, my office at the University was in a terrible state as a bird had entered through a tiny window opening and wrecked […]

## The First European Meeting of the Econometric Society

June 23, 2014
Olav Bjerkholt has alerted me to an interesting new paper of his that documents a milestone gathering of econometricians. Titled, The First European Econometric Society Meeting, September 1931, Lausanne, Olav's paper was presented at the 18th Annu...

## The difference between data hype and data hope

June 23, 2014
I was reading one of my favorite stats blogs, StatsChat, where Thomas points to this article in the Atlantic and highlights this quote: Dassault Systèmes is focusing on that level of granularity now, trying to simulate propagation of cholesterol in human … Continue reading →

## Smullyan and the Randomistas

June 23, 2014
Steve Ziliak wrote in: I thought you might be interested in the following exchanges on randomized trials: Here are a few exchanges on the economics and ethics of randomized controlled trials, reacting to my [Zilliak's] study with Edward R. Teather-Posadas, “The Unprincipled Randomization Principle in Economics and Medicine”. Our study is forthcoming in the Oxford […] The post Smullyan and the Randomistas appeared first on Statistical Modeling, Causal Inference, and…

## Mayo’s Error Statistics as a case study in the inevitability of Bayes

June 23, 2014
Cox’s Theorem implies that we either use Bayes or our methods will violate some simple but desirable properties. This has two consequences: (1) Frequentist methods such as p-values, which aren’t equivalent to posteriors, are guaranteed to b...

## On deck this week

June 23, 2014
Mon: Smullyan and the Randomistas Tues: Too Linear To Be True: The curious case of Jens Forster Wed: More on those randomistas Thurs: Estimating a customer satisfaction regression, asking only a subset of predictors for each person Fri: Quantifying luck vs. skill in sports Sat, Sun: Hey, it’s summer—time to take the weekends off. Have […] The post On deck this week appeared first on Statistical Modeling, Causal Inference, and…

## Getting the basics right is half the battle

June 23, 2014
I was traveling quite a lot recently, and last week, read the Wall Street Journal cover to cover for the first time in a while. I am happy to report that there are many more data graphics than I remember...

