Cumulative residual plots seem like they could be useful

August 23, 2017
By

Peter Vanney, a statistician at Texas Highway Patrol, writes: I’m wondering if you could comment on CURE (CUmulative REsidual) plots that I’m seeing quite a bit in vehicle crash modeling. Ezra Hauer and Joseph Bamfo champion them as a way to determine model fit for their hierarchical Bayesian generalized linear mixed models. I had not […] The post Cumulative residual plots seem like they could be useful appeared first on…

Read more »

The arithmetic-geometric mean

August 23, 2017
By
The arithmetic-geometric mean

All statisticians are familiar with the classical arithmetic mean. Some statisticians are also familiar with the geometric mean. Whereas the arithmetic mean of n numbers is the sum divided by n, the geometric mean of n nonnegative numbers is the n_th root of the product of the numbers. The geometric [...] The post The arithmetic-geometric mean appeared first on The…

Read more »

Don’t always give ’em what they want: Practicing scientists want certainty, but I don’t want to offer it to them!

August 22, 2017
By

Stephen Senn writes: What the practicing scientist wants to know is what is a good test in practice. I agree with Stephen Senn on most things—even where it seems we disagree, I think we agree on the fundamentals—but in this case I think you have to be careful about giving the practicing scientist what he […] The post Don’t always…

Read more »

Some Neat New R Notations

August 22, 2017
By
Some Neat New R Notations

The R package seplyr supplies a few neat new coding notations. An Abacus, which gives us the term “calculus.” The first notation is an operator called the “named map builder”. This is a cute notation that essentially does the job of stats::setNames(). It allows for code such as the following: library("seplyr") names <- c('a', 'b') … Continue reading Some Neat…

Read more »

Details, details, details: giving Zillow a pie treatment

August 22, 2017
By
Details, details, details: giving Zillow a pie treatment

Kaiser Fung, founder of Junk Charts and Principal Analytics Prep, takes a detailed look at a Zillow chart, showing underwater homes and delinquent mortgages. Surprisingly, pie charts work well here.

Read more »

Two papers and one presentation by Ron Kennett related to workflow

August 22, 2017
By

Ron Kennett sent along these two papers: Statistics: A Life Cycle View Aspects of statistical consulting not taught by academia Also this presentation. They’re somewhat relevant to our current project on statistical workflow, so I’m posting them here for convenience. P.S. I used to think it was a good idea to teach statistical consulting, and […] The post Two papers…

Read more »

Jump-start A Pull Request

Pull requests (PRs) is my favorite thing about Github.1 I have created and merged numerous PRs over the years. If you are new to PRs and contributing to other people’s packages, I strongly recommend you to read the following materials: The presentation “Collaborative Coding” by Charlotte Wickham The blog post “Contributing Code to the Tidyverse” by Jim Hester Jim’s post…

Read more »

Generate A Summary for A Post in Hugo

While I love Hugo for a lot of things, there are a few that I think can still be improved. For example, the default Disqus template bothered me for a long time. I’m very familiar with Disqus, so I know how to set it up correctly and reasonably using my custom template, but Hugo’s default Disqus template could be very…

Read more »

He wants some readings on the replication crisis that are accessible to college freshmen in economics

August 21, 2017
By

Harvey Rosen writes: My query is similar to the one from André Ariew that you posted on August 7, in which he asked if you could suggest readings for his graduate course in philosophy. I occasionally teach an undergraduate course on introductory microeconomics. I like to devote some time to discussing challenges to economists’ conventional […] The post He wants…

Read more »

Mixture models in Stan: you can use log_mix()

August 21, 2017
By
Mixture models in Stan:  you can use log_mix()

From the Stan manual: log_mix() . . . I like it. Super-clean. The post Mixture models in Stan: you can use log_mix() appeared first on Statistical Modeling, Causal Inference, and Social Science.

Read more »

Publish your raw data and your speculations, then let other people do the analysis: track and field edition

August 21, 2017
By

There seems to be an expectation in science that the people who gather a dataset should also be the ones who analyze it. But often that doesn’t make sense: what it takes to gather relevant data has little to do with what it takes to perform a reasonable analysis. Indeed, the imperatives of analysis can […] The post Publish your…

Read more »

6 tips for timing the performance of algorithms

August 21, 2017
By
6 tips for timing the performance of algorithms

When you implement a statistical algorithm in a vector-matrix language such as SAS/IML, R, or MATLAB, you should measure the performance of your implementation, which means that you should time how long a program takes to analyze data of varying sizes and characteristics. There are some general tips that can [...] The post 6 tips for timing the performance of…

Read more »

Irish immigrants in the Civil War

August 20, 2017
By

I was cc-ed on a series of emails on a topic I know nothing about, maybe because I’m on the political science faculty here, I don’t know. Anyway, there was some statistical content here so I thought I’d share with you. The email is from James McManus: Analysis of the Civil War Immigrant problem McPherson’s […] The post Irish immigrants…

Read more »

Bayesian Random Projection (More on Terabytes of Economic Data)

August 20, 2017
By

Some additional thoughts related to Serena Ng's World Congress piece (earlier post here, with a link to her paper):The key newish dimensionality-reduction strategies that Serena emphasizes are random projection and leverage score sampling.  I...

Read more »

Is dplyr Easily Comprehensible?

August 20, 2017
By
Is dplyr Easily Comprehensible?

dplyr is one of the most popular R packages. It is powerful and important. But is it in fact easily comprehensible? dplyr makes sense to those of us who use it a lot. And we can teach part time R users a lot of the common good use patterns. But, is it an easy task … Continue reading Is dplyr…

Read more »

“Babbage was out to show that not only was the system closed, with a small group controlling access to the purse strings and the same individuals being selected over and again for the few scientific honours or paid positions that existed, but also that one of the chief beneficiaries . . . was undeserving.”

August 19, 2017
By

Fernando Martel Garcia writes: Here’s an early reference from the Victorian Age. Enjoy! It’s a news article by Rebekah Higgitt called “Fraud and the decline of science,” subtitled, “Charles Babbage’s accusations of fraudulent science underlined his attack on scientific governance, but were also bitterly personal.” My reply: Wow! I think I’m on Babbage’s side on […] The post “Babbage was…

Read more »

Stan Weekly Roundup, 18 August 2017

August 18, 2017
By

Summer? What summer? Stan 2.17 is coming and there’s work to be done. Sebastian Weber has been making huge strides in adding MPI parallel autodiff to the math library (with design maturing for Stan itself and the library interfaces). Ongoing discusions on the Discourse forum and prototypes for a function to add to the Stan […] The post Stan Weekly…

Read more »

It is somewhat paradoxical that good stories tend to be anomalous, given that when it comes to statistical data, we generally want what is typical, not what is surprising. Our resolution of this paradox is . . .

August 18, 2017
By

From a blog comment a few years ago regarding an article by Robert Kosara: As Thomas and I discuss in our paper [When Do Stories Work? Evidence and Illustration in the Social Sciences], it is somewhat paradoxical that good stories tend to be anomalous, given that when it comes to statistical data, we generally want […] The post It is…

Read more »

Just google “Despite limited statistical power”

August 17, 2017
By

Here it is. It’s not always clear what people mean by this expression, but sometimes it seems that they’re making the “What does not kill my statistical significance makes it stronger” fallacy, thinking that the attainment of statistical significance is a particular feat in the context of a noisy study, so that they’re (mistakenly) thinking […] The post Just google…

Read more »

Use the LENGTH statement to pre-set the lengths of character variables in SAS – with a comparison to R

Use the LENGTH statement to pre-set the lengths of character variables in SAS – with a comparison to R

I often create character variables (i.e. variables with strings of text as their values) in SAS, and they sometimes don’t render as expected.  Here is an example involving the built-in data set SASHELP.CLASS. Here is the code: data c1;      set sashelp.class;      * define a new character variable to classify someone as tall or […]

Read more »

Thank You For The Very Nice Comment

August 16, 2017
By
Thank You For The Very Nice Comment

Somebody nice reached out and gave us this wonderful feedback on our new Supervised Learning in R: Regression (paid) video course. Thanks for a wonderful course on DataCamp on XGBoost and Random forest. I was struggling with Xgboost earlier and Vtreat has made my life easy now :). Supervised Learning in R: Regression covers a … Continue reading Thank You…

Read more »

Also holding back progress are those who make mistakes and then label correct arguments as “nonsensical.”

August 16, 2017
By

Here’s James Heckman in 2013: Also holding back progress are those who claim that Perry and ABC are experiments with samples too small to accurately predict widespread impact and return on investment. This is a nonsensical argument. Their relatively small sample sizes actually speak for — not against — the strength of their findings. Dramatic […] The post Also holding…

Read more »

mc-stan.org down & single points of failure

August 16, 2017
By

[update: back up. whew. back to our regularly scheduled programming.] [update: just talked to our registrar on the phone and they say it’ll probably take an hour or two for the DNS to catch up again, but then everything should be OK. I would highly recommend PairNIC—their support was awesome.] mc-stan.org is down because I […] The post mc-stan.org down…

Read more »


Subscribe

Email:

  Subscribe