Choosing bins for histograms in SAS

August 25, 2014
By
Choosing bins for histograms in SAS

When you create a histogram with statistical software, the software uses the data (including the sample size) to automatically choose the width and location of the histogram bins. The resulting histogram is an attempt to balance statistical considerations, such as estimating the underlying density, and "human considerations," such as choosing […]

Read more »

Poker math showdown!

August 25, 2014
By
Poker math showdown!

In comments, Rick Schoenberg wrote: One thing I tried to say as politely as I could in [the book, "Probability with Texas Holdem Applications"] on p146 is that there’s a huge error in Chen and Ankenman’s “The Mathematics of Poker” which renders all the calculations and formulas in the whole last chapter wrong or meaningless […] The post Poker math showdown! appeared first on Statistical Modeling, Causal Inference, and Social…

Read more »

Forecasting with R in WA

August 24, 2014
By
Forecasting with R in WA

On 23–25 September, I will be running a 3-day workshop in Perth on “Forecasting: principles and practice” mostly based on my book of the same name. Workshop participants will be assumed to be familiar with basic statistical tools such as multiple...

Read more »

JAGS and Stan

August 24, 2014
By
JAGS and Stan

During the last year I have been running some estimations in both JAGS and Stan. In that period I have seen one example where JAGS could not get me decent samples (in the sense of low Rhat and high number of effective samples) but that was data which I...

Read more »

How Many Mic’s Do We Rip

August 23, 2014
By

Yakir Reshef writes: Our technical comment on Kinney and Atwal’s paper on MIC and equitability has come out in PNAS along with their response. Similarly to Ben Murrell, who also wrote you a note when he published a technical comment on the same work, we feel that they “somewhat missed the point.” Specifically: one statistic […] The post How Many Mic’s Do We Rip appeared first on Statistical Modeling, Causal…

Read more »

The Econometrics of Temporal Aggregation – III – Unit Roots

August 22, 2014
By
The Econometrics of Temporal Aggregation – III – Unit Roots

In two previous posts, I've talked a bit about the effect that aggregating time series data can have on some standard econometric results. The first of those posts was about a talk that I gave last month at the 2014 Conference of the New Zeal...

Read more »

An exercise in hypothesis testing

August 22, 2014
By
An exercise in hypothesis testing

I've just turned in the manuscript for the second edition of Think Stats.  If you're dying to get your hands on a copy, you can pre-order one here.Most of the book is about computational methods, but in the last chapter I break out some analytic m...

Read more »

UNECE-coordinated work relating to Big Data

August 22, 2014
By
UNECE-coordinated work relating to Big Data

From: http://www1.unece.org/stat/platform/display/bigdata/Big+Data+in+Official+Statistics;jsessionid=AE7DF06FDB27C80A30DACD65F6BDADB6Preliminary results of the survey "Skills necessary for people working with Big Data in Statistical Organisations"...

Read more »

Recently in the sister blog

August 22, 2014
By
Recently in the sister blog

Meritocracy won’t happen: the problem’s with the ‘ocracy’ Does the sex of your child affect your political attitudes? More hype about political attitudes and neuroscience Modern polling needs innovation, not traditionalism Who cares about copycat pollsters? The mythical swing voter Mythical swing voter update No, all Americans are not created equal when it comes to […] The post Recently in the sister blog appeared first on Statistical Modeling, Causal Inference,…

Read more »

Replication Wiki for economics

August 22, 2014
By

Jan Hoeffler of the University of Gottingen writes: I have been working on a replication project funded by the Institute for New Economic Thinking during the last two years and read several of your blog posts that touched the topic. We developed a wiki website that serves as a database of empirical studies, the availability […] The post Replication Wiki for economics appeared first on Statistical Modeling, Causal Inference, and…

Read more »

Analyzing activity-tracker data: How many steps per day do YOU take?

August 22, 2014
By
Analyzing activity-tracker data: How many steps per day do YOU take?

My wife got one of those electronic activity trackers a few months ago and has been diligently walking every day since then. At the end of the day she sometimes reads off how many steps she walked, as measured by her activity tracker. I am always impressed at how many […]

Read more »

biblatex for statisticians

August 22, 2014
By
biblatex for statisticians

I am now using biblatex for all my bibliographic work as it seems to have developed enough to be stable and reliable. The big advantage of biblatex is that it is easy to format the bibliography to conform to specific journal or publisher styles. It is also possible to have structured bibliographies (e.g., divided into sections: books, papers, R packages, etc.) Here is my default setting which should be suitable…

Read more »

Risk Analysis for Three Precious Metals

August 21, 2014
By
Risk Analysis for Three Precious Metals

Qinlu (Louisa) Chen and I have recently completed a paper titled, "Risk Analysis for Three Precious Metals: An Application of Extreme Value Theory". It's based in part on work that Louisa undertook in her B.Sc. Honours program here at UVic.Here's the a...

Read more »

Come hear me talk about A/B Testing next Tuesday #optimizely

August 21, 2014
By

Coming Tuesday, I'm talking about A/B Testing at the Optimizely Experience event in New York. It's at the Intrepid Museum, a great setting. My talk will address questions that come up frequently in practice when you start doing a lot of online experiments. Testing is one big area of business analytics, and an area in which practical experience has to complement textbook learning. Hope to see you there. You can…

Read more »

Extracting Latent Variables from Rating Scales: Factor Analysis vs. Nonnegative Matrix Factorization

August 21, 2014
By
Extracting Latent Variables from Rating Scales: Factor Analysis vs. Nonnegative Matrix Factorization

For many of us, factor analysis provides a gateway to learning how to run and interpret nonnegative matrix factorization (NMF). This post will analyze a set of ratings on a 218 item adjective checklist using both principal axis factor analysis and NMF....

Read more »

The Feltron Annual Report of 2013 on Communication

August 21, 2014
By
The Feltron Annual Report of 2013 on Communication

Each year, Nicholas Felton releases an personal year report, and the one of 2013 [feltron.com] was just released. These reports always stand out because of the immense sense of data-centric detail, and an always original infographic style. This year...

Read more »

The field is a fractal

August 21, 2014
By

In a blog comment, Winston Lin points to this quote from Bill Thurston: There is a real joy in doing mathematics, in learning ways of thinking that explain and organize and simplify. One can feel this joy discovering new mathematics, rediscovering old mathematics, learning a way of thinking from a person or text, or finding […] The post The field is a fractal appeared first on Statistical Modeling, Causal Inference,…

Read more »

Explanation of laxity

August 21, 2014
By

Been a little light on the postings recently. Here is why:

Read more »

A (very) brief review of published human subjects research conducted with social media companies

August 20, 2014
By

As I wrote the other day, more and more human subjects research is being performed by large tech companies. The best way to handle the ethical issues raised by this research is still unclear. The first step is to get some … Continue reading →

Read more »

Converting combination of random variables to hierarchical form for JAGS (BUGS, Stan, etc.)

August 20, 2014
By
Converting combination of random variables to hierarchical form for JAGS (BUGS, Stan, etc.)

An emailer asks:Hi, John. Long-time listener, first-time caller... I have a model that says X is a function of three (independent) random variables: X ~ normal(mu, sigma) / uniform(a,a+b) - beta(v,w) and I also have N random samples of X. Can I use JAG...

Read more »

Do your "data janitor work" like a boss with dplyr

August 20, 2014
By
Do your "data janitor work" like a boss with dplyr

Data “janitor-work”The New York Times recently ran a piece on wrangling and cleaning data:“For Big-Data Scientists, ‘Janitor Work’ Is Key Hurdle to Insights”Whether you call it “janitor-work,” wrangling/munging, cleaning/cleansing/scru...

Read more »

“A hard case for Mister P”

August 20, 2014
By

Kevin Van Horn sent me an email with the above title (ok, he wrote MRP, but it’s the same idea) and the following content: I’m working on a problem that at first seemed like a clear case where multilevel modeling would be useful. As I’ve dug into it I’ve found that it doesn’t quite fit […] The post “A hard case for Mister P” appeared first on Statistical Modeling, Causal…

Read more »

Mind your Qs

August 20, 2014
By
Mind your Qs

Rescheduling Notice: I have been informed by the organizers that the Meetup tonight has to be rescheduled due to an unexpected problem with the venue. When a new date is set, I will let you know. *** Since I am...

Read more »


Subscribe

Email:

  Subscribe