The first CREDAM Award for creative data management goes to … the German government!

February 26, 2014
By

“If you torture the data long enough, it will confess.” This aphorism, attributed to Ronald Coase, sometimes has been used in a disrespective manner, as if was wrong to do creative data analysis. This view obviously is misleading. In contra...

Read more »

Further thoughts on post-publication peer review (PPPR)

February 26, 2014
By

Sanjay Srivastava blogged some interesting thoughts about the process of post-publication peer review (PPPR), reflecting about his own comment on a PLOS ONE publication. I agree that open peer commentaries after publication are one important part of th...

Read more »

Installation of WRS package (Wilcox’ Robust Statistics)

February 26, 2014
By

Update Feb 17, 2014: WRS moved to Github – This installation procedure has been updated and still is valid Some users had trouble installing the WRS package from R-Forge. Here’s a method that should work automatically and fail-safe: [cc lan...

Read more »

At what sample size do correlations stabilize?

February 26, 2014
By

Maybe you have encountered this situation: you run a large-scale study over the internet, and out of curiosity, you frequently  the correlation between two variables. My experience with this practice is usually frustrating, as in small sample sizes (a...

Read more »

Finally! Tracking CRAN packages downloads

February 26, 2014
By

[Update June 12: Data.tables functions have been improved (thanks to a comment by Matthew Dowle); for a similar approach see also Tal Galili's post] The guys from RStudio now provide CRAN download logs (see also this blog post). Great work! I always as...

Read more »

Exploring the robustness of Bayes Factors: A convenient plotting function

February 26, 2014
By

One critique frequently heard about Bayesian statistics is the subjectivity of the assumed prior distribution. If one is cherry-picking a prior, of course the posterior can be tweaked, especially when only few data points are at hand. For example, see ...

Read more »

New robust statistical functions in WRS package – Guest post by Rand Wilcox

February 26, 2014
By

Today a new version (0.23.1) of the WRS package (Wilcox’ Robust Statistics) has been released. This package is the companion to his rather exhaustive book on robust statistics, “Introduction to Robust Estimation and Hypothesis Testing”...

Read more »

Interactive exploration of a prior’s impact

February 26, 2014
By

The probably most frequent criticism of Bayesian statistics sounds something like “It’s all subjective – with the ‘right’ prior, you can get any result you want.”. In order to approach this criticism it has been sugg...

Read more »

Applied Statistics Lesson of the Day – The Matched-Pair (or Paired) t-Test

Applied Statistics Lesson of the Day – The Matched-Pair (or Paired) t-Test

My last lesson introduced the matched pairs experimental design, which is a special type of the randomized blocked design.  Let’s now talk about how to analyze the data from such a design. Since the experimental units are organized in pairs, the units between pairs (blocks) are not independently assigned.  (The units within each pair are […]

Read more »

Nonlinear Time Series just appeared

February 25, 2014
By
Nonlinear Time Series just appeared

My friends Randal Douc and Éric Moulines just published this new time series book with David Stoffer. (David also wrote Time Series Analysis and its Applications with Robert Shumway a year ago.) The books reflects well on the research of Randal and Éric over the past decade, namely convergence results on Markov chains for validating […]

Read more »

Useful for referring—2-25-2014

February 25, 2014
By
Useful for referring—2-25-2014

Interview with Nick Chamandy, statistician at Google You and Your Research +  video Trustworthy Online Controlled Experiments: Five Puzzling Outcomes Explained A Survival Guide to Starting and Finishing a PhD Six Rules For Wearing Suits For Beginners Why I Created C++ More advice to scientists on blogging Software engineering practices for graduate students Statistics Matter […]

Read more »

Useful for referring—2-25-2014

February 25, 2014
By
Useful for referring—2-25-2014

Interview with Nick Chamandy, statistician at Google You and Your Research +  video Trustworthy Online Controlled Experiments: Five Puzzling Outcomes Explained A Survival Guide to Starting and Finishing a PhD Six Rules For Wearing Suits For Beginners Why I Created C++ More advice to scientists on blogging Software engineering practices for graduate students Statistics Matter […]

Read more »

Mapping All Intercity Bus Routes in the U.S.

February 25, 2014
By
Mapping All Intercity Bus Routes in the U.S.

AIBRA, short for American Intercity Bus Riders Association, has recently released a detailed map [kfhgroup.com] containing all the intercity bus lines currently in operation within the U.S. Not surprisingly, the resulting transportation grid correlate...

Read more »

Fast matrix computations for functional additive models

February 25, 2014
By
Fast matrix computations for functional additive models

I have just arxiv’ed a new manuscript on speeding up computation for functional additive models such as functional ANOVA. A functional additive model is essentially a model says that a = b + c, where a, b and c are functions. It is a useful model when we want to express things like: I have […]

Read more »

Basketball Stats: Don’t model the probability of win, model the expected score differential.

February 25, 2014
By

Someone who wants to remain anonymous writes: I am working to create a more accurate in-game win probability model for basketball games. My idea is for each timestep in a game (a second, 5 seconds, etc), use the Vegas line, the current score differential, who has the ball, and the number of possessions played already […]The post Basketball Stats: Don’t model the probability of win, model the expected score differential.…

Read more »

Knowledge in the chart and knowledge in the head

February 25, 2014
By
Knowledge in the chart and knowledge in the head

One of the many insights from Don Norman's great design book is that a user's behavior is affected by "knowledge in the world", and "knowledge in the head." Applied to graphics, this means readers of graphics use both knowledge in...

Read more »

Mathematica: Introducing the Wolfram Language

February 25, 2014
By

Finally, here it is, check out the video below as Stephen Wolfram showcases the Wolfram language, From my previous post, I said that I used Wolfram Mathematica for about a year before I embrace R. And frankly, I've been in love with Mathematica; it nev...

Read more »

Next Kölner R User Meeting: 26 February 2014

February 25, 2014
By
Next Kölner R User Meeting: 26 February 2014

The next Cologne R user group meeting is scheduled for tomorrow, 26 February 2014. We are delighted to welcome:Diego de Castillo: R and databasesKim Kuen Tang: Hands on using R and kdb+ togetherFrank Celler: ArangoDB (Lightning Talk)Further details an...

Read more »

LaTeX: How to install TeX Live – qtree package in Ubuntu 12.10

February 25, 2014
By
LaTeX: How to install TeX Live – qtree package in Ubuntu 12.10

There is a question on TeX - StackExchange that has no direct solution to the installation of the qtree - TeX Live package in Ubuntu. And I want to answer that in this post, then just drop the link of this article to the comment section of the said que...

Read more »

Job Trends in the Analytics Market: New, Improved, now Fortified with C, Java, MATLAB, Python, Julia and Many More!

February 25, 2014
By
Job Trends in the Analytics Market: New, Improved, now Fortified with C, Java, MATLAB, Python, Julia and Many More!

I’m expanding the coverage of my article, The Popularity of Data Analysis Software. This is the first installment, which includes a new opening and a greatly expanded analysis of the analytics job market. Here it is, from the abstract onward … Continue reading →

Read more »

The forecast mean after back-transformation

February 25, 2014
By
The forecast mean after back-transformation

Many functions in the forecast package for R will allow a Box-Cox transformation. The models are fitted to the transformed data and the forecasts and prediction intervals are back-transformed. This preserves the coverage of the prediction intervals, and the back-transformed point forecast can be considered the median of the forecast densities (assuming the forecast densities on the transformed scale are symmetric). For many purposes, this is acceptable, but occasionally the…

Read more »

Bayesian First Aid: Two Sample t-test

February 24, 2014
By
Bayesian First Aid: Two Sample t-test

As spring follows winter once more here down in southern Sweden, the two sample t-test follows the one sample t-test. This is a continuation of the Bayesian First Aid alternative to the one sample t-test where I’ll introduce the two sample alternat...

Read more »

Dancing Statistics: Communicating Psychology to the Public through Dance

February 24, 2014
By
Dancing Statistics: Communicating Psychology to the Public through Dance

Do you know what correlation, variance, frequency distributions, sampling and standard errors are? If not, you now have to chance to learn each of these statistical concepts via the medium of... modern dance. Initiated by Lucy Irving (Middlesex Unive...

Read more »


Subscribe

Email:

  Subscribe