Honored oldsters write about statistics

May 4, 2014
By

The new book titled: Past, Present, and Future of Statistical Science is now available for download. The official description makes the book sound pretty stuffy: Past, Present, and Future of Statistical Science, commissioned by the Committee of Presidents of Statistical Societies (COPSS) to celebrate its 50th anniversary and the International Year of Statistics, will be […] The post Honored oldsters write about statistics appeared first on Statistical Modeling, Causal Inference,…

Read more »

New jobs in business analytics at Monash

May 4, 2014
By
New jobs in business analytics at Monash

We have an exciting new initiative at Monash University with some new positions in business analytics. This is part of a plan to strengthen our research and teaching in the data science/computational statistics area. We are hoping to make multiple appointments, at junior and senior levels. These are five-year appointments, but we hope that the positions will continue after that if we can secure suitable funding. What is business analytics?…

Read more »

Choropleths and Bar charts of State Wise Potentially Preventable Deaths using googleVis and shiny

May 4, 2014
By

UPDATE: THE BLOG/SITE HAS MOVED TO GITHUB. THE NEW LINK FOR THE BLOG/SITE IS patilv.github.io and THE LINK TO THIS POST IS: http://bit.ly/1mcofpp. PLEASE UPDATE ANY BOOKMARKS YOU MAY HAVE.

Read more »

European MEP Data

May 4, 2014
By
European MEP Data

Pretty soon we will be having European Elections. I cannot tell when exactly, that depends on country. Over here the elections get a some attention, which is how I ran into data from votewatch.eu on voting of MEPs. That was just too interesting, so I m...

Read more »

GrubHub’s Phasmid Websites

May 4, 2014
By
GrubHub’s Phasmid Websites

The rationale behind mining business data directly from the business's own website is that the business has a clear economic motivation to ensure that the data is up to date. If you own a restaurant that changes location, and your...

Read more »

Jeffreys’ Substitution Posterior for the Median: A Nice Trick to Non-parametrically Estimate the Median

May 4, 2014
By
Jeffreys’ Substitution Posterior for the Median: A Nice Trick to Non-parametrically Estimate the Median

While reading up on quantile regression I found a really nice hack described in Bayesian Quantile Regression Methods (Lancaster & Jae Jun, 2010). It is called Jeffreys’ substitution posterior for the median, first described by Harold Jeffreys i...

Read more »

You can only become coherent by ‘converting’ non-Bayesianly

May 4, 2014
By
You can only become coherent by ‘converting’ non-Bayesianly

“What ever happened to Bayesian foundations?” was one of the final topics of our seminar (Mayo/SpanosPhil6334). In the past 15 years or so, not only have (some? most?) Bayesians come to accept violations of the Likelihood Principle, they have also tended to disown Dutch Book arguments, and the very idea of inductive inference as updating beliefs by […]

Read more »

A clear picture of power and significance in A/B tests

May 3, 2014
By
A clear picture of power and significance in A/B tests

A/B tests are one of the simplest reliable experimental designs. Controlled experiments embody the best scientific design for establishing a causal relationship between changes and their influence on user-observable behavior. “Practical guide to controlled experiments on the web: listen to your customers not to the HIPPO” Ron Kohavi, Randal M Henne, and Dan Sommerfield, Proceedings […] Related posts: Bandit Formulations for A/B Tests: Some Intuition Sample size and power for…

Read more »

“The graph clearly shows that mammography adds virtually nothing to survival and if anything, decreases survival (and increases cost and provides unnecessary treatment)”

May 3, 2014
By
“The graph clearly shows that mammography adds virtually nothing to survival and if anything, decreases survival (and increases cost and provides unnecessary treatment)”

Paul Alper writes: You recently posted on graphs and how to convey information.  I don’t believe you have ever posted anything on this dynamite randomized clinical trial of 90,000 (!!) 40-59 year-old women over a 25-year period (also !!). The graphs below are figures 2, 3 and 4 respectively, of http://www.bmj.com/content/348/bmj.g366 The control was physical […] The post “The graph clearly shows that mammography adds virtually nothing to survival and…

Read more »

How to do a chi-square test in 7 steps

May 3, 2014
By
How to do a chi-square test in 7 steps

What is a chi-square test: A chi square tests the relationship between two attributes. Suppose we suspect that rural Americans tended to vote Romney, and urban Americans tended to vote Obama. In this case, we suspect a relationship between where you live and who you vote for. The full name for this test is Pearson’s […]

Read more »

Calendar Strategy: Option Expiry

May 3, 2014
By
Calendar Strategy: Option Expiry

Today, I want to follow up with the Calendar Strategy: Month End post. Let’s examine the perfromance Option Expiry days as presented in the The Mooost Wonderful Tiiiiiiime of the Yearrrrrrrrr! post. First, I created two convenience functions for creating a calendar signal and back-testing calendar strategy: calendar.signal and calendar.strategy functions are in the strategy.r […]

Read more »

The May Reading List

May 2, 2014
By
The May Reading List

Bjerkholt, O., 2013. Promoting econometrics through Econometrica 1933-39. Memorandum 28/2013, Department of Economics, University of Oslo.Gulesserian, S. G. and M. Kejriwal, 2014. On the power of bootstrap tests for stationarity: A Monte Carlo comparis...

Read more »

Discovering general multidimensional associations

May 2, 2014
By

Continuing our discussion of general measures of correlations, Ben Murrell sends along this paper (with corresponding R package), which begins: When two variables are related by a known function, the coefficient of determination (denoted R-squared) measures the proportion of the total variance in the observations that is explained by that function. This quantifies the strength […] The post Discovering general multidimensional associations appeared first on Statistical Modeling, Causal Inference, and…

Read more »

In the media

May 2, 2014
By
In the media

Yesterday, UCL News Office issued this press release which mentions our (that's Marta and me) paper on the Eurovision contest, which has just been published in the Journal of Applied Statistics.The idea of the paper was to try and quantify the presence...

Read more »

Multidimensional Scaling (MDS) with R

May 2, 2014
By
Multidimensional Scaling (MDS) with R

This page shows Multidimensional Scaling (MDS) with R. It demonstrates with an example of automatic layout of Australian cities based on distances between them. The layout obtained with MDS is very close to their locations on a map. At first, … Continue reading →

Read more »

On demand (but on a very serious topic)

May 2, 2014
By

My friend Virgilio has posted this on his Facebook page and invited me to comment. It is an article by a Spanish cardiologist that tells the story of a patient who has suffered a second stroke in a short amount of time $-$ as it turns out, th...

Read more »

Why Blog?

May 2, 2014
By
Why Blog?

The Blog Review ProcessA series of events in my life have lead me to reconsider the value of blogging.The Back StoryShort story: I got fired.Long story: Recently I was hired to write occasional blog posts for Quandl. They probably figured that due to m...

Read more »

Great papers to read

May 2, 2014
By
Great papers to read

My research group meets every two weeks. It is always fun to talk about general research issues and new tools and tips we have discovered. We also use some of the time to discuss a paper that I choose for them. Today we discussed Breiman’s classic (2...

Read more »

RSS statistical analytics challenge 2014

May 1, 2014
By
RSS statistical analytics challenge 2014

Great news! The RSS is setting a data analysis challenge this year, sponsored by the Young Statisticians Section and Research Section of the Royal Statistical Society: Details are available on the wordpress website of the Challenge. Registration is open and the Challenge goes live on Tuesday 6 May 2014 for an exciting 6 weeks competition. (A […]

Read more »

A bit of the agenda of Practical Data Science with R

May 1, 2014
By
A bit of the agenda of Practical Data Science with R

The goal of Zumel/Mount: Practical Data Science with R is to teach, through guided practice, the skills of a data scientist. We define a data scientist as the person who organizes client input, data, infrastructure, statistics, mathematics and machine learning to deploy useful predictive models into production. Our plan to teach is to: Order the […] Related posts: How does Practical Data Science with R stand out? Data Science, Machine…

Read more »

An anthropological study shows why complete data designs are counterproductive

May 1, 2014
By
An anthropological study shows why complete data designs are counterproductive

I saw Joe N.'s tweet asking me about a study of how professors spend their time, reported by Lisa Wade at Sociological Images. This is an anthropological study, something that I am not at all familiar with although the people in the field seem to believe that they can make statistically valid observations. I'm glad the author of the study, John Ziker, wrote a (really) long article describing what he…

Read more »

Reform academic statistics

May 1, 2014
By
Reform academic statistics

Terry Speed recently gave a talk on the role of statisticians in “Big Data” initiatives (see the video or just look at the slides). He points to the history of statisticians’ discussions of massive data sets (e.g., the Proceedings of a 1998 NRC workshop on Massive data sets) and how this history is being ignored […]

Read more »

Heller, Heller, and Gorfine on univariate and multivariate information measures

May 1, 2014
By

Malka Gorfine writes: We noticed that the important topic of association measures and tests came up again in your blog, and we have few comments in this regard. It is useful to distinguish between the univariate and multivariate methods. A consistent multivariate method can recognise dependence between two vectors of random variables, while a univariate […] The post Heller, Heller, and Gorfine on univariate and multivariate information measures appeared first…

Read more »


Subscribe

Email:

  Subscribe