## NBA Predictions — Finals

June 12, 2012
By

Now we are on to the finals! The algorithm enters the finals with a 6-4 record so far. Here is what we have for tonight: So, let’s see if OKC wins this one.

## Finding Word Use Patterns in Wikileaks Cables

June 12, 2012
By

6/18: A follow-up to this post is now available here. Recent Discoveries When I was a diplomat, I was always interested in the Wikileaks cables and what could be done with them. Unfortunately, I never got a chance to look at the site in depth, du...

## Poison gas or…air pollution?

June 12, 2012
By

From our Beijing bureau, we have the following message from the U.S. embassy that was recently issued to U.S. citizens in China: The Embassy has received reports from U.S. citizens living and traveling in Wuhan that the air quality in the city has be...

## Exploding NBA Basketball Shot Heat Map Analysis

June 12, 2012
By

The interactive infographic "Where the Heat and the Thunder Hit Their Shots [nytimes.com] by the New York Times combines pure eye-candy with sophisticated statistical display, as it conveys the overall shooting patterns of the players on the Miami Hea...

## Simple graph WIN: the example of birthday frequencies

June 12, 2012
By

From Chris Mulligan: The data come from the Center for Disease Control and cover the years 1969-1988. Chris also gives instructions for how to download the data and plot them in R from scratch (in 30 lines of R code)! And now, the background A few months ago I heard about a study reporting that, [...]

## Network Medicine: Using Visualization to Decode Complex Diseases

June 12, 2012
By

"We should stop training cardiologists and neurologists. The doctor of the future needs to become a 'networkologist', able to understand where diseases are lying in the network and how they relate to each other." Albert Làszló Barab&ag...

## Big Data Needs May Create Thousands Of Tech Jobs

June 12, 2012
By

Big Data Needs May Create Thousands Of Tech Jobs: The amount of digital being stored today is ballooning. And this boom in what is known as big data is expected to produce thousands of new jobs over the next several years - especially in the high-tech ...

## Green: E.P.A. Soot Rules Expected This Week

June 12, 2012
By

Green: E.P.A. Soot Rules Expected This Week: Under an agreement with 11 states that sued over delays, the agency is expected to announce proposed rules on fine particulates on Thursday.

## Bloomberg issues a health warning dressed up as a fast-food menu

June 12, 2012
By

NYC mayor Michael Bloomberg is getting mixed reviews for his proposal to ban super-sized sugary drinks. Reader John O. wasn't impressed with this graphical effort (link): The key problem: this picture is not scary at all. The reason it's not...

## Volatility Position Sizing 2

June 12, 2012
By

I have discussed Volatility Position Sizing in the Volatility Position Sizing to improve Risk Adjusted Performance post using the Average True Range (ATR) as a measure of Volatility. Today I want show how to use historical volatility to adjust portfolio leverage. Let’s start with Buy and Hold strategy using SPY and rescale it to the [...]

## Transforming subsets of data in R with by, ddply and data.table

June 12, 2012
By

Transforming data sets with R is usually the starting point of my data analysis work. Here is a scenario which comes up from time to time: transform subsets of a data frame, based on context given in one or a combination of columns. As an example I use...

## \verbatim [beamer package]

June 11, 2012
By

Once again working on my slides for the AMSI Lecture 2012 tour, it took me a while to get the following LaTeX code (about the family reunion puzzle) to work: \begin{frame}[fragile,label=notleM2] \slidetitle{A family meeting} \begin{block}{Random switch of couples} \only<1>{ \begin{itemize} \item Pick two couples [among the 20 couples] at random with probabilities proportional to the [...]

## Convergence Monitoring for Non-Identifiable and Non-Parametric Models

June 11, 2012
By
$Convergence Monitoring for Non-Identifiable and Non-Parametric Models$

Becky Passonneau and colleagues at the Center for Computational Learning Systems (CCLS) at Columbia have been working on a project for ConEd (New York’s major electric utility) to rank structures based on vulnerability to secondary events (e.g., transformer explosions, cable meltdowns, electrical fires). They’ve been using the R implementation BayesTree of Chipman, George and McCulloch’s [...]

## Stratified Sampling for Oversampling Small Sub-Populations

June 11, 2012
By

How to do it In stratified sampling, the population is divided into different sub-groups or strata, and then the subjects are randomly selected from each of the strata. So, in the above example, you would divide the population into different linguistic sub-groups (one of which is Yiddish speakers). Here are two simple steps you should follow:

## FoodMood: Measuring Global Food Sentiments via Twitter

June 11, 2012
By

FoodMood [foodmood.in] by Affect Lab, Ai Applied and Jana + Koos aims to communicate a better understanding of global food consumption patterns and their impact on the daily emotional well-being of people. The treemap captures, in real-time, the foo...

## Chris Volinsky knows where you are

June 11, 2012
By

Chris Volinsky knows where you are: An interesting article on how AT&T is using cell phone location data for urban planning and other purposes.

## The HaploREG Database for Functional Annotation of SNPs

June 11, 2012
By

The ENCODE project continues to generate massive numbers of data points on how genes are regulated.  This data will be of incredible use for understanding the role of genetic variation, both for altering low-level cellular phenotypes (like gene ...

## The unitary nature of consciousness: “It’s impossible to be insanely frustrated about 2 things at once”

June 11, 2012
By

Dan Kahan writes: We all know it’s ridiculous to be able to go on an fMRI fishing trip & resort to post hoc story-telling to explain the “significant” correlations one (inevitably) observes (good fMRI studies *don’t* do this; only bad ones do– to the injury of the reputation of all the scholars doing good studies [...]

