## Data analysis acquisition “worst deal ever”?

December 3, 2012
By

A little over a year ago I mentioned that data analysis companies were getting gobbled up by larger technology companies. In particular, HP bought Autonomy, a British data analysis company, for about \$11 billion. (By the way, can anyone tell … Continue reading →

## Four numbers say little, even on a busy chart

December 3, 2012
By

Reader Robert J. calls this a "really bad" chart (link). The data-ink ratio, he notes, is horrible. The message of the chart can be stated in one or two sentences. And it's not clear what the other items are buying...

## Wonderful "How-To" Resources for Learning Structural Equation Modeling (SEM) with AMOS

December 3, 2012
By

Structural equation modeling (SEM) is a complex beast, and can be quite intimidating to someone trying to learn the basics. Fortunately, there are some great resources out there for learning! Unfortunately, I think a lot of beginners don't know what those great resources are, or where to find them.

## Wonderful "How-To" Resources for Learning Structural Equation Modeling (SEM) with AMOS

December 3, 2012
By

Structural equation modeling (SEM) is a complex beast, and can be quite intimidating to someone trying to learn the basics. Fortunately, there are some great resources out there for learning! Unfortunately, I think a lot of beginners don't know what th...

## Variability in long-short decile strategy tests

December 3, 2012
By

How to capture return variability when testing strategies with long-short deciles. Traditional practice Question: Does variable X have predictive power for our universe of assets? A common scheme of quants to answer the question is to form a series of portfolios over time.  The portfolio at each time point: is long the equal weighting of … Continue reading →

## Show percentages for bar charts with PROC SGPLOT

December 3, 2012
By

It seemed like an easy task. A SAS user asked me how to use the SGPLOT procedure to create a bar chart where the vertical axis shows percentages instead of counts. I assumed that there was some simple option that would change the scale of the vertical axis from counts [...]

## Affordances

December 3, 2012
By

How do we know what we can do with things in the world or in user interfaces? What makes us push buttons, flip switches, or pick up objects that fit our hands? This guidance comes from affordances, a clever and intuitive theory that has been around for decades but is often misunderstood. The term affordance was coined by James J. Gibson in his book, The Ecological Approach to Visual Perception…

## forecast package v4.0

December 3, 2012
By

A few days ago I released version 4.0 of the forecast package for R. There were quite a few changes and new features, so I thought it deserved a new version number. I keep a list of changes in the Changelog for the package, but I doubt that many people look at it. So for the record, here are the most important changes to the forecast package made since v3.0…

## One year on!

December 2, 2012
By

I have been blogging for just under a year now, and have written over 50 posts. There have been over 30,000 hits on the blog, and some very helpful comments. I’ve had a lot of fun, and there is something … Continue reading →

## Statistical Science meets Philosophy of Science

December 2, 2012
By

Many of the discussions on this blog have revolved around a cluster of issues under the general question: “Statistical Science and Philosophy of Science: Where Do (Should) They meet? (in the contemporary landscape)?”  In tackling these issues, this blog regularly returns to a set of contributions growing out of a conference with the same title [...]

## Sunday data/statistics link roundup (12/2/12)

December 2, 2012
By

An interview with Anthony Goldbloom, CEO of Kaggle. I’m not sure I’d agree with the characterization that all data scientists are: creative, curious, and competitive and certainly those characteristics aren’t unique to data scientists. And I didn’t know this: “We … Continue reading →

## Triangle tests

December 2, 2012
By

IntroductionA triangle test is a test beloved by sensory scientists for its simplicity and general use in detecting presence of product differences. The principle is simple. Test subjects get served three samples. One of these contains A, two of these ...

## A lifetime supply of . . .

December 2, 2012
By

This story reminded me of when I was planning my short course for Procter & Gamble: The question in my mind was: should I take the consulting fee they offered or should I try to get a voucher for a lifetime supply of P&G products (assuming, of course, that such a voucher exists)? Wouldn’t that [...]

## \$241,364.83 – \$13,000 = \$228,364.83

December 2, 2012
By

A blog commenter pointed me to this news article on Sudhir Venkatesh, a sociology professor here: He was the subject last year of a grueling investigation into a quarter-million dollars of spending that Columbia auditors said was insufficiently documented, misappropriated or outright fabricated. According to internal documents from that investigation, which were obtained by The [...]

## Financial Turbulence Example

December 1, 2012
By

Today, I want to highlight the Financial Turbulence Index idea introduced by Mark Kritzman and Yuanzhen Li in the Skulls, Financial Turbulence, and Risk Management paper. Timely Portfolio did a great series of posts about Financial Turbulence: Part 1, Part 2, Part 3. As example, I will compute Financial Turbulence for the equal weight index [...]

## Working in the open

December 1, 2012
By

Open Data flourishes and more and more open-data sites are launched with sophisticated functionalities. So now http://open.undp.org. ‘Open.undp.org presents UNDP’s 6,000+ …Continue reading »

## The purpose of writing

December 1, 2012
By

Basbøll: My aim is Socratic. I don’t want to help you become more knowledgeable. I want to help you better distinguish what you know from what you don’t know. Excellent point. Indeed, laying out what I do know and tracing the boundary of ...

## Screening and False Discovery Rates

December 1, 2012
By
$Screening and False Discovery Rates$

Today we have another guest post. This one is by Ryan Tibshirani an Assistant Professor in my department. (You might also want to check out the course on Optimization that Ryan teaches jointly with Geoff Gordon.) Screening and False Discovery Rates by Ryan Tibshirani Two years ago, as a TA for Emmanuel Candes’ Theory of [...]

## Outburst of interesting news in one hour

December 1, 2012
By

I guess I would carry an “NPR” label when I’m profiled by google’s personalized searching algorithm or other targeted online advertising algorithms. On my way to work or home, I usually pick up one or two good stories from NPR. Nov 29, 2012 was an outlier in some sense. I found much more than a […]

## A Real-Time Map of the Song "I Have Been Everywhere" by Johnny Cash

November 30, 2012
By

Freelance web developer Iain Mullan has developed a map mashup titled "Johnny Cash Has Been EVERYWHERE (Man)!" [iainmullan.com]. The concept is simple yet funny: using a combination of an on-demand music service, an online lyrics catalog and some Go...

November 30, 2012
By

I just was doing my morning reading of a few news sources and stumbled across this Huffington Post article talking about research correlating babies cries to autism. It suggests that the sound of a babies cries may predict their future … Continue reading →

## “The scientific literature must be cleansed of everything that is fraudulent, especially if it involves the work of a leading academic”

November 30, 2012
By

Someone points me to this report from Tilburg University on disgraced psychology researcher Diederik Stapel. The reports includes bits like this: When the fraud was first discovered, limiting the harm it caused for the victims was a matter of urgency. This was particularly the case for Mr Stapel’s former PhD students and postdoctoral researchers . [...]

## Error Statistics (brief overview)

November 30, 2012
By

In view of some questions about “behavioristic” vs “evidential” construals of frequentist statistics (from the last post), and how the error statistical philosophy tries to improve on Birnbaum’s attempt at providing the latter, I’m reblogging a portion of a post from Nov. 5, 2011 when I also happened to be in London. (The beginning just records [...]