Revisiting the Syria chart

October 8, 2013
By

New York/Tri-State residents: Meet me at NYU Bookstore tonight, 6-7:30 pm. (link) *** When I wrote about the graphic showing the vote distribution around Syria in the Congress a few posts ago (link), readers offered opinions about what's a better...

Why models need a certain culture to flourish

October 8, 2013
By

About half a year ago Ian Branagan, Chief Risk Officer of Renaissance Re - a Bermudian reinsurance company with a focus on property catastrophe insurance, gave a talk about the usage of models in risk management and how they evolved over the last twent...

On the treachery of point-and-click "black-box" data analysis

October 8, 2013
By

*in this post, by "black-box" I'm referring to software whose methods are undisclosed and un-audible rather than black-box math models There’s a certain expectation that the analyses that inform not only business and stock trading, but public health and social welfare decisions, are carefully thought out and performed with painstaking attention to detail. However, the »more

The Leek group policy for developing sustainable R packages

October 7, 2013
By

As my group has grown over the past few years and I have more people writing software, I have started to progressively freak out more and more about how to make sure that the software is sustainable as students graduate … Continue reading →

Argentan half-marathon [split times]

October 7, 2013
By

Filed under: R, Running Tagged: Argentan, background, half-marathon, Normandy, race, road races, running, splits

October 7, 2013
By

Sounds silly, but it's not. I got talked into joining a few weeks ago, and I'm glad I did. I rarely tweet (except to announce new No Hesitations posts), but I follow others. Several times in the last few weeks alone, various pieces of valuable informat...

Bing is preferred to Google by people who aren’t like me

October 7, 2013
By

This one is fun because I have a double conflict of interest: I’ve been paid (at different times) both by Google and by Microsoft. Here’s the story: Microsoft, September 2012: An independent research company, Answers Research based in San Diego, CA, conducted a study using a representative online sample of nearly 1000 people, ages 18 […]The post Bing is preferred to Google by people who aren’t like me appeared first…

Numbersense Pros: Interview with Len Testa

October 7, 2013
By

In the first chapter of my first book, Numbers Rule Your World (link), I explored the concept of variability using a pair of examples, one of which was Disney's FastPass virtual reservation system. Truly grasping the ins and outs of variability is one of the most important objectives for a budding statistician (or data scientist). In the discussion, I highlighted the work of Len Testa, whose website, TouringPlans.com, provides custom,…

The look of verifying data

October 7, 2013
By

Get data that fit before you fit data. Why verify? Garbage in, garbage out. How to verify The example data used here is daily (adjusted) prices of stocks.  By some magic that I’m yet to fathom, market data can be wondrously wrong even without the benefit of the possibility of transcription errors.  It doesn’t seem … Continue reading →

How to create a library of functions in PROC IML

October 7, 2013
By

What is the best way to share SAS/IML functions with your colleagues? Give them the source code? Create a function library that they can use? This article describes three techniques that make your SAS/IML functions accessible to others. As background, remember that you can define new functions and subroutines in [...]

Parallel Tempering in R with Rmpi

October 7, 2013
By
$Parallel Tempering in R with Rmpi$

My office computer recently got a really nice upgrade and now I have 8 cores on my desktop to play with. I also at the same time received some code for a Gibbs sampler written in R from my adviser. I wanted to try a metropolis-coupled markov chain monte carlo, , algorithm on it to […] The post Parallel Tempering in R with Rmpi appeared first on Lindons Log.

Story Points

October 7, 2013
By

I consider presentation and storytelling the next step in visualization, after most of the focus has been on exploration and analysis so far. An upcoming version of Tableau will include a feature called Story Points, which supports presentation directly in the visualization tool. A Story A Tableau Story is a new type of sheet, like […]

Absolute and Relative Risk

October 7, 2013
By

It is important that citizens can make sense out of the often outrageous claims of advertisers and pro-screening advocates.  It isn’t what they say, but how they say it. What looks like a very large and scary increase in risk, … Continue reading →

October 6, 2013
By

A fascinating read about applying decision theory to mathematical proofs. They talk about Type I and Type II errors and everything.  Statistical concepts explained through dance. Even for a pretty culture-deficient dude like me this is cool. Lots of good … Continue reading →

Nice & weird people in the Canary island (oh: I went there for work too!)

October 6, 2013
By

The past one has been a very interesting week, which I've spent visiting the University of Las Palmas, in the Canary Island. Since it was the last week on maternity leave for Marta, we all went. I knew the weather would be good, but we didn't expect it...

Ideas that spread fast and slow

October 6, 2013
By

Atul Gawande (the thinking man’s Malcolm Gladwell) asks: Why do some innovations spread so swiftly and others so slowly? Consider the very different trajectories of surgical anesthesia and antiseptics, both of which were discovered in the nineteenth century. The first public demonstration of anesthesia was in 1846. The Boston surgeon Henry Jacob Bigelow was approached […]The post Ideas that spread fast and slow appeared first on Statistical Modeling, Causal Inference,…

Influence Analysis for Repeated Measures Data

October 6, 2013
By

I am trying exercise 59.8 (page 5057) of the SAS/STAT Users Guide 12.3 in R. The interesting thing is that influence is investigated on subject level rather than individual level. The diagnostics in nlme does not do leave-subject-out, at least, not tha...

Was Janina Hosiasson pulling Harold Jeffreys’ leg?

October 6, 2013
By

The very fact that Jerzy Neyman considers she might have been playing a “mischievous joke” on Harold Jeffreys (concerning probability) is enough to intrigue and impress me (with Hosiasson!). I’ve long been curious about what really happened. Eleonore Stump, a leading medieval philosopher and friend (and one-time colleague), and I pledged to travel to Vilnius […]

Crime Against Women in India – Addressing 8 Questions Using rCharts, googleVis, and shiny

October 5, 2013
By

UPDATE: THE BLOG/SITE HAS MOVED TO GITHUB. THE NEW LINK FOR THE BLOG/SITE IS patilv.github.io and THE LINK TO THIS POST IS: http://bit.ly/1lHtVon. PLEASE UPDATE ANY BOOKMARKS YOU MAY HAVE. Recent crimes against women, specifically the 2012 ga...

Estimating rates from a single occurrence of a rare event

October 5, 2013
By

Elon Musk’s writing about a Tesla battery fire reminded me of some of the math related to trying to estimate the rate of a rare event from a single occurrence of the event (plus many non-event occurrences). In this article we work through some of the ideas. Elon Musk wrote that the issues of the […] Related posts: Sample size and power for rare events What is a large enough…

Pure Brilliance From FRB St. Louis: EconomicAcademics.org

October 5, 2013
By

This just in from Christian Zimmermann and the RePEc Team at FRB St. Louis:"Congratulations, you made the list! .. The Federal Reserve Bank of St. Louis is launching a blog aggregator, EconomicAcademics.org, to highlight and promote the discussion of e...

Give me a ticket for an aeroplane

October 5, 2013
By

How long are songs? Gabriel Rossman discusses the two peaks, one at just under 3 minutes and one at just under 4 minutes. He quotes musician Jacob Slichter: In anticipation of “crossing over” the single to radio formats . . . Each mix had to be edited down to under four minutes, an important limit […]The post Give me a ticket for an aeroplane appeared first on Statistical Modeling, Causal…

A functional Gibbs sampler in Scala

October 4, 2013
By

For many years I’ve had a passing interest in functional programming and languages which support functional programming approaches. I’m also quite interested in MOOCs and their future role in higher education. So I recently signed up for my first on-line course, Functional Programming Principles in Scala, via Coursera. I’m around half way through the course […]