My talk midtown this Friday noon (and at Columbia Monday afternoon)

April 24, 2013
By

At the City University of New York Graduate Center, 365 Fifth Avenue (between 34th and 35th street), room 6002. The topic: causality and statistical learning. Announcement is here (scroll down). It says that if you would like to attend any event, please respond by emailing datamining@gc.cuny.edu I’m also giving a shorter talk on the same [...]The post My talk midtown this Friday noon (and at Columbia Monday afternoon) appeared first…

Read more »

The Price is Right Problem: Part Two

April 24, 2013
By
The Price is Right Problem: Part Two

This article is an excerpt from Think Bayes, a book I am working on.  The entire current draft is available from http://thinkbayes.com.  I welcome comments and suggestions.In the previous article, I described presented The Price is ...

Read more »

The Tweets-Votes Curve

April 24, 2013
By
The Tweets-Votes Curve

Fabio Rojas points me to this excellently-titled working paper by Joseph DiGrazia, Karissa McKelvey, Johan Bollen, and himself: Is social media a valid indicator of political behavior? We answer this ques- tion using a random sample of 537,231,508 tweets from August 1 to November 1, 2010 and data from 406 competitive U.S. congressional elections provided [...]The post The Tweets-Votes Curve appeared first on Statistical Modeling, Causal Inference, and Social Science.

Read more »

The Tweets-Votes Curve

April 24, 2013
By
The Tweets-Votes Curve

Fabio Rojas points me to this excellently-titled working paper by Joseph DiGrazia, Karissa McKelvey, Johan Bollen, and himself: Is social media a valid indicator of political behavior? We answer this ques- tion using a random sample of 537,231,508 tweets from August 1 to November 1, 2010 and data from 406 competitive U.S. congressional elections provided [...]

Read more »

The Tweets-Votes Curve

April 24, 2013
By
The Tweets-Votes Curve

Fabio Rojas points me to this excellently-titled working paper by Joseph DiGrazia, Karissa McKelvey, Johan Bollen, and himself: Is social media a valid indicator of political behavior? We answer this ques- tion using a random sample of 537,231,508 tweets from August 1 to November 1, 2010 and data from 406 competitive U.S. congressional elections provided [...]

Read more »

How to overlay a custom density curve on a histogram in SAS

April 24, 2013
By
How to overlay a custom density curve on a histogram in SAS

I've previously described how to overlay two or more density curves on a single plot. I've also written about how to use PROC SGPLOT to overlay custom curves on a graph. This article describes how to overlay a density curve on a histogram. For common distributions, you can overlay a [...]

Read more »

The AllTrials campaign

April 24, 2013
By

The AllTrials campaign is pushing for all data on drug trials to be made public — see the campaign statement. If the public has all the evidence rather than a biased selection of evidence, then it will be possible to make better decisions. There’s been a good start, but more people need to know about […] The post The AllTrials campaign appeared first on Burns Statistics.

Read more »

Foundation for Open Access Statistics

April 23, 2013
By
Foundation for Open Access Statistics

Now here’s a foundation I (Bob) can get behind: Foundation for Open Access Statistics (FOAS) Their mission is to “promote free software, open access publishing, and reproducible research in statistics.” To me, that’s like supporting motherhood and apple pie! FOAS spun out of and is partially designed to support the Journal of Statistical Software (aka [...]The post Foundation for Open Access Statistics appeared first on Statistical Modeling, Causal Inference, and…

Read more »

Bad normal approximation

April 23, 2013
By

Sometimes you can approximate a binomial distribution with a normal distribution. Under the right conditions, a Binomial(n, p) has approximately the distribution of a normal with the same mean and variance, i.e. mean np and variance np(1-p). The approximation works…Read more ›

Read more »

TweetMap ALPHA: Querying a Massive Amount of Tweets on a Map

April 23, 2013
By
TweetMap ALPHA: Querying a Massive Amount of Tweets on a Map

The impressive TweetMap ALPHA [harvard.edu], developed by Harvard University's Center for Geographic Analysis, is based on a dataset of about 95 million tweets, which can be dynamically queried by time, by location or by keyword. Tweetmap makes use o...

Read more »

Interview at Yale Center for Environmental Law & Policy

April 23, 2013
By

Interview with Roger Peng from YCELP on Vimeo. A few weeks ago I sat down with Angel Hsu of the Yale Center for Environmental Law and Policy to talk about some of their work on air pollution indicators. (Note: I … Continue reading →

Read more »

Charles Murray’s “Coming Apart” and the measurement of social and political divisions

April 23, 2013
By
Charles Murray’s “Coming Apart” and the measurement of social and political divisions

Following up on our blog discussions a year ago, I published a review of Charles Murray’s recent book, “Coming Apart,” for the journal Statistics, Politics, and Policy. I invited Murray to publish a response, and he did so. Here’s the abstract to my review: This article examines some claims made in a recent popular book [...]The post Charles Murray’s “Coming Apart” and the measurement of social and political divisions appeared…

Read more »

Spin, spin, spin away

April 23, 2013
By
Spin, spin, spin away

From a purely graphical perspective, the following NYT chart (link) is well executed: Labeling is always a challenge with scatter plots. Here, they have 54 points, and the chart still doesn't look too crammed. I like the axis labels, and...

Read more »

Spin, spin, spin away

April 23, 2013
By
Spin, spin, spin away

From a purely graphical perspective, the following NYT chart (link) is well executed: Labeling is always a challenge with scatter plots. Here, they have 54 points, and the chart still doesn't look too crammed. I like the axis labels, and...

Read more »

Russian dolls

April 23, 2013
By
Russian dolls

This is a bit of a Russian doll situation, because I'm effectively pointing to another blog, which (kind of) points to this blog (and, who knows, may be this blog is somehow pointing to that other blog too... [To enjoy this fully, you should read ...

Read more »

Review: Kölner R Meeting 12 April 2013

April 23, 2013
By
Review: Kölner R Meeting 12 April 2013

Our 5th Cologne R user group meeting was the best attended meeting so far, with 20 members finding their way to the Institute of Sociology for two talks by Diego de Castillo on shiny and Stephan Holtmeier on cluster analysis, followed by beer and schni...

Read more »

Derivative-free estimate of derivatives

April 23, 2013
By
Derivative-free estimate of derivatives

Hey, Arnaud Doucet, Sylvain Rubenthaler and I have just put a technical report on arXiv about estimating the first- and second-order derivatives of the log-likelihood (also called the score and the observed information matrix respectively) in general (intractable) statistical models, and in particular in (non-linear non-Gaussian) state-space models. We call them “derivative-free” estimates because they […]

Read more »

Scripts and Functions: Using R to Implement the Golden Section Search Method for Numerical Optimization

Scripts and Functions: Using R to Implement the Golden Section Search Method for Numerical Optimization

In an earlier post, I introduced the golden section search method – a modification of the bisection method for numerical optimization that saves computation time by using the golden ratio to set its test points.  This post contains the R function that implements this method, the R functions that contain the 3 functions that were […]

Read more »

The Golden Section Search Method: Modifying the Bisection Method with the Golden Ratio for Numerical Optimization

The Golden Section Search Method: Modifying the Bisection Method with the Golden Ratio for Numerical Optimization

Introduction The first algorithm that I learned for root-finding in my undergraduate numerical analysis class (MACM 316 at Simon Fraser University) was the bisection method.  It’s very intuitive and easy to implement in any programming language (I was using MATLAB at the time).  The bisection method can be easily adapted for optimizing 1-dimensional functions with […]

Read more »

The Largest Web Page on the Internet: 7 Billion People on One Page

April 22, 2013
By
The Largest Web Page on the Internet: 7 Billion People on One Page

7 Billion World [7billionworld.com] displays 7 billion people together on a single webpage. Developed by Worldometers - which themselves were originally posted in the good year of 2005 -, the web page itself is generated through some small programmin...

Read more »

Prime Explorer: Exploring Patterns in Prime Number Spatial Layouts

April 22, 2013
By
Prime Explorer: Exploring Patterns in Prime Number Spatial Layouts

Prime Explorer [bigblueboo.com], developed by a San Francisco-based software company called Mode of Expression, provides an interactive view of all the prime numbers ranging from 1 to 62,500. Each prime number is represented by a bright, white squar...

Read more »

The Price is Right Problem

April 22, 2013
By
The Price is Right Problem

This article is an excerpt from Think Bayes, a book I am working on.  The entire current draft is available from http://thinkbayes.com.  I welcome comments and suggestions.The Price is Right ProblemOn November 1, 2007, contestants named Letia...

Read more »

Statistics Sweden’s statistics are available for new services

April 22, 2013
By
Statistics Sweden’s statistics are available for new services

From: http://www.scb.se/Pages/List____354067.aspxNow companies and private individuals have access to Statistics Sweden’s statistics, which can be used for new products and services. This involves data from the Statistical Database that can be digita...

Read more »


Subscribe

Email:

  Subscribe