Working with intraday data

September 24, 2013
By

When working with intraday data, analysts are often facing a large dataset problem. R is well equipped to deal with this but the standard approach has to be modified in some ways. Large dataset means different things to different people. I’m talking here about a dataset of less than 10 columns and 2 to 5 […]

Read more »

Changing the width of bars and columns in googleVis

September 24, 2013
By
Changing the width of bars and columns in googleVis

Changing the plotting width in bar-, column- and combo-charts of googleVis works identical and is defined by the bar.groupWidth argument. The dot in the argument means that it has to be split in R into bar="{groupWidth:'10%'}". Examplelibrary(googleVis...

Read more »

Scalable Stan

September 23, 2013
By
Scalable Stan

Bob writes: If you have papers that have used Stan, we’d love to hear about it. We finally got some submissions, so we’re going to start a list on the web site for 2.0 in earnest. You can either mail them to the list, to me directly, or just update the issue (at least until […]The post Scalable Stan appeared first on Statistical Modeling, Causal Inference, and Social Science.

Read more »

Integrity and the Jaynesian/Frequentist divide.

September 23, 2013
By

The heart of the Jaynesian/Frequentist divide can be found in the post “The Definition of a Frequentist”. A Frequentist views weather prediction as one of getting the frequency of weather events right. A Jaynesian’s goal is to pin dow...

Read more »

The limiting reagent for big data is often small, well-curated data

September 23, 2013
By

I've been working on "big" data in genomics since I was a first year student in graduate school (a longer time than I'd rather admit). At the time, "big" meant microarray studies with a couple of hundred patients. Of course, … Continue reading →

Read more »

My talk Tues 24 Sept at 12h30 at Université de Technologie de Compiègne

September 23, 2013
By

Philosophie et practique de la statistique bayésienne. I’ll try to update the slides a bit since a few years ago, to add some thoughts I’ve had recently about problems with noninformative priors, even in simple settings. The location of the talk will not be convenient for most of you, but anyone who comes to the […]The post My talk Tues 24 Sept at 12h30 at Université de Technologie de Compiègne…

Read more »

Deep Inside the LSE

September 23, 2013
By

Mary Morgan, Professor of History and Philosophy of Economics at LSE (Department of Economic History) is giving a talk at Penn today, in our Department of History and Sociology of Science. She's done interesting work in the history of econometrics...

Read more »

Detecting Weak Instruments in R

September 23, 2013
By
Detecting Weak Instruments in R

Any instrumental variables (IV) estimator relies on two key assumptions in order to identify causal effects: That the excluded instrument or instruments only effect the dependent variable through their effect on the endogenous explanatory variable or variables (the exclusion restriction), That the correlation between the excluded instruments and the endogenous explanatory variables is strong enough […]

Read more »

Detecting Weak Instruments in R

September 23, 2013
By
Detecting Weak Instruments in R

Any instrumental variables (IV) estimator relies on two key assumptions in order to identify causal effects: That the excluded instrument or instruments only effect the dependent variable through their effect on the endogenous explanatory variable or variables (the exclusion restriction), That the correlation between the excluded instruments and the endogenous explanatory variables is strong enough […]

Read more »

More on Bayesian methods and multilevel modeling

September 23, 2013
By

Ban Chuan Cheah writes: In a previous post, http://andrewgelman.com/2013/07/30/the-roy-causal-model/ you pointed to a paper on Bayesian methods by Heckman. At around the same time I came across another one of his papers, “The Effects of Cognitive and Noncognitive Abilities on Labor Market Outcomes and Social Behavior (2006)” (http://www.nber.org/papers/w12006 or published version http://www.jstor.org/stable/10.1086/504455). In this paper […]The post More on Bayesian methods and multilevel modeling appeared first on Statistical Modeling, Causal…

Read more »

Generate permutations in SAS

September 23, 2013
By
Generate permutations in SAS

I've written several articles that show how to generate permutations in SAS. In the SAS DATA step, you can use the ALLPEM subroutine to generate all permutations of a DATA step array that contain a small number (18 or fewer) elements. In addition, the PLAN procedure enables you to generate [...]

Read more »

Going to Plot Some Proportions? Why not Flog ‘em First?

September 23, 2013
By
Going to Plot Some Proportions? Why not Flog ‘em First?

Fractions and proportions can be difficult to plot nicely for a number of reasons: If the proportions are based on small counts (e.g., two of his three computing devices were Apple products) then the calculated proportions will only take on a numbe...

Read more »

Waiting in One Line or Multiple Lines

September 23, 2013
By
Waiting in One Line or Multiple Lines

Whenever I go to the grocery store it always seems to be a lesson in statistics. I go get the things I need to buy and then  I try to select the checkout register that will decrease the amount of time I have to wait. Inevitably, I select the one line where there is some […]

Read more »

De la significativité (statistique)

September 23, 2013
By
De la significativité (statistique)

On va encore dire que je fais mon numéro d’anti-journaliste primaire, mais je voulais revenir sur une histoire, relayée par certains journaux (français) qui me laisse songeur sur les liens entre les journalistes et le milieu scientifique. Je ne reviendrais pas sur l’information que j’avais relayée alors (que @Tournyol a reprise sur Twitter ce week-end, et qui a beaucoup circulé ces dernières heures sur Twitter) où plusieurs journalistes traduisaient “augmenté de…

Read more »

Exploratory Data Analysis: Quantile-Quantile Plots for New York’s Ozone Pollution Data

Exploratory Data Analysis: Quantile-Quantile Plots for New York’s Ozone Pollution Data

Introduction Continuing my recent series on exploratory data analysis, today’s post focuses on quantile-quantile (Q-Q) plots, which are very useful plots for assessing how closely a data set fits a particular distribution.  I will discuss how Q-Q plots are constructed and use Q-Q plots to assess the distribution of the “Ozone” data from the built-in […]

Read more »

On-line learning and teaching resources

September 22, 2013
By
On-line learning and teaching resources

Twenty-first century Junior Woodchuck Guidebook I grew up reading Donald Duck comics. I love the Junior Woodchucks, and their Junior Woodchuck Guidebook. The Guidebook is a small paperback book, containing information on every conceivable subject, including geography, mythology, history, literature … Continue reading →

Read more »

MCMSki IV, Jan. 6-8, 2014, Chamonix (news #9)

September 22, 2013
By
MCMSki IV, Jan. 6-8, 2014, Chamonix (news #9)

This a reminder about the October 15 deadlines for MCMSki IV: First, the early bird rate for the registration ends up on October 15. Second, the young investigator travel support can only be requested up to October 15 as well. Be sure to book your hotel or rental place early too as Chamonix is quite popular in […]

Read more »

Statistical Theater of the Absurd: “Stat on a Hot Tin Roof”

September 22, 2013
By
Statistical Theater of the Absurd: “Stat on a Hot Tin Roof”

Memory lane: Did you ever consider how some of the colorful exchanges among better-known names in statistical foundations could be the basis for high literary drama in the form of one-act plays (even if appreciated by only 3-7 people in the world)? (Think of the expressionist exchange between Bohr and Heisenberg in Michael Frayn’s play […]

Read more »

Guns are cool

September 21, 2013
By
Guns are cool

Reddit has a subreddit Guns are cool which in turn contains the MASS SHOOTINGS IN 2013 page. I pulled those data to see if there was anything of interest to plot or examine. I ended with modelling the frequency of shootings (once a ...

Read more »

Parum PI

September 20, 2013
By
Parum PI

A couple of weeks ago, the MRC-funded research project on the Regression Discontinuity Design (of which I'm the Principal Investigator) has officially started, so I thought I wrote a few lines of update about it, after the couple of posts (here an...

Read more »

“Six red flags for suspect work”

September 20, 2013
By

Raghu Parthasarathy sends along this article by C. Glenn Begley listing six questions to ask when worried about unreplicable work in biology: Were experiments performed blinded? (Even animal studies should be blinded when it comes to the recording and interpretation of the data—do you hear that, Mark Hauser?) Were basic experiments repeated? (“If reports fail […]The post “Six red flags for suspect work” appeared first on Statistical Modeling, Causal Inference,…

Read more »

Theory gets too Much Respect, and Measurement Doesn’t get Enough (60-Second Lecture Video and Transcription)

September 20, 2013
By

In the last post, I told you about Penn's "60-Second Lecture". Mine is now completed, and we had a good time. Watch the video, and you'll have a good laugh at the unflattering opening shot, complete with a barking dog in the background, and the blindin...

Read more »

The gradient of the bivariate normal cumulative distribution

September 20, 2013
By
The gradient of the bivariate normal cumulative distribution

I'm spoiled by the internet. I've grown so accustomed to being able to instantly find an answer to any query—no matter how obscure—that I am surprised when I don't find what I am looking for. The other day I was trying to find a mathematical result: a formula for the [...]

Read more »


Subscribe

Email:

  Subscribe