Belly Button Biodiversity: The End Game

May 30, 2013
By
Belly Button Biodiversity: The End Game

In the previous installment of this saga, I admitted that my predictions had completely failed, and I outlined the debugging process I began.  Then the semester happened, so I didn't get to work on it again until last week.It turns out that there ...

Read more »

PLATO, an Alternative to PLINK

May 30, 2013
By
PLATO, an Alternative to PLINK

Since the near beginning of genome-wide association studies, the PLINK software package (developed by Shaun Purcell’s group at the Broad Institute and MGH) has been the standard for manipulating the large-scale data produced by these studies.  O...

Read more »

There are no outliers

May 30, 2013
By

Matt Brigg’s comment on outliers in his post Tyranny of the mean: Coontz used the word “outliers”. There are no such things. There can be mismeasured data, i.e. incorrect data, say when you tried to measure air temperature but your thermometer fell into boiling water. Or there can be errors in recording the data; transposition […]

Read more »

Infill asymptotics and sprawl asymptotics

May 30, 2013
By
Infill asymptotics and sprawl asymptotics

Anirban Bhattacharya, Debdeep Pati, Natesh Pillai, and David Dunson write: Penalized regression methods, such as L1 regularization, are routinely used in high-dimensional applications, and there is a rich literature on optimality properties under sparsity assumptions. In the Bayesian paradigm, sparsity is routinely induced through two-component mixture priors having a probability mass at zero, but such [...]The post Infill asymptotics and sprawl asymptotics appeared first on Statistical Modeling, Causal Inference, and…

Read more »

Chance to ask me a question this Friday

May 30, 2013
By

I will be at Book Expo this Friday signing books at the McGraw-Hill booth. If you're in NYC, drop by and say hi between 11 and 12. Yes, it's a new book! The title is Numbersense: How to Use Big...

Read more »

Chance to ask me a question this Friday

May 30, 2013
By

I will be at Book Expo this Friday signing books at the McGraw-Hill booth. If you're in NYC, drop by and say hi between 11 and 12. Yes, it's a new book! The title is Numbersense: How to Use Big Data to Your Advantage (link). If you read my blogs, you already know where I'm going with this. How can we be smart consumers of data analyses in a world…

Read more »

Using simulation to estimate the power of a statistical test

May 30, 2013
By
Using simulation to estimate the power of a statistical test

The power of a statistical test measures the test's ability to detect a specific alternate hypothesis. For example, educational researchers might want to compare the mean scores of boys and girls on a standardized test. They plan to use the well-known two-sample t test. The null hypothesis is that the [...]

Read more »

K. Staley: review of Error & Inference

May 30, 2013
By
K. Staley: review of Error & Inference

K. W. Staley Associate Professor Department of Philosophy, Saint Louis University (Almost) All about error BOOK REVIEW Metascience (2012) 21:709–713 DOI 10.1007/s11016-011-9618-1 Deborah G. Mayo and Aris Spanos (eds): Error and inference: Recent exchanges on experimental reasoning, reliability, objectivity, and rationality. New York: Cambridge University Press, 2010, xvii+419 pp The ERROR’06 (experimental reasoning, reliability, objectivity, […]

Read more »

What statistics should do about big data: problem forward not solution backward

May 29, 2013
By

There has been a lot of discussion among statisticians about big data and what statistics should do to get involved. Recently Steve M. and Larry W. took up the same issue on their blog. I have been thinking about this … Continue reading →

Read more »

Another one of those “Psychological Science” papers (this time on biceps size and political attitudes among college students)

May 29, 2013
By

Paul Alper writes: Unless I missed it, you haven’t commented on the recent article of Michael Bang Peterson [with Daniel Sznycer, Aaron Sell, Leda Cosmides, and John Tooby]. It seems to have been reviewed extensively in the lay press. A typical example is here. This review begins with “If you are physically strong, social science [...]The post Another one of those “Psychological Science” papers (this time on biceps size and…

Read more »

SAS Dominates Analytics Job Market; R up 42%

May 29, 2013
By
SAS Dominates Analytics Job Market; R up 42%

I’m continuing to gather and analyze data to update The Popularity of Data Analysis Software. In this installment I cover the latest employment figures. Employment is important to us all, so what software skills are employers seeking? A thorough answer … Continue reading →

Read more »

The 3D Trajectories of the Tennis Ball during the Final ATP Matches

May 29, 2013
By
The 3D Trajectories of the Tennis Ball during the Final ATP Matches

Corona Perspectives [coronaperspectives.com] developed by advertising agency JWT Spain and web development studio Espada y Santa Cruz provides an interactive and 3D perspective of all the tennis ball trajectories during 3 past ATP tennis matches. The...

Read more »

Will Mu Go Out With Median

May 29, 2013
By
Will Mu Go Out With Median

True story (no really, this did actually happen).  While in grad school one of the other teaching assistants was approached by one of the students and was asked “will mu go out with median?”  The teaching assistant thought the play on words was pretty funny, laughed, and then cluelessly walked away.  All of us other grad students […]

Read more »

Why doesn’t R have a MaxDiff package?

May 28, 2013
By

Almost once every year someone asks if R has a package for running the MaxDiff procedure sold by Sawtooth.  One such inquiry recently received a reply with a link showing in some detail the R code needed to generate a balanced incomplete...

Read more »

Escalatingly uncomfortable

May 28, 2013
By

Aggressive, fizzing nonconformity. The post Escalatingly uncomfortable appeared first on Statistical Modeling, Causal Inference, and Social Science.

Read more »

Nostalgia

May 28, 2013
By

Saw Argo the other day, was impressed by the way it was filmed in such a 70s style, sorta like that movie The Limey or an episode of the Rockford Files. I also felt nostalgia for that relatively nonviolent era. All those hostages and nobody was killed. It’s a good thing the Ayatollah didn’t have [...]The post Nostalgia appeared first on Statistical Modeling, Causal Inference, and Social Science.

Read more »

Beatquake: the Music Listening Activity across Facebook over 90 days

May 28, 2013
By
Beatquake: the Music Listening Activity across Facebook over 90 days

Mapping Music on Facebook [facebookstories.com] by Stamen Design for Facebook shows the dynamic characteristics of the typical listening activity across Facebook. Inspired by the dynamic movement of a graphic equalizer, Beatquake maps the popularity ...

Read more »

Every One, Every Day: Media Architecture Cube Reflects Energy Usage

May 28, 2013
By
Every One, Every Day: Media Architecture Cube Reflects Energy Usage

Every One, Every Day [kuuki.com.au], designed by media production collective Kuuki, is a media architecture installation measuring 27 cubic meters that reflects the near real-time price and demand of energy in New South Wales, Australia. The cube wa...

Read more »

Python Epistemology at PyCon Taiwan

May 28, 2013
By
Python Epistemology at PyCon Taiwan

This weekend I gave a talk entitled "Python Epistemology" for PyCon Taiwan 2013.  I would have loved to be in Taipei for the talk, but sadly I was in an empty room in front of a teleconference screen.Python Epistemology: PyCon Taiwan 2013Python Ep...

Read more »

Steve Marron on “Big Data”

May 28, 2013
By
Steve Marron on “Big Data”

Steve Marron is a statistician at UNC. In his younger days he was well known for his work on nonparametric theory. These days he works on a number of interesting things including analysis of structured objects (like tree-structured data) and high dimensional theory. Steve sent me a thoughtful email the other day about “Big Data” […]

Read more »

Simplify until your fake-data check works, then add complications until you can figure out where the problem is coming from

May 28, 2013
By

I received the following email: I am trying to develop a Bayesian model to represent the process through which individual consumers make online product rating decisions. In my model each individual faces total J product options and for each product option (j) each individual (i) needs to make three sequential decisions: - First he decides [...]The post Simplify until your fake-data check works, then add complications until you can figure…

Read more »

Superimposing time series is the biggest source of silly theories

May 28, 2013
By
Superimposing time series is the biggest source of silly theories

Business Insider (link) published the following chart and declared "the end of the car age in one chart". The chart superimposed the monthly motor vehicle miles driven per capita and the labor force participation rate. This is the conclusion of...

Read more »

New heat maps in the REG procedure

May 28, 2013
By
New heat maps in the REG procedure

Has anyone noticed that the REG procedure in SAS/STAT 12.1 produces heat maps instead of scatter plots for fit plots and residual plots when the regression involves more than 5,000 observations? I wasn't aware of the change until a colleague informed me, although the change is discussed in the "Details" [...]

Read more »


Subscribe

Email:

  Subscribe