data scientist position

April 7, 2014
By
data scientist position

Our newly created Chaire “Economie et gestion des nouvelles données” in Paris-Dauphine, ENS Ulm, École Polytechnique and ENSAE is recruiting a data scientist starting as early as May 1, the call remaining open till the position is filled. The location is in one of the above labs in Paris, the duration for at least one […]

Read more »

Writing good software can have more impact than publishing in high impact journals for genomic statisticians

April 7, 2014
By

Every once in a while we see computational papers published in science journals with high impact factors.  Genomics related methods appear quite often in these journals. Several of my junior colleagues express frustration that all their papers get rejected from these journals. … Continue reading →

Read more »

The Internet and religious affiliation

April 7, 2014
By
The Internet and religious affiliation

A few weeks ago I published this paper on arXiv: "Religious affiliation, education and Internet use".  Regular readers of this blog will recognize this as the article I was writing about in July 2012, including this article.A few days ago, MIT Tec...

Read more »

How literature is like statistical reasoning: Kosara on stories. Gelman and Basbøll on stories.

April 7, 2014
By

In “Story: A Definition,” visual analysis researcher Robert Kosara writes: A story ties facts together. There is a reason why this particular collection of facts is in this story, and the story gives you that reason. provides a narrative path through those facts. In other words, it guides the viewer/reader through the world, rather than just throwing […] The post How literature is like statistical reasoning: Kosara on stories. Gelman and Basbøll…

Read more »

R Continues Its Rapid Growth

April 7, 2014
By
R Continues Its Rapid Growth

I’ve just updated the section below from The Popularity of Data Analysis Software. Note that the overall article is still under construction and all the figure numbers have changed from previous versions. Growth in Capability The capability of analytics software … Continue reading →

Read more »

On deck this week

April 7, 2014
By

Mon: How literature is like statistical reasoning: Kosara on stories. Gelman and Basbøll on stories. Tues: Understanding Simpson’s paradox using a graph Wed: Advice: positive-sum, zero-sum, or negative-sum Thurs: Small multiples of lineplots > maps (ok, not always, but yes in this case) Fri: “More research from the lunatic fringe” Sat: “Schools of statistical thoughts […] The post On deck this week appeared first on Statistical Modeling, Causal Inference, and…

Read more »

Numbersense Pros: Cathy O’Neil talks about trust in data analysis

April 7, 2014
By
Numbersense Pros: Cathy O’Neil talks about trust in data analysis

Cathy O'Neil may need no introduction to blog readers. She's the author of the hard-hitting MathBabe blog, and she shares my passion for explaining how data analysis really works. She is co-author of the recent book Doing Data Science (link), with Rachel Schutt. Cathy has a varied career spanning academia and industry, as she explains below. *** KF: How did you pick up your impressive statistical reasoning skills? CO: Thanks…

Read more »

Point Forecast Accuracy Evaluation

April 7, 2014
By

Here's a new one for your reading pleasure. Interesting history. Minchul and I went in trying to escape the expected loss minimization paradigm. We came out realizing that we hadn't escaped, but simultaneously, that not all loss functions are created e...

Read more »

Vector and matrix norms in SAS

April 7, 2014
By
Vector and matrix norms in SAS

Did you know that SAS/IML 12.1 provides built-in functions that compute the norm of a vector or matrix? A vector norm enables you to compute the length of a vector or the distance between two vectors in SAS. Matrix norms are used in numerical linear algebra to estimate the condition […]

Read more »

Story: A Definition

April 7, 2014
By
Story: A Definition

What makes a story? What does a story do? In part one of this little series, I argued that stories and worlds are not opposites, but complements. In this part, I try to explain the differences between worlds and stories, and present a definition.

Read more »

Stationarity of ARCH processes

April 7, 2014
By

In the context of AR(1) processes, we spent some time to explain what happens when  is close to 1. if  the process is stationary, if  the process is a random walk if  the process will explode Again, random walks are extremely interesting processes,...

Read more »

Jaynes’s Bayesian view of Frequencies

April 6, 2014
By

The last post together with Christian Hennig’s comment, naturally reminded me of Jaynes and his view of frequencies. After a discussion similar to my previous post, but approached in a different way and in more depth, Jaynes states (PTLOS p. 292)...

Read more »

Looking at Measles Data in Project Tycho, part II

April 6, 2014
By
Looking at Measles Data in Project Tycho, part II

Continuing from last week, I will now look at incidence rates of measles in the US. To recap, Project Tycho contains data from all weekly notifiable disease reports for the United States dating back to 1888. These data are freely available to any...

Read more »

Seven (a-day)

April 6, 2014
By
Seven (a-day)

This week (among other things, including my Vespa breaking down twice in three days) I was busy taking part in an interview panel for a research associate position, together with colleagues in the Medical School at UCL.One of the questions we...

Read more »

An old discussion of food deserts

April 6, 2014
By
An old discussion of food deserts

I happened to be reading an old comment thread from 2012 (follow the link from here) and came across this amusing exchange: Perhaps this is the paper Jonathan was talking about? Here’s more from the thread: Anyway, I don’t have anything ...

Read more »

Advocacy graphics

April 6, 2014
By
Advocacy graphics

Note: If you are here to read about Google Flu Trends, please see this roundup of the coverage. My blog is organized into two sections: the section you are on is about data visualization; the other section concerns Big Data...

Read more »

Interpreting noise

April 6, 2014
By
Interpreting noise

When watching the TV news, or reading newspaper commentary, I am frequently amazed at the attempts people make to interpret random noise. For example, the latest tiny fluctuation in the share price of a major company is attributed to the CEO being ill. When the exchange rate goes up, the TV finance commentator confidently announces that it is a reaction to Chinese building contracts. No one ever says “The unemployment…

Read more »

Who is allowed to cheat? I.J. Good and that after dinner comedy hour….

April 5, 2014
By
Who is allowed to cheat? I.J. Good and that after dinner comedy hour….

It was from my Virginia Tech colleague I.J. Good (in statistics), who died five years ago (April 5, 2009), at 93, that I learned most of what I call “howlers” on this blog. His favorites were based on the “paradoxes” of stopping rules. (I had posted this last year here.) “In conversation I have emphasized to other statisticians, […]

Read more »

Bizarre academic spam

April 5, 2014
By

I’ve been getting these sorts of emails every couple days lately: Respected Professor Gelman I am a senior undergraduate at Indian Institute of Technology Kanpur (IIT Kanpur). I am currently in the 8th Semester of my Master of Science (Integrated) in Mathematics and Scientific Computing program. I went through some of your previous work and […] The post Bizarre academic spam appeared first on Statistical Modeling, Causal Inference, and Social…

Read more »

Making inferences about unusual population quantities.

April 5, 2014
By
Making inferences about unusual population quantities.

This post builds on my last: Alternatives to model diagnostics for statistical inference?, where I had claimed that we could make quality inferences about the best linear approximation to a quadratic relationship. The R code below implements such a scenario, to establish a framework for discussion. I have inserted comments to augment the code. set.seed(42) […]

Read more »

I was going to criticize this on blog but I’m just too tired of things like this. What’s really horrible is the news article which takes all this so seriously. My problem is not with people who run regressions and post them on the web—the more the merrier, I say—but with reputable news outlets whose editors should know better

April 4, 2014
By

A friend pointed me to this monstrosity. As an MIT grad, I’d like to think that Technology Review could do better. To elaborate a bit: A one-paragraph blurb would be fine to me, you can report that someone ran some regressions on the GSS and came up with an amusing hypothesis. That’s enough, then move […] The post I was going to criticize this on blog but I’m just too…

Read more »

There’s an App for That

April 4, 2014
By
There’s an App for That

I was looking for econometrics-related "apps" for my Android tablet. Very little of interest came up for "Econometrics", but there certainly are some nice data-related apps.Here are a few examples:FRED Economic DataIndicators OnlineCentERdataAmerica's ...

Read more »

Playing With the FRB/US Model

April 4, 2014
By
Playing With the FRB/US Model

Via both Mark Thoma and Gareth Thomas, I learned about a new initiative at the Federal Reserve Board. The details are available here.In summary:"The FRB/US model of the U.S. economy is one of several that Federal Reserve Board staff consults for f...

Read more »


Subscribe

Email:

  Subscribe