Use regression for a univariate analysis? Yes!

May 13, 2013
By
Use regression for a univariate analysis? Yes!

I've conducted a lot of univariate analyses in SAS, yet I'm always surprised when the best way to carry out the analysis uses a SAS regression procedure. I always think, "This is a univariate analysis! Why am I using a regression procedure? Doesn't a regression require at least two variables?" [...]

Read more »

Living it up with computational errors

May 13, 2013
By

How to have a better chance of a good outcome. Making mistakes There’s been a lot of talk recently about data analysis problems with spreadsheets.  If you’ve not stuck your head out of your cave lately, then you can catch some of the discussion by doing an internet search for: Reinhart Rogoff There are several […] The post Living it up with computational errors appeared first on Burns Statistics.

Read more »

Sunday data/statistics link roundup (5/12/2013, Mother’s Day!)

May 13, 2013
By

A tutorial on deep-learning, I really enjoyed reading it, but I'm still trying to figure out how this is different than non-linear logistic regression to estimate features then supervised prediction using those features? Or maybe I'm just naive.... Rafa on … Continue reading →

Read more »

Light entertainment: others’ idea of fun

May 13, 2013
By
Light entertainment: others’ idea of fun

This chart (link) I think it's a line chart, not an area chart.

Read more »

Teaching statistical report-writing

May 12, 2013
By
Teaching statistical report-writing

Teaching how to write statistical reports It is difficult to write statistical reports and it is difficult to teach how to write statistical reports. When statistics is taught in the traditional way, with emphasis on the underlying mathematics the process … Continue reading →

Read more »

OpenData Latinoamerica

May 12, 2013
By

Miguel Paz writes: Poderomedia Foundation and PinLatam are launching OpenDataLatinoamerica.org, a regional data repository to free data and use it on Hackathons and other activities by HacksHackers chapters and other organizations. We are doing this because the road to the future of news has been littered with lost datasets. A day or so after every [...]The post OpenData Latinoamerica appeared first on Statistical Modeling, Causal Inference, and Social Science.

Read more »

awalé

May 12, 2013
By
awalé

Following Le Monde puzzle #810, I tried to code an R program (not reproduced here) to optimise an awalé game but the recursion was too rich for R: even with a very small number of holes and seeds in the awalé… Searching on the internet, it seems the computer simulation of a winning strategy for […]

Read more »

Crime novels for economists

May 12, 2013
By
Crime novels for economists

Following up on this post by Noah Smith on economics in science fiction, Mark Palko writes on economics in crime fiction. Just as almost all science fiction is ultimately about politics, one could say that just about all crime fiction is about economics. But if I had to pick one crime novelist with an economics [...]The post Crime novels for economists appeared first on Statistical Modeling, Causal Inference, and Social…

Read more »

Reshaping data

May 12, 2013
By

Preparing and reshaping data is the ever continuing task of a data analyst. Luckily we have many tools for it. The default tool in R would be reshape(), although this is so user friendly that a reshape package has been added too. I try to use reshape()...

Read more »

Animations Understood

May 11, 2013
By
Animations Understood

When I first saw a graphic made from Yihui’s animation package (Xie, 2013) I was amazed at the magic and thought “I could never do that”. Passage of time… One night I found myself bored and as usual avoiding work. … Continue reading →

Read more »

Actually, I have no problem with this graph

May 11, 2013
By
Actually, I have no problem with this graph

Tom Salvesen asks, is this the worst info-graphic of the year? I say, no. Nobody really cares about these numbers. It’s an amusing feature. The alternative would not be a better display of these data, the alternative would be some photo or cartoon. They’re just having fun. I wouldn’t give it any design awards but [...]The post Actually, I have no problem with this graph appeared first on Statistical Modeling,…

Read more »

Reproducibility and randomness

May 11, 2013
By

With Stéphane Tufféry, we were working this week on a chapter of a book, entitled Statistical Learning in Actuarial Science. The chapter should be based on R functions, and we wanted to reproduce some outputs he previously obtained with SAS. The good thing is that even complex functions (logistic regression, regression trees, etc) produce the same kind of outputs. But we found a problem that we could not fix: generating…

Read more »

Harp, Percussion, Shakespeare, AND Bayesian Data Analysis

May 10, 2013
By
Harp, Percussion, Shakespeare, AND Bayesian Data Analysis

There will be a week-long introductory course in doing Bayesian data analysis, July 15-19 (2013), at Indiana University. There are many art, music, theater and social events happening that same week, including the International Harp Competition, the IU...

Read more »

Tutorials on git/github and GNU make

May 10, 2013
By
Tutorials on git/github and GNU make

If you’re not using version control, you should be. Learn git. If you’re not on github, you should be. That’s real open source. To help some colleagues get started with git and github, I wrote a minimal tutorial. There are lots of git and github resources available, but I thought I’d give just the bare […]

Read more »

Machine Learning – week 3

May 10, 2013
By
Machine Learning – week 3

Recall that in last week's installment of Coursera's Machine Learning class, we covered linear regression - the most fundamental way of making a prediction where the outcome is a number, like predicting the price of a stock or how much rainfall (i...

Read more »

Bubble sort implemented in pure R

May 10, 2013
By
Bubble sort implemented in pure R

Please note that this is programming I purely did for the learning experience. The pure R bubble sort implemented in this post is veeeeery slow for two reasons: Interpreted code with lots of iteration is very slow. Bubble sort is… See more ›

Read more »

The recursion of pop-econ

May 10, 2013
By

Dave Berri posted the following at the Freakonomics blog: The “best” picture of 2012 was Argo. At least that’s the film that won the Oscar for best picture. According to the Oscars, the decision to give this award to Argo was made by the nearly 6,000 voting members of the Academy of Motion Picture Arts [...]The post The recursion of pop-econ appeared first on Statistical Modeling, Causal Inference, and Social…

Read more »

Managing Government Information as an Asset

May 10, 2013
By
Managing Government Information as an Asset

A new Open Data Policy of the US Government says that ‘starting this week (9th May 2013) new and modernized …Continue reading »

Read more »

If it’s called the “The High Quality Research Act,” then ….

May 10, 2013
By
If it’s called the “The High Quality Research Act,” then ….

Among the (less technical) items sent my way over the past few days are discussions of the so-called High Quality Research Act. I’d not heard of it, but it’s apparently an outgrowth of the recent hand-wringing over junk science, flawed statistics, non-replicable studies, and fraud (discussed at times on this blog). And it’s clearly a […]

Read more »

Medicare Provider Charge Data – How Are Things in Spokane?

May 9, 2013
By
Medicare Provider Charge Data – How Are Things in Spokane?

Disclosure: I am not a health care professional, but just some one interested in playing with data and ways to represent them. I am not going to interpret findings here, but will make a remark or two about how to interpret them from the few graphs...

Read more »

Same old same old

May 9, 2013
By

In an email I sent to a colleague who’s writing about lasso and Bayesian regression for R users: The one thing you might want to add, to fit with your pragmatic perspective, is to point out that these different methods are optimal under different assumptions about the data. However, these assumptions are never true (even [...]The post Same old same old appeared first on Statistical Modeling, Causal Inference, and Social…

Read more »

Mapping the Pavement Quality of Streets in L.A.

Mapping the Pavement Quality of Streets in L.A.

L.A. Street QualGrades [latimes.com], developed by the Los Angeles Times Data Desk, maps the pavement quality rating for each of the 68,000 street segments in L.A., the largest municipal system in the US with about 6,500 miles of paved roadway street...

Read more »

YouTube Trends Map: Explore the Most Popular Videos by Location

YouTube Trends Map: Explore the Most Popular Videos by Location

The YouTube Trends Map [youtube.com] is a visualization of the most shared and viewed videos in various regions across the United States over the last 12 to 24 hours. It accompanies the more analytical Trends Dashboard to provide a full overview of th...

Read more »


Subscribe

Email:

  Subscribe