An Economist’s Guide to Visualizing Data

March 13, 2014
By
An Economist’s Guide to Visualizing Data

Stephen Jenkins wrote: I was thinking that you and your blog readers might be interested in “An Economist’s Guide to Visualizing Data” by Jonathan Schwabish, in the most recent Journal of Economic Perspectives (which is the American Economic Association’s main “outreach” journal in some ways). I replied: Ooh, I hate this so much! This seems […]The post An Economist’s Guide to Visualizing Data appeared first on Statistical Modeling, Causal Inference,…

Read more »

Setting the right priority

March 13, 2014
By
Setting the right priority

On the sister blog, I wrote about a new report on the music industry lamenting that the hype over "Long Tail" retail has not really helped small artists (as a group). This was a tip sent by reader Patrick S....

Read more »

Get empowered to detect power howlers

March 13, 2014
By
Get empowered to detect power howlers

If a test’s power to detect µ’ is low then a statistically significant result is good/lousy evidence of discrepancy µ’? Which is it? If your smoke alarm has little capability of triggering unless your house is fully ablaze, then if it has triggered, is that a strong or weak indication of a fire? Compare this insensitive […]

Read more »

Canada et al

March 13, 2014
By

1. Today was the first day of our course on Bayesian methods in health economics. After my lecture on intro to health economics, Chris has given 2 lectures on Bayesian methods and their implementation in BUGS and then Richard has talked about Baye...

Read more »

Testing for trend in ARIMA models

March 12, 2014
By
Testing for trend in ARIMA models

Today’s email brought this one: I was wondering if I could get your opinion on a particular problem that I have run into during the reviewing process of an article. Basically, I have an analysis where I am looking at a couple of time-series and I wanted to know if, over time there was an upward trend in the series. Inspection of the raw data suggests there is, but we…

Read more »

Software Carpentry at UVA, Redux

March 12, 2014
By
Software Carpentry at UVA, Redux

Software Carpentry is an international collaboration backed by Mozilla and the Sloan Foundation comprising a team of volunteers that teach computational competence and basic programming skills to scientists. In addition to a suite of online lessons, ...

Read more »

Python: Numerical Descriptions of the Data

March 12, 2014
By
Python: Numerical Descriptions of the Data

We are going to explore the basics of Statistics using Python. And we'll go through the following:Importing the data;Apply summary statistics;Other measures of variability (variance and coefficient of variation);Other measures of position (percentile a...

Read more »

More on publishing in journals

March 12, 2014
By

I’m postponing today’s scheduled post (“Empirical implications of Empirical Implications of Theoretical Models”) to continue the lively discussion from yesterday, What if I were to stop publishing in journals?. An example: my papers with Basbøll Thomas Basbøll and I got into a long discussion on our blogs about business school professor Karl Weick and other […]The post More on publishing in journals appeared first on Statistical Modeling, Causal Inference, and…

Read more »

Oh no, the Leekasso….

March 12, 2014
By
Oh no, the Leekasso….

An astute reader (Niels Hansen, who is visiting our department today) caught a bug in my code on Github for the Leekasso. I had: lm1 = lm(y ~ leekX) predict.lm(lm1,as.data.frame(leekX2)) Unfortunately, this meant that I was getting predictions for the … Continue reading →

Read more »

Reality check on the long tail

March 12, 2014
By
Reality check on the long tail

Some time ago, there was a lot of hype about how new tech will demolish the superstar effect in entertainment sales because all the little titles in the long tail will be exposed to consumers. I recall Amazon being labeled the shiny example of a company that made profits off the long tail (as opposed to the boring top of the distribution). I still remember this graphic from Wired (link):…

Read more »

Unit root tests and ARIMA models

March 12, 2014
By
Unit root tests and ARIMA models

An email I received today: I have a small problem. I have a time series called x : - If I use the default values of auto.arima(x), the best model is an ARIMA(1,0,0) - However, I tried the function ndiffs(x, test=“adf”) and ndiffs(x, test=“kpss”) as the KPSS test seems to be the default value, and the number of difference is 0 for the kpss test (consistent with the results of…

Read more »

Optimizing a function of an integral

March 12, 2014
By
Optimizing a function of an integral

Last week I showed how to find parameters that maximize the integral of a certain probability density function (PDF). Because the function was a PDF, I could evaluate the integral by calling the CDF function in SAS. (Recall that the cumulative distribution function (CDF) is the integral of a PDF.) [...]

Read more »

Heuristics in Analytics

March 12, 2014
By

Last week, a book -- a real, hard-cover paper-paged book -- arrived in the mail with the title:  Heuristics in Analytics:  A Practical Perspective of What Influences Our Analytic World.  The book wasn't a total surprise, because I had re...

Read more »

where did the normalising constants go?! [part 2]

March 11, 2014
By
where did the normalising constants go?! [part 2]

Coming (swiftly and smoothly) back home after this wonderful and intense week in Banff, I hugged my loved ones,  quickly unpacked, ran a washing machine, and  then sat down to check where and how my reasoning was wrong. To start with, I experimented with a toy example in R: and (of course!) it produced the […]

Read more »

The myth of the myth of the myth of the hot hand

March 11, 2014
By

Phil pointed me to this paper so I thought I probably better repeat what I wrote a couple years ago: 1. The effects are certainly not zero. We are not machines, and anything that can affect our expectations (for example, our success in previous tries) should affect our performance. 2. The effects I’ve seen are […]The post The myth of the myth of the myth of the hot hand appeared…

Read more »

Less wordy R

March 11, 2014
By
Less wordy R

The Swarm Lab presents a nice comparison of R and Python code for a simple (read ‘one could do it in Excel’) problem. The example works, but I was surprised by how wordy the R code was and decided to check if one could easily produce a shorter version. The beginning is pretty much the […]

Read more »

HereHere: Mapping the Concerns of NY Citizens as an Iconographic Map

March 11, 2014
By
HereHere: Mapping the Concerns of NY Citizens as an Iconographic Map

Here Here [herehere.co], developed by Future Social Experiences (FuSE) Labs at Microsoft Research, expresses neighborhood-specific public data by mapping it as text labels and cartoon-like iconography. The data is based on New York City's 311 non-e...

Read more »

Sorting: Understanding How Famous Sorting Algorithms Work

March 11, 2014
By
Sorting: Understanding How Famous Sorting Algorithms Work

There are quite a few visualizations of sorting algorithms out there, such as at sorting-algorithms.com and sortvis.org. "Sorting" [sorting.at], developed by Nokia data visualization designer Carlo Zapponi, brings some innovation to this field by tack...

Read more »

SAS, SPSS, Stata Users: Learn R from Home April 21

March 11, 2014
By
SAS, SPSS, Stata Users: Learn R from Home April 21

Has learning R been driving you a bit crazy? If so, it may be that you’re “lost in translation.” On April 21 and 23, I’ll be teaching a webinar, R for SAS, SPSS and Stata Users. With each R concept, … Continue reading →

Read more »

What if I were to stop publishing in journals?

March 11, 2014
By

In our recent discussion of modes of publication, Joseph Wilson wrote, “The single best reform science can make right now is to decouple publication from career advancement, thereby reducing the number of publications by an order of magnitude and then move to an entirely disjointed, informal, online free-for-all communication system for research results.” My first […]The post What if I were to stop publishing in journals? appeared first on Statistical…

Read more »

googleVis code development moved to GitHub

March 11, 2014
By
googleVis code development moved to GitHub

After nearly 4 years of developing googleVis on Google Code with SVN we decided to move to GitHub. The main reason was that Google stopped the facility of hosting pre-CRAN builds of the package for user testing. The devtools package on the other hand m...

Read more »

Machine Learning Lesson of the Day – Introduction to Linear Basis Function Models

Machine Learning Lesson of the Day – Introduction to Linear Basis Function Models

Given a supervised learning problem of using inputs () to predict a continuous target , the simplest model to use would be linear regression.  However, what if we know that the relationship between the inputs and the target is non-linear, but we are unsure of exactly what form this relationship has? One way to overcome […]

Read more »

Capturing Intraday data

March 11, 2014
By
Capturing Intraday data

I want to follow up the Intraday data post with an example of how you can capture Intraday data without too much effort by recording 1 minute snapshots of the market. I will take market snapshots from Yahoo Finance using following function that downloads delayed market quotes with date and time stamps: Next we can […]

Read more »


Subscribe

Email:

  Subscribe