Relevance, to you or me: a response to Cairo

September 15, 2014
By
Relevance, to you or me: a response to Cairo

Alberto Cairo discussed a graphic by the New York Times on the slowing growth of Medicare spending (link). The chart on the top is published, depicting the quite dramatic flattening of the growth in average spending over the last years--average...

Read more »

My dataviz workshop starts in October

September 15, 2014
By

Course announcement is on my sister blog. This course adopts the creative writing workshop structure to teach data visualization. Think of chart making as a craft. We emphasize the importance of sketches, revisions, feedback from readers, and a critical eye. It's limited enrollment.

Read more »

1976 NBER-Census Time Series Conference

September 15, 2014
By

What a great blast from the past -- check out the program of the 1976 NBER-Census Time-Series Conference. (Thanks to Bill Wei for forwarding, via Hang Kim.)The 1976 conference was a pioneer in bridging time-series econometrics and statistics. Econ...

Read more »

Handling run-time errors in user-defined modules

September 15, 2014
By
Handling run-time errors in user-defined modules

I received the following email from a SAS/IML programmer: I am getting an error in a PROC IML module that I wrote. The SAS Log says NOTE: Paused in module NAME When I submit other commands, PROC IML doesn't seem to understand them. How can I continue the program? The […]

Read more »

How do you say π^π^π?

September 15, 2014
By
How do you say π^π^π?

Well, not that you really probably want to know how to say such an absurdly large number. However for those of you who are interested (allowing for rounding) it is:one quintillion, three hundred forty quadrillion, one hundred sixty-four trillion, one h...

Read more »

Regression with Python, pandas and StatsModels

September 14, 2014
By
Regression with Python, pandas and StatsModels

I was at Boston Data-Con 2014 this morning, which was a great event.  The organizer, John Verostek, seems to have created this three-day event single-handedly, so I am hugely impressed.Imran Malek started the day with a very nice iPython tutorial....

Read more »

One datavis for you, ten for me

September 14, 2014
By
One datavis for you, ten for me

Over the years of my graduate studies I made a lot of plots. I mean tonnes. To get an extremely conservative estimate I grep’ed for every instance of “plot\(” in all of the many R scripts I wrote over the past five years. The actual number is very likely orders of magnitude larger as 1) many […]

Read more »

If Scotland becomes a country

September 14, 2014
By
If Scotland becomes a country

On the 18th of September 2014, Scottish people will vote on secession from the United Kingdom, potentially ending a union that has existed since 1707. If Scots vote “Yes” to end the union, the United Kingdom will consist of England, Wales and Northern Ireland, while the newly created country of Scotland may look like this: […]

Read more »

Six quotes from Kaiser Fung

September 14, 2014
By

You may think you have all of the data. You don’t. One of the biggest myth of Big Data is that data alone produce complete answers. Their “data” have done no arguing; it is the humans who are making this claim. Before getting into the methodological issues, one needs to ask the most basic question. […] The post Six quotes from Kaiser Fung appeared first on Statistical Modeling, Causal Inference,…

Read more »

Trying a prefmap

September 14, 2014
By
Trying a prefmap

Preference mapping is a key technique in sensory and consumer research. It links the sensory perception on products to the liking of products and hence provides clues to the development of new, well tasting, products. Even though it is a key technique,...

Read more »

RDataMining Slides Series

September 14, 2014
By
RDataMining Slides Series

by Yanchang Zhao, RDataMining.com I have made a series of slides on R and data mining, based on my book titled R and Data Mining — Examples and Case Studies. The slides will be used at my presentations at seminars … Continue reading →

Read more »

The Ecology of Data Matrices: A Metaphor for Simultaneous Clustering

September 13, 2014
By
The Ecology of Data Matrices: A Metaphor for Simultaneous Clustering

"...a metaphor is an affair between a predicate with a past and an object that yields while protesting." Nelson Goodman (1976)It is, as if, data matrices were alive. The rows are species, and the columns are habitats. At least that seems to be the...

Read more »

The Econometrics of Temporal Aggregation – IV – Cointegration

September 13, 2014
By
The Econometrics of Temporal Aggregation – IV – Cointegration

My previous post on aggregating time series data over time dealt with some of the consequences for unit roots. The next logical thing to consider is the effect of such aggregation on cointegration, and on testing for its presence.As in the earlier disc...

Read more »

He just ordered a translation from Diederik Stapel

September 13, 2014
By
He just ordered a translation from Diederik Stapel

Fernando Martel Garcia writes: So I am applying for a DC driver’s license and needed a translation of my Spanish license to show to the DMV. I go to http://www.onehourtranslation.com/ and as I prepare to pay I see a familiar face in the bottom banner: It appears Stapel is one of their “over 15,000 dedicated […] The post He just ordered a translation from Diederik Stapel appeared first on Statistical…

Read more »

Class Announcement: 36-350, Statistical Computing, Fall 2014

September 12, 2014
By

Fourth time is charm: 36-350, Statistical Computing Instructors: Yours truly and Andrew Thomas Description: Computational data analysis is an essential part of modern statistics. Competent statisticians must not just be able to run existing program...

Read more »

Unit Root tests and Seasonally Adjusted Data

September 12, 2014
By
Unit Root tests and Seasonally Adjusted Data

We all know why it's common to "seasonally adjust" economic time series data that are recorded on a monthly, quarterly, etc. basis. Students are sometimes surprised to learn that in some countries certain such time series are reported only in seas...

Read more »

What does it take to win a major?

September 12, 2014
By
What does it take to win a major?

Prior to his epic collapse in the semis, I went to the US Open to see the G.O.A.T put on a clinic against the world's #17. In between sets, they showed a graphic on the jumbotron of each player's stats: first serve %, nets points won, unforced errors, ...

Read more »

GitHut: the Universe of Programming Languages across GitHub

September 12, 2014
By
GitHut: the Universe of Programming Languages across GitHub

GitHut [githut.info], developed by Microsoft data visualization designer Carlo Zapponi, is an interactive small multiples visualization revealing the complexity of the wide range of programming languages used across the repositories hosted on GitHub. ...

Read more »

Read sas7bdat files in R with GGASoftware Parso library

September 12, 2014
By

... using the new R package sas7bdat.parso. The software company GGASoftware has extended the work of myself and others on the sas7bdat R package by developing a Java library called Parso, which also reads sas7bdat files. They have worked out most of the remaining kinks. For example, the Parso library reads sas7bdat files with compressed […]

Read more »

What is the purpose of a poem?

September 12, 2014
By

OK, let’s take a break from blogging about economics. OK, I haven’t actually been blogging so much about econ lately, but it just happens that I’m writing this on 19 July, a day after poking a stick into the hornet’s nest by posting “Differences between econometrics and statistics: From varying treatment effects to utilities, economists […] The post What is the purpose of a poem? appeared first on Statistical Modeling,…

Read more »

R: k-Means Clustering on an Image

September 12, 2014
By
R: k-Means Clustering on an Image

Enough with the theory we recently published, let's take a break and have fun on the application of Statistics used in Data Mining and Machine Learning, the k-Means Clustering.k-means clustering is a method of vector quantization, originally from signa...

Read more »

Useful for referring—9-11-2014

September 12, 2014
By
Useful for referring—9-11-2014

Some R Resources for GLMs 失联搜救中的统计数据分析 The gap between data mining and predictive models Data Mining, machine learning and statistics. useR! 2014 is underway with 16 tutorials What is Scalable Machine Learning? rlist:基于list在R中处理非关系型数据 The perfect candidate The Leek group guide to giving talks 38 Seminal Articles Every Data Scientist Should Read Deep Learning – important […]

Read more »

mysterious shiny things

September 12, 2014
By
mysterious shiny things

(Disclaimer: I’m new to Shiny, and blog posts, but I know something about geography.)  In the Shiny gallery, take a look at 2001 versus 2002. Something funny happens to Switzerland (and other European countries), in terms of the legend, it moves from Europe to the Middle East. Also, the legend color scheme switches.     […] The post mysterious shiny things appeared first on Statistical Modeling, Causal Inference, and Social…

Read more »


Subscribe

Email:

  Subscribe