There are no easy charts

February 5, 2015
By
There are no easy charts

Every chart, even if the dataset is small, deserves care. Long-time reader zbicyclist submits the following, which illustrates this point well. The following comments are by zbicyclist: This is from http://win.niddk.nih.gov/statistics/ -- from the National Institute of Diabetes and Kidney...

Read more »

Free online data mining and machine learning courses by Stanford University

February 5, 2015
By
Free online data mining and machine learning courses by Stanford University

by Yanchang Zhao, RDataMining.com Three free online data mining and machine learning courses lectured by professors at Stanford University started in past two weeks, which provide excellent opportunities to learn advanced data mining and machine learning techniques. If you are … Continue reading →

Read more »

Four Different Types of Regression Residuals

February 4, 2015
By
Four Different Types of Regression Residuals

When we estimate a regression model, the differences between the actual and "predicted" values for the dependent variable (over the sample) are termed the "residuals". Specifically, if the model is of the form:            ...

Read more »

Four Different Types of Regression Residuals

February 4, 2015
By
Four Different Types of Regression Residuals

When we estimate a regression model, the differences between the actual and "predicted" values for the dependent variable (over the sample) are termed the "residuals". Specifically, if the model is of the form:            ...

Read more »

Knowledge units – the atoms of statistical education

February 4, 2015
By

Editor's note: This idea is Brian's idea and based on conversations with him and Roger, but I just executed it. The length of academic courses has traditionally ranged between a few days for a short course to a few months for a semester-long course.  Lectures are typically either 30 minutes or one hour. Term and lecture lengths

Read more »

Microbial Genomics: the State of the Art in 2015

February 4, 2015
By
Microbial Genomics: the State of the Art in 2015

Current Opinion in Microbiology recently published a special issue in genomics. In an excellent editorial overview, “Genomics: The era of genomically-enabled microbiology”, Neil Hall and Jay Hinton give an overview of the state of the field in micr...

Read more »

Microbial Genomics: the State of the Art in 2015

February 4, 2015
By
Microbial Genomics: the State of the Art in 2015

Current Opinion in Microbiology recently published a special issue in genomics. In an excellent editorial overview, “Genomics: The era of genomically-enabled microbiology”, Neil Hall and Jay Hinton give an overview of the state of the field in micr...

Read more »

Mark Twain (4) vs. L. Ron Hubbard

February 4, 2015
By
Mark Twain (4) vs. L. Ron Hubbard

OK, first the result from yesterday’s contest, Plato (1) vs. Henny Youngman. This one was surprisingly close. Youngman got the most votes, but I gotta go with the philosopher-king. The arguments that swayed me were X’s point that Plato could do an entire talk by projecting shadows on the wall, and, especially, Keith’s connection to […] The post Mark Twain (4) vs. L. Ron Hubbard appeared first on Statistical Modeling,…

Read more »

Link: Becksploitation: The Over-Use of a Cartographic Icon

February 4, 2015
By
Link: Becksploitation: The Over-Use of a Cartographic Icon

The paper Becksploitation: The Over-Use of a Cartographic Icon by Kenneth Field and William Cartwright (free pre-print PDF) in The Cartographic Journal describes the Harry Beck’s famous map of the London Underground and what makes it great. It also offers a collection of misuses of the superficial structure, and critiques them. I wish we’d had papers (and titles!) … Continue reading Link: Becksploitation: The Over-Use of a Cartographic Icon

Read more »

Miscellaneous math resources

February 4, 2015
By

Every Wednesday I’ve been pointing out various resources on my web site. So far they’ve all been web pages, but the following are all PDF files. Probability and statistics: How to test a random number generator Predictive probabilities for normal outcomes Predictive probability interim analysis Relating two definitions of expectation Illustrating the error in the […]

Read more »

The plagiarist next door

February 4, 2015
By

In a comment on this chess-related post, Matt Gaffney pointed me to this wonderful page full of chess curiosities by Tim Krabbé. My nederlands is not what it used to be, but Krabbé has posted lots of material in English so that’s no problem. I started reading his “Open chess diary” (i.e., blog), it’s updated […] The post The plagiarist next door appeared first on Statistical Modeling, Causal Inference, and…

Read more »

Specify the order of variables at run time in SAS

February 4, 2015
By
Specify the order of variables at run time in SAS

In SAS, the order of variables in a data set is usually unimportant. However, occasionally SAS programmers need to reorder the variables in order to make a special graph or to simplify a computation. Reordering variables in the DATA step is slightly tricky. There are Knowledge Base articles about how […]

Read more »

Standard error: a poem

February 4, 2015
By
Standard error: a poem

This poem was written by David Goddard from the Monash University Department of Epidemiology and Preventive Medicine. It is reproduced here with his permission. The poem won the inaugural Monash University poetry competition and will soon be published in an anthology of contemporary poetry. For those who like this sort of thing (as I do), there is a […]

Read more »

How to Get the Frequency Table of a Categorical Variable as a Data Frame in R

How to Get the Frequency Table of a Categorical Variable as a Data Frame in R

Introduction One feature that I like about R is the ability to access and manipulate the outputs of many functions.  For example, you can extract the kernel density estimates from density() and scale them to ensure that the resulting density integrates to 1 over its support set. I recently needed to get a frequency table of […]

Read more »

How to Get the Frequency Table of a Categorical Variable as a Data Frame in R

How to Get the Frequency Table of a Categorical Variable as a Data Frame in R

Introduction One feature that I like about R is the ability to access and manipulate the outputs of many functions.  For example, you can extract the kernel density estimates from density() and scale them to ensure that the resulting density integrates to 1 over its support set. I recently needed to get a frequency table of […]

Read more »

Base R Assessment!

February 3, 2015
By
Base R Assessment!

Test your skills with this R-powered R assessment of base R knowledge! Built using the R powered adaptive testing platform Concerto, this assessment provides a short but powerful tool at evaluating your base R understanding relative to that of your pe...

Read more »

Plato (1) vs. Henny Youngman

February 3, 2015
By
Plato (1) vs. Henny Youngman

Here it is, our very first matchup! The Philosopher-King vs. the King of One-Liners. Plato’s got the fame, the staying power, and the #1 seed in his bracket. On the other hand, Henny knew how to hustle. Here’s Roger Ebert, as quoted on Youngman’s Wikipedia page: I once observed Henny Youngman taping a TV show […] The post Plato (1) vs. Henny Youngman appeared first on Statistical Modeling, Causal Inference,…

Read more »

R + ggplot2 Graph Catalog

February 3, 2015
By
R + ggplot2 Graph Catalog

Joanna Zhao’s and Jenny Bryan’s R graph catalog is meant to be a complement to the physical book, Creating More Effective Graphs, but it’s a really nice gallery in its own right. The catalog shows a series of different data visualizations, all ma...

Read more »

Canberra IAPA Seminar – Text Analytics: Natural Language into Big Data – 17 February

February 3, 2015
By
Canberra IAPA Seminar – Text Analytics: Natural Language into Big Data – 17 February

Topic: Text Analytics: Natural Language into Big Data Speaker: Dr. Leif Hanlen, Technology Director at NICTA Date: Tuesday 17 February Time: 5.30pm for a 6pm start Cost: Nil Where: SAS Offices, 12 Moore Street, Canberra, ACT 2600 Registration URL: http://www.iapa.org.au/Event/TextAnalyticsNaturalLanguageIntoBigData … Continue reading →

Read more »

BayesFactor version 0.9.10 released to CRAN

February 3, 2015
By

If you're running Solaris (yes, all zero of you) you'll have to wait for version 0.9.10-1, due to a small issue preventing the Solaris package from building on CRAN. The rest of you can pick up the updated version on CRAN today.See below the fold for c...

Read more »

R in Insurance 2015: Registration Opened

February 3, 2015
By
R in Insurance 2015: Registration Opened

The registration for the third conference on R in Insurance on Monday 29 June 2015 at the University of Amsterdam has opened. This one-day conference will focus again on applications in insurance and actuarial science that use R, the lingua franca for ...

Read more »

Spelling Things Out

February 3, 2015
By
Spelling Things Out

When visualizing data, we often strive for efficiency: show the data, nothing else. But there can be tremendous value in redundancy to make a point and drive it home. Two recent examples from news graphics illustrate this nicely. The first is this animated chart of global temperatures from 1881 to 2014. It shows more data … Continue reading Spelling Things Out

Read more »

Total survey error

February 3, 2015
By

Erez Shalom writes: It’s election time in Israel and every week several surveys come out trying to predict the ‘mandates’ that each party will get (out of a total of 120). These surveys are historically flakey, and no one takes the ‘sampling error’ they come with seriously, but no one has a good idea of […] The post Total survey error appeared first on Statistical Modeling, Causal Inference, and Social…

Read more »


Subscribe

Email:

  Subscribe