James Joyce (3) vs. Mary Baker Eddy

February 5, 2015
By
James Joyce (3) vs. Mary Baker Eddy

Yesterday’s winner will come as no surprise to you. My favorite argument in favor of L. Ron Hubbard came from Jameson: “We know that Hubbard has what it takes in terms of cheating things. Specifically, he’d be willing to sell his own religion down the river (ahem) for an extra hour of life. Thus, we […] The post James Joyce (3) vs. Mary Baker Eddy appeared first on Statistical Modeling,…

Read more »

Johns Hopkins Data Science Specialization Top Performers

February 5, 2015
By
Johns Hopkins Data Science Specialization Top Performers

Editor's note: The Johns Hopkins Data Science Specialization is the largest data science program in the world.  Brian, Roger, and myself  conceived the program at the beginning of January 2014 , then built, recorded, and launched the classes starting in April 2014 with the help of Ira.  Since April 2014 we have enrolled 1.76 million student and

Read more »

Early data on knowledge units – atoms of statistical education

February 5, 2015
By
Early data on knowledge units – atoms of statistical education

Yesterday I posted about atomizing statistical education into knowledge units. You can try out the first knowledge unit here: https://jtleek.typeform.com/to/jMPZQe. The early data is in and it is consistent with many of our hypotheses about the future of online education. Namely: Completion rates are high when segments are shorter You can learn something about statistics in

Read more »

Why I keep talking about “generalizing from sample to population”

February 5, 2015
By

Someone publishes some claim, some statistical comparison with “p less than .05″ attached to it. My response is: OK, you see this pattern in the sample. Do you think it holds in the population? Why do I ask this? Why don’t I ask the more standard question: Do you really think this result is statistically […] The post Why I keep talking about “generalizing from sample to population” appeared first…

Read more »

There are no easy charts

February 5, 2015
By
There are no easy charts

Every chart, even if the dataset is small, deserves care. Long-time reader zbicyclist submits the following, which illustrates this point well. The following comments are by zbicyclist: This is from http://win.niddk.nih.gov/statistics/ -- from the National Institute of Diabetes and Kidney...

Read more »

Free online data mining and machine learning courses by Stanford University

February 5, 2015
By
Free online data mining and machine learning courses by Stanford University

by Yanchang Zhao, RDataMining.com Three free online data mining and machine learning courses lectured by professors at Stanford University started in past two weeks, which provide excellent opportunities to learn advanced data mining and machine learning techniques. If you are … Continue reading →

Read more »

Four Different Types of Regression Residuals

February 4, 2015
By
Four Different Types of Regression Residuals

When we estimate a regression model, the differences between the actual and "predicted" values for the dependent variable (over the sample) are termed the "residuals". Specifically, if the model is of the form:            ...

Read more »

Four Different Types of Regression Residuals

February 4, 2015
By
Four Different Types of Regression Residuals

When we estimate a regression model, the differences between the actual and "predicted" values for the dependent variable (over the sample) are termed the "residuals". Specifically, if the model is of the form:            ...

Read more »

Knowledge units – the atoms of statistical education

February 4, 2015
By

Editor's note: This idea is Brian's idea and based on conversations with him and Roger, but I just executed it. The length of academic courses has traditionally ranged between a few days for a short course to a few months for a semester-long course.  Lectures are typically either 30 minutes or one hour. Term and lecture lengths

Read more »

Microbial Genomics: the State of the Art in 2015

February 4, 2015
By
Microbial Genomics: the State of the Art in 2015

Current Opinion in Microbiology recently published a special issue in genomics. In an excellent editorial overview, “Genomics: The era of genomically-enabled microbiology”, Neil Hall and Jay Hinton give an overview of the state of the field in micr...

Read more »

Microbial Genomics: the State of the Art in 2015

February 4, 2015
By
Microbial Genomics: the State of the Art in 2015

Current Opinion in Microbiology recently published a special issue in genomics. In an excellent editorial overview, “Genomics: The era of genomically-enabled microbiology”, Neil Hall and Jay Hinton give an overview of the state of the field in micr...

Read more »

Mark Twain (4) vs. L. Ron Hubbard

February 4, 2015
By
Mark Twain (4) vs. L. Ron Hubbard

OK, first the result from yesterday’s contest, Plato (1) vs. Henny Youngman. This one was surprisingly close. Youngman got the most votes, but I gotta go with the philosopher-king. The arguments that swayed me were X’s point that Plato could do an entire talk by projecting shadows on the wall, and, especially, Keith’s connection to […] The post Mark Twain (4) vs. L. Ron Hubbard appeared first on Statistical Modeling,…

Read more »

Link: Becksploitation: The Over-Use of a Cartographic Icon

February 4, 2015
By
Link: Becksploitation: The Over-Use of a Cartographic Icon

The paper Becksploitation: The Over-Use of a Cartographic Icon by Kenneth Field and William Cartwright (free pre-print PDF) in The Cartographic Journal describes the Harry Beck’s famous map of the London Underground and what makes it great. It also offers a collection of misuses of the superficial structure, and critiques them. I wish we’d had papers (and titles!) … Continue reading Link: Becksploitation: The Over-Use of a Cartographic Icon

Read more »

Miscellaneous math resources

February 4, 2015
By

Every Wednesday I’ve been pointing out various resources on my web site. So far they’ve all been web pages, but the following are all PDF files. Probability and statistics: How to test a random number generator Predictive probabilities for normal outcomes Predictive probability interim analysis Relating two definitions of expectation Illustrating the error in the […]

Read more »

The plagiarist next door

February 4, 2015
By

In a comment on this chess-related post, Matt Gaffney pointed me to this wonderful page full of chess curiosities by Tim Krabbé. My nederlands is not what it used to be, but Krabbé has posted lots of material in English so that’s no problem. I started reading his “Open chess diary” (i.e., blog), it’s updated […] The post The plagiarist next door appeared first on Statistical Modeling, Causal Inference, and…

Read more »

Specify the order of variables at run time in SAS

February 4, 2015
By
Specify the order of variables at run time in SAS

In SAS, the order of variables in a data set is usually unimportant. However, occasionally SAS programmers need to reorder the variables in order to make a special graph or to simplify a computation. Reordering variables in the DATA step is slightly tricky. There are Knowledge Base articles about how […]

Read more »

Standard error: a poem

February 4, 2015
By
Standard error: a poem

This poem was written by David Goddard from the Monash University Department of Epidemiology and Preventive Medicine. It is reproduced here with his permission. The poem won the inaugural Monash University poetry competition and will soon be published in an anthology of contemporary poetry. For those who like this sort of thing (as I do), there is a […]

Read more »

How to Get the Frequency Table of a Categorical Variable as a Data Frame in R

How to Get the Frequency Table of a Categorical Variable as a Data Frame in R

Introduction One feature that I like about R is the ability to access and manipulate the outputs of many functions.  For example, you can extract the kernel density estimates from density() and scale them to ensure that the resulting density integrates to 1 over its support set. I recently needed to get a frequency table of […]

Read more »

How to Get the Frequency Table of a Categorical Variable as a Data Frame in R

How to Get the Frequency Table of a Categorical Variable as a Data Frame in R

Introduction One feature that I like about R is the ability to access and manipulate the outputs of many functions.  For example, you can extract the kernel density estimates from density() and scale them to ensure that the resulting density integrates to 1 over its support set. I recently needed to get a frequency table of […]

Read more »

Base R Assessment!

February 3, 2015
By
Base R Assessment!

Test your skills with this R-powered R assessment of base R knowledge! Built using the R powered adaptive testing platform Concerto, this assessment provides a short but powerful tool at evaluating your base R understanding relative to that of your pe...

Read more »

Plato (1) vs. Henny Youngman

February 3, 2015
By
Plato (1) vs. Henny Youngman

Here it is, our very first matchup! The Philosopher-King vs. the King of One-Liners. Plato’s got the fame, the staying power, and the #1 seed in his bracket. On the other hand, Henny knew how to hustle. Here’s Roger Ebert, as quoted on Youngman’s Wikipedia page: I once observed Henny Youngman taping a TV show […] The post Plato (1) vs. Henny Youngman appeared first on Statistical Modeling, Causal Inference,…

Read more »

R + ggplot2 Graph Catalog

February 3, 2015
By
R + ggplot2 Graph Catalog

Joanna Zhao’s and Jenny Bryan’s R graph catalog is meant to be a complement to the physical book, Creating More Effective Graphs, but it’s a really nice gallery in its own right. The catalog shows a series of different data visualizations, all ma...

Read more »

Canberra IAPA Seminar – Text Analytics: Natural Language into Big Data – 17 February

February 3, 2015
By
Canberra IAPA Seminar – Text Analytics: Natural Language into Big Data – 17 February

Topic: Text Analytics: Natural Language into Big Data Speaker: Dr. Leif Hanlen, Technology Director at NICTA Date: Tuesday 17 February Time: 5.30pm for a 6pm start Cost: Nil Where: SAS Offices, 12 Moore Street, Canberra, ACT 2600 Registration URL: http://www.iapa.org.au/Event/TextAnalyticsNaturalLanguageIntoBigData … Continue reading →

Read more »


Subscribe

Email:

  Subscribe