Posts Tagged ‘ Data Analysis ’

Aborting a SAS/IML program upon encountering an error

March 3, 2014
By
Aborting a SAS/IML program upon encountering an error

A colleague sent me an interesting question: What is the best way to abort a SAS/IML program? For example, you might want to abort a program if the data is singular or does not contain a sufficient number of observations or variables. As a first attempt would be to try [...]

Read more »

The Statistics behind “Verification by Multiplicity”

March 2, 2014
By
The Statistics behind “Verification by Multiplicity”

There’s a new post up at the ninazumel.com blog that looks at the statistics of “verification by multiplicity” — the statistical technique that is behind NASA’s announcement of 715 new planets that have been validated in the data from the Kepler Space Telescope. We normally don’t write about science here at Win-Vector, but we do […] Related posts: “I don’t think that means what you think it means;” Statistics to…

Read more »

How to automatically select a smooth curve for a scatter plot in SAS

February 26, 2014
By
How to automatically select a smooth curve for a scatter plot in SAS

My last blog post described three ways to add a smoothing spline to a scatter plot in SAS. I ended the post with a cautionary note: From a statistical point of view, the smoothing spline is less than ideal because the smoothing parameter must be chosen manually by the user. [...]

Read more »

13 popular articles from 2013

January 7, 2014
By
13 popular articles from 2013

In 2013 I published 110 blog posts. Some of these articles were more popular than others, often because they were linked to from a SAS newsletter such as the SAS Statistics and Operations Research News. In no particular order, here are some of my most popular posts from 2013, organized [...]

Read more »

How to specify mosaic plot colors in SAS

November 6, 2013
By
How to specify mosaic plot colors in SAS

The mosaic plot is a graphical visualization of a frequency table. In a previous post, I showed how to use the FREQ procedure to create a mosaic plot. This article shows how to create a mosaic plot by using the MOSAICPARM statement in the graph template language (GTL). (The MOSAICPARM [...]

Read more »

Create mosaic plots in SAS by using PROC FREQ

November 4, 2013
By
Create mosaic plots in SAS by using PROC FREQ

Mosaic plots (Hartigan and Kleiner, 1981; Friendly, 1994, JASA) are used for exploratory data analysis of categorical data. Mosaic plots have been available for decades in SAS products such as JMP, SAS/INSIGHT, and SAS/IML Studio. However, not all SAS customers have access to these specialized products, so I am pleased [...]

Read more »

How to order categories in a two-way table with PROC FREQ

October 28, 2013
By
How to order categories in a two-way table with PROC FREQ

If you've ever tried to use PROC FREQ to create a frequency table of two character variables, you know that by default the categories for each variable are displayed in alphabetical order. A different order is sometimes more useful. For example, consider the following two-way table for the smoking status [...]

Read more »

The joy of data analysis

October 24, 2013
By
The joy of data analysis

Music and snow. Poke my eyes out Perhaps your immediate response is: “I’d rather poke my eyes out with a burning stick than do data analysis.” There’s a completely different reaction from a lot of people who have experienced data analysis. Music It’s not entirely clear why humans like music so much. Part of it […] The post The joy of data analysis appeared first on Burns Statistics.

Read more »

Output percentiles of multiple variables in a tabular format

October 23, 2013
By
Output percentiles of multiple variables in a tabular format

A challenge for statistical programmers is getting data into the right form for analysis. For graphing or analyzing data, sometimes the "wide format" (each subject is represented by one row and many variables) is required, but other times the "long format" (observations for each subject span multiple rows) is more [...]

Read more »

machine learning [book review]

October 20, 2013
By
machine learning [book review]

I have to admit the rather embarrassing fact that Machine Learning, A probabilistic perspective by Kevin P. Murphy is the first machine learning book I really read in detail…! It is a massive book with close to 1,100 pages and I thus hesitated taking it with me around, until I grabbed it in my bag […]

Read more »


Subscribe

Email:

  Subscribe