Posts Tagged ‘ Data Analysis ’

The distribution of colors for plain M&M candies

February 20, 2017
By
The distribution of colors for plain M&M candies

Many introductory courses in probability and statistics encourage students to collect and analyze real data. A popular experiment in categorical data analysis is to give students a bag of M&M® candies and ask them to estimate the proportion of colors in the population from the sample data. In some classes, [...] The post The distribution of colors for plain M&M candies appeared first on The DO Loop.

Read more »

An easy way to run thousands of regressions in SAS

February 13, 2017
By
An easy way to run thousands of regressions in SAS

A common question on SAS discussion forums is how to repeat an analysis multiple times. Most programmers know that the most efficient way to analyze one model across many subsets of the data (perhaps each country or each state) is to sort the data and use a BY statement to [...] The post An easy way to run thousands of regressions in SAS appeared first on The DO Loop.

Read more »

Counting is hard, especially when you don’t have theories

January 19, 2017
By
Counting is hard, especially when you don’t have theories

Exploring the data about movies, uncovering data issues

Read more »

Ten posts from 2016 that deserve a second look

January 11, 2017
By
Ten posts from 2016 that deserve a second look

Last week I wrote about the 10 most popular articles from The DO Loop in 2016. The popular articles tend to be about elementary topics that appeal to a wide range of SAS programmers. Today I present an "editor's choice" list of technical articles that describe more advanced statistical methods […] The post Ten posts from 2016 that deserve a second look appeared first on The DO Loop.

Read more »

Is "La Quinta" Spanish for "Next to Denny’s"?

January 6, 2017
By
Is "La Quinta" Spanish for "Next to Denny’s"?

“La Quinta” is Spanish for “next to Denny’s.”      -- Mitch Hedberg, comedian Mitch Hedberg's joke resonates with travelers who drive on the US interstate system because many highway exits feature both a La Quinta Inn™ and a Denny's® restaurant within a short distance of each other. But does a […] The post Is "La Quinta" Spanish for "Next to Denny's"? appeared first on The DO Loop.

Read more »

The top 10 posts from The DO Loop in 2016

January 4, 2017
By
The top 10 posts from The DO Loop in 2016

I wrote 105 posts for The DO Loop blog in 2016. My most popular articles were about data analysis, SAS programming tips, and elementary statistics. Without further ado, here are the most popular articles from 2016. Data Analysis and Visualization Start with a juicy set of data and an interesting […] The post The top 10 posts from The DO Loop in 2016 appeared first on The DO Loop.

Read more »

Data Preparation, Long Form and tl;dr Form

December 26, 2016
By
Data Preparation, Long Form and tl;dr Form

Data preparation and cleaning are some of the most important steps of predictive analytic and data science tasks. They are laborious, where most of the errors are made, your last line of defense against a wild data, and hold the biggest opportunities for outcome improvement. No matter how much time you spend on them, they … Continue reading Data Preparation, Long Form and tl;dr Form

Read more »

Discover power laws by log-transforming data

December 5, 2016
By
Discover power laws by log-transforming data

A recent issue of Astronomy magazine mentioned Kepler's third law of planetary motion, which states "the square of a planet's orbital period is proportional to the cube of its average distance from the Sun" (Astronomy, Dec 2016, p. 17). The article included a graph (shown at the right) that shows […] The post Discover power laws by log-transforming data appeared first on The DO Loop.

Read more »

Visualize the ages of US presidents

November 14, 2016
By
Visualize the ages of US presidents

Who was the oldest person elected president of the United States? How about the youngest? Who was the oldest when he left office? Let's look at some data. Wikipedia has a page that presents a table of the presidents of the US by age. It lists the dates for which […] The post Visualize the ages of US presidents appeared first on The DO Loop.

Read more »

Sometimes you need to reverse the data before you fit a distribution

November 2, 2016
By
Sometimes you need to reverse the data before you fit a distribution

Occasionally on a discussion forum, a statistical programmer will ask a question like the following: I am trying to fit a parametric distribution to my data. The sample has a long tail, so I have tried the lognormal, Weibull, and gamma distributions, but nothing seems to fit. Please help!! In […] The post Sometimes you need to reverse the data before you fit a distribution appeared first on The DO…

Read more »


Subscribe

Email:

  Subscribe