Posts Tagged ‘ Data Analysis ’

No major hurricanes have hit the US coast recently. Lucky us!

May 14, 2015
By
No major hurricanes have hit the US coast recently. Lucky us!

Perhaps you saw the headlines earlier this week about the fact that it has been nine years since the last major hurricane (category 3, 4, or 5) hit the US coast. According to a post on the GeoSpace blog, which is published by the American Geophysical Union (AGU), researchers ran […] The post No major hurricanes have hit the US coast recently. Lucky us! appeared first on The DO Loop.

Read more »

Create and use a permutation matrix in SAS

April 29, 2015
By
Create and use a permutation matrix in SAS

Suppose that you compute the correlation matrix (call it R1) for a set of variables x1, x2, ..., x8. For some reason, you later want to compute the correlation matrix for the variables in a different order, maybe x2, x1, x7,..., x6. Do you need to go back to the […] The post Create and use a permutation matrix in SAS appeared first on The DO Loop.

Read more »

Finding observations that match a target value

March 18, 2015
By
Finding observations that match a target value

Imagine that you have one million rows of numerical data and you want to determine if a particular "target" value occurs. How might you find where the value occurs? For univariate data, this is an easy problem. In the SAS DATA step you can use a WHERE clause or a […]

Read more »

Analyzing the first 10 million digits of pi: Randomness within structure

March 12, 2015
By
Analyzing the first 10 million digits of pi: Randomness within structure

Saturday, March 14, 2015, is Pi Day, and this year is a super-special Pi Day! This is your once-in-a-lifetime chance to celebrate the first 10 digits of pi (π) by doing something special on 3/14/15 at 9:26:53. Apologies to my European friends, but Pi Day requires that you represent dates […] The post Analyzing the first 10 million digits of pi: Randomness within structure appeared first on The DO Loop.

Read more »

Plotting multiple time series in SAS/IML (Wide to Long, Part 2)

February 27, 2015
By
Plotting multiple time series in SAS/IML (Wide to Long, Part 2)

I recently wrote about how to overlay multiple curves on a single graph by reshaping wide data (with many variables) into long data (with a grouping variable). The implementation used PROC TRANSPOSE, which is a procedure in Base SAS. When you program in the SAS/IML language, you might encounter data […]

Read more »

Plotting multiple series: Transforming data from wide to long

February 25, 2015
By
Plotting multiple series: Transforming data from wide to long

Data. To a statistician, data are the observed values. To a SAS programmer, analyzing data requires knowledge of the values and how the data are arranged in a data set. Sometimes the data are in a "wide form" in which there are many variables. However, to perform a certain analysis […]

Read more »

The advantages of using count() to get N-way frequency tables as data frames in R

The advantages of using count() to get N-way frequency tables as data frames in R

Introduction I recently introduced how to use the count() function in the “plyr” package in R to produce 1-way frequency tables in R.  Several commenters provided alternative ways of doing so, and they are all appreciated.  Today, I want to extend that tutorial by demonstrating how count() can be used to produce N-way frequency tables […]

Read more »

How to Get the Frequency Table of a Categorical Variable as a Data Frame in R

How to Get the Frequency Table of a Categorical Variable as a Data Frame in R

Introduction One feature that I like about R is the ability to access and manipulate the outputs of many functions.  For example, you can extract the kernel density estimates from density() and scale them to ensure that the resulting density integrates to 1 over its support set. I recently needed to get a frequency table of […]

Read more »

Popular posts from The DO Loop in 2014

January 2, 2015
By
Popular posts from The DO Loop in 2014

I published 118 blog posts in 2014. This article presents my most popular posts from 2014 and late 2013. 2014 will always be a special year for me because it was the year that the SAS University Edition was launched. The University Edition means that SAS/IML is available to all […]

Read more »

Exploratory Data Analysis – All Blog Posts on The Chemical Statistician

Exploratory Data Analysis – All Blog Posts on The Chemical Statistician

This series of posts introduced various methods of exploratory data analysis, providing theoretical backgrounds and practical examples.  Fully commented and readily usable R scripts are available for all topics for you to copy and paste for your own analysis!  Most of these posts involve data visualization and plotting, and I include a lot of detail and […]

Read more »


Subscribe

Email:

  Subscribe