Online resources for handling big data and parallel computing in R

May 6, 2012
By
Online resources for handling big data and parallel computing in R

by Yanchang Zhao, RDataMining.com Compared with many other programming languages, such as C/C++ and Java, R is less efficient and consumes much more memory. Fortunately, there are some packages that enables parallel computing in R and also packages for processing … Continue reading →

Read more »

Illumina stays independent, for now

May 6, 2012
By

Illumina stays independent, for now: It seems the sequencing company will remain independent for now. Roche lost its bid to replace the Illumina board on Wednesday and promptly withdrew its hostile bid. Roche plays a long game, though, and has ample e...

Read more »

reshape (from base) Explained: Part II

May 6, 2012
By
reshape (from base) Explained: Part II

Part II Explains More Complex  Wide to Long With base reshape  In part I of this base reshape tutorial we went over the basics of reshaping data with reshape.  We learned two rules that help us to be more efficient … Continue reading →

Read more »

Making an Animated Heart GIF in R

May 5, 2012
By

Messing around with animations, I created this toy example to make a growing outline of a heart in R. I have put it up as a gist on GitHub and embedded it below. It uses the animation and the ggplot2 packages--both of which make it incredibly simple to...

Read more »

Things I’ve Learned About WordPress.com

May 5, 2012
By
Things I’ve Learned About WordPress.com

I’m almost done moving this site from Google Sites to WordPress. This post describes some of the some things I’ve learned about WordPress.com. By default, WordPress.com makes your site look like a blog. I preferred it look like a web … Continue reading →

Read more »

Recently in the sister blog

May 5, 2012
By

Culture war: The rules You can only accept capital punishment if you’re willing to have innocent people executed every now and then The politics of America’s increasing economic inequality

Read more »

Recently in the sister blog

May 5, 2012
By

Culture war: The rules You can only accept capital punishment if you’re willing to have innocent people executed every now and then The politics of America’s increasing economic inequality

Read more »

Visualizing tables in ggplot2

May 5, 2012
By
Visualizing tables in ggplot2

Recently I wanted to recreate  assocplot  using  ggplot2. In the end I propose a simple way to visualize data arranged two-way tables using geom_tile.I used Titanic data set as an example combining age and sex dimensi...

Read more »

Related to z-statistics

May 5, 2012
By

Pawel Sobkowicz writes: How many zombies do you know?’ Using indirect survey methods to measure alien attacks and outbreaks of the undead, Arxiv preprint arXiv:1003.6087, 2010 I hope you would find interesting the following paper, recently posted on arXiv: Aliens on Earth. Are reports of close encounters correct?, arXiv:1203.6805 This is soooooo much better than [...]

Read more »

Related to z-statistics

May 5, 2012
By

Pawel Sobkowicz writes: How many zombies do you know?’ Using indirect survey methods to measure alien attacks and outbreaks of the undead, Arxiv preprint arXiv:1003.6087, 2010 I hope you would find interesting the following paper, recently posted on arXiv: Aliens on Earth. Are reports of close encounters correct?, arXiv:1203.6805 This is soooooo much better than [...]

Read more »

UCLA Data Fest 2012

May 5, 2012
By

The very very cool UCLA Data Fest is going on as we speak. This is a statistical analysis marathon where teams of undergrads work through the night (and day) to address an important problem through data analysis. Last year they looked at crime data fro...

Read more »

Normalizing Data

May 5, 2012
By

A frequent question for people first learning statistical procedures is how to normalize data.  Other terms for normalizing may include standardizing, scaling and centering (though the last two terms may mean something other than the first two terms).  In my … Continue reading →

Read more »

Knickgraph … ?

May 4, 2012
By
Knickgraph … ?

The Swiss Federal Statistical Office has been publishing data visualizations for more than 100 years. Its head of graphic design, Daniel von Burg, reveals some curiosities. Used for the first time in the 1897 Atlas, the Knickgraph optimizes the surface of a bar graph. Its length is proportional to the value that is being represented. …Read More

Read more »

New National Academy of Sciences Members

May 4, 2012
By

The National Academy of Sciences elected new members a few days ago. Among them are statisticians Robert Tibshirani and sociologist Stephen Raudenbush. Obviously well-deserved! (Thanks to Karl Broman.)

Read more »

Models, assumptions, and data summaries

May 4, 2012
By

I saw an analysis recently that I didn’t like. I won’t go into the details, but basically it was a dose-response inference, where a continuous exposure was binned into three broad categories (terciles of the data) and the probability of an adverse event was computed for each tercile. The effect and the sample size was [...]

Read more »

Beer with a confidence interval

May 4, 2012
By

Medalla is a Puerto Rican beer. On the side of a can it says Alcohol by volume over 4%, not more than 6%. I’d never seen a beer give a confidence interval for its alcohol content. I’d only seen point…Read more ›

Read more »

Hammer on the importance of statistics (or, as I used to know…

May 4, 2012
By

Hammer on the importance of statistics (or, as I used to know him, MC Hammer). The overlay of the video for “Can’t Touch This” really helps me understand what he’s talking about. (Thanks to Chris V. for the link.)

Read more »

Expand data by using frequencies

May 4, 2012
By
Expand data by using frequencies

A reader asked: I want to create a vector as follows. Suppose there are two given vectors x=[A B C] and f=[1 2 3]. Here f indicates the frequency vector. I hope to generate a vector c=[A B B C C C]. I am trying to use the REPEAT function [...]

Read more »

Rob Tibshirani and Andy Clark named to NAS

May 4, 2012
By
Rob Tibshirani and Andy Clark named to NAS

Rob Tibshirani and Andy Clark are now members of the National Academy of Sciences.

Read more »

R Tutorials and Learning Materials

May 4, 2012
By

We are getting ready to host an R bootcamp this summer at work and I am looking at building on materials that already exist. I just wanted to list a few here while I figure out the best ways to incorporate them. Video Tutorials:This is a fairly ne...

Read more »

New New York data research organizations

May 3, 2012
By

In a single day, New York City obtained two data analysis/statistics/machine learning organizations: Microsoft Research New York City with John Langford (machine learning), Duncan Watts (networks), and Dave Pennock (algorithmic economics). eBay technology center focusing on data – led by Chris Dixon, the co-founder of the recommendation engine company Hunch, which has recently been acquired [...]

Read more »

what’s wrong with package comment?!

May 3, 2012
By
what’s wrong with package comment?!

I spent most of the Sunday afternoon trying to understand why defining did not have the same effect as writing the line until I found there is a clash due to the comment package… The assuredly simple code produces an error message: This is quite an inconvenience as I need to compile my solution manual [...]

Read more »

News from the sister blog!

May 3, 2012
By

US National Academy of Sciences elects 84 new members (Please click through and read the whole thing.)

Read more »


Subscribe

Email:

  Subscribe