Blog Archives

Operate on the body of a file but not the header

October 14, 2014
By
Operate on the body of a file but not the header

Sometimes you need to run some UNIX command on a file but only want to operate on the body of the file, not the header. Create a file called body somewhere in your $PATH, make it executable, and add this to it:#!/bin/bashIFS= read -r headerprintf '%s\n...

Read more »

R package to convert statistical analysis objects to tidy data frames

September 16, 2014
By
R package to convert statistical analysis objects to tidy data frames

I talked a little bit about tidy data my recent post about dplyr, but you should really go check out Hadley’s paper on the subject.R expects inputs to data analysis procedures to be in a tidy format, but the model output objects that you get back are...

Read more »

UVA / Charlottesville R Meetup

September 11, 2014
By
UVA / Charlottesville R Meetup

TL;DR? We started an R Users group, awesome community, huge turnout at first meeting, lots of potential.---I've sat through many hours of meetings where faculty lament the fact that their trainees (and the faculty themselves!) are woefully ill-prepared...

Read more »

GNU datamash

September 10, 2014
By
GNU datamash

GNU datamash is a command-line utility that offers simple calculations (e.g. count, sum, min, max, mean, stdev, string coalescing) as well as a rich set of statistical functions, to quickly assess information in textual input files or from a UNIX pipe....

Read more »

Do your "data janitor work" like a boss with dplyr

August 20, 2014
By
Do your "data janitor work" like a boss with dplyr

Data “janitor-work”The New York Times recently ran a piece on wrangling and cleaning data:“For Big-Data Scientists, ‘Janitor Work’ Is Key Hurdle to Insights”Whether you call it “janitor-work,” wrangling/munging, cleaning/cleansing/scru...

Read more »

Introduction to R for Life Scientists: Course Materials

July 7, 2014
By
Introduction to R for Life Scientists: Course Materials

Last week I taught a three-hour introduction to R workshop for life scientists at UVA's Health Sciences Library.I broke the workshop into three sections:In the first half hour or so I presented slides giving an overview of R and why R is so awesome. Du...

Read more »

Bedtools tutorial from 2013 CSHL course

June 24, 2014
By
Bedtools tutorial from 2013 CSHL course

A couple of months ago I posted about how to visualize exome coverage with bedtools and R. But if you're looking to get a basic handle on genome arithmetic, take a look at Aaron Quinlan's bedtools tutorials from the 2013 CSHL course. The tutorial uses ...

Read more »

An Annotated Online Bioinformatics / Computational Biology Curriculum

June 13, 2014
By
An Annotated Online Bioinformatics / Computational Biology Curriculum

Two years ago David Searls published an article in PLoS Comp Bio describing a series of online courses in bioinformatics. Yesterday, the same author published an updated version, "A New Online Computational Biology Curriculum," (PLoS Comput Biol 10(6):...

Read more »

Collaborative lesson development with GitHub

June 2, 2014
By
Collaborative lesson development with GitHub

If you're doing any kind of scientific computing and not using version control, you're doing it wrong. The git version control system and GitHub, a web-based service for hosting and collaborating on git-controlled projects, have both become wildly popu...

Read more »

Using Volcano Plots in R to Visualize Microarray and RNA-seq Results

May 28, 2014
By
Using Volcano Plots in R to Visualize Microarray and RNA-seq Results

I've been asked a few times how to make a so-called volcano plot from gene expression results. A volcano plot typically plots some measure of effect on the x-axis (typically the fold change) and the statistical significance on the y-axis (typically the...

Read more »


Subscribe

Email:

  Subscribe