Blog Archives

Cleaning up tables

May 16, 2018
By

This post is re-published from my blog. Please see it for the latest updates from May 16, 2018. Context One of things I have to do quite often is create tables for papers and presentations. Often the “Table 1” of a paper has descriptives about the study, broken down by subgroups. For presentation purposes, it […]

Read more »

Tidying messy Excel data (tidyxl)

May 8, 2018
By
Tidying messy Excel data (tidyxl)

Reposted from Abhijit’s blog. Some <- have been replaced by = due to idiosyncracies of the WordPress platform. Well, here’s what I was dealing with: Exemplar Excel file from collaborator Notice that we have 3 header rows, first with patient IDs, second with spine region, and third with variable names (A and B, to protect […]

Read more »

Tidying messy Excel data (Introduction)

May 7, 2018
By
Tidying messy Excel data (Introduction)

[Re-posted from Abhijit’s blog] Personal expressiveness, or how data is stored in a spreadsheet When you get data from a broad research community, the variability in how that data is formatted and stored is truly astonishing. Of course there are the standardized formats that are output from machines, like Next Generation Sequencing and other automated […]

Read more »

Moving to blogdown

May 2, 2018
By
Moving to blogdown

I’ve been in the process of transferring my blog (along with creating a personal website) to blogdown, which is hosted on Github Pages and Netlify. The new blog, or rather, the continuation of this blog, will be at webbedfeet.netlify.io/posts, and it went live today. I’ll be cross-posting here for a while, at least until Tal […]

Read more »

Surprising result when exploring Rcpp gallery

July 21, 2017
By
Surprising result when exploring Rcpp gallery

I’m starting to incorporate more Rcpp in my R work, and so decided to spend some time exploring the Rcpp Gallery. One example by John Merrill caught my eye. He provides a C++ solution to transforming an list of lists into a data frame, and shows impressive speed savings compared to as.data.frame. This got me thinking about how […]

Read more »

Quirks about running Rcpp on Windows through RStudio

July 20, 2017
By

Quirks about running Rcpp on Windows through RStudio This is a quick note about some tribulations I had running Rcpp (v. 0.12.12) code through RStudio (v. 1.0.143) on a Windows 7 box running R (v. 3.3.2). I also have RTools v. 3.4 installed. I fully admit that this may very well be specific to my […]

Read more »

Finding my Dropbox in R

July 5, 2017
By

I’ll often keep non-sensitive data on Dropbox so that I can access it on all my machines without gumming up git. I just wrote a small script to find the Dropbox location on each of my computers automatically. The crucial information is available here, from Dropbox. My small snippet of code is the following: I haven’t […]

Read more »

Some thoughts on the downsides of current Data Science practice

April 19, 2017
By

Bert Huang has a nice blog talking about poor results of ML/AI algorithms in “wild” data, which echos some of my experience and thoughts. His conclusions are worth thinking about, IMO. 1. Big data is complex data. As we go out and collect more data from a finite world, we’re necessarily going to start collecting […]

Read more »

pandas “transform” using the tidyverse

April 12, 2017
By

Chris Moffit has a nice blog on how to use the transform function in pandas. He provides some (fake) data on sales and asks the question of what fraction of each order is from each SKU. Being a R nut and a tidyverse fan, I thought to compare and contrast the code for the pandas […]

Read more »

pandas “transform” using the tidyverse

April 12, 2017
By
pandas “transform” using the tidyverse

Chris Moffit has a nice blog on how to use the transform function in pandas. He provides some (fake) data on sales and asks the question of what fraction of each order is from each SKU. Being a R nut and a tidyverse fan, I thought to compare and contrast the code for the pandas […]

Read more »


Subscribe

Email:

  Subscribe