Posts Tagged ‘ Programming ’

sigr: Simple Significance Reporting

March 7, 2017
By
sigr: Simple Significance Reporting

sigr is a simple R package that conveniently formats a few statistics and their significance tests. This allows the analyst to use the correct test no matter what modeling package or procedure they use. Model Example Let’s take as our example the following linear relation between x and y: library('sigr') set.seed(353525) d <- data.frame(x= rnorm(5)) … Continue reading sigr: Simple Significance Reporting

Read more »

Step-Debugging magrittr/dplyr Pipelines in R with wrapr and replyr

March 6, 2017
By

In this screencast we demonstrate how to easily and effectively step-debug magrittr/dplyr pipelines in R using wrapr and replyr. Some of the big issues in trying to debug magrittr/dplyr pipelines include: Pipelines being large expressions that are hard to line-step into. Visibility of intermediate results. Localizing operations (in time and code position) in the presence … Continue reading Step-Debugging magrittr/dplyr Pipelines in R with wrapr and replyr

Read more »

replyr: Get a Grip on Big Data in R

March 5, 2017
By
replyr: Get a Grip on Big Data in R

replyr is an R package that contains extensions, adaptions, and work-arounds to make remote R dplyr data sources (including big data systems such as Spark) behave more like local data. This allows the analyst to more easily develop and debug procedures that simultaneously work on a variety of data services (in-memory data.frame, SQLite, PostgreSQL, and … Continue reading replyr: Get a Grip on Big Data in R

Read more »

Iteration and closures in R

February 26, 2017
By
Iteration and closures in R

I recently read an interesting thread on unexpected behavior in R when creating a list of functions in a loop or iteration. The issue is solved, but I am going to take the liberty to try and re-state and slow down the discussion of the problem (and fix) for clarity. The issue is: are references … Continue reading Iteration and closures in R

Read more »

The Zero Bug

February 21, 2017
By
The Zero Bug

I am going to write about an insidious statistical, data analysis, and presentation fallacy I call “the zero bug” and the habits you need to cultivate to avoid it. The zero bug Here is the zero bug in a nutshell: common data aggregation tools often can not “count to zero” from examples, and this causes … Continue reading The Zero Bug

Read more »

Announcing the wrapr packge for R

February 11, 2017
By
Announcing the wrapr packge for R

Recently Dirk Eddelbuettel pointed out that our R function debugging wrappers would be more convenient if they were available in a low-dependency micro package dedicated to little else. Dirk is a very smart person, and like most R users we are deeply in his debt; so we (Nina Zumel and myself) listened and immediately moved … Continue reading Announcing the wrapr packge for R

Read more »

Evolving R Tools and Practices

February 6, 2017
By
Evolving R Tools and Practices

One of the distinctive features of the R platform is how explicit and user controllable everything is. This allows the style of use of R to evolve fairly rapidly. I will discuss this and end with some new notations, methods, and tools I am nominating for inclusion into your view of the evolving “current best … Continue reading Evolving R Tools and Practices

Read more »

Books on Scala for statistical computing and data science

December 22, 2016
By
Books on Scala for statistical computing and data science

Introduction People regularly ask me about books and other resources for getting started with Scala for statistical computing and data science. This post will focus on books, but it’s worth briefly noting that there are a number of other resources available, on-line and otherwise, that are also worth considering. I particularly like the Coursera course … Continue reading Books on Scala for statistical computing and data science

Read more »

Scala for Data Science [book review]

December 22, 2016
By
Scala for Data Science [book review]

This post will review the book: Scala for Data Science, Bugnion, Packt, 2016. Disclaimer: This book review has not been solicited by the publisher (or anyone else) in any way. I purchased the review copy of this book myself. I have not received any benefit from the writing of this review. Introduction On this blog … Continue reading Scala for Data Science [book review]

Read more »

Comparative examples using replyr::let

December 22, 2016
By
Comparative examples using replyr::let

Consider the problem of “parametric programming” in R. That is: simply writing correct code before knowing some details, such as the names of the columns your procedure will have to be applied to in the future. Our latest version of replyr::let makes such programming easier. Archie’s Mechanics #2 (1954) copyright Archie Publications (edit: great news! … Continue reading Comparative examples using replyr::let

Read more »


Subscribe

Email:

  Subscribe