Posts Tagged ‘ Rstats ’

Data frames and tables in Scala

August 21, 2015
By
Data frames and tables in Scala

Introduction To statisticians and data scientists used to working in R, the concept of a data frame is one of the most natural and basic starting points for statistical computing and data analysis. It always surprises me that data frames aren’t a core concept in most programming languages’ standard libraries, since they are essentially a … Continue reading Data frames and tables in Scala

Read more »

Calling R from Scala sbt projects using rscala

August 15, 2015
By
Calling R from Scala sbt projects using rscala

Overview In the previous post I showed how the rscala package (which has replaced the jvmr package) can be used to call Scala code from within R. In this post I will show how to call R from Scala code. I have previously described how to do this using jvmr. This post is really just … Continue reading Calling R from Scala sbt projects using rscala

Read more »

Calling Scala code from R using rscala

August 15, 2015
By
Calling Scala code from R using rscala

Introduction In a previous post I looked at how to call Scala code from R using a CRAN package called jvmr. This package now seems to have been replaced by a new package called rscala. Like the old package, it requires a pre-existing Java installation. Unlike the old package, however, it no longer depends on … Continue reading Calling Scala code from R using rscala

Read more »

How Predictable is the English Premier League?

May 19, 2015
By
How Predictable is the English Premier League?

The reason why football is so exciting is uncertainty. The outcome of any match or league is unknown, and you get to watch the action unfold without knowing what’s going to happen. Watching matches where you know the score is never exciting. This weekend the English Premier League season will conclude with little fanfare. Bar […]

Read more »

R style default plot for Pandas DataFrame

March 28, 2015
By
R style default plot for Pandas DataFrame

The default plot method for dataframes in R is to show each numeric variable in a pair-wise scatter plot. I find this to be a really useful first look at a dataset, both to see correlations and joint distributions between variables, but also to quickly diagnose potential strangeness like bands of repeating values or outliers. […]

Read more »

Calling R from Scala sbt projects

January 24, 2015
By
Calling R from Scala sbt projects

[Update: The jvmr package has been replaced by the rscala package. There is a new version of this post which replaces this one.] Overview In previous posts I’ve shown how the jvmr CRAN R package can be used to call Scala sbt projects from R and inline Scala Breeze code in R. In this post … Continue reading Calling R from Scala sbt projects

Read more »

Inlining Scala Breeze code in R using jvmr and sbt

January 3, 2015
By
Inlining Scala Breeze code in R using jvmr and sbt

[Update: The CRAN package “jvmr” has been replaced by a new package “rscala”. Rather than completely re-write this post, I’ve just created a github gist containing a new function, breezeInterpreter(), which works similarly to the function breezeInit() in this post. Usage information is given at the top of the gist.] Introduction In the previous post … Continue reading Inlining Scala Breeze code in R using jvmr and sbt

Read more »

Calling Scala code from R using jvmr

January 2, 2015
By
Calling Scala code from R using jvmr

[Update: the jvmr package has been replaced by a new package called rscala. I have a new post which explains it.] Introduction In previous posts I have explained why I think that Scala is a good language to use for statistical computing and data science. Despite this, R is very convenient for simple exploratory data … Continue reading Calling Scala code from R using jvmr

Read more »

One-way ANOVA with fixed and random effects from a Bayesian perspective

December 22, 2014
By
One-way ANOVA with fixed and random effects from a Bayesian perspective

This blog post is derived from a computer practical session that I ran as part of my new course on Statistics for Big Data, previously discussed. This course covered a lot of material very quickly. In particular, I deferred introducing notions of hierarchical modelling until the Bayesian part of the course, where I feel it … Continue reading One-way ANOVA with fixed and random effects from a Bayesian perspective

Read more »

R resources

December 3, 2014
By

-+*This is the third in my weekly series of posts pointing out resources on this site. This week’s topic is R. R language for programmers Default arguments and lazy evaluation in R Distributions in R Moving data between R and Excel via the clipboard Sweave: First steps toward reproducible analyses Troubleshooting Sweave Regular expressions in […]

Read more »


Subscribe

Email:

  Subscribe