Blog Archives

Data frames and tables in Scala

August 21, 2015
By
Data frames and tables in Scala

Introduction To statisticians and data scientists used to working in R, the concept of a data frame is one of the most natural and basic starting points for statistical computing and data analysis. It always surprises me that data frames aren’t a core concept in most programming languages’ standard libraries, since they are essentially a … Continue reading Data frames and tables in Scala

Read more »

Calling R from Scala sbt projects using rscala

August 15, 2015
By
Calling R from Scala sbt projects using rscala

Overview In the previous post I showed how the rscala package (which has replaced the jvmr package) can be used to call Scala code from within R. In this post I will show how to call R from Scala code. I have previously described how to do this using jvmr. This post is really just … Continue reading Calling R from Scala sbt projects using rscala

Read more »

Calling Scala code from R using rscala

August 15, 2015
By
Calling Scala code from R using rscala

Introduction In a previous post I looked at how to call Scala code from R using a CRAN package called jvmr. This package now seems to have been replaced by a new package called rscala. Like the old package, it requires a pre-existing Java installation. Unlike the old package, however, it no longer depends on … Continue reading Calling Scala code from R using rscala

Read more »

Index to first 50 posts

April 10, 2015
By
Index to first 50 posts

This is the 50th post to this blog. For my 25th post I provided a catalogue of my first 25 posts, and as promised then, I now provide a similar index for posts 25 to 50. 25. Catalogue of my first 25 blog posts 26. Multivariate data analysis (using R): a course and some lecture … Continue reading Index to first 50 posts

Read more »

Scala for Machine Learning [book review]

April 9, 2015
By
Scala for Machine Learning [book review]

Nicolas, Patrick R. (2014) Scala for Machine Learning, Packt Publishing: Birmingham, UK. Full disclosure: I received a free electronic version of this book from the publisher for the purposes of review. There is clearly a market for a good book about using Scala for statistical computing, machine learning and data science. So when the publisher … Continue reading Scala for Machine Learning [book review]

Read more »

Calling R from Scala sbt projects

January 24, 2015
By
Calling R from Scala sbt projects

[Update: The jvmr package has been replaced by the rscala package. There is a new version of this post which replaces this one.] Overview In previous posts I’ve shown how the jvmr CRAN R package can be used to call Scala sbt projects from R and inline Scala Breeze code in R. In this post … Continue reading Calling R from Scala sbt projects

Read more »

Inlining Scala Breeze code in R using jvmr and sbt

January 3, 2015
By
Inlining Scala Breeze code in R using jvmr and sbt

[Update: The CRAN package “jvmr” has been replaced by a new package “rscala”. Rather than completely re-write this post, I’ve just created a github gist containing a new function, breezeInterpreter(), which works similarly to the function breezeInit() in this post. Usage information is given at the top of the gist.] Introduction In the previous post … Continue reading Inlining Scala Breeze code in R using jvmr and sbt

Read more »

Calling Scala code from R using jvmr

January 2, 2015
By
Calling Scala code from R using jvmr

[Update: the jvmr package has been replaced by a new package called rscala. I have a new post which explains it.] Introduction In previous posts I have explained why I think that Scala is a good language to use for statistical computing and data science. Despite this, R is very convenient for simple exploratory data … Continue reading Calling Scala code from R using jvmr

Read more »

One-way ANOVA with fixed and random effects from a Bayesian perspective

December 22, 2014
By
One-way ANOVA with fixed and random effects from a Bayesian perspective

This blog post is derived from a computer practical session that I ran as part of my new course on Statistics for Big Data, previously discussed. This course covered a lot of material very quickly. In particular, I deferred introducing notions of hierarchical modelling until the Bayesian part of the course, where I feel it … Continue reading One-way ANOVA with fixed and random effects from a Bayesian perspective

Read more »

Statistical computing languages at the RSS

November 22, 2014
By
Statistical computing languages at the RSS

On Friday the Royal Statistical Society hosted a meeting on Statistical computing languages, organised by my colleague Colin Gillespie. Four languages were presented at the meeting: Python, Scala, Matlab and Julia. I presented the talk on Scala. The slides I presented are available, in addition to the code examples and instructions on how to run … Continue reading Statistical computing languages at the RSS

Read more »


Subscribe

Email:

  Subscribe