R

Blogs on the R software.

scala-glm: Regression modelling in Scala

June 21, 2017
By
scala-glm: Regression modelling in Scala

Introduction As discussed in the previous post, I’ve recently constructed and delivered a short course on statistical computing with Scala. Much of the course is concerned with writing statistical algorithms in Scala, typically making use of the scientific and numerical computing library, Breeze. Breeze has all of the essential tools necessary for building statistical algorithms, … Continue reading scala-glm: Regression modelling in Scala

Read more »

Data Science Tool Market Share Leading Indicator: Scholarly Articles

June 19, 2017
By
Data Science Tool Market Share Leading Indicator: Scholarly Articles

Below is the latest update to The Popularity of Data Science Software. It contains an analysis of the tools used in the most recent complete year of scholarly articles. The section is also integrated into the main paper itself. New … Continue reading →

Read more »

thinning a Markov chain, statistically

June 12, 2017
By
thinning a Markov chain, statistically

Art Owen has arXived a new version of his thinning MCMC paper, where he studies how thinning or subsampling can improve computing time in MCMC chains. I remember quite well the message set by Mark Berliner and Steve MacEachern in an early 1990’s paper that subsampling was always increasing the variance of the resulting estimators. […]

Read more »

Julia: Installation and Editors

June 12, 2017
By
Julia: Installation and Editors

If you have been following this blog, you may have noticed that I don't have any update for more than a year now. The reason is that I've been busy with my research, my work, and I promised not to share anything here until I finished my degree (Master ...

Read more »

Likelihood calculation for the g-and-k distribution

June 10, 2017
By
Likelihood calculation for the g-and-k distribution

    Hello, An example often used in the ABC literature is the g-and-k distribution (e.g. reference [1] below), which is defined through the inverse of its cumulative distribution function (cdf). It is easy to simulate from such distributions by drawing uniform variables and applying the inverse cdf to them. However, since there is no closed-form […]

Read more »

Correcting bias in meta-analyses: What not to do (meta-showdown Part 1)

June 9, 2017
By

tl;dr: Publication bias and p-hacking can dramatically inflate effect size estimates in meta-analyses. Many methods have been proposed to correct for such bias and to estimate the underlying true effect. In a large simulation study, we studied which me...

Read more »

On Formality

On Formality

As I said on the About page, I don’t like formality. There is absolutely no need to tie my educational degree to my last name. Actually I feel uncomfortable with other people calling me “Dr Xie”. Anyone can just call me by my first name.1 In my opinion, the best way to show respect to people is not through these titles (Doctor or Professor), but the true care about their…

Read more »

UK General Election 2017: May vs June

June 9, 2017
By
UK General Election 2017: May vs June

Pundits have been asking themselves over the last days how did Theresa May’s gamble fail? Although the Conservative party received 42% of the vote, Labour received 40%, and all the other parties received 18% between them. The Tories lost their par...

Read more »

Test-driving Microsoft Cognitive Toolkit in R using reticulate

June 8, 2017
By

Recently new tools for data science pop up constantly, so it is hard to keep up with the changes and choose those that promise to be useful in the long run.Recently two new solutions were announced that look very promising: reticulate package for R and...

Read more »

The Ghost Printer Behind Top-level R Expressions

The Ghost Printer Behind Top-level R Expressions

For any developers who have ever written an S3 method for the print() function, they probably know what a top-level R expression means, but this is a very confusing concept to non-developers. I have to explain this every now and then, so I decided to write a short post about it. Yesterday I saw a Github issue in the rmarkdown repository, and you can see that there are still users…

Read more »


Subscribe

Email:

  Subscribe