## GitHut: the Universe of Programming Languages across GitHub

September 12, 2014
By

GitHut [githut.info], developed by Microsoft data visualization designer Carlo Zapponi, is an interactive small multiples visualization revealing the complexity of the wide range of programming languages used across the repositories hosted on GitHub. ...

## Embedding RData files in Rmarkdown files for more reproducible analyses

September 12, 2014
By

For those of us interested in reproducible analysis, Rmarkdown is a great way of communicating our code to other researchers. Rstudio, in particular, makes it very easy to create attractive HTML document containing text, code, and figures, which can then be sent to colleagues or put on the internet for anyone to see. If you aren't using Rmarkdown for your statistical analyses, I recommend you start; you'll never go back…

## Read sas7bdat files in R with GGASoftware Parso library

September 12, 2014
By

... using the new R package sas7bdat.parso. The software company GGASoftware has extended the work of myself and others on the sas7bdat R package by developing a Java library called Parso, which also reads sas7bdat files. They have worked out most of the remaining kinks. For example, the Parso library reads sas7bdat files with compressed … Continue reading Read sas7bdat files in R with GGASoftware Parso library →

## What is the purpose of a poem?

September 12, 2014
By

OK, let’s take a break from blogging about economics. OK, I haven’t actually been blogging so much about econ lately, but it just happens that I’m writing this on 19 July, a day after poking a stick into the hornet’s nest by posting “Differences between econometrics and statistics: From varying treatment effects to utilities, economists […]

## R: k-Means Clustering on Imaging

September 12, 2014
By

Enough with the theory we recently published, let's take a break and have fun on the application of Statistics used in Data Mining and Machine Learning, the k-Means Clustering.k-means clustering is a method of vector quantization, originally from signa...

## Useful for referring—9-11-2014

September 12, 2014
By

Some R Resources for GLMs 失联搜救中的统计数据分析 The gap between data mining and predictive models Data Mining, machine learning and statistics. useR! 2014 is underway with 16 tutorials What is Scalable Machine Learning? rlist：基于list在R中处理非关系型数据 The perfect candidate The Leek group guide to giving talks 38 Seminal Articles Every Data Scientist Should Read Deep Learning – important […]

## mysterious shiny things

September 12, 2014
By

(Disclaimer: I’m new to Shiny, and blog posts, but I know something about geography.)  In the Shiny gallery, take a look at 2001 versus 2002. Something funny happens to Switzerland (and other European countries), in terms of the legend, it moves from Europe to the Middle East. Also, the legend color scheme switches.     […]

## Bayesian Cognitive Modeling  Examples Ported to Stan

September 11, 2014
By

There’s a new intro to Bayes in town. Michael Lee and Eric-Jan Wagenmaker. 2014. Bayesian Cognitive Modeling: A Practical Course. Cambridge Uni. Press. This book’s a wonderful introduction to applied Bayesian modeling. But don’t take my word for it — you can download and read the first two parts of the book (hundreds of pages […]

## UVA / Charlottesville R Meetup

September 11, 2014
By

TL;DR? We started an R Users group, awesome community, huge turnout at first meeting, lots of potential.---I've sat through many hours of meetings where faculty lament the fact that their trainees (and the faculty themselves!) are woefully ill-prepared...

## One-tailed or two-tailed

September 11, 2014
By

This image of a two-tailed lizard (from here, I can’t find the name of the person who took the picture) never fails to amuse me. But let us get to the question at hand . . . Richard Rasiej writes: I’m currently teaching a summer session course in Elementary Statistics. The text that I was […]

## Mathematics and Mathematical Statistics Lesson of the Day – Convex Functions and Jensen’s Inequality

$Mathematics and Mathematical Statistics Lesson of the Day – Convex Functions and Jensen’s Inequality$

Consider a real-valued function that is continuous on the interval , where and are any 2 points in the domain of .  Let be the midpoint of and .  Then, if then is defined to be midpoint convex. More generally, let’s consider any point within the interval .  We can denote this arbitrary point as where . […]

## Generalized Double Pareto Priors for Regression

September 11, 2014
By

This post is a review of the “GENERALIZED DOUBLE PARETO SHRINKAGE” Statistica Sinica (2012) paper by Armagan, Dunson and Lee. Consider the regression model $$Y=X\beta+\varepsilon$$ where we put a generalized double pareto distribution as the prior on the regression coefficients $$\beta$$. The GDP distribution has density $$$$f(\beta|\xi,\alpha)=\frac{1}{2\xi}\left( 1+\frac{|\beta|}{\alpha\xi} \right)^{-(\alpha+1)}. \label{}$$$$ GDP as Scale […] The post Generalized Double Pareto Priors for Regression appeared first on Lindons Log.

## Pi Visualized as a Public Urban Art Mural

September 10, 2014
By

Visualize Pi [tumblr.com] is a mural project that aimed to use popular mathematics to connect Brooklyn students to the community with a visualization of Pi. It was funded by a successful KickStarter project as proposed by visual artist artist Ellie Ba...

## The Key Players in the Middle East and their Relationships

September 10, 2014
By

Whom Likes Whom in the Middle-East? [informationisbeautiful.net] by David McCandless and UniversLab is a forced-network visualisation of key players & notable relationships in the Middle East. Next to its expressive aesthetic, the interactive feature...

## GNU datamash

September 10, 2014
By

GNU datamash is a command-line utility that offers simple calculations (e.g. count, sum, min, max, mean, stdev, string coalescing) as well as a rich set of statistical functions, to quickly assess information in textual input files or from a UNIX pipe....

## Neurostats 2014 Highlights

September 10, 2014
By

Last week the Neurostats 2014 workshop took place at the University of Warwick (co-organised by Adam Johansen, Nicolas Chopin, and myself). The goal was to put some neuroscientists and statisticians together to talk about neural data and what to do with it. General impressions: The type of Bayesian hierarchical modelling that Andrew Gelman has been […]

## “It’s as if you went into a bathroom in a bar and saw a guy pissing on his shoes, and instead of thinking he has some problem with his aim, you suppose he has a positive utility for getting his shoes wet”

September 10, 2014
By

The notion of a geocentric universe has come under criticism from Copernican astronomy. . . . A couple months ago in a discussion of differences between econometrics and statistics, I alluded to the well-known fact that everyday uncertainty aversion can’t be explained by a declining marginal utility of money. What really bothers me—it’s been bothering […]

## An exploratory technique for visualizing the distributions of 100 variables

September 10, 2014
By

In a previous blog post I showed how to order a set of variables by a statistic. After reshaping data, you can create a graph that contains box plots for many variables. Ordering the variables by some statistic (mean, median, variance,...) helps to differentiate and distinguish the variables. You can […]

## Fun with Bordered Cubes

September 10, 2014
By

I am interested in generating 3D reasoning items in R. To this end I have adapted some of the awesome functions built in the rgl library to my ends. My new function is 'cube' and it takes position and automatically sizes itself as a 1x1x1 cube though t...

## A non-comprehensive list of awesome female data people on Twitter

September 9, 2014
By

I was just talking to a student who mentioned she didn't know Jenny Bryan was on Twitter. She is and she is an awesome person to follow. I also realized that I hadn't seen a good list of women on Twitter … Continue reading →

## Suspiciously vague graph purporting to show “percentage of slaves or serfs in the world”

September 9, 2014
By

Phillip Middleton sent this along, it’s from Peter Diamandis, who is best known for his X Prize, the “global leader in the creation of incentivized prize competitions.” Diamandis wrote: Phillip Middleton, Is technology making you work harder? Or giving you more time off? Seriously, it feels like it’s enabling me to work around the clock! […] The post Suspiciously vague graph purporting to show “percentage of slaves or serfs in…

## You don’t have all the data

September 9, 2014
By

Binging on Gelman again while I was traveling. Here is a gem (link). If you ever dreamed of having "all the data", be sure to give this a read and give it a few moments' thought. Skim through the main post quickly. The good stuff is in the comments.

## Try a new way of learning dataviz; course announcement

September 9, 2014
By

COURSE ANNOUNCEMENT THE ART OF DATA VISUALIZATION Fall 2014 (Oct 6 - Nov 24, Mondays 6:30-9:30) New York University Instructor: Kaiser Fung Location: New York City Learn how to make knock-out data visualization in an innovative, immersive and fun setting,...