Statistics

Statistics Blogs

Coordinatized Data: A Fluid Data Specification

March 29, 2017
By
Coordinatized Data: A Fluid Data Specification

Authors: John Mount and Nina Zumel. Introduction It has been our experience when teaching the data wrangling part of data science that students often have difficulty understanding the conversion to and from row-oriented and column-oriented data formats (what is commonly called pivoting and un-pivoting). Boris Artzybasheff illustration Real trust and understanding of this concept doesn’t … Continue reading Coordinatized Data: A Fluid Data Specification

Read more »

Confused by machines, or spooked by the machine-makers

March 29, 2017
By

This New York Times article draws attention to real trends in the financial investments industry but gets completely lost in the smoke around those pushing "machines" and "data". The trend most concerning to the investments industry is the sustained, large-scale outflow of money from "actively-managed" funds, mutual funds being the biggest category of such. The industry makes loads of money from management fees by promoting the idea that investors are…

Read more »

Text as Data

March 28, 2017
By

"Text as data" is a vibrant and by now well-established field. (Just Google "text as data".) For an informative overview geared toward econometricians, see the new paper, "Text as Data" by Matthew Gentzkow, Bryan T. Kelly, and Matt Taddy (GKT). (Ungate...

Read more »

Collaboration with New York Public Library

March 27, 2017
By

For many years now, the field of Data Science and Business Analytics has been booming, and hiring managers are finding a severe dearth of high-quality job-seekers. Meanwhile, there are a good number of people interested in entering the field but keep bumping into walls. Hiring managers like to hire experienced people for a host of reasons, including the fear of other hiring managers poaching their trained employees. For a number…

Read more »

In Praise of Two Giants of Econometrics

March 26, 2017
By
In Praise of Two Giants of Econometrics

Two giants in our field, now deceased, are celebrated in recent Working Papers by Peter Phillips and Timo Teräsvirta.Peter's paper is titled, "Tribute to T. W. Anderson", is in an issue of Econometric Theory that also includes ted's last published res...

Read more »

Debugging Pipelines in R with Bizarro Pipe and Eager Assignment

March 25, 2017
By
Debugging Pipelines in R with Bizarro Pipe and Eager Assignment

This is a note on debugging magrittr pipelines in R using Bizarro Pipe and eager assignment. Pipes in R The magrittr R package supplies an operator called “pipe” which is written as “%>%“. The pipe operator is partly famous due to its extensive use in dplyr and use by dplyr users. The pipe operator is … Continue reading Debugging Pipelines in R with Bizarro Pipe and Eager Assignment

Read more »

A "Journal of Insignificant (Economic) Results"?

March 25, 2017
By
A "Journal of Insignificant (Economic) Results"?

The Replication Network carried a guest blog post by Andrea Menclova this week. The post was titled, "Is it Time for a Journal of Insignificant Results?"I was previously unaware of the existence of such journals in Psychology, Biomedicine, and Ecology ...

Read more »

Superpixels in imager

March 24, 2017
By
Superpixels in imager

Superpixels are used in image segmentation as a pre-processing step. Instead of segmenting pixels directly, we first group similar pixels into “super-pixels”, which can then be processed further (and more cheaply). (image from Wikimedia) The current version of imager doesn’t implement them, but it turns out that SLIC superpixels are particularly easy to implement. SLIC […]

Read more »

The Tidyverse Curse

March 23, 2017
By

I’ve just finished a major overhaul to my widely read article, Why R is Hard to Learn. It describes the main complaints I’ve heard from the participants to my workshops, and how those complaints can often be mitigated. Here’s the only … Continue reading →

Read more »

Toi aussi, raconte ce que tu veux avec des graphiques

March 23, 2017
By
Toi aussi, raconte ce que tu veux avec des graphiques

L’autre jour, les décodeurs publiaient sur le site du Monde un article sur l’utilisation des graphiques en base 100, Je reprends le commentaire Celui-ci indique, sans appel selon Mme Le Pen, le déclin des économies françaises, italiennes et espagnoles face à l’industrie allemande. Marine Le Pen n’hésite ainsi pas à affirmer que le déclin de la production française est lié à l’introduction de la monnaie unique en 2001. On peut reconstruire le graphique assez…

Read more »


Subscribe

Email:

  Subscribe