Posts Tagged ‘ data ’

Statistics for Big Data

November 22, 2014
By
Statistics for Big Data

Doctoral programme in cloud computing for big data I’ve spent much of this year working to establish our new EPSRC Centre for Doctoral Training in Cloud Computing for Big Data, which partly explains the lack of posts on this blog in recent months. The CDT is now established, with 11 students in the first cohort, […]

Read more »

Nominal, Ordinal, Interval, Schmordinal

November 18, 2014
By
Nominal, Ordinal, Interval, Schmordinal

Everyone wants to learn about ordinal data! I have a video channel with about 40 videos about statistics, and I love watching to see which videos are getting the most viewing each day. As the Fall term has recently started … Continue reading →

Read more »

Read Before You Cite!

November 11, 2014
By
Read Before You Cite!

Note to self - file this post in the "Look Before You Leap" category!Looking at The New Zealand Herald newspaper this morning, this headline caught my eye:"How Did Sir Owen Glenn's Domestic Violence Inquiry Get $7 Billion Figure Wrong?"$7&nbs...

Read more »

I can’t believe I’m citing David Brooks on data

November 6, 2014
By

This is a first. I'm agreeing with David Brooks. Sort of. In his new NYT column titled "Death by Data" (link), Brooks disparaged the recently celebrated practice of using machine learning in electoral politics, such as trying to win elections "Obama-style" by targeting investments on the people most likely to listen to his message, and trying to craft electoral messages by testing and measuring how people react to certain words…

Read more »

The class pondering Big Data

October 23, 2014
By

Note: I'm traveling a lot lately and it is affecting my ability to post on a regular basis. It's three weeks into my chart-building workshop (link) at NYU and we are starting to discuss individual projects. One of the major...

Read more »

Pondering OCCAM data in medicine

October 9, 2014
By
Pondering OCCAM data in medicine

The New York Times Magazine has a pretty good piece about the use of OCCAM data to solve medical questions, like diagnosis and drug selection. I'm happy that it paints a balanced picture of both the promise and the pitfalls. Here are some thoughts in my head as I read this piece: Small samples coupled with small effects pose a design problem in traditional clinical trials. The subjects of the…

Read more »

Princeton’s loss of nerve

October 2, 2014
By
Princeton’s loss of nerve

I have earlier reported that Princeton's new President has initiated a review of their "grade deflation" policy that was put in almost ten years ago. As you may recall (link), grading in U.S. colleges has become a farce: at top-tier schools, getting an A means you are an average student; not getting an A is many times more informative than getting an A. The new administration at Princeton has now…

Read more »

Read sas7bdat files in R with GGASoftware Parso library

September 12, 2014
By

... using the new R package sas7bdat.parso. The software company GGASoftware has extended the work of myself and others on the sas7bdat R package by developing a Java library called Parso, which also reads sas7bdat files. They have worked out most of the remaining kinks. For example, the Parso library reads sas7bdat files with compressed […]

Read more »

Trifacta, an attempt to simplify the analyst’s life

September 8, 2014
By

A LinkedIn contact and 538 reader pointed me to this demo video by Joe Hellerstein, from a Bay Area startup called Trifacta. They have a neat product that tries to automate data cleaning/processing tasks for analysts. I love that people are working on this problem. It's an area that I'm interested in getting involved in. Also, they have a sleek user interface, well thought out, and innovative. There is a…

Read more »

Background on my new 538 article on restaurant health grades

September 3, 2014
By
Background on my new 538 article on restaurant health grades

Some behind-the-scenes comments on my recent article on New York's restaurant inspection grades; it appeared on FiveThirtyEight this Tuesday. *** The Nature of Ratings This article is about the ratings of things. I devoted a considerable amount of pages to this topic in Numbersense (link) - Chapter 1 is all about the US News ranking of schools. A few key points are: All rating schemes are completely subjective. There is…

Read more »


Subscribe

Email:

  Subscribe