## When is there “hidden structure in data” to be discovered?

April 4, 2013
By

Michael Collins sent along the following announcement for a talk: Fast learning algorithms for discovering the hidden structure in data Daniel Hsu, Microsoft Research 11am, Wednesday April 10th, Interschool lab, 7th floor CEPSR, Columbia University A major challenge in machine learning is to reliably and automatically discover hidden structure in data with minimal human intervention. [...]

## The Revolution Will Be Visualized

April 4, 2013
By

In the 1970s, it was the protest songs. In the 1980s, it was the anti-war movies. Today, the protest is no longer happening in songs or movies. Today, it’s online, based on data, and using visualization. Gun Deaths It’s a very abstract and yet very clear image: something moves along a trajectory, is suddenly stopped, and drops to the ground. A gun has been fired, somebody has been killed. Periscopic’s…

## The Revolution Will Be Visualized

April 4, 2013
By

In the 1970s, it was the protest songs. In the 1980s, it was the anti-war movies. Today, the protest is no longer happening in songs or movies. Today, it’s online, based on data, and using visualization. Gun Deaths It’s a very abstract and yet very clear image: something moves along a trajectory, is suddenly stopped, and drops to the ground. A gun has been fired, somebody has been killed. Periscopic’s…

## Guest Post. Kent Staley: On the Five Sigma Standard in Particle Physics

April 4, 2013
By

Kent Staley Associate Professor Department of philosophy Saint Louis University Regular visitors to Error Statistics Philosophy may recall a discussion that broke out here and on other sites last summer when the CMS and ATLAS collaborations at the Large Hadron Collider announced that they had discovered a new particle in their search for the Higgs […]

## Where is social media being discussed?

April 3, 2013
By

Accessing APIs is pretty sweet. Even though I've only begun to scratch the surface of some popular APIs, I wanted to share my initial findings with you. I figured an interesting question is related to social media. Specifically Twitter and Facebook. My...

## ISBA on INLA [webinar]

April 3, 2013
By

If you have missed the item of information, Håvard Rue is giving an ISBA webinar tomorrow on INLA: the ISBA Webinar on INLA is scheduled for April 4th, 2013 from 8:30 - 12:30 EDT. ------------------------------------------------------- To join the online meeting (Now from mobile devices using the Cisco WebEx Meeting App) ------------------------------------------------------- 1. Go to https://www.webex.com/login/attend-a-meeting […]

## Hierarchical array priors for ANOVA decompositions

April 3, 2013
By

Alexander Volfovsky and Peter Hoff write: ANOVA decompositions are a standard method for describing and estimating heterogeneity among the means of a response variable across levels of multiple categorical factors. In such a decomposition, the complete set of main effects and interaction terms can be viewed as a collection of vectors, matrices and arrays that [...]

## Bad charts can happen to good people

April 3, 2013
By

I shouldn't be surprised by this. No sooner did I sing the praise of Significance magazine (link) than a reader sent me to some charts that are not deserving of their standard. Here is one such chart (link): Quite a...

## R 3.0.0 is released!

April 3, 2013
By

The new R 3.0.0 is out! You know the drill! Get the source code from here.

## The difference of density estimates: When does it make sense?

April 3, 2013
By

I was recently asked how to compute the difference between two density estimates in SAS. The person who asked the question sent me a link to a paper from The Review of Economics and Statistics that contains several examples of this technique (for example, see Figure 3 on p. 16 [...]

## CFP: the 11th Australasian Data Mining Conference (AusDM 2013), submission extended to 31 July

April 3, 2013
By

********************************************************************* The 11th Australasian Data Mining Conference (AusDM 2013) Canberra, Australia, 13-15 November 2013, http://ausdm13.togaware.com Join us on LinkedIn: http://www.linkedin.com/groups/AusDM-4907891 ********************************************************************* Data mining, the art and science of intelligent analysis of (usually large) data sets for meaningful (and previously unknown) … Continue reading →

## Beware of grep with a list

April 3, 2013
By

Another R tip: beware of as.character applied to a list. Really, beware of grep with a list: You might have thought that the result would be just 1, but grep expects a vector of character strings. If the input is not that, it uses as.character(). Since the result of that starts with "c(", grep finds […]

## apply vs for

April 3, 2013
By

It’s widely understood that, in R programming, one should avoid for loops and always try to use apply-type functions. But this isn’t entirely true. It may have been true for Splus, back in the day: As I recall, that had to do with the entire environment from each iteration being retained in memory. Here’s a […]

## x[[c(5,3)]]

April 3, 2013
By

An R tip: Did you know that x[[c(5,3)]] is the same as x[[5]][[3]]? I should make more thorough use of this. In the help file for [[: [[ can be applied recursively to lists, so that if the single index i is a vector of length p, alist[[i]] is equivalent to alist[[i1]]...[[ip]] providing all but […]

## Thoughts on statistical consulting

April 3, 2013
By

The Statistics Department at UW-Madison has a course on statistical consulting, offered each semester. I’m often asked to give a lecture, which I do in an informal way: summarizing my experiences and answering questions. I thought it might be useful write my thoughts on statistical consulting here: why, how, and difficulties. This will be a […]

## a brief on naked statistics

April 2, 2013
By

Over the last Sunday breakfast I went through Naked Statistics: Stripping the Dread from the Data. The first two pages managed to put me in a prejudiced mood for the rest of the book. To wit: the author starts with some math bashing (like, no one ever bothers to tell us about the uses of [...]

## Petty in pink

April 2, 2013
By

I have to admit that in some sense I am being petty here. But I also think that this story is actually quite interesting, so I'm posting about it.As I've mentioned elsewhere in the blog (here and here), together with some colleagues I'm working on an e...

## 100 savvy sites on statistics

April 2, 2013
By

Online Math Degrees has a page of “100 savvy sites on statistics and quantitative analysis”.  It has some that you recognize, some that you’ve hardly ever heard of. The sites are divided into groups.  I think the R section should have included R-bloggers, but what do I know? I think it’s well worth strolling through. […]The post 100 savvy sites on statistics appeared first on Burns Statistics.

## A common theme in mathematics

April 2, 2013
By
$A common theme in mathematics$

This is from a post Connected objects and a reconstruction theorem: A common theme in mathematics is to replace the study of an object with the study of some category that can be built from that object. For example, we can replace the study of a group  with the study of its category of linear representations, […]

## Estimating continuous piecewise linear regression

April 2, 2013
By

When talking about smoothing splines a simple point to start with is a continuous piecewise linear regression with fixed knots. I did not find any simple example showing how to estimate the it in GNU R so I have created a little snippet that does the j...

## Introducing the healthvis R package – one line D3 graphics with R

April 2, 2013
By

We have been a little slow on the posting for the last couple of months here at Simply Stats. That’s bad news for the blog, but good news for our research programs! Today I’m announcing the new healthvis R package … Continue reading →

## So much artistic talent

April 2, 2013
By

I saw this excellent art show the other day, and it reminded me how much artistic talent is out there. I really have no idea whassup with those all-black canvases and the other stuff you see at modern art museums, given that there’s so much interesting new stuff being created every year. I see a [...]

## Top 10 tips to get started with R

April 2, 2013
By

Be motivated. R has a steep learning curve. Find a problem you can't solve otherwise. E.g. plotting multivariate data, a statistical analysis for which an R function exists already. Download and install R. Get to know the R console. Learn how to inst...