Write a matrix in the "long form"

December 2, 2013
By
Write a matrix in the "long form"

If you write an n x p matrix from PROC IML to a SAS data set, you'll get a data set with n rows and p columns. For some applications, it is more convenient to write the matrix in a "long format" with np observations and three columns. The first [...]

Read more »

Probabilities and P-Values

December 2, 2013
By
Probabilities and P-Values

P-values seem to be the bane of a statistician’s existence.  I’ve seen situations where entire narratives are written without p-values and only provide the effects. It can also be used as a data reduction tool but ultimately it reduces the world into a binary system: yes/no, accept/reject. Not only that but the binary threshold is […]

Read more »

Evaluating Quandl Data Quality – part II

December 2, 2013
By
Evaluating Quandl Data Quality – part II

This post is a more in depth analysis of Quandl futures data vs. Bloomberg data. Since my last post Quandl has updated its futures database to 200+ contracts from 68 contracts originally. For practical reasons, I limit myself here to the initial list of 60+ contracts. I’m still comparing the “Front Month” contract between the […]

Read more »

The Border of Search

December 2, 2013
By
The Border of Search

The original proposition of a web search engine was to help you find the answer to your information need in a page or site on the web: if someone has already solved your problem, let us help you find their...

Read more »

R: Explore ARIMA(2, 2, 2) subclass family on Shiny

December 2, 2013
By
R: Explore ARIMA(2, 2, 2) subclass family on Shiny

I've been thinking that it might be better to explore the Box-Jenkins ARIMA (Autoregressive Integrated Moving-Average) three-iterative modelling on Shiny. So here is what I got, this app is intended for ARIMA(2, 2, 2) subclass family only.The app has s...

Read more »

Sunday data/statistics link roundup (12/2/13)

December 1, 2013
By

I'm in Australia for Bioinfo Summer 2013! First time in Australia and excited about the great lineup of speakers and to meet a bunch of people at the University of Adelaide.  An interesting post about how CS has become the … Continue reading →

Read more »

Visualising (not so) Big Data

December 1, 2013
By
Visualising (not so) Big Data

Facebook is a frequently used source for information. We do not know all kinds of such data queries and reutilisations……. …Continue reading →

Read more »

Visualising (not so) Big Data

December 1, 2013
By
Visualising (not so) Big Data

Facebook is a frequently used source for information. We do not know all kinds of such data queries and reutilisations……. …Continue reading →

Read more »

Separated by a common blah blah blah

December 1, 2013
By

I love reading the kind of English that English people write. It’s the same language as American but just slightly different. I was thinking about this recently after coming across this footnote from “Yeah Yeah Yeah: The Story of Modern Pop,” by Bob Stanley: Mantovani’s atmospheric arrangement on ‘Care Mia’, I should add, is something […]The post Separated by a common blah blah blah appeared first on Statistical Modeling, Causal…

Read more »

JAGS model Fe concentration in rainwater including values below detection level

December 1, 2013
By
JAGS model Fe concentration in rainwater including values below detection level

In my previous post I ignored the fact that some data are below the detection level. I would not know how to handle those in a mixed model from lme4 or nlme. However, JAGS can handle these values. Next to that I kept the usual independent variables, su...

Read more »

More Explorations with catR

December 1, 2013
By
More Explorations with catR

# For the purposes of simulating computerized adaptive tests# the R package catR is unparallelled. # catR is an excellent tool for students who are curious about# how a computerized adaptive test might work. It is also useful# for testing companie...

Read more »

Lost

November 30, 2013
By
Lost

The results of the ISBA elections have come out and unfortunately, I've been beaten to the post of programme chair for the Section on Biostatistics and Pharmaceutical Statistics.I am not sure by how much $-$ I meant to ask for more details, but I'...

Read more »

My talk at the LSHTM

November 30, 2013
By

Yesterday I gave a talk on our RDD project at the Centre for Statistical Methodology of the London School of Hygiene and Tropical Medicine. While presenting me, Karla (the organiser of the seminar) joked that I should go for a hat trick of present...

Read more »

More confusing statements of probability due to no controls

November 30, 2013
By

A recent article in USA Today is titled “Many with sudden cardiac arrest had early signs” (link). The signs include shortness of breath, faintness, chest pain, etc. Hold on to the headline because it’s the only thing believable in the entire article. The words “early signs” imply to readers that were the men to heed these warnings, they could have prevented the cardiac arrests. Think about the following two statements:…

Read more »

???

November 30, 2013
By

I received the following unsolicited email, subject line Technology and Engineering Research: Dear Editor We have done research in some of the cutting edge technology and engineering field and would like to if you will be able to write about it in your news section. Our Primarily research focus on building high performance systems that […]The post ??? appeared first on Statistical Modeling, Causal Inference, and Social Science.

Read more »

Le Monde puzzle [#842]

November 29, 2013
By
Le Monde puzzle [#842]

An easily phrased (and solved?) Le Monde mathematical puzzle that does not [really] require an R code: The five triplets A,B,C,D,E are such that and Given that find the five triplets. Adding up both sets of equations shows everything solely depends upon E1… So running an R code that checks for all possible values of […]

Read more »

Social media popularity, revisited.

November 29, 2013
By
Social media popularity, revisited.

You might remember my post on Twitter mentions in New York Times articles from a few months back. I simply accessed the NYT API and plotted a graph showing the rise in mentions over the past few years. At the time, I used R's very popular plotting libr...

Read more »

The gradual transition to replicable science

November 29, 2013
By

Somebody emailed me: I am a researcher at ** University and I have recently read your article on average predictive comparisons for statistical models published 2007 in the journal “Sociological Methodology”. Gelman, Andrew/Iain Pardoe. 2007. “Average Predictive Comparisons for Models with Nonlinearity, Interactions, and Variance Components”. Sociological Methodology 37: 23-51. Currently I am working with […]The post The gradual transition to replicable science appeared first on Statistical Modeling, Causal Inference,…

Read more »

Unusual timing shows how random mass murder can be (or even less)

November 29, 2013
By
Unusual timing shows how random mass murder can be (or even less)

This post follows the original one on the headline of the USA Today I read during my flight to Toronto last month. I remind you that the unusual pattern was about observing four U.S. mass murders happening within four days, “for the first time in at least seven years”. Which means that the difference between […]

Read more »

Was “Statistical zealots” copied from an old letter?

November 29, 2013
By

Recently Jeff Leek over at Simple Statistics posted what I though was an original discussion about Statistical Zealotry. But then I saw this newly unearthed letter from an unknown European Professor to a colleague. It’s at least 300 years old, bu...

Read more »

“Statistics is what people think math is”

November 28, 2013
By

My 5books interview (from 2011), where we talk about The Bill James Baseball Abstracts, Judgment under Uncertainty, How Animals Work, The Honest Rainmaker, and How to Talk So Kids Will Listen and Listen So Kids Will Talk. The post “Statistics is ...

Read more »

Book Review: Practical Data Analysis by Hector Cuesta

November 28, 2013
By
Book Review: Practical Data Analysis by Hector Cuesta

I have been reading this book since last week, and now I want to share my thoughts about it. I was excited to review this because I've never heard most of the tools it features, like OpenRefine, MongoDB, and MapReduce. The book has 360 pages and surpri...

Read more »

Fast Threshold Clustering Algorithm (FTCA) test

November 28, 2013
By
Fast Threshold Clustering Algorithm (FTCA) test

Today I want to share the test and implementation for the Fast Threshold Clustering Algorithm (FTCA) created by David Varadi. This implementation was developed and contributed by Pierre Chretien, I only made minor updates. Let’s first replicate the results from the Fast Threshold Clustering Algorithm (FTCA) post: The clusters are stable and match David’s results […]

Read more »


Subscribe

Email:

  Subscribe