Create a cascade chart in SAS

April 27, 2015
By
Create a cascade chart in SAS

Sometimes different communities use the same name for different objects. To a soldier, "boots" are rugged, heavy, high-top foot coverings. To a soccer (football) player, "boots" are lightweight cleats. So it is with the term "waterfall plot." To researchers in the medical field, a "waterfall plot" is a sorted bar […] The post Create a cascade chart in SAS appeared first on The DO Loop.

Read more »

Conference Report: CHI 2015

April 27, 2015
By

Last week, I had the pleasure of attending the CHI 2015 conference in Seoul, South Korea. CHI technically stands for Computer-Human Interaction, but it has become a name rather than an acronym in recent years. And CHI’s scope is very broad, it covers many areas that are not strictly part of HCI (Human-Computer Interaction – … Continue reading Conference Report: CHI 2015

Read more »

This year’s Atlantic Causal Inference Conference: 20-21 May

April 26, 2015
By

Dylan Small writes: The conference will take place May 20-21 (with a short course on May 19th) and the web site for the conference is here. The deadline for submitting a poster title for the poster session is this Friday. Junior researchers (graduate students, postdoctoral fellows, and assistant professors) whose poster demonstrates exceptional research will […] The post This year’s Atlantic Causal Inference Conference: 20-21 May appeared first on Statistical…

Read more »

Introductory Statistics for Data Science

April 26, 2015
By
Introductory Statistics for Data Science

The latest issue of Chance contains a very timely article by Nicholas Horton, Benjamin Baumer, and Hadley Wickham. It's titled, "Setting the Stage for Data Science: Integration of Data Management Skills in Introductory and Second Courses in S...

Read more »

sklearn DecisionTree plot example needs pydotplus

April 26, 2015
By

In Python, sklearn (scikit-learn)'s DecisionTree example uses pydot for plotting the generated tree: @here.But for Python 3, pydot has some issues with the string from dot_data.getvalue(), for example it will report "TypeError: startswith first arg mus...

Read more »

SPDEVPPI

April 25, 2015
By

We've just arxived our paper on efficient computation for the Expected Value of Partial Perfect Information (EVPPI) based on SPDE-INLA. The EVPPI is a decision-theoretic measure of the impact of uncertainty in some of the parameters in a mode...

Read more »

Statistical analysis on a dataset that consists of a population

April 25, 2015
By

This is an oldie but a goodie. Donna Towns writes: I am wondering if you could help me solve an ongoing debate? My colleagues and I are discussing (disagreeing) on the ability of a researcher to analyze information on a population. My colleagues are sure that a researcher is unable to perform statistical analysis on […] The post Statistical analysis on a dataset that consists of a population appeared first…

Read more »

SAS®: Getting Started with PROC IML

April 25, 2015
By

Another powerful procedure of SAS, my favorite one, that I would like to share is the PROC IML (Interactive Matrix Language). This procedure treats all objects as a matrix, and is very useful for doing scientific computations involving vectors and matrices. To get started, we are going to demonstrate and discuss the following: Creating and Shaping Matrices;Matrix Query;Subscripts;Descriptive Statistics;Set Operations;Probability Functions and Subroutine;Linear Algebra;Reading and Creating Data;Above outline is based…

Read more »

Unemployment of Europe in 2014 by NUTS 2 region

April 25, 2015
By
Unemployment of Europe in 2014 by NUTS 2 region

During the Christmas break I worked on some code to show unemployment by NUTS 2 region. At that point no 2014 data was available. When I noticed the 214 was available I dug up the code and plotted again.Data and CodeAs written, the code was made beginn...

Read more »

Random Data Sets Quickly

April 25, 2015
By
Random Data Sets Quickly

This post will discuss a recent GitHub package I’m working on, wakefield to generate random data sets. The post is broken into the following sections: Demo 1.1 Random Variable Functions 1.2 Random Data Frames 1.3 Missing Values 1.4 Default Data … Continue reading →

Read more »

Random Data Sets Quickly

April 25, 2015
By
Random Data Sets Quickly

This post will discuss a recent GitHub package I’m working on, wakefield to generate random data sets. The post is broken into the following sections: Demo 1.1 Random Variable Functions 1.2 Random Data Frames 1.3 Missing Values 1.4 Default Data … Continue reading →

Read more »

Bayesian comparison of groups using Python emcee

April 24, 2015
By
Bayesian comparison of groups using Python emcee

Prof. Brain Blais has implemented the BEST model of two groups in emcee, a Python system for MCMC sampling. See his post about it here.

Read more »

“Statistical Concepts in Their Relation to Reality” by E.S. Pearson

April 24, 2015
By
“Statistical Concepts in Their Relation to Reality” by E.S. Pearson

To complete the last post, here’s Pearson’s portion of the “triad”  “Statistical Concepts in Their Relation to Reality” by E.S. PEARSON (1955) SUMMARY: This paper contains a reply to some criticisms made by Sir Ronald Fisher in his recent article on “Scientific Methods and Scientific Induction”. Controversies in the field of mathematical statistics seem largely […]

Read more »

100 Years

April 24, 2015
By
100 Years

Comparing statistical visualizations over a period of 100 years is quite rare. The newly published Atlas of the Swiss Federal …Continue reading →

Read more »

Statistical significance, practical significance, and interactions

April 24, 2015
By
Statistical significance, practical significance, and interactions

I’ve said it before and I’ll say it again: interaction is one of the key underrated topics in statistics. I thought about this today (OK, a couple months ago, what with our delay) when reading a post by Dan Kopf on the exaggeration of small truths. Or, to put it another way, statistically significant but […] The post Statistical significance, practical significance, and interactions appeared first on Statistical Modeling, Causal…

Read more »

scale acceleration

April 23, 2015
By
scale acceleration

Kate Lee pointed me to a rather surprising inefficiency in matlab, exploited in Sylvia Früwirth-Schnatter’s bayesf package: running a gamma simulation by rgamma(n,a,b) takes longer and sometimes much longer than rgamma(n,a,1)/b, the latter taking advantage of the scale nature of b. I wanted to check on my own whether or not R faced the same […]

Read more »

Gelman speed read

April 23, 2015
By

For those who have found it tough to keep up with Andrew Gelman's prolificacy, here are some brief summaries of several recent posts: On people obsessed with proving the statistical significance of tiny effects: "they are trying to use a bathroom scale to weigh a feather—and the feather is resting loosely in the pouch of a kangaroo that is vigorously jumping up and down." (link) [I left a comment. In…

Read more »

Edmond Malinvaud: A Tribute to his Contributions in Econometrics

April 23, 2015
By
Edmond Malinvaud: A Tribute to his Contributions in Econometrics

I wrote this brief post just after Edmond Malinvaud passed away on 7 March of this year, at the age of 91. Peter Phillips' tribute to Malinvaud is a "must read" piece (see here).Like Peter, I also used Malinvaud's text when undertaking my Masters-...

Read more »

Political Attitudes in Social Environments

April 23, 2015
By

Jose Duarte, Jarret Crawford, Charlotta Stern, Jonathan Haidt, Lee Jussim, and Philip Tetlock wrote an article, “Political Diversity Will Improve Social Psychological Science,” in which the argued that the field of social psychology would benefit from the inclusion of more non-liberal voices (here I’m using “liberal” in the sense of current U.S. politics). Duarte et […] The post Political Attitudes in Social Environments appeared first on Statistical Modeling, Causal Inference,…

Read more »

What if the Washington Post did not display all the data

April 23, 2015
By
What if the Washington Post did not display all the data

Thanks to reader Charles Chris P., I was able to get the police staffing data to play around with. Recall from the previous post that the Washington Post made the following scatter plot, comparing the proportion of whites among police...

Read more »

Thinking big at Yahoo

April 22, 2015
By
Thinking big at Yahoo

I’m speaking in the “Yahoo Labs Big Thinkers” series on Friday 26 June. I hope I can live up to the title! My talk is on “Exploring the boundaries of predictability: what can we forecast, and when should we give up?”  Essentially I will start with some of the ideas in this post, and then discuss the […]

Read more »

Conjoint Analysis and the Strange World of All Possible Feature Combinations

April 22, 2015
By
Conjoint Analysis and the Strange World of All Possible Feature Combinations

The choice modeler looks over the adjacent display of cheeses and sees the joint marginal effects of the dimensions spanning the feature space: milk source, type, origin, moisture content, added mold or bacteria, aging, salting, packaging, price, and m...

Read more »

A message from the vice chairman of surgery at Columbia University: “Garcinia Camboja. It may be the simple solution you’ve been looking for to bust your body fat for good.”

April 22, 2015
By
A message from the vice chairman of surgery at Columbia University:  “Garcinia Camboja. It may be the simple solution you’ve been looking for to bust your body fat for good.”

Should Columbia University fire this guy just cos he says things like this: “You may think magic is make believe but this little bean has scientists saying they’ve found the magic weight loss cure for every body type—it’s green coffee extract.” “I’ve got the No. 1 miracle in a bottle to burn your fat. It’s […] The post A message from the vice chairman of surgery at Columbia University: “Garcinia…

Read more »


Subscribe

Email:

  Subscribe