qstats – quick and dirty statistics tool for the Unix pipeline

November 5, 2013
By
qstats – quick and dirty statistics tool for the Unix pipeline

Back when 200MB hard drives were the size of washing machines and programs had no choice but to be as efficient as possible, Unix was born. In a serendipitous twist of fate, the same programs that were borne of this era of 4MB RAM and 16 bit processors are useful to data analysts with 2,000 »more

Read more »

Optional stopping in data collection: p values, Bayes factors, credible intervals, precision

November 5, 2013
By
Optional stopping in data collection: p values, Bayes factors, credible intervals, precision

This post argues that data collection should stop when a desired degree of precision is achieved (as measured by a Bayesian credible interval), not when a critical p value is achieved, not when a critical Bayes factor is achieved, and not even when a B...

Read more »

Commissions

November 5, 2013
By
Commissions

Today, I want to explain the commission’s functionality build in to Systematic Investor Toolbox(SIT) “share” back-test. At each re-balance time the capital is allocated given the weight such that For example, if weight is 100% (i.e. fully invested) and capital = $100 and price = $10 then The period return is equal to The total […]

Read more »

Shlemiel the Software Developer and Unknown Unknowns

November 4, 2013
By
Shlemiel the Software Developer and Unknown Unknowns

The Stan meeting today reminded me of Joel Spolsky’s recasting of the Yiddish joke about Shlemiel the Painter. Joel retold it on his blog, Joel on Software, in the post Back to Basics: Shlemiel gets a job as a street painter, painting the dotted lines down the middle of the road. On the first day […]The post Shlemiel the Software Developer and Unknown Unknowns appeared first on Statistical Modeling, Causal…

Read more »

Smoothing mortality rates

November 4, 2013
By
Smoothing mortality rates

This morning, I was working with Julie, a student of mine, coming from Rennes, on mortality tables. Actually, we work on genealogical datasets from a small region in Québec, and we can observe a lot of volatiliy. If I borrow one of her graph, we get something like Since we have some missing data, we wanted to use some Generalized Nonlinear Models. So let us see how to get a…

Read more »

My talk @ the Dutch Chemometrics Symposium

November 4, 2013
By
My talk @ the Dutch Chemometrics Symposium

For same reason, Paul Eilers really liked the talk I gave on INLA at the BayesPharma workshop earlier this year and so he invited me to talk at the Dutch Chemometrics Symposium.Now: you may ask what have I got to do with chemometrics. An...

Read more »

Archival and analysis of #GI2013 Tweets

November 4, 2013
By
Archival and analysis of #GI2013 Tweets

I archived and analyzed all Tweets containing #GI2013 from the recent Cold Spring Harbor Genome Informatics meeting, using my previously described code.Friday was the most Tweeted day. Perhaps this was due to Lior Pachter's excellent keynote, "Stories ...

Read more »

Excel Chart Demo

November 4, 2013
By

As an illustration of how Sharp Statistics can integrate extra analysis tools into Excel there is now a demonstration Excel Add In that can be download and installed to produce some useful statistical plots of your data. It can produce 3 simple statist...

Read more »

Federal Reserve Bank of Philadelphia Launches Improved U.S. GDP Growth Series

November 4, 2013
By

Exciting news for empirical macroeconomics and finance: The Federal Reserve Bank of Philadelphia today released a new and improved \(GDP\) growth series, \(GDPplus\). It's an optimal blend of the BEA's expenditure-side and income-side estimates (c...

Read more »

Recently in the sister blog

November 4, 2013
By

This one’s probably the most important: Republicans on track to retain control of House in 2014 And I like this one for the headline: Impact factor 911 is a joke Here are the others: Press releases make for fishy statistics Why is the Motley Fool hyping Netflix? Our health-care system is like Coca Cola Obama […]The post Recently in the sister blog appeared first on Statistical Modeling, Causal Inference, and…

Read more »

Forthcoming paper on the strong likelihood principle

November 4, 2013
By
Forthcoming paper on the strong likelihood principle

My paper, “On the Birnbaum Argument for the Strong Likelihood Principle” has been accepted by Statistical Science. The latest version is here. (It differs from all versions posted anywhere). If you spot any typos, please let me know (error@vt.edu). If you can’t open this link, please write to me and I’ll send it directly. As always, […]

Read more »

Create mosaic plots in SAS by using PROC FREQ

November 4, 2013
By
Create mosaic plots in SAS by using PROC FREQ

Mosaic plots (Hartigan and Kleiner, 1981; Friendly, 1994, JASA) are used for exploratory data analysis of categorical data. Mosaic plots have been available for decades in SAS products such as JMP, SAS/INSIGHT, and SAS/IML Studio. However, not all SAS customers have access to these specialized products, so I am pleased [...]

Read more »

Spatial Clustering With Equal Sizes

November 4, 2013
By
Spatial Clustering With Equal Sizes

This is a problem I have encountered many times where the goal is to take a sample of spatial locations and apply constraints to the algorithm.  In addition to providing a pre-determined number of K clusters a fixed size of elements needs to be held constant within each cluster. An application of this algorithm is […]

Read more »

Sunday data/statistics link roundup (11/3/13)

November 4, 2013
By

There has been a big knockdown-dragout battle in the blogosphere over how GTEX is doing their analysis. Read the original post here, my summary here, and the response from GTEX here. I agree that the criticism bordered on hyperbolic but also think … Continue reading →

Read more »

About blogging

November 4, 2013
By
About blogging

This is the 100th post on “Learn and Teach Statistics and Operations Research”. To celebrate, I am writing about the joys of blogging. Anyone with an internet connection can blog these days, and do! It is the procrastinator’s “dark playground” … Continue reading →

Read more »

Unusual timing shows how random mass murder can be (or not)

November 3, 2013
By
Unusual timing shows how random mass murder can be (or not)

This was one headline in the USA Today I picked from the hotel lobby on my way to Pittsburgh airport and then Toronto this morning. The unusual pattern was about observing four U.S. mass murders happening within four days, “for the first time in at least seven years”. The article did not explain why this […]

Read more »

The Rise, Fall and Rise of English Triple Crown Racing Speeds

November 3, 2013
By
The Rise, Fall and Rise of English Triple Crown Racing Speeds

A horse with a crimson “6” displayed on either side of its girth processes to the parade ring. There are people milling about, looking over “Six” as well as the other entrants for the 2013 running of the St. Leger Stakes. Alongside Si...

Read more »

The Employment Nondiscrimination Act is overwhelmingly popular in nearly every one of the 50 states

November 3, 2013
By
The Employment Nondiscrimination Act is overwhelmingly popular in nearly every one of the 50 states

The above graph shows the estimated support, by state, for the Employment Nondiscrimination Act, a gay rights bill that the Senate will be voting on this Monday. The estimates were constructed by Kate Krimmel, Jeff Lax, and Justin Phillips using multilevel regression and poststratification. Check out that graph again. The scale goes from 20% to […]The post The Employment Nondiscrimination Act is overwhelmingly popular in nearly every one of the…

Read more »

How best to compare effects measured in two different time periods?

November 3, 2013
By

I received the following email from someone who wishes to remain anonymous: My colleague and I are trying to understand the best way to approach a problem involving measuring a group of individuals’ abilities across time, and are hoping you can offer some guidance. We are trying to analyze the combined effect of two distinct […]The post How best to compare effects measured in two different time periods? appeared first…

Read more »

Statistical aspects of two-way cross-over studies

November 3, 2013
By

I ran into this presentation on Statistical aspects of two-way cross-over studies by Ing. Helmut Schütz (http://bebac.at). He presented some code and referred to the bear package. The bear package is menu driven, which is not my thing. I had...

Read more »

Bayes Theorem and Quantum Mechanics

November 3, 2013
By

I was reminded of this old post by Andrew Gelman about whether Quantum Mechanics requires a change in the axioms of probability theory. Without weighing in I’ll just point out that Bayesian Statistics is more general than Bayes Theorem and this a...

Read more »

I’ve already written next year’s April Fools post!

November 2, 2013
By

Good to have gotten that one out of the way already. (Actually, I wrote it a few months ago. This post is itself in the monthlong+ queue.) I don’t know how easy it is to search this blog by date to find the Fools posts from previous years. The ...

Read more »

I’ve already written next year’s April Fools post!

November 2, 2013
By

Good to have gotten that one out of the way already. (Actually, I wrote it a few months ago. This post is itself in the monthlong+ queue.) I don’t know how easy it is to search this blog by date to find the Fools posts from previous years. The ...

Read more »


Subscribe

Email:

  Subscribe