Correspondence Analysis in R

November 6, 2013
By
Correspondence Analysis in R

Correspondence analysis (from a layman’s perspective) is like principal components analysis for categorical data. It can be useful to discover structure in this type of data. My friend Gianmarco Alberti, an archaeologist, has put together an in depth web site … Continue reading →

Read more »

A Mitochondrial Manhattan Plot

November 6, 2013
By
A Mitochondrial Manhattan Plot

Manhattan plots have become the standard way to visualize results for genetic association studies, allowing the viewer to instantly see significant results in the rough context of their genomic position.  Manhattan plots are typically shown on a l...

Read more »

If we observed p_hat = .46, why do we use p=.5?

November 6, 2013
By

I aim to commit statistical sin. I’m going to accept the null hypothesis for no other reason than because I “failing to reject it”. Having tarnished my reputation with that, I’ll finish by ignoring the only data available and ba...

Read more »

How to specify mosaic plot colors in SAS

November 6, 2013
By
How to specify mosaic plot colors in SAS

The mosaic plot is a graphical visualization of a frequency table. In a previous post, I showed how to use the FREQ procedure to create a mosaic plot. This article shows how to create a mosaic plot by using the MOSAICPARM statement in the graph template language (GTL). (The MOSAICPARM [...]

Read more »

R: Mapping Philippine Earthquakes (October 2013)

November 6, 2013
By
R: Mapping Philippine Earthquakes (October 2013)

Last month, October 15, 2013 around 8:12 am (Philippine Time), a magnitude 7.2 earthquake hit Bohol island, detroying several infrastructures and killing hundreds of residents. The Philippine Institute of Volcanology and Seismology or PhiVolcs recorde...

Read more »

Bayesian Biostatistics 2014

November 5, 2013
By

This has to do with the ISBA Biostats section (I suppose it will be even more, if I am elected to the post of Program Chair, but I'll try and be involved even if I don't win!): the next Bayesian Biostatistics Conference has just been announced and will...

Read more »

WSJ tells stories about working women

November 5, 2013
By
WSJ tells stories about working women

Over twitter, Jeff Harrison @mrjeffharrison shouted "Story time!" at this Wall Street Journal report on a Citibank/LinkedIn survey of working men and women. "Story time" is the trick of reporting some statistics, then spinning a story that has little or nothing to do with the data just presented. This tactic is effective as some readers erroneously assume that the story is supported by the data. A good illustration is this…

Read more »

Pricing reinsurance contracts, another case study

November 5, 2013
By
Pricing reinsurance contracts, another case study

A reinsurance case study for tomorrow’s class. The goal will be to price some nonproportional reinsurance contract, for business interruption claims. Consider the following dataset, > library(gdata) > db=read.xls( + "http://perso.univ-rennes1.fr/arthur.charpentier/SIN_1985_2000-PE.xls", + sheet=1) Content type 'application/vnd.ms-excel' length 183808 bytes (179 Kb) open URL ================================================== downloaded 179 Kb As for any (standard) insurance contract, there are two parts in the pricing the expected number of claims the average cost of individual…

Read more »

How to Host a Conference on Google Hangouts on Air

November 5, 2013
By

We recently hosted the first ever Simply Statistics Unconference on the Future of Statistics. In preparing for the event, we learned a lot about how to organize such an event and frankly we wished there had been a bit more … Continue reading →

Read more »

How much do we trust a new claim that early childhood stimulation raised earnings by 42%?

November 5, 2013
By

Hal Pashler wrote in about a recent paper, “Labor Market Returns to Early Childhood Stimulation: a 20-year Followup to an Experimental Intervention in Jamaica,” by Paul Gertler, James Heckman, Rodrigo Pinto, Arianna Zanolini, Christel Vermeerch, Susan Walker, Susan M. Chang, and Sally Grantham-McGregor. Here’s Pashler: Dan Willingham tweeted: @DTWillingham: RCT from Jamaica: Big effects 20 […]The post How much do we trust a new claim that early childhood stimulation raised…

Read more »

What’s in a cronut? Let me find out

November 5, 2013
By
What’s in a cronut? Let me find out

Reader Ross S. did not join the line for this cronut, illustrating the popularity of different makers of tracking software on 1.3 million websites. Original by Analytics SEO is here. *** The biggest beef I have with this cronut is...

Read more »

Typos in BMHE

November 5, 2013
By
Typos in BMHE

No matter how many times you check and no matter how good the publishers are, I'm guessing there's no way out of getting typos in a publication, especially if it's a relatively long one, such as a book.I've just discovered a couple in BMHE, which I tho...

Read more »

Display googleVis charts within RStudio

November 5, 2013
By
Display googleVis charts within RStudio

The preview version 0.98.441 of RStudio introduced a new viewer pane to render local web content and with that it allows me to display googleVis charts within RStudio rather than in a separate browser window. I think this is a rather nice feature and...

Read more »

Finding Reversal Zone in Intraday FX prices

November 5, 2013
By
Finding Reversal Zone in Intraday FX prices

There is no holy grail in trading instead there are strategies that work for a while or in a specific market environment. The role of the analyst is therefore twofold.  First find a good trading strategy, second find the right environment for this strategy. The present post focuses on the latter. More specifically it aims […]

Read more »

qstats – quick and dirty statistics tool for the Unix pipeline

November 5, 2013
By
qstats – quick and dirty statistics tool for the Unix pipeline

Back when 200MB hard drives were the size of washing machines and programs had no choice but to be as efficient as possible, Unix was born. In a serendipitous twist of fate, the same programs that were borne of this era of 4MB RAM and 16 bit processors are useful to data analysts with 2,000 »more

Read more »

Optional stopping in data collection: p values, Bayes factors, credible intervals, precision

November 5, 2013
By
Optional stopping in data collection: p values, Bayes factors, credible intervals, precision

This post argues that data collection should stop when a desired degree of precision is achieved (as measured by a Bayesian credible interval), not when a critical p value is achieved, not when a critical Bayes factor is achieved, and not even when a B...

Read more »

Commissions

November 5, 2013
By
Commissions

Today, I want to explain the commission’s functionality build in to Systematic Investor Toolbox(SIT) “share” back-test. At each re-balance time the capital is allocated given the weight such that For example, if weight is 100% (i.e. fully invested) and capital = $100 and price = $10 then The period return is equal to The total […]

Read more »

Shlemiel the Software Developer and Unknown Unknowns

November 4, 2013
By
Shlemiel the Software Developer and Unknown Unknowns

The Stan meeting today reminded me of Joel Spolsky’s recasting of the Yiddish joke about Shlemiel the Painter. Joel retold it on his blog, Joel on Software, in the post Back to Basics: Shlemiel gets a job as a street painter, painting the dotted lines down the middle of the road. On the first day […]The post Shlemiel the Software Developer and Unknown Unknowns appeared first on Statistical Modeling, Causal…

Read more »

Smoothing mortality rates

November 4, 2013
By
Smoothing mortality rates

This morning, I was working with Julie, a student of mine, coming from Rennes, on mortality tables. Actually, we work on genealogical datasets from a small region in Québec, and we can observe a lot of volatiliy. If I borrow one of her graph, we get something like Since we have some missing data, we wanted to use some Generalized Nonlinear Models. So let us see how to get a…

Read more »

My talk @ the Dutch Chemometrics Symposium

November 4, 2013
By
My talk @ the Dutch Chemometrics Symposium

For same reason, Paul Eilers really liked the talk I gave on INLA at the BayesPharma workshop earlier this year and so he invited me to talk at the Dutch Chemometrics Symposium.Now: you may ask what have I got to do with chemometrics. An...

Read more »

Archival and analysis of #GI2013 Tweets

November 4, 2013
By
Archival and analysis of #GI2013 Tweets

I archived and analyzed all Tweets containing #GI2013 from the recent Cold Spring Harbor Genome Informatics meeting, using my previously described code.Friday was the most Tweeted day. Perhaps this was due to Lior Pachter's excellent keynote, "Stories ...

Read more »

Excel Chart Demo

November 4, 2013
By

As an illustration of how Sharp Statistics can integrate extra analysis tools into Excel there is now a demonstration Excel Add In that can be download and installed to produce some useful statistical plots of your data. It can produce 3 simple statist...

Read more »

Federal Reserve Bank of Philadelphia Launches Improved U.S. GDP Growth Series

November 4, 2013
By

Exciting news for empirical macroeconomics and finance: The Federal Reserve Bank of Philadelphia today released a new and improved \(GDP\) growth series, \(GDPplus\). It's an optimal blend of the BEA's expenditure-side and income-side estimates (c...

Read more »


Subscribe

Email:

  Subscribe