## Q-A Section 11–GWAS, Linkage Analysis

July 15, 2012
By

eQTL tries to regress each gene expression against each SNP, in order to find those regulatory elements. And eQTL uses “normal” samples, right? (by normal I mean “no disease” like those in 1000genome project) GWAS compares SNPs between normal(control) and disease(test) samples, trying to find out those higher-frequency variants enriched for diseases. linkage mapping/recombination mapping/positional [...]

## Useful for referring–7-15-2012

July 15, 2012
By

Simplicity is hard to sell Self-Repairing Bayesian Inference Praxis and Ideology in Bayesian Data Analysis In-consistent Bayesian inference Big Data Generalized Linear Models with Revolution R Enterprise Quants, Models, and the Blame Game Fun with the googleVis Package for R Topological Data Analysis The Winners of the LaTeX and Graphics Contest  Is Machine Learning Losing Impact? Machine Learning Doesn’t [...]

## Sunday Data/Statistics Link Roundup (7/15/12)

July 15, 2012
By

A really nice list of journals software/data release policies from Titus’ blog. Interesting that he couldn’t find a data/release policy for the New England Journal of Medicine. I wonder if that is because it publishes mostly clinical studie...

## Open Government Data Benchmark: FR, UK, USA

July 15, 2012
By

Finally there’s a very interesting comparison of OGD in three leading countries. qunb did it . Have a look at this presentation. 1) There are lots of duplicates on OGD platforms . 2) There are very few structured data yet . . 3) Apps are the real challenge There are different strategies fostering the developmemt …Read More

## Some decision analysis problems are pretty easy, no?

July 15, 2012
By

Cassie Murdoch reports: A 47-year-old woman in Uxbridge, Massachusetts, got behind the wheel of her car after having a bit too much to drink, but instead of wreaking havoc on the road, she ended up lodged in a sand trap at a local golf course. Why? Because her GPS made her do it—obviously! She said [...]

## The 1000 Genomes Project Community Meeting 2012

July 15, 2012
By

I attended the The 1000 Genomes Project Community Meeting 12th and 13th July 2012 at University of Michigan. Because lots of the presentations are more from the computation point of view instead of statistical point of view, I have little idea about those talks. But from my point of view, if you want me to [...]

## Bits: Betaworks Buys What’s Left of Social News Site Digg

July 15, 2012
By

Bits: Betaworks Buys What's Left of Social News Site Digg: Betaworks, a technology incubator in New York, has purchased Digg, a social news site that has slowly lost steam amid the rise of services like Twitter and Facebook. It’s a bit sad to see Di...

## Sourcing an R Script from Dropbox

July 15, 2012
By

Working on my R bootcamp materials and I thought it would be handy to get the bootcamp computers setup by sourcing an R script that will install all necessary non-core packages in it. The problem? How to deploy this script efficiently. A quick method w...

## Modern Two-Sample Tests

July 14, 2012
By
$Modern Two-Sample Tests$

When you are a student, one of the first problems you learn about is the two-sample test. So you might think that this problem is old news. But it has had a revival: there is a lot of recent research activity on this seemingly simple problem. What makes the problem still interesting and challenging is [...]

## Linear programming in R: an lpSolveAPI example

July 14, 2012
By

First of all, a shout out to R-bloggers for adding my feed to their website! Linear programming is a valuable instrument when it comes to decision making. This post shows how R in conjunction with the lpSolveAPI package, can be used to build a linear programming model and to analyse  Read more »The post Linear programming in R: an lpSolveAPI example appeared first on FishyOperations.

## Bits: Mobile App Developers Scoop Up Vast Amounts of Data, Reports Say

July 14, 2012
By

Bits: Mobile App Developers Scoop Up Vast Amounts of Data, Reports Say: As mobile apps proliferate, new details are emerging about how much data mobile advertising networks inhale and how uninformed their users might be about this. Two new reports thi...

## Ripping off a ripoff

July 14, 2012
By

I opened the newspaper today (recall that this blog is on an approximately one-month delay) to see a moderately horrifying story about art appraisers who are deterred by fear of lawsuits from expressing an opinion about possible forgeries. Maybe this trend will come to science too? Perhaps Brett Pelham will sue Uri Simonsohn for the [...]

## Computing log gamma differences

July 14, 2012
By

Statistical computing often involves working with ratios of factorials. These factorials are often too big to fit in a floating point number, and so we work with logarithms. So if we need to compute log(a! / b!), we call software…Read more ›

## Dynamical systems: Mapping chaos with R

July 13, 2012
By
$Dynamical systems: Mapping chaos with R$

Chaos. Hectic, seemingly unpredictable, complex dynamics. In a word: fun. I usually stick to the warm and fuzzy world of stochasticity and probability distributions, but this post will be (almost) entirely devoid of randomness. While chaotic dynamics are entirely deterministic, their sensitivity to initial conditions can trick the observer into seeing iid. In ecology, chaotic

July 13, 2012
By

This is not about statistics, but it’s about Emacs, which I’ve been using for a long time. This guy is an Emacs virtuoso, and the crazy thing is that he’s only been using it for 8 months! Best line: “Should I wait for the next version of Emacs?...

## Retractions, retractions: “left-wing enough to not care about truth if it confirms their social theories, right-wing enough to not care as long as they’re getting paid enough”

July 13, 2012
By

Two news items. 1. A couple people pointed me to the uncovering of another fraudulent Dutch psychology researcher—this time it was Dirk Smeesters, rather than Diederik Stapel. It’s hard to keep these guys straight—they all pretty much have the same names and do the same things. Stapel and Smeesters also seem to live in the [...]

## Staggering excess

July 13, 2012
By

Business Insider calls this stacked bar chart "staggering" (link). Maybe they are referring to its complexity. Is there a reason to include all the fine details? The details serve little purpose other than to shout at readers that there is...

## Examples and resources on association rule mining with R

July 13, 2012
By

by Yanchang Zhao, RDataMining.com The technique of association rules is widely used for retail basket analysis, as well as in other applications to find assocations between itemsets and between sets of attribute-value pairs. It can also be used for classification … Continue reading →

## Higgs-Boson and p-values: A Response to Wasserman

July 13, 2012
By

Several blogs have devoted attention to reporters' treatment of the evidence for Higgs-Boson. (For example,  others have posts here and here.) These posts criticize reporters who interpret p-values from the Higgs-Boson experiments as \(P(text{no Higgs-like particle } | text{ data})\).…

## 1-Month Reversal Strategy

July 13, 2012
By

Today I want to show a simple example of the 1-Month Reversal Strategy. Each month we will buy 20% of loosers and short sell 20% of winners from the S&P 500 index. The loosers and winners are measured by prior 1-Month returns. I will use this post to set the stage for my next post [...]

## Dennis Lindley’s “Philosophy of Statistics”

July 13, 2012
By

Yesterday’s slight detour [i] presents an opportunity to (re)read Lindley’s “Philosophy of Statistics” (2000) (see also an earlier post).  I recommend the full article and discussion. There is actually much here on which we agree. The Philosophy of Statistics Dennis V. Lindley The Statistician (2000) 49:293-319 Summary. This paper puts forward an overall view of [...]

## SAS functions for computing parameters in Erlang-C model

July 12, 2012
By

Call center management is both Arts and Sciences. While driving moral and setting up strategies is more about Arts, staffing and servicing level configuration based on call load is in the domain of Sciences. The science part of call center management ...

## Using discrete-event simulation to simulate hospital processes

July 12, 2012
By

Discrete-event simulation is a very useful tool when it comes to simulating alternative scenario’s for current of future business operations. Let’s take the following case; Patients of an outpatient diabetes clinic are complaining about long waiting times, this seems to have an adverse effect on patient satisfaction and patient retention.  Read more »The post Using discrete-event simulation to simulate hospital processes appeared first on FishyOperations.