“Patchwriting” is a Wegmanesque abomination but maybe there’s something similar that could be helpful?

November 12, 2014
By

Reading Thomas Basbøll’s blog I came across a concept I’d not previously heard about, “patchwriting,” which is defined as “copying from a source text and deleting some words, altering grammatical structures, or plugging in one synonym for another.” (See here for further discussion.) As Basbøll writes, this is simply a variant of plagiarism, indeed it’s […] The post “Patchwriting” is a Wegmanesque abomination but maybe there’s something similar that could…

Read more »

AusDM 2014 Conference Program

November 12, 2014
By
AusDM 2014 Conference Program

The Program of AusDM 2014 Conference is now available at http://ausdm14.ausdm.org/program. It features two keynote talks, one on Learning in Sequential Decision Problems by Prof Peter Bartlett from UC Berkeley, and the other on Making Sense of a Random World through … Continue reading →

Read more »

The distribution of Pythagorean triples

November 12, 2014
By
The distribution of Pythagorean triples

When I studied high school geometry, I noticed that many homework problems involved right triangles whose side lengths were integers. The canonical example is the 3-4-5 right triangle, which has legs of length 3 and 4 and a hypotenuse of length 5. The triple (3, 4, 5) is called a […]

Read more »

VIS 2014 – Tuesday

November 12, 2014
By

The big opening day of the conference, Tuesday, brought us a keynote, talks, and panels. Also, a new trend I really like: many talks end with the URL of a webpage that contains a brief summary of the paper, the PDF, and often even a link to the source code of the tool they developed. … Continue reading VIS 2014 – Tuesday

Read more »

Convergence of a Series

November 12, 2014
By
Convergence of a Series

Let us explore using simulation some of the concepts of basic asymptotic theory as presented in Wooldridge 2012, Chapter 3.Definition: A sequence of nonrandom numbers {a_N:N=1,2,...} converges to a if for all epsilon>0 there exists N_epsilon such th...

Read more »

Crowdsourcing Data Analysis 2: Gender, Status, and Science

November 12, 2014
By

Emily Robinson writes: Brian Nosek, Eric Luis Uhlmann, Amy Sommer, Kaisa Snellman, David Robinson, Raphael Silberzahn, and I have just launched a second crowdsourcing data analysis project following the success of the first one. In the crowdsourcing analytics approach, multiple independent analysts are recruited to test the same hypothesis on the same data set in whatever […] The post Crowdsourcing Data Analysis 2: Gender, Status, and Science appeared first on Statistical…

Read more »

IJF review papers

November 11, 2014
By
IJF review papers

Review papers are extremely useful for new researchers such as PhD students, or when you want to learn about a new research field. The International Journal of Forecasting produced a whole review issue in 2006, and it contains some of the most highly cited papers we have ever published. Now, beginning with the latest issue […]

Read more »

Normality Testing & Non-Stationary Data

November 11, 2014
By
Normality Testing & Non-Stationary Data

Bob Jensen emailed me about my recent post about the way in which the Jarque-Bera test can be impacted when temporally aggregated data are used. Apparently he publicized my post on the listserv for Accounting Educators in the U.S.. He also drew my...

Read more »

The history of MRP highlights some differences between political science and epidemiology

November 11, 2014
By

Responding to a comment from Thomas Lumley (who asked why MRP estimates often seem to appear without any standard errors), I wrote: In political science, MRP always seems accompanied by uncertainty estimates. However, when lots of things are being displayed at once, it’s not always easy to show uncertainty, and in many cases I simply […] The post The history of MRP highlights some differences between political science and epidemiology…

Read more »

VIS 2014 – Monday

November 11, 2014
By
VIS 2014 – Monday

IEEE VIS 2014 technically began on Saturday, with the first full day open to all attendees being Sunday. Monday continued the workshops and tutorials, and that is where we join our intrepid reporter. VIS Social Run The day started at 6:30am, when five fearless runners braved the cold and dark, and completed the inaugural VIS … Continue reading VIS 2014 – Monday

Read more »

Read Before You Cite!

November 11, 2014
By
Read Before You Cite!

Note to self - file this post in the "Look Before You Leap" category!Looking at The New Zealand Herald newspaper this morning, this headline caught my eye:"How Did Sir Owen Glenn's Domestic Violence Inquiry Get $7 Billion Figure Wrong?"$7&nbs...

Read more »

Unknown pleasures

November 11, 2014
By
Unknown pleasures

Have I missed unknown pleasures in Python by focusing on R? A comment on my blog post of last week suggested just that. Reason enough to explore Python a little. Learning another computer language is like learning another human language - it takes time...

Read more »

Mais que s’est-il passé pendant la Première Guerre Mondiale?

November 11, 2014
By
Mais que s’est-il passé pendant la Première Guerre Mondiale?

La réponse courte est que des gens sont morts. Beaucoup. Cela étant dit, on ne dit pas grand chose. On peut comparer les pyramides des âges pour mieux comprendre ce qui a pu se passer. Juste avant la guerre (en 1913), la pyramide des âges ressemblait à ça (en utilisant les données de mortality.org) > EXPO <- read.table( + "http://freakonometrics.free.fr/Exposures-France.txt", header=TRUE,skip=2) > EM=EXPO$Male > EF=EXPO$Female > Y= EXPO$Year > A= EXPO$Age…

Read more »

Munging fixed width formats in Python

November 11, 2014
By
Munging fixed width formats in Python

In a previous post, I described how to munge fixed width format data in R. I also developed Python code for the same use case, which is described in this IPython Notebook. This seems the easiest way to present this given WordPress.com’s restricti...

Read more »

“LaF”-ing about fixed width formats

November 10, 2014
By
“LaF”-ing about fixed width formats

If you have ever worked with US government data or other large datasets, it is likely you have faced fixed-width format data. This format has no delimiters in it; the data look like strings of characters. A separate format file defines which columns of data represent which variables. It seems as if the format is […]

Read more »

Reverse Regression Follow-up

November 10, 2014
By
Reverse Regression Follow-up

At the end of my recent post on Reverse Regression, I posed three simple questions - homework for the students among you, if you will. Here they are again, with brief "solutions":First recall the context. We fitted the following simple regression ...

Read more »

2nd Edition has shipped (Doing Bayesian Data Analysis)

November 10, 2014
By
2nd Edition has shipped (Doing Bayesian Data Analysis)

I am told by some readers that they have received a physical copy of the 2nd Edition of Doing Bayesian Data Analysis, but I have yet to see it myself. I hope the paper is heavy and opaque, but the book lifts the spirits and the concepts are transparent...

Read more »

Illegal Business Controls America

November 10, 2014
By

The other day I wrote: After encountering the Chicago-cops example I was going to retitle this post, “The psych department’s just another crew” in homage to the line, “The police department’s just another crew” from the rap, “Who Protects Us From You.” But, just to check, I googled that KRS-One rap and it turns out […] The post Illegal Business Controls America appeared first on Statistical Modeling, Causal Inference, and…

Read more »

On deck this week

November 10, 2014
By

Mon: Illegal Business Controls America Tues: The history of MRP highlights some differences between political science and epidemiology Wed: “Patchwriting” is a Wegmanesque abomination but maybe there’s something similar that could be helpful? Thurs: If you do an experiment with 700,000 participants, you’ll (a) have no problem with statistical significance, (b) get to call it […] The post On deck this week appeared first on Statistical Modeling, Causal Inference, and…

Read more »

Rasmus’ socks fit perfectly!

November 10, 2014
By
Rasmus’ socks fit perfectly!

Following the previous post on Rasmus’ socks, I took the opportunity of a survey on ABC I am currently completing to compare the outcome of his R code with my analytical derivation. After one quick correction [by Rasmus] of a wrong representation of the Negative Binomial mean-variance parametrisation [by me], I achieved this nice fit… […]

Read more »

Financial and statistical incentives to over-diagnose and over-treat

November 10, 2014
By

Nice article in the New York Times about the "overdiagnosis" problem in cancer screening. The particular case is thyroid cancer in South Korea. There are a number of things about any form of screening tests that one should always bear in mind: Death rate is measured as the number of deaths divided by the number of people with the disease. The latter number increases with better diagnosis techniques. Better diagnosis…

Read more »

Practical Data Science Cookbook

November 10, 2014
By
Practical Data Science Cookbook

Practical Data Science Cookbook My friends Sean Murphy, Ben Bengfort, Tony Ojeda and I recently published a book, Practical Data Science Cookbook. All of us are heavily involved in developing the data community in the Washington DC metro area, serving on the Board of Directors of Data Community DC. Sean and Ben co-organize the meetup […]

Read more »

Penn Econometrics Reading Group Materials Online

November 10, 2014
By

Locals who come to the Friday research/reading group will obviously be interested in this post, but others may also be interested in following and influencing the group's path.The schedule has been online here for a while. Starting now, it will co...

Read more »


Subscribe

Email:

  Subscribe