Posts Tagged ‘ Uncategorized ’

Data handcuffs

February 10, 2016
By

A few years ago, if you asked me what the top skills I got asked about for students going into industry, I'd definitely have said things like data cleaning, data transformation, database pulls, and other non-traditional statistical tasks. But as companies have progressed from the point of storing data to actually wanting to do something with

Read more »

Sample with replacement and unequal probability in SAS

February 10, 2016
By
Sample with replacement and unequal probability in SAS

How do you sample with replacement in SAS when the probability of choosing each observation varies? I was asked this question recently. The programmer thought he could use PROC SURVEYSELECT to generate the samples, but he wasn't sure which sampling technique he should use to sample with unequal probability. This […] The post Sample with replacement and unequal probability in SAS appeared first on The DO Loop.

Read more »

Leek group guide to reading scientific papers

February 9, 2016
By

The other day on Twitter Amelia requested a guide for reading papers I love @jtleek’s github guides to reviewing papers, writing R packages, giving talks, etc. Would love one on reading papers, for students. — Amelia McNamara (@AmeliaMN) February 5, 2016   So I came up with a guide which you can find here: Leek

Read more »

Phd positions in Probabilistic Machine Learning at #AaltoPML group Finland

February 9, 2016
By
Phd positions in Probabilistic Machine Learning at #AaltoPML group Finland

There are PhD positions in our Probabilistic Machine Learning group at Aalto, Finland, and altogether 15 positions in Helsinki ICT network. Apply here The most interesting topic in the call is supervised by Prof. Samuel Kaski at AaltoPML (and you may collaborate with me too :) We are looking for PhD candidates interested in probabilistic […] The post Phd positions in Probabilistic Machine Learning at #AaltoPML group Finland appeared first…

Read more »

Read data into vectors or into a matrix: Which is better?

February 8, 2016
By
Read data into vectors or into a matrix: Which is better?

In the SAS/IML language, you can read data from a SAS data set into a set of vectors (each with their own name) or into a single matrix. Beginning programmers might wonder about the advantages of each approach. When should you read data into vectors? When should you read data […] The post Read data into vectors or into a matrix: Which is better? appeared first on The DO Loop.

Read more »

Speaking at DataPhilly February 2016

February 5, 2016
By
Speaking at DataPhilly February 2016

The next DataPhilly meetup will feature a medley of machine-learning talks, including an Intro to ML from yours truly. Check out the speakers list and be sure to RSVP. Hope to see you there! Thursday, February 18, 2016 6:00 PM to 9:00 PM Speakers: Corey Chivers Randy Olson Austin Rochford Corey Chivers (Penn Medicine) Abstract: […]

Read more »

Rolling statistics in SAS/IML

February 3, 2016
By
Rolling statistics in SAS/IML

Last week I showed how to use PROC EXPAND to compute moving averages and other rolling statistics in SAS. Unfortunately, PROC EXPAND is part of SAS/ETS software and not every SAS site has a license for SAS/ETS. For simple moving averages, you can write a DATA step program, as discussed […] The post Rolling statistics in SAS/IML appeared first on The DO Loop.

Read more »

Summer internship positions for undergraduate students with Aki

February 2, 2016
By

There are couple cool summer internship positions for undergraduate students (BSc level) in Probabilistic Machine Learning group at Aalto (Finland) with me (Aki) and Samuel Kaski. Possible research topics are related to Bayesian inference, machine learning, Stan, disease risk prediction, personalised medicine, computational biology, contextual information retrieval, information visualization, etc. Application deadline 18 February. See more […] The post Summer internship positions for undergraduate students with Aki appeared first on Statistical…

Read more »

A menagerie of messed up data analyses and how to avoid them

February 1, 2016
By
A menagerie of messed up data analyses and how to avoid them

Update: I realize this may seem like I'm picking on people. I really don't mean to, I have for sure made all of these mistakes and many more. I can give many examples, but the one I always remember is the time Rafa saved me from "I got a big one here" when I made

Read more »

Group processing in SAS: The NOTSORTED option

February 1, 2016
By
Group processing in SAS: The NOTSORTED option

Novice SAS programmers quickly learn the advantages of using PROC SORT to sort data, followed by a BY-group analysis of the sorted data. A typical example is to analyze demographic data by state or by ZIP code. A BY statement enables you to produce multiple analyses from a single procedure […] The post Group processing in SAS: The NOTSORTED option appeared first on The DO Loop.

Read more »


Subscribe

Email:

  Subscribe