SAS

Blogs on the SAS software

An Economic Approach for a Class of Dimensionality Reduction Techniques

July 30, 2010
By
An Economic Approach for a Class of Dimensionality Reduction Techniques

Just back from KDD2010. In the conference, there are several papers that interested me. On the computation side, Liang Sun et al.'s paper [1], "A Scalable Two-Stage Approach for a Class of Dimensionality Reduction Techniques" caught my eyes. Liang pro...

Read more »

Implement Randomized SVD in SAS

July 13, 2010
By
Implement Randomized SVD in SAS

In the 2010 SASware Ballot®, a dedicated PROC for Randomized SVD was among the options. While an official SAS PROC will not be available in the immediate future as well as in older SAS releases, it is fairly simple to implement this algorithm using ex...

Read more »

"Entrywise" Norm calculation using PROC FASTCLUS

June 26, 2010
By
"Entrywise" Norm calculation using PROC FASTCLUS

In some data mining applications, matrix norm has to be calculated, for instance [1]. You can find a detailed explanation of Matrix Norm on Wiki @ Here Instead of user written routine in DATA STEP, we can obtain "Entrywise" norm via PROC FASTCLUS effi...

Read more »

Boost to tackle nonlinearity

June 1, 2010
By
Boost to tackle nonlinearity

data nonlinear; do x=1 to 627; p=(sin(x/100)+1)*0.45; do j=1 to 100; x1=x+(j-1)/100; if ranuni(8655645)<=p then y=1; else y=0; output; drop p j; end; end; run; proc rank data=nonlinear out=nonlinearrank groups=...

Read more »

K-Nearest Neighbor in SAS

May 5, 2010
By
K-Nearest Neighbor in SAS

K-Nearest-Neighbor, aka KNN, is a widely used data mining tool and is often called memory-based/case-based/instance-based method as no model is fit. A good introduction to KNN can be find at [1], or @ Wiki. Typically, KNN algorithm relies on a soph...

Read more »

Next Project: Regularized Logistic Regression

May 5, 2010
By
Next Project: Regularized Logistic Regression

L1 Regularized Logistic Regression effectively handles large number of predictors and serves variable selection simultaneously. [1] indicates that L1 RLR can be implemented via IRLS-LARS algorithm. You can tweak PROC GLMSELECT in v9.2 for this. L2 R...

Read more »

Conduct R analysis within SAS

April 30, 2010
By
Conduct R analysis within SAS

R is attractive to statistical analysts for its ease of use and ready access of packages implementing modern methodologies. If you have IML, you can submit R commands within SAS/IML enviornment, see Rick's post @ here. Unfortunately, not all analyst...

Read more »


Subscribe

Email:

  Subscribe