Data Mining

Data mining blogs

%SVD macro with BY-Processing

December 18, 2014
By
%SVD macro with BY-Processing

For the Regularized Discriminant Analysis Cross Validation, we need to compute SVD for each pair of \((\lambda, \gamma)\), and the factorization result will be feed to the downdating algorithm to obtain leave one out variance-covariance matrix \(\hat{\...

Read more »

Experient downdating algorithm for Leave-One-Out CV in RDA

December 15, 2014
By
Experient downdating algorithm for Leave-One-Out CV in RDA

In this post, I want to demonstrate a piece of experiment code for downdating algorithm for Leave-One-Out (LOO) Cross Validation in Regularized Discriminant Analysis [1]. In LOO CV, the program needs to calculate the inverse of \(\hat{\Sigma}_{k\v}(\la...

Read more »

Control Excel via SAS DDE & Python win32com

December 15, 2014
By
Control Excel via SAS DDE & Python win32com

Excel is probably the most used interface between human and data. Whenever you are dealing with business people, Excel is the de facto means for all things about data processing. I used to only use SAS and Python for number crunching but in one of my r...

Read more »

Recordings of RStudio Webinar Series on Essential Tools for Data Science with R

December 9, 2014
By
Recordings of RStudio Webinar Series on Essential Tools for Data Science with R

by Yanchang Zhao, RDataMining.com RStudio recently ran a series of live webinars on Essential Tools for Data Science with R, but it is inconvenient for people from other time zones to attend. Fortunately, the recordings have been made available online, … Continue reading →

Read more »

R and Data Mining – Examples and Case Studies now in Chinese

December 1, 2014
By
R and Data Mining – Examples and Case Studies now in Chinese

My book titled R and Data Mining – Examples and Case Studies now has its Chinese version, translated by researchers at South China University of Technology, and published by China Machine Press in September 2014. It is sold in China … Continue reading →

Read more »

R and Data Mining Workshop at AusDM 2014, Brisbane, 27 November

November 24, 2014
By
R and Data Mining Workshop at AusDM 2014, Brisbane, 27 November

R and Data Mining Workshop at AusDM 2014 http://ausdm14.ausdm.org/workshop There will be a half-day workshop on R and Data Mining at the AusDM 2014 conference in Brisbane, Thursday afternoon, 27 November. The workshop will be composed of several sessions on … Continue reading →

Read more »

Slides of keynote speeches, tutorials and panelist presentations at IEEE Big Data 2014

November 23, 2014
By
Slides of keynote speeches, tutorials and panelist presentations at IEEE Big Data 2014

Slides of keynote speeches, tutorials and panelist presentations at the 2014 IEEE International Conference on Big Data can be found at the conference website at links below. (1) Keynote speech http://cci.drexel.edu/bigdata/bigdata2014/keynotespeech.htm – Never-Ending Language Learning, Tom Mitchell – E. Fredkin … Continue reading →

Read more »

Free Stanford online course on Statistical Learning (with R) starting on 19 Jan 2015

November 21, 2014
By
Free Stanford online course on Statistical Learning (with R) starting on 19 Jan 2015

This is an introductory-level course in supervised learning, with a focus on regression and classification methods. The syllabus includes: linear and polynomial regression, logistic regression and linear discriminant analysis; cross-validation and the bootstrap, model selection and regularization methods (ridge and … Continue reading →

Read more »

AusDM 2014 Conference Program

November 12, 2014
By
AusDM 2014 Conference Program

The Program of AusDM 2014 Conference is now available at http://ausdm14.ausdm.org/program. It features two keynote talks, one on Learning in Sequential Decision Problems by Prof Peter Bartlett from UC Berkeley, and the other on Making Sense of a Random World through … Continue reading →

Read more »

SBS documentary “The Age of Big Data”

November 8, 2014
By
SBS documentary “The Age of Big Data”

by Yanchang Zhao, RDataMining.com “Data is becoming a powerful and most valuable commodity in 21st century. It is leading to scientific insights and new ways of understanding human behaviour. Data can also make you rich. Very rich.” – SBS documentary … Continue reading →

Read more »


Subscribe

Email:

  Subscribe