Data Mining

Data mining blogs

Hadoop and Neo4j

February 23, 2015
By
Hadoop and Neo4j

Hadoop is being widely used for processing big data and Neo4j is a popular open-source graph database. When doing social network analysis on big data, a “natural” thought is to use them together. Unfortunately, Neo4j cannot work directly on HDFS … Continue reading →

Read more »

CFP: 13th Australasian Data Mining Conference (AusDM 2015)

February 9, 2015
By
CFP: 13th Australasian Data Mining Conference (AusDM 2015)

The 13th Australasian Data Mining Conference (AusDM 2015) Sydney, Australia, 8-9 August 2015 co-located with SIGKDD’15 URL: http://ausdm15.ausdm.org/ Join us on LinkedIn: http://www.linkedin.com/groups/AusDM-4907891 The Australasian Data Mining Conference has established itself as the premier Australasian meeting for both practitioners and … Continue reading →

Read more »

UIUC free online courses on data mining starting on 9 Feb, lectured by Prof. Jiawei Han et al.

February 8, 2015
By
UIUC free online courses on data mining starting on 9 Feb, lectured by Prof. Jiawei Han et al.

by Yanchang Zhao, RDataMining.com A series of free online data mining courses will start on 9 Feb 2015, lectured by Prof. Jiawei Han and several other staff at UIUC. Prof. Han is one of the top data mining researchers around … Continue reading →

Read more »

Migrating code pieces to GitHub

February 5, 2015
By
Migrating code pieces to GitHub

One of the original reasons for this blog was to keep track of my SAS code as well as its relevant context. That was the mindset when I was a SAS analyst, but now working in professional software company, using the right tool for versioning, col...

Read more »

Free online data mining and machine learning courses by Stanford University

February 5, 2015
By
Free online data mining and machine learning courses by Stanford University

by Yanchang Zhao, RDataMining.com Three free online data mining and machine learning courses lectured by professors at Stanford University started in past two weeks, which provide excellent opportunities to learn advanced data mining and machine learning techniques. If you are … Continue reading →

Read more »

Canberra IAPA Seminar – Text Analytics: Natural Language into Big Data – 17 February

February 3, 2015
By
Canberra IAPA Seminar – Text Analytics: Natural Language into Big Data – 17 February

Topic: Text Analytics: Natural Language into Big Data Speaker: Dr. Leif Hanlen, Technology Director at NICTA Date: Tuesday 17 February Time: 5.30pm for a 6pm start Cost: Nil Where: SAS Offices, 12 Moore Street, Canberra, ACT 2600 Registration URL: http://www.iapa.org.au/Event/TextAnalyticsNaturalLanguageIntoBigData … Continue reading →

Read more »

Stamp of Approval

January 3, 2015
By
Stamp of Approval

After getting a hint of this a few months ago, I've finally tracked down an image of a stamp that will be released this year to celebrate the invention of the World Wide Web using an image I created. Here's...

Read more »

%SVD macro with BY-Processing

December 18, 2014
By
%SVD macro with BY-Processing

For the Regularized Discriminant Analysis Cross Validation, we need to compute SVD for each pair of \((\lambda, \gamma)\), and the factorization result will be feed to the downdating algorithm to obtain leave one out variance-covariance matrix \(\hat{\...

Read more »

Experient downdating algorithm for Leave-One-Out CV in RDA

December 15, 2014
By
Experient downdating algorithm for Leave-One-Out CV in RDA

In this post, I want to demonstrate a piece of experiment code for downdating algorithm for Leave-One-Out (LOO) Cross Validation in Regularized Discriminant Analysis [1]. In LOO CV, the program needs to calculate the inverse of \(\hat{\Sigma}_{k\v}(\la...

Read more »

Control Excel via SAS DDE & Python win32com

December 15, 2014
By
Control Excel via SAS DDE & Python win32com

Excel is probably the most used interface between human and data. Whenever you are dealing with business people, Excel is the de facto means for all things about data processing. I used to only use SAS and Python for number crunching but in one of my r...

Read more »


Subscribe

Email:

  Subscribe