Blog Archives

Bayesian Classification with Gaussian Process

January 6, 2013
By
Bayesian Classification with Gaussian Process

Despite prowess of the support vector machine, it is not specifically designed to extract features relevant to the prediction. For example, in network intrusion detection, we need to learn relevant network statistics for the network defense. In consu...

Read more »

Bayesian Inference Using OpenBUGS

July 23, 2012
By
Bayesian Inference Using OpenBUGS

In our previous statistics tutorials, we have treated population parameters as fixed values, and provided point estimates and confidence intervals for them. An alternative approach is the Bayesian statistics. It treats population parameters as random...

Read more »

Significance Test for Kendall’s Tau-b

April 15, 2012
By
Significance Test for Kendall’s Tau-b

A variation of the standard definition of Kendall correlation coefficient is necessary in order to deal with data samples with tied ranks. It known as the Kendall’s tau-b coefficient and is more effective in determining whether two non-parametr...

Read more »

Support Vector Machine with GPU, Part II

October 22, 2011
By
Support Vector Machine with GPU, Part II

In our last tutorial on SVM training with GPU, we mentioned a necessary step to pre-scale the data with rpusvm-scale, and to reverse scaling the prediction outcome. This cumbersome procedure is now simplified with the latest RPUSVM. read more

Read more »

Support Vector Machine with GPU

August 28, 2011
By
Support Vector Machine with GPU

Most elementary statistical inference algorithms assume that the data can be modeled by linear parameters with a normally distributed error component. A new class of algorithms called support vector machine (SVM) remove such constraint. read more

Read more »

Kendall Rank Coefficient

December 7, 2010
By
Kendall Rank Coefficient

The correlation coefficient is a measurement of correlation between two random variables. While its computation is straightforward, it is not readily applicable to non-parametric statistics. read more

Read more »

Hierarchical Cluster Analysis

November 25, 2010
By
Hierarchical Cluster Analysis

With the distance matrix found in previous tutorial, we can use various techniques of cluster analysis for relationship discovery. For example, in the data set mtcars, we can run the distance matrix with hclust, and plot a dendrogram that displays a ...

Read more »

Type II Error

November 23, 2009
By
Type II Error

In hypothesis testing, a type II error is due to a failure of rejecting an invalid null hypothesis. The probability of avoiding a type II error is called the power of the hypothesis test, and is denoted by the quantity 1 - β . read more

Read more »

Multiple Linear Regression

September 14, 2009
By
Multiple Linear Regression

A multiple linear regression (MLR) model that describes a dependent variable y by independent variables x1, x2, ..., xp (p > 1) is expressed by the equation as follows, where the numbers α and βk (k = 1, 2, ..., p) are the parameter...

Read more »

Analysis of Variance

September 1, 2009
By
Analysis of Variance

In an experiment study, various treatments are applied to test subjects and the response data is gathered for analysis. A critical tool for carrying out the analysis is the Analysis of Variance (ANOVA). It enables a researcher to differentiate treatm...

Read more »

Subscribe

Email:

  Subscribe