The C Clustering Library

plantationscarfAI and Robotics

Nov 25, 2013 (3 years and 6 months ago)

38 views

Open source clustering software




Clustering
extension

module

for
Python
-
language




Can

be

used

in association
with

Python
-
language

to
perform

clustering
routines




Uses

C Clustering
Library




Manual

can

be

downloaded

from

following

link



The C Clustering Library

The University of Tokyo, Institute of Medical Science, Human Genome Center


“implement the most commonly used clustering methods for gene expression data analysis”

The clustering algorithms are:



Hierarchical clustering (
pairwise

centroid
-
, single
-
, complete
-
, and average
-
linkage)


k
-
means clustering


Self
-
Organizing Maps


Principal Component Analysis.


To measure the similarity or distance between gene expression data, eight

distance measures are available:



Pearson correlation


Absolute value of the Pearson correlation


Uncentered

Pearson correlation (equivalent to the cosine of the angle between two data


vectors)


Absolute
uncentered

Pearson correlation (equivalent to the cosine of the smallest angle


between two data vectors)


Spearman's rank correlation


Kendall's ¿


Euclidean distance;


City
-
block distance.