EMSS 2012 Proceeding

Unsupervised learning approach to feature selection in biological data analysis

Authors:   Witold Jacak, Karin Proell

Abstract

In this paper we present a novel method for scoring function specification and feature selection by combining unsupervised learning with supervised cross validation. Unsupervised clustering methods (k-means, one dimensional Kohonen SOM, fuzzy c-means) are used to perform a clustering of object-data for a chosen subset of input features and given number of clusters. The resulting object clusters are compared with the predefined original object classes and a matching factor (score) is calculated. This score is used as criterion function for heuristic sequential feature selection and novel cross selection algorithm.

I3M  Scientific Sponsors

I3M  Industrial Sponsors

I3M  Media Sponsors