A Supervised Biclustering Optimization Model for Feature Selection in Biomedical Dataset Classification

2016-06-30
Arikan, Saziye Deniz Oguz
İyigün, Cem
Biclustering groups samples and features simultaneously in the given set of data. When biclusters are obtained from the data, clusters of samples and clusters of features that determine the partitioning of samples into the underlying clusters are also obtained. We focus on a supervised biclustering problem leading to unsupervised feature selection. We formulate this problem as an optimization model which aims to maximize classification accuracy by selecting a small subset of features. We solve the model with exact and inexact solution methods based on optimization techniques. Microarray cancer datasets are used to experiment our approach.

Suggestions

A binomial noised model for cluster validation
Toledano-Kitai, Dvora; Avros, Renata; Volkovich, Zeev; Weber, Gerhard Wilhelm; Yahalom, Orly (IOS Press, 2013-01-01)
Cluster validation is the task of estimating the quality of a given partition of a data set into clusters of similar objects. Normally, a clustering algorithm requires a desired number of clusters as a parameter. We consider the cluster validation problem of determining the optimal ("true") number of clusters. We adopt the stability testing approach, according to which, repeated applications of a given clustering algorithm provide similar results when the specified number of clusters is correct. To implemen...
A new framework of multi-objective evolutionary algorithms for feature selection and multi-label classification of video data
Karagoz, Gizem Nur; Yazıcı, Adnan; Dokeroglu, Tansel; Coşar, Ahmet (2020-06-01)
There are few studies in the literature to address the multi-objective multi-label feature selection for the classification of video data using evolutionary algorithms. Selecting the most appropriate subset of features is a significant problem while maintaining/improving the accuracy of the prediction results. This study proposes a framework of parallel multi-objective Non-dominated Sorting Genetic Algorithms (NSGA-II) for exploring a Pareto set of non-dominated solutions. The subsets of non-dominated featu...
A Methodology to develop process ontology from organizational guidelines written in natural language
Gürbüz, Özge; Demirörs, Onur; Department of Information Systems (2017)
Integrating ontologies with process modeling improves data representations and makes it easier to query, store and reuse processes at the semantics level. Therefore, in recent years, this topic has become increasingly popular. The studies in the literature have proposed methods for the integration process either to relate domain ontologies to process models or to transform process models to process ontologies. Another way to establish the integration between ontologies and process models is to develop proce...
A new electromagnetic target classification method with MUSIC algorithm
Secmen, Mustafa; Sayan, Gönül (2006-01-01)
This paper introduces a novel method for aspect invariant electromagnetic target recognition based on the use of multiple signal classification (MUSIC) algorithm to extract late-time resonant target features from the ultra-wideband scattered data. This method is mainly based on the usage of MUSIC spectra obtained from electromagnetic scattered data as the target features. This approach achieves very high accuracy rates even at very low signal-to-noise ratio (SNR) values although it needs scattered data for ...
A novel electromagnetic target recognition method by MUSIC algorithm
Secmen, Mustafa; Sayan, Gönül (2006-12-01)
This paper introduces a novel method for aspect invariant electromagnetic target recognition based on the use of multiple signal classification (MUSIC) algorithm to extract late-time resonant target features from the ultra-wideband scattered data. This approach achieves very high accuracy rates even at very low signal-to-noise ratio (SNR) values although it needs scattered data for classifier design at only a few different aspects and makes use of the MUSIC algorithm in a simple and computationally efficien...
Citation Formats
S. D. O. Arikan and C. İyigün, “A Supervised Biclustering Optimization Model for Feature Selection in Biomedical Dataset Classification,” 2016, vol. 9714, Accessed: 00, 2020. [Online]. Available: https://hdl.handle.net/11511/38151.