Microarray technology has been the leading research direction in medicine, pharmacology, genome studies and other related areas over the past years. This technology enables researches to simultaneously study activity expression of tens of thousands of genes. After the experimental data have been processed, arrays of numerical values of gene expressions are obtained that are the basis for receiving relevant information and new knowledge. This paper briefly overviews the basics of microarray technology as well as task classes that could be solved using microarray data. The existing approaches to clustering gene expression sets are discussed. It is shown that the fuzzy c-means clustering method appears the most appropriate for that purpose. Due to that, the problem of choosing an optimal size of fuzziness parameter arises. Three widespread techniques for solving the problem are considered and their comparative analysis is provided.
Tarca, A. L., Romero, R., Draghici, S. "Analysis of microarray experiments of gene expression profiling," American Journal of Obstetrics and Gynecology, Vol.195, pp.373-378, 2006.
Tjaden, B., Cohen, J. "A Survey of Computational Methods Used in microarray Data Interpretation," in Applied Mycology and Biotechnology, Vol. 6, Bioinformatics, D. K. Arora, R. M. Berka and G. B. Sigh, Eds. Elsevier Science, 2006, pp.161-178.
Schwämmle, V., Jensen, O. N. "A simple and fast method to determine the parameters for fuzzy c-means cluster validation," Available: http://arxiv.org/abs/1004.1307v1