Andrej Kastrin (Author)

Abstract

The high dimensionality of global gene expression profiles, where number of variables (genes) is very large compared to the number of observations (samples), presents challenges that affect generalizability and applicability of microarray analysis. Latent variable modeling offers a promising approach to deal with high-dimensional microarray data. The latent variable model is based on a few latent variables that capture most of the gene expression information. Here, we describe how to accomplish a reduction in dimension by alatent variable methodology, which can greatly reduce the number of features used to characterize microarray data. We propose a general latent variable framework for prediction of predefined classes of samples using gene expression profiles from microarray experiments. The framework consists of (i) selection of smaller number of genes that are most differentially expressed between samples, (ii) dimension reduction using hierarchical clustering, where each cluster partition is identified as latent variable, (iii) discretization of gene expression matrix, (iv) fitting the Rasch item response model for genes in each cluster partition to estimate the expression of latent variable, and (v) construction of prediction model with latent variables as covariates to study the relationship between latent variables and phenotype. Two different microarray data sets are used to illustrate a general framework of the approach. We show that the predictive performance of our method is comparable to the current best approach based on an all-gene space. The method is general and can be applied to the other high-dimensional data problems.

Keywords

No keyword data available

Data

Language: English
Year of publishing:
Typology: 1.01 - Original Scientific Article
Organization: UL FDV - Faculty of Social Sciences
Publisher: Fakulteta za družbene vede
UDC: 519.7
COBISS: 28668253 Link will open in a new window
ISSN: 1854-0023
Views: 1005
Downloads: 159
Average score: 0 (0 votes)
Metadata: JSON JSON-RDF JSON-LD TURTLE N-TRIPLES XML RDFA MICRODATA DC-XML DC-RDF RDF

Other data

Secondary language: Unknown
URN: URN:NBN:SI
Type (COBISS): Not categorized
Pages: str. 51-67
Volume: ǂVol. ǂ6
Issue: ǂno. ǂ1
Chronology: 2009
Keywords (UDC): mathematics;natural sciences;naravoslovne vede;matematika;mathematics;matematika;mathematical cybernetics;matematična kibernetika;
ID: 1469883
Recommended works:
, diplomsko delo univerzitetnega študijskega programa
, naloge iz matematične pismenosti in problemske naloge
, no subtitle data available
, študijsko gradivo