Andrej Kastrin (Avtor)

Povzetek

The high dimensionality of global gene expression profiles, where number of variables (genes) is very large compared to the number of observations (samples), presents challenges that affect generalizability and applicability of microarray analysis. Latent variable modeling offers a promising approach to deal with high-dimensional microarray data. The latent variable model is based on a few latent variables that capture most of the gene expression information. Here, we describe how to accomplish a reduction in dimension by alatent variable methodology, which can greatly reduce the number of features used to characterize microarray data. We propose a general latent variable framework for prediction of predefined classes of samples using gene expression profiles from microarray experiments. The framework consists of (i) selection of smaller number of genes that are most differentially expressed between samples, (ii) dimension reduction using hierarchical clustering, where each cluster partition is identified as latent variable, (iii) discretization of gene expression matrix, (iv) fitting the Rasch item response model for genes in each cluster partition to estimate the expression of latent variable, and (v) construction of prediction model with latent variables as covariates to study the relationship between latent variables and phenotype. Two different microarray data sets are used to illustrate a general framework of the approach. We show that the predictive performance of our method is comparable to the current best approach based on an all-gene space. The method is general and can be applied to the other high-dimensional data problems.

Ključne besede

Ni podatka o ključnih besedah

Podatki

Jezik: Angleški jezik
Leto izida:
Tipologija: 1.01 - Izvirni znanstveni članek
Organizacija: UL FDV - Fakulteta za družbene vede
Založnik: Fakulteta za družbene vede
UDK: 519.7
COBISS: 28668253 Povezava se bo odprla v novem oknu
ISSN: 1854-0023
Št. ogledov: 1005
Št. prenosov: 159
Ocena: 0 (0 glasov)
Metapodatki: JSON JSON-RDF JSON-LD TURTLE N-TRIPLES XML RDFA MICRODATA DC-XML DC-RDF RDF

Ostali podatki

Sekundarni jezik: Neznan jezik
URN: URN:NBN:SI
Vrsta dela (COBISS): Delo ni kategorizirano
Strani: str. 51-67
Letnik: ǂVol. ǂ6
Zvezek: ǂno. ǂ1
Čas izdaje: 2009
Ključne besede (UDK): mathematics;natural sciences;naravoslovne vede;matematika;mathematics;matematika;mathematical cybernetics;matematična kibernetika;
ID: 1469883
Priporočena dela:
, diplomsko delo univerzitetnega študijskega programa
, naloge iz matematične pismenosti in problemske naloge
, ni podatka o podnaslovu
, študijsko gradivo