Branislav Panić (Avtor), Jernej Klemenc (Avtor), Marko Nagode (Avtor)

Povzetek

A maximum-likelihood estimation of a multivariate mixture model's parameters is a difficult problem. One approach is to combine the REBMIX and EM algorithms. However, the REBMIX algorithm requires the use of histogram estimation, which is the most rudimentary approach to an empirical density estimation and has many drawbacks. Nevertheless, because of its simplicity, it is still one of the most commonly used techniques. The main problem is to estimate the optimum histogram-bin width, which is usually set by the number of non-overlapping, regularly spaced bins. For univariate problems it is usually denoted by an integer value; i.e., the number of bins. However, for multivariate problems, in order to obtain a histogram estimation, a regular grid must be formed. Thus, to obtain the optimum histogram estimation, an integer-optimization problem must be solved. The aim is therefore the estimation of optimum histogram binning, alone and in application to the mixture model parameter estimation with the REBMIX&EM strategy. As an estimator, the Knuth rule was used. For the optimization algorithm, a derivative based on the coordinate-descent optimization was composed. These proposals yielded promising results. The optimization algorithm was efficient and the results were accurate. When applied to the multivariate, Gaussian-mixture-model parameter estimation, the results were competitive. All the improvements were implemented in the rebmix R package.

Ključne besede

histogram;diskretna optimizacija;ocena parametrov;EM;REBMIX;mešani model;integer optimization;parameter estimation;mixture model;

Podatki

Jezik: Angleški jezik
Leto izida:
Tipologija: 1.01 - Izvirni znanstveni članek
Organizacija: UL FS - Fakulteta za strojništvo
UDK: 004.4(045)
COBISS: 22207235 Povezava se bo odprla v novem oknu
ISSN: 2227-7390
Matična publikacija: Mathematics
Št. ogledov: 462
Št. prenosov: 268
Ocena: 0 (0 glasov)
Metapodatki: JSON JSON-RDF JSON-LD TURTLE N-TRIPLES XML RDFA MICRODATA DC-XML DC-RDF RDF

Ostali podatki

Sekundarni jezik: Slovenski jezik
Sekundarne ključne besede: histogram;diskretna optimizacija;ocena parametrov;EM;REBMIX;mešani model;
Vrsta dela (COBISS): Članek v reviji
Strani: f. 1-30
Letnik: ǂVol. ǂ8
Zvezek: ǂiss. ǂ7
Čas izdaje: Jul. 2020
DOI: 10.3390/math8071090
ID: 11893707