Branislav Panić (Author), Jernej Klemenc (Author), Marko Nagode (Author)

Abstract

A maximum-likelihood estimation of a multivariate mixture model's parameters is a difficult problem. One approach is to combine the REBMIX and EM algorithms. However, the REBMIX algorithm requires the use of histogram estimation, which is the most rudimentary approach to an empirical density estimation and has many drawbacks. Nevertheless, because of its simplicity, it is still one of the most commonly used techniques. The main problem is to estimate the optimum histogram-bin width, which is usually set by the number of non-overlapping, regularly spaced bins. For univariate problems it is usually denoted by an integer value; i.e., the number of bins. However, for multivariate problems, in order to obtain a histogram estimation, a regular grid must be formed. Thus, to obtain the optimum histogram estimation, an integer-optimization problem must be solved. The aim is therefore the estimation of optimum histogram binning, alone and in application to the mixture model parameter estimation with the REBMIX&EM strategy. As an estimator, the Knuth rule was used. For the optimization algorithm, a derivative based on the coordinate-descent optimization was composed. These proposals yielded promising results. The optimization algorithm was efficient and the results were accurate. When applied to the multivariate, Gaussian-mixture-model parameter estimation, the results were competitive. All the improvements were implemented in the rebmix R package.

Keywords

histogram;diskretna optimizacija;ocena parametrov;EM;REBMIX;mešani model;integer optimization;parameter estimation;mixture model;

Data

Language: English
Year of publishing:
Typology: 1.01 - Original Scientific Article
Organization: UL FS - Faculty of Mechanical Engineering
UDC: 004.4(045)
COBISS: 22207235 Link will open in a new window
ISSN: 2227-7390
Parent publication: Mathematics
Views: 462
Downloads: 268
Average score: 0 (0 votes)
Metadata: JSON JSON-RDF JSON-LD TURTLE N-TRIPLES XML RDFA MICRODATA DC-XML DC-RDF RDF

Other data

Secondary language: Slovenian
Secondary keywords: histogram;diskretna optimizacija;ocena parametrov;EM;REBMIX;mešani model;
Type (COBISS): Article
Pages: f. 1-30
Volume: ǂVol. ǂ8
Issue: ǂiss. ǂ7
Chronology: Jul. 2020
DOI: 10.3390/math8071090
ID: 11893707