master's thesis
Abstract
Quantifying uncertainty is a vital part of every statistical study. There are many different methods, but in the hands of an inexperienced user, most of them can lead to big mistakes in the interpretation.
Bootstrap is a favorable method for this task because of its robustness, versatility, ease of understanding and lack of stringent distributional assumptions.
But even after 40 years of existence, it is not clear if this general method is accurate enough to substitute the traditional methods specialized for specific parameters of interest.
To answer this, we designed an extensive simulation study that assesses the methods' confidence interval prediction for six different parameters on samples of multiple sizes, generated from seven diverse distributions.
We chose the double bootstrap as the best general bootstrap method, additionally recommending the standard bootstrap for confidence intervals for extreme percentiles.
We compared the best bootstrap methods to the traditional methods and found out that for almost all of the parameters no traditional method is practically better. Moreover, bootstrap gives good predictions even on the distributions where traditional methods fail because of broken assumptions.
Our work thus suggests that estimates generated by the proposed bootstrap methods are comparable to or even better than the ones made by the traditional methods.
Keywords
uncertainty;quantifying uncertainty;confidence intervals;bootstrap;computer science;master's thesis;
Data
Language: |
English |
Year of publishing: |
2023 |
Typology: |
2.09 - Master's Thesis |
Organization: |
UL FRI - Faculty of Computer and Information Science |
Publisher: |
[U. Zrimšek] |
UDC: |
004(043.2) |
COBISS: |
178901507
|
Views: |
25 |
Downloads: |
4 |
Average score: |
0 (0 votes) |
Metadata: |
|
Other data
Secondary language: |
Slovenian |
Secondary title: |
Bootstrap kot splošen pristop k merjenju negotovosti |
Secondary abstract: |
Merjenje negotovosti je pomemben del vsake statistične raziskave. Obstaja veliko metod, vendar pa lahko ob nerazumevanju njihovega ozadja večina privede do velikih napak pri interpretaciji.
Metoda bootstrap bi lahko zaradi svoje robustnosti, vsestranskosti, preprostosti in pomanjkanja strogih predpostavk o porazdelitvi podatkov zmanjšala število takih napak.
Toda tudi po štiridesetih letih obstoja ni povsem jasno, če je ta splošna metoda dovolj natančna, da bi lahko nadomestila tradicionalne metode, specializirane za posamezne parametre.
Da bi odgovorili na to vprašanje, smo izvedli obsežno simulacijo, ki ocenjuje napovedi intervalov zaupanja za šest parametrov na vzorcih različnih velikosti, vzorčenih iz sedmih raznolikih porazdelitev.
Za najboljšo splošno metodo bootstrap smo izbrali dvojni bootstrap, dodatno pa priporočamo uporabo standardnega bootstrapa za izračun intervalov zaupanja za ekstremne percentile. Predlagane bootstrap metode smo primerjali s tradicionalnimi metodami in ugotovili, da za skoraj vse parametre ne obstaja praktično boljša tradicionalna metoda. Poleg tega nam bootstrap zagotavlja dobre napovedi celo na porazdelitvah, kjer tradicionalne metode zaradi napačnih predpostavk zatajijo.
Zaključujemo, da so ocene intervalov pridobljene s predlaganimi bootstrap metodami primerljive, če ne celo boljše od ocen tradicionalnih metod. |
Secondary keywords: |
negotovost;merjenje negotovosti;intervali zaupanja;bootstrap;magisteriji;Računalništvo;Univerzitetna in visokošolska dela; |
Type (COBISS): |
Master's thesis/paper |
Study programme: |
1000471 |
Embargo end date (OpenAIRE): |
1970-01-01 |
Thesis comment: |
Univ. v Ljubljani, Fak. za računalništvo in informatiko |
Pages: |
X, 76 str. |
ID: |
21923929 |