diplomsko delo
Abstract
Pri modeliranju bioloških procesov pogosto naletimo na vzorce, ki se težko opišejo z obstoječimi metodami modeliranja in napovedovanja. To še posebno velja za analizo ritmičnega izražanja, saj se mora metoda spopasti z variacijami amplitud, period, faznih zamikov ter manjšim vzorcem, ki je pogosto onesnažen s človeškimi napakami. Zato je pomembno vedeti, kako se obnesejo obstoječe metode za take primere, in katere med njimi so bolj primerne in katere manj. Delo se posveča pregledu nekaterih obstoječih metod in optimalnemu izboru implementacij med obravnavanimi glede na doseženo točnost, ki jo dosežejo v petih serijah sintetično generiranih podatkov z različnimi parametri. S pomočjo rezultatov določimo, katere metode se obnesejo v katerih pogojih in kateri pogoji vzorčenja proizvedejo najbolj primerne podatke za analize. Na koncu izvedbo analize ritmičnosti naredimo še na primeru transkriptomskih podatkov, ki so bili pridobljeni iz javno dostopne podatkovne baze GEO.
Keywords
modeliranje;računska biologija;sistemska biologija;regresija;cirkadiani ritmi;računalništvo;računalništvo in informatika;univerzitetni študij;diplomske naloge;
Data
Language: |
Slovenian |
Year of publishing: |
2020 |
Typology: |
2.11 - Undergraduate Thesis |
Organization: |
UL FRI - Faculty of Computer and Information Science |
Publisher: |
[D. Miškić] |
UDC: |
004:57(043.2) |
COBISS: |
1538535619
|
Views: |
758 |
Downloads: |
186 |
Average score: |
0 (0 votes) |
Metadata: |
|
Other data
Secondary language: |
English |
Secondary title: |
Overview and comparison of computational approaches for the analysis of rhythmicity in gene expression data |
Secondary abstract: |
When modelling biological processes, we often find ourselves faced with complex patterns, that cannot be adequately described by existing methods of modelling and data prediction. That applies well when data contains rhythmical elements, as chosen method must be able to process amplitude, period and acrophase variations coupled with smaller data set, which is often biased by human error. With this in mind, it is important to know which methods currently in use can be best suited for such problems. In this work we will focus on few selected methods and their performance in five different synthetically generated data series. With the help of obtained results we will be able to determine which method is better suited for which conditions and performs best in most series, as well as which sampling conditions produce most and least suited data for further computational analyses. We demonstrate the application of selected methods on the analysis of transcriptomic data obtained from GEO database. |
Secondary keywords: |
modelling;computational biology,;systemic biology;regression;circadian rhythms;computer science;computer and information science;diploma; |
Type (COBISS): |
Bachelor thesis/paper |
Study programme: |
1000468 |
Embargo end date (OpenAIRE): |
1970-01-01 |
Thesis comment: |
Univ. v Ljubljani, Fak. za računalništvo in informatiko |
Pages: |
137 str. |
ID: |
11410993 |