diplomsko delo
Povzetek
Cilj diplomske naloge je bil implementirati sekvenčni algoritem za iskanje krajših zaporedij v genomih, ga pohitriti s paralelizacijo ter implementirati paralelno verzijo na grafični kartici. Sekvenčni algoritem je moral poiskati krajša zaporedja znakov v danem zaporedju, ki predstavlja genom. Izračunati je moral frekvence pojavitev zaporedij in pogostost interakcij na podanih položajih ter na naključno premaknjenih položajih, za vsako krajše zaporedje posebej. Na podlagi pojavitev je nato moral določiti, katera krajša zaporedja so za določene položaje v genomu bolj značilna. Na podlagi podatkov o interakcijah med proteini in genomom na določenih položajih, ter na podlagi najdenih krajših zaporedij znakov, je moral nato izračunati in statistično ovrednotiti pogostost pojavitve interakcij. Sekvenčni algoritem smo implementirali v programskem jeziku C, paralelizacijo sekvenčnega algoritma pa smo izvedli na podlagi arhitekture OpenCL, ki omogoča implementacijo algoritmov na grafičnih karticah.
Ključne besede
paralelno računanje;GPE;sinhronizacija;niti;bioinformatika;proteini;DNA;RNA;geni;računalništvo;računalništvo in informatika;univerzitetni študij;diplomske naloge;
Podatki
Jezik: |
Slovenski jezik |
Leto izida: |
2015 |
Tipologija: |
2.11 - Diplomsko delo |
Organizacija: |
UL FRI - Fakulteta za računalništvo in informatiko |
Založnik: |
[M. Pajnič] |
UDK: |
004:575.111(043.2) |
COBISS: |
1536599491
|
Št. ogledov: |
1309 |
Št. prenosov: |
403 |
Ocena: |
0 (0 glasov) |
Metapodatki: |
|
Ostali podatki
Sekundarni jezik: |
Angleški jezik |
Sekundarni naslov: |
Implementation of parallel algorithm for k-mer enrichment analysis of genomic sequences |
Sekundarni povzetek: |
The goal of this thesis was to implement a sequential algorithm that would search for subsequences in a genome. To accelerate the execution time of this algorithm we designed a parallel version and implemented the parallel version on a graphics card. The sequential algorithm had to search for predefined subsequences in a genome that was represented as a sequence of characters. It had to calculate the frequencies of sequence occurrences and the frequencies of interactions on predefined positions and on randomly modified positions in the genome, for each subsequence. Based on these frequencies it had to identify sequences that were more frequent on certain locations in a given genome. Based on data about protein-RNA interactions on certain locations in the genome, and based on the found character sequences, the algorithm had to calculate and statistically evaluate the frequencies of interactions. The sequential algorithm was implemented in the C programming language, while the parallelization was implemented on the OpenCL architecture. |
Sekundarne ključne besede: |
parallel computing;GPU;synchronization;threads;bioinformatics;proteins;DNA;RNA;genes;computer science;computer and information science;diploma; |
Vrsta datoteke: |
application/pdf |
Vrsta dela (COBISS): |
Diplomsko delo/naloga |
Študijski program: |
1000468 |
Konec prepovedi (OpenAIRE): |
1970-01-01 |
Komentar na gradivo: |
Univ. v Ljubljani, Fak. za računalništvo in informatiko |
Strani: |
55 str. |
ID: |
9043470 |