Jezik: | Slovenski jezik |
---|---|
Leto izida: | 2012 |
Tipologija: | 2.11 - Diplomsko delo |
Organizacija: | UL FRI - Fakulteta za računalništvo in informatiko |
Založnik: | [M. Vončina] |
UDK: | 004.738.12(043.2) |
COBISS: | 9058132 |
Št. ogledov: | 35 |
Št. prenosov: | 1 |
Ocena: | 0 (0 glasov) |
Metapodatki: |
Sekundarni jezik: | Angleški jezik |
---|---|
Sekundarni naslov: | A presentation of web news from multiple sources |
Sekundarni povzetek: | We built a website, where visitors can find and read current news from Slovenia from multiple sources. We presented news articles in groups of similar news to shorten the time to find important news and to spare visitors browsing of several websites. To achieve this we built a database of news and news processor. We developed a system to read and parse news from multiple sources, news normalization with lemmatization, weighting of words in the news and presenting the news using a vector space model. We used our model to calculate similarity between news, which enabled us to clusters similar news. We built a prototype website to display relevant news clusters. |
Sekundarne ključne besede: | text similarity;categorization;lemmatization;cosine coefficient;news;computer science;diploma; |
Vrsta datoteke: | application/pdf |
Vrsta dela (COBISS): | Diplomsko delo |
Komentar na gradivo: | Univ. v Ljubljani, Fak. za računalništvo in informatiko |
Strani: | 32 str. |
ID: | 24063159 |