diplomsko delo

Abstract

Predstavitev spletnih novic iz več virov

Keywords

podobnost besedil;kategorizacija;lematizacija;kosinusna podobnost besedil;novice;računalništvo;visokošolski strokovni študij;diplomske naloge;

Data

Language: Slovenian
Year of publishing:
Typology: 2.11 - Undergraduate Thesis
Organization: UL FRI - Faculty of Computer and Information Science
Publisher: [M. Vončina]
UDC: 004.738.12(043.2)
COBISS: 9058132 Link will open in a new window
Views: 35
Downloads: 1
Average score: 0 (0 votes)
Metadata: JSON JSON-RDF JSON-LD TURTLE N-TRIPLES XML RDFA MICRODATA DC-XML DC-RDF RDF

Other data

Secondary language: English
Secondary title: A presentation of web news from multiple sources
Secondary abstract: We built a website, where visitors can find and read current news from Slovenia from multiple sources. We presented news articles in groups of similar news to shorten the time to find important news and to spare visitors browsing of several websites. To achieve this we built a database of news and news processor. We developed a system to read and parse news from multiple sources, news normalization with lemmatization, weighting of words in the news and presenting the news using a vector space model. We used our model to calculate similarity between news, which enabled us to clusters similar news. We built a prototype website to display relevant news clusters.
Secondary keywords: text similarity;categorization;lemmatization;cosine coefficient;news;computer science;diploma;
File type: application/pdf
Type (COBISS): Undergraduate thesis
Thesis comment: Univ. v Ljubljani, Fak. za računalništvo in informatiko
Pages: 32 str.
ID: 24063159
Recommended works:
, diplomsko delo
, diplomsko delo
, diplomsko delo