Modeliranje sopojavitev besed z metodami strojnega učenja

diplomsko delo

Ruben Sipoš (Avtor), Janez Demšar (Mentor), Dunja Mladenić (Komentor)

Povzetek

Modeliranje sopojavitev besed z metodami strojnega učenja

Ključne besede

strojno učenje;n-terke besed;trojke;izračun trojk;modeliranje sopojavitev besed;posploševanje konceptov;računalništvo;univerzitetni študij;diplomske naloge;

Podatki

Jezik:	Slovenski jezik
Leto izida:	2009
Tipologija:	2.11 - Diplomsko delo
Organizacija:	UL FRI - Fakulteta za računalništvo in informatiko
Založnik:	[R. Sipoš]
UDK:	004(043.2)
COBISS:	7141972
Št. ogledov:	953
Št. prenosov:	213
Ocena:	0 (0 glasov)
Metapodatki:

Ostali podatki

Sekundarni jezik:	Angleški jezik
Sekundarni naslov:	[Modelling words co-occurrence with machine learning]
Sekundarni povzetek:	Advances in machine learning and increasing computing power are providing new possibilities for data processing and knowledge acquisition. One of the key questions in automatic text analysis is how to acquire semantic information. A possible approach is to model semantics using word co-occurrence. In the context of this work we have developed an approach which enables us to build models, represented as triples consisting of subject, predicate and object, based on word n-grams. We used Google n-grams constructed on the basis of their index of web pages. Special attention was also given to how to efficiently process this quantity of data, because it is one of the largest datasets of this type. Also, we provide justification for choosing representation using triples and describe how to efficiently compute triples because current approaches are time consuming We propose a new procedure for construction of models of word co-occurrences. Each model represents a set of triples using more general concepts. We also give the results of evaluation, which indicate the potential usefulness of our results. We conclude with some interesting ideas for further research.
Sekundarne ključne besede:	machine learning;word n-grams;triples;triple extraction;modeling word co-occurrences;concept abstraction;computer science;diploma;
Vrsta datoteke:	application/pdf
Vrsta dela (COBISS):	Diplomsko delo
Komentar na gradivo:	Univerza v Ljubljani, Fakulteta za računalništvo in informatiko
Strani:	50 str.
ID:	23868211

Slovenski jezik

English language

Priporočena dela:

Modeliranje sopojavitev besed z metodami strojnega učenja

2009, diplomsko delo

Analiza poškodb pri športnih plesalcih z metodami strojnega učenja

2023, diplomsko delo

Anonimizacija sodnih odločb z metodami strojnega učenja

2020, diplomsko delo

Ocenjevanje zanesljivosti posameznih klasifikacij z lokalnimi metodami

2009, diplomsko delo

Konstrukcija krivulj preživetja iz cenzuriranih podatkov z metodami strojnega učenja

2008, diplomsko delo