diplomsko delo

Abstract

V tej diplomski nalogi bomo raziskovali izboljšanje zmožnosti zdravorazumskega sklepanja jezikovnih modelov ChatGPT in SloBERTa z integracijo zdravorazumskega znanja iz baze SloATOMIC. Na začetku bomo predstavili problemsko domeno in opisali področje. Nato bomo predstavili uporabljeno tehnologijo in postopek priprave podatkov. Primerjali bomo rezultate na podatkovni bazi SI-NLI brez in z dodatnimi stavki iz SloATOMIC. Dodatno bomo še primerjali rezultate na manjši podmnožici ročno popravljenih podatkov. Na koncu bomo opisali težave in predstavili možne nadaljnje izboljšave.

Keywords

zdravorazumsko sklepanje;jezikovni modeli;SloATOMIC;SI-NLI;grafi znanj;univerzitetni študij;diplomske naloge;

Data

Language: Slovenian
Year of publishing:
Typology: 2.11 - Undergraduate Thesis
Organization: UL FRI - Faculty of Computer and Information Science
Publisher: [J. Gospodarič]
UDC: 004.85:81'322(043.2)
COBISS: 197359619 Link will open in a new window
Views: 146
Downloads: 29
Average score: 0 (0 votes)
Metadata: JSON JSON-RDF JSON-LD TURTLE N-TRIPLES XML RDFA MICRODATA DC-XML DC-RDF RDF

Other data

Secondary language: English
Secondary title: Natural language inference using commonsense reasoning database
Secondary abstract: In this thesis, we will explore the enhancement of the commonsense reasoning capabilities of the language models ChatGPT and SloBERTa by integrating commonsense knowledge from the SloATOMIC database. We will begin by presenting the problem domain and describing the field. Then, we will introduce the technology used and the data preparation process. We will compare the results on the SI-NLI dataset with and without the additional sentences from SloATOMIC. Additionally, we will compare the results on a smaller subset of manually corrected data. Finally, we will describe the challenges encountered and present possible further improvements.
Secondary keywords: commonsense reasoning;language models;SloATOMIC;SINLI;Knowledge graphs;computer and information science;diploma;Računalniško jezikoslovje;Računalništvo;Univerzitetna in visokošolska dela;
Type (COBISS): Bachelor thesis/paper
Study programme: 1000468
Thesis comment: Univ. v Ljubljani, Fak. za računalništvo in informatiko
Pages: 34 str.
ID: 24275735