Language: | Slovenian |
---|---|
Year of publishing: | 2015 |
Typology: | 2.11 - Undergraduate Thesis |
Organization: | UL FRI - Faculty of Computer and Information Science |
Publisher: | [Ž. Pušnik] |
UDC: | 004.85:81'322.2(043.2) |
COBISS: | 1536476611 |
Views: | 1858 |
Downloads: | 613 |
Average score: | 0 (0 votes) |
Metadata: |
Secondary language: | English |
---|---|
Secondary title: | Using deep convolutional neural networks on natural language problems |
Secondary abstract: | The thesis examines the learning of language problems with convolutional neural networks. Convolutional neural networks were developed for machine vision. We used them to classify short abstracts and to learn a comma placement in Slovenian language. We programmed our convolutional neural network in programming language python with Theano library. Our work is based on existing research. We describe adaptation of datasets to our model. Several experiments were conducted and we compared lemmatization versus stemming and vector representation of text versus byte array representation. The best results were obtained with text quantized with 1 to m encoding. Comma placing results are comparable with other machine learning approaches. |
Secondary keywords: | machine learning;natural language processing;neural network;neuron;convolution;convolutional neural network;classification;classification model;classificator;classification accuracy;language;text;comma;lemma;stemm;momentum;gradient;gradient descent;backpropagation;learning rate;momentum rate;dataset;text corpus;attribute;computer science;computer and information science;diploma; |
File type: | application/pdf |
Type (COBISS): | Bachelor thesis/paper |
Study programme: | 1000468 |
Embargo end date (OpenAIRE): | 1970-01-01 |
Thesis comment: | Univ. v Ljubljani, Fak. za računalništvo in informatiko |
Pages: | 47 str. |
ID: | 8890380 |