diplomsko delo
Robi Markač (Author), Marko Bajec (Mentor)

Abstract

Diplomsko delo se ukvarja z razvojem prototipnega sistema za glasovno upravljanje aplikacije na android napravi. Na spletu se pojavljajo večinoma plačljive storitve razpoznave govora (tudi za slovenski jezik), ki jih ponujajo tehnološki giganti in delujejo večinoma preko interneta. Odprtokodni sistemi za razpoznavo govora so slabo dokumentirani in vsebujejo podporo zgolj za svetovne jezike, kot sta na primer angleščina in nemščina. Zato je bil izveden in opisan celoten postopek razvoja razpoznave govora na android napravi za slovenski jezik na podlagi odprtokodnega orodja CMU Sphinx, ki deluje brez internetne povezave. Z uporabo orodij CMU Sphinx je bil razvit akustični model za omejen nabor slovenskih ukazov na podlagi enega govorca. Ta akustični model je bil nato integriran v preprosto demonstracijsko android aplikacijo, kjer je bilo s pomočjo knjižnice PocketSphinx implementirano razpoznavanje ukazov v slovenskem jeziku. Rezultati testiranj so bili izredno uspešni in so pokazali hitro ter natančno delovanje razpoznave govornih ukazov.

Keywords

CMU Sphinx;PocketSphinx;razpoznavanje govora;glasovno ukazovanje;slovenski jezik;računalništvo in informatika;univerzitetni študij;diplomske naloge;

Data

Language: Slovenian
Year of publishing:
Typology: 2.11 - Undergraduate Thesis
Organization: UL FRI - Faculty of Computer and Information Science
Publisher: [R. Markač]
UDC: 004.934:004.5(043.2)
COBISS: 1538347203 Link will open in a new window
Views: 748
Downloads: 191
Average score: 0 (0 votes)
Metadata: JSON JSON-RDF JSON-LD TURTLE N-TRIPLES XML RDFA MICRODATA DC-XML DC-RDF RDF

Other data

Secondary language: English
Secondary title: Slovenian voice control of android device
Secondary abstract: The diploma thesis is focused on the development of a prototype system for voice control of an application on an android device. On internet majority of services for speech recognition offered by technological giants are payable and mostly works with usage of internet. Open source speech recognition systems are not well documented and support only world languages like english or german. Therefore, the whole process of speech recognition development on android device for the Slovenian language was created and described on the basis of open source tools CMU Sphinx, which are operating without an internet connection, which has not yet been addressed. Using the CMU Sphinx tools, an acoustic model was developed for a limited set of Slovene commands based on one speaker. This acoustic model was then integrated into a simple demonstration android application where the recognition of commands in the Slovene language was implemented using the PocketSphinx library. Test results were extremely successful and showed the fast and accurate operation of voice command recognition.
Secondary keywords: CMU Sphinx;PocketSphinx;speech recognition;voice command recognition;slovenian language;computer and information science;diploma;
Type (COBISS): Bachelor thesis/paper
Study programme: 1000468
Embargo end date (OpenAIRE): 1970-01-01
Thesis comment: Univ. v Ljubljani, Fak. za računalništvo in informatiko
Pages: 71 str.
ID: 11222861
Recommended works:
, bachelor's thesis
, diplomsko delo
, diplomsko delo
, diplomsko delo