diplomsko delo
David Pintarič (Avtor), Božidar Potočnik (Mentor), Martin Šavc (Komentor)

Povzetek

V diplomskem delu se ukvarjamo s problemom prepoznavanja aktivnosti osebe iz zaporedja slik, pri čemer prepoznavo poskušamo izboljšati z upoštevanjem časovne komponente. To dosežemo z uporabo povratnih nevronskih mrež. Omejili smo se na naslednje aktivnosti: oseba ni v ravnovesju, se pripogiba, stoji, sedi, leži, hitro hodi, počasi hodi in pada. Pregledali smo obstoječe postopke prepoznavanja, preučili povratne nevronske mreže, pripravili množico podatkov, zasnovali algoritem, izvedli eksperimente in na koncu analizirali rezultate. Rezultati na 25 označenih videoposnetkih so pri uporabi povratne nevronske mreže pokazali 83,24 % povprečno natančnost pri uporabi tipa zaporedje v vektor in 75,53 % povprečno natančnost pri uporabi tipa zaporedje v zaporedje. Kljub temu da so dobljeni rezultati boljši od tistih, kjer ne upoštevamo časovne komponente, ugotavljamo, da povratne nevronske mreže zaradi računske zahtevnosti niso vedno najboljša izbira.

Ključne besede

računalniški vid;povratna nevronska mreža;pomnilna celica LSTM;pomnilna celica GRU;globoko učenje;detekcija oseb;prepoznavanje aktivnosti osebe;diplomske naloge;

Podatki

Jezik: Slovenski jezik
Leto izida:
Tipologija: 2.11 - Diplomsko delo
Organizacija: UM FERI - Fakulteta za elektrotehniko, računalništvo in informatiko
Založnik: [D. Pintarič]
UDK: 004.8:004.93(043.2)
COBISS: 22912790 Povezava se bo odprla v novem oknu
Št. ogledov: 755
Št. prenosov: 198
Ocena: 0 (0 glasov)
Metapodatki: JSON JSON-RDF JSON-LD TURTLE N-TRIPLES XML RDFA MICRODATA DC-XML DC-RDF RDF

Ostali podatki

Sekundarni jezik: Angleški jezik
Sekundarni naslov: Person activity recognition from image sequence using deep recurrent neural networks
Sekundarni povzetek: The diploma thesis deals with the problem of person activity recognition from a sequence of images, while trying to improve recognition by taking into account the temporal data component. This is achieved through the use of recurrent neural networks. The focus was limited to the following activities: a person is out of balance, bending, standing, sitting, lying down, walking fast, walking slowly and falling. The existing identification methods were reviewed, the recurrent neural networks were examined, a large dataset was prepared, an algorithm was designed, experiments were conducted and finally the results were analysed. The results on the 25 labeled videos showed an 83.24% average accuracy rate when using a sequence-to-vector type recurrent neural network and a 75.53% average accuracy rate when using a sequence-to-sequence type of a recurrent neural network. Although the results obtained are better than those where the temporal data component is disregarded, it can be concluded that recurrent neural networks, due to the computational complexity, are not always the best choice.
Sekundarne ključne besede: computer vision;recurrent neural network;LSTM cell;GRU cell;deep learning;human object recognition;human activity recognition;person activity recognition;
Vrsta dela (COBISS): Diplomsko delo/naloga
Komentar na gradivo: Univ. v Mariboru, Fak. za elektrotehniko, računalništvo in informatiko, Računalništvo in informacijske tehnologije
Strani: VII, 46 str.
ID: 11220665