zaključna naloga Razvojno raziskovalnega programa I. stopnje Strojništvo
Abstract
Strojno učenje se vedno bolj uporablja v vseh vedah, tudi v strojništvu V nalogi je predstavljen algoritem strojnega učenja za klasificiranje znanstvenih in strokovnih besedil. Razložena je tudi priprava besedila pred klasificiranjem, kar je zelo pomembno za končno uspešnost algoritma. Besedila, ki smo jih dobili na spletišču COBISS so uporabljena v nalogi so znanstveni članki, katerih soavtorji so zaposleni na Fakulteti za strojništvo, Univerze v Ljubljani. Članke smo klasificirali v deset kategorij iz akademije za proizvodnjo inženirstvo - CIRP. Predstavljene so bile najbolj pogoste izbire kategorij, v katerih so pisali avtorji in katere besede so se največkrat pojavljale.
Keywords
diplomske naloge;strojno učenje;klasifikacija besedila;spletno strganje podatkov;podatkovno rudarjenje;python;
Data
Language: |
Slovenian |
Year of publishing: |
2018 |
Typology: |
2.11 - Undergraduate Thesis |
Organization: |
UL FS - Faculty of Mechanical Engineering |
Publisher: |
[M. Jeršin] |
UDC: |
004.85:519.2(043.2) |
COBISS: |
16401947
|
Views: |
774 |
Downloads: |
266 |
Average score: |
0 (0 votes) |
Metadata: |
|
Other data
Secondary language: |
English |
Secondary title: |
Classification of research publications from the domain of mechanical engineering using machine learning |
Secondary abstract: |
Machine learning is getting more and more used nowadays, also in mechanical engineering. This thesis presents algorithm of research publications text classification using machine learning. It is shown the preparation of text step by step, which is very important before the actual classification because of the later performance of the algorithm. Texts used in this algorithm are research publications, which co-authors are employees in Faculty of mechanical engineering, University of Ljubljana. We classified the publications into ten different classes from the international academy for production engineering - CIRP. The most common used categories and most used words are shown and presented. |
Secondary keywords: |
machine learning;text classification;web scraping;data mining;python; |
Type (COBISS): |
Final paper |
Study programme: |
0 |
Embargo end date (OpenAIRE): |
1970-01-01 |
Thesis comment: |
Univ. v Ljubljani, Fak. za strojništvo |
Pages: |
XIV, 25 f., [6] f. pril. |
ID: |
10959149 |