Language: | Slovenian |
---|---|
Year of publishing: | 2019 |
Typology: | 2.09 - Master's Thesis |
Organization: | UL FRI - Faculty of Computer and Information Science |
Publisher: | [S. Janežič] |
UDC: | 004(043.2) |
COBISS: | 1538419395 |
Views: | 642 |
Downloads: | 178 |
Average score: | 0 (0 votes) |
Metadata: |
Secondary language: | English |
---|---|
Secondary title: | Label inference for data clusters in point-based visualizations |
Secondary abstract: | Two-dimensional point-based visualizations of multidimensional data may reveal data structures and clusters that require further interpretation. We present an approach that can automatically annotate the clusters in these visualisations. Our method extends the existing procedure for automatic annotation of two-dimensional representations of text documents and enables it for general attribute-value data. We propose to finds groups of points on scatterplot visualisations and assign them labels that describe a group’s characteristics in a language of the attributes of the original data. The approach uses DBSCAN clustering algorithm to find groups of points in the scatterplots. Statistical tests are used to determine labels for each of the groups. The proposed approach also features an interactive exploration of arbitrary subgroups manually chosen by the user. We analyze three datasets to demonstrate the usefulness of our approach. We show that the proposed method is sufficiently fast to support interactive analysis and that the group annotations found by our approach are meaningful. |
Secondary keywords: | scatterplots;cluster annotation;explanation of clustering results;clustering;computer science;computer and information science;master's degree; |
Type (COBISS): | Master's thesis/paper |
Study programme: | 1000471 |
Embargo end date (OpenAIRE): | 1970-01-01 |
Thesis comment: | Univ. v Ljubljani, Fak. za računalništvo in informatiko |
Pages: | 52 str. |
ID: | 11255889 |