Title | Effective implementation of basic operations for information retrieval |
Publication Type | Journal Article |
Year of Publication | 2012 |
Authors | Szymanski, J |
Journal | Journal of Digital Information Management |
Volume | 10 |
Issue | 6 |
Pagination | 389 - 398 |
Date Published | 2012 |
Keywords | Documents Categorization, Information retrieval, PCA, SOM, Text clustering |
Abstract | In the article we describe the approach to parallel implementation of elementary operations for textual data categorization. In the experiments we evaluate parallel computations of similarity matrices and k-means algorithm. The test datasets have been prepared as graphs created from Wikipedia articles related with links. W also present the approach to computing pairs of eigenvectors and eigenvalues for visualizations of the datasets. The implemented basic operations: computing similarity matrix, data clustering and spectral analysis have been used in our system for visualization of the Wikipedia categories on SOM as well as in a system for categorization search results in Wikipedia. |
URL | http://www.scopus.com/inward/record.url?eid=2-s2.0-84874861039&partnerID=40&md5=f0cc1565c8fce706ec0125e688e6474a |