Title | Using BFA with wordnet based model for web retrieval |
Publication Type | Journal Article |
Year of Publication | 2006 |
Authors | Snášel, V, Moravec, P, Pokorný, J |
Journal | Journal of Digital Information Management |
Volume | 4 |
Issue | 2 |
Pagination | 107 - 111 |
Date Published | 2006 |
Keywords | Latent semantic indexing, Ontology, Semantic web, Vector model, Web information retrieval, WordNet |
Abstract | In the area of information retrieval, the dimension of document vectors plays an important role. We may need to find a few words or concepts, which characterize the document based on its contents, to overcome the problem of the "curse of dimensionality", which makes indexing of high-dimensional data problematic. To do so, we earlier proposed a Wordnet and Wordnet+LSI (Latent Semantic Indexing) based model for dimension reduction. While LSI works on the whole collection, another procedure of feature extraction (and thus dimension reduction) exists, using binary factorization. The procedure is based on the search of attractors in Hopfield-like associative memory. Being applied to textual data the procedure conducted well and even more it showed sensitivity to the context in which the words were used. In this paper, we suggest that the binary factorization may benefit from the Wordnet filtration. |
URL | http://www.scopus.com/inward/record.url?eid=2-s2.0-44649186320&partnerID=40&md5=84c389a9e2d0c1be0cd8ecdb6b1534ec |