Using BFA with wordnet based model for web retrieval

TitleUsing BFA with wordnet based model for web retrieval
Publication TypeJournal Article
Year of Publication2006
AuthorsSnášel, V, Moravec, P, Pokorný, J
JournalJournal of Digital Information Management
Volume4
Issue2
Pagination107 - 111
Date Published2006
KeywordsLatent semantic indexing, Ontology, Semantic web, Vector model, Web information retrieval, WordNet
Abstract

In the area of information retrieval, the dimension of document vectors plays an important role. We may need to find a few words or concepts, which characterize the document based on its contents, to overcome the problem of the "curse of dimensionality", which makes indexing of high-dimensional data problematic. To do so, we earlier proposed a Wordnet and Wordnet+LSI (Latent Semantic Indexing) based model for dimension reduction. While LSI works on the whole collection, another procedure of feature extraction (and thus dimension reduction) exists, using binary factorization. The procedure is based on the search of attractors in Hopfield-like associative memory. Being applied to textual data the procedure conducted well and even more it showed sensitivity to the context in which the words were used. In this paper, we suggest that the binary factorization may benefit from the Wordnet filtration.

URLhttp://www.scopus.com/inward/record.url?eid=2-s2.0-44649186320&partnerID=40&md5=84c389a9e2d0c1be0cd8ecdb6b1534ec

Collaborative Partner

Institute of Electronic and Information Technology (IEIT)

Collaborative Partner

Collaborative Partner