Query-by-multiple-examples using support vector machines

TitleQuery-by-multiple-examples using support vector machines
Publication TypeJournal Article
Year of Publication2009
AuthorsZhang, D, Lee, WS
JournalJournal of Digital Information Management
Volume7
Issue4
Pagination202 - 210
Date Published2009
KeywordsInformation retrieval, Machine learning, Supportvector machine, Text classification
Abstract

We identify and explore an Information Retrieval paradigm called Query-By-Multiple-Examples (QBME) where the information need is described not by a set of terms but by a set of documents. Intuitive ideas for QBME include using the centroid of these documents or the well-known Rocchio algorithm to construct the query vector. We consider this problem from the perspective of text classification, and find that a better query vector can be obtained through learning with Support Vector Machines (SVMs). For online queries, we show how SVMs can be learned from one-class examples in linear time. For offline queries, we show how SVMs can be learned from positive and unlabeled examples together in linear or polynomial time, optimising some meaningful multivariate performance measures. The effectiveness and efficiency of the proposed approaches have been confirmed by our experiments on four real-world datasets.

URLhttp://www.scopus.com/inward/record.url?eid=2-s2.0-77953247724&partnerID=40&md5=8fea0823a5d70e478bf228f315a5ef89

Collaborative Partner

Institute of Electronic and Information Technology (IEIT)

Collaborative Partner

Collaborative Partner