Framework for mixed entity resolving system using unsupervised clustering

TitleFramework for mixed entity resolving system using unsupervised clustering
Publication TypeJournal Article
Year of Publication2010
AuthorsOn, B-W, Lee, I
JournalJournal of Digital Information Management
Volume8
Issue6
Pagination362 - 368
Date Published2010
KeywordsMixed entity resolution, Unsupervised clustering
Abstract

During web search, confusion can happen due to homonym when users use non-unique values as a search term of an entity. Especially, when parts of names of an entity were used as its identifier, we call a mixed entity resolution problem whose goal is to clear out the erroneous entities. For example, if only last name is used as an identifier, we cannot distinguish "Vannessa Bush" from "George Bush." Mixed entity resolution problem is common among Web pages data. In this paper, to resolve aforementioned mixed entities on the Web, we propose a prototypical system which includes a web service based interface, unsupervised clustering scheme, and cluster ranking algorithms. In particular, since the correct number of clusters is often unknown, we study a state-of-the-art unsupervised clustering solution based on propagation of pairwise similarities of entities. Experimental results show that our approach outperforms main competing solution.

URLhttp://www.scopus.com/inward/record.url?eid=2-s2.0-79960657815&partnerID=40&md5=b7024f58cda537138350a05e3d88ac8c

Collaborative Partner

Institute of Electronic and Information Technology (IEIT)

Collaborative Partner

Collaborative Partner