Rule based autonomous citation mining with tierl

TitleRule based autonomous citation mining with tierl
Publication TypeJournal Article
Year of Publication2010
AuthorsAfzal, MT, Maurer, H, Balke, W-T, Kulathuramaiyer, N
JournalJournal of Digital Information Management
Volume8
Issue3
Pagination196 - 204
Date Published2010
KeywordsCitation index, Citation mining, Digital libraries, Information extraction
Abstract

Citations management is an important task in managing digital libraries. Citations provide valuable information e.g., used in evaluating an author's influences or scholarly quality (the impact factor of research journals). But although a reliable and effective autonomous citation management is essential, manual citation management can be extremely costly. Automatic citation mining on the other hand is a non-trivial task mainly due to non-conforming citation styles, spelling errors and the difficulty of reliably extracting text from PDF documents. In this paper we propose a novel rule-based autonomous citation mining technique, to address this important task. We define a set of common heuristics that together allow to improve the state of the art in automatic citation mining. Moreover, by first disambiguating citations based on venues, our technique significantly enhances the correct discovery of citations. Our experiments show that the proposed approach is indeed able to overcome limitations of current leading citation indexes such as ISI Web of Knowledge, Citeseer and Google Scholar.

URLhttp://www.scopus.com/inward/record.url?eid=2-s2.0-79951786283&partnerID=40&md5=4fbea6129a6c4c727c3c0618ccbe6087

Collaborative Partner

Institute of Electronic and Information Technology (IEIT)

Collaborative Partner

Collaborative Partner