Please use this identifier to cite or link to this item:
Full metadata record
DC FieldValueLanguage
dc.creatorGarcia, Cristiano Mesquita-
dc.creatorPereira, Armando Honorio-
dc.creatorPereira, Denilson Alves-
dc.identifier.citationGARCIA, C. M.; PEREIRA, A. H.; PEREIRA, D. A. A framework to collect and extract publication lists of a given researcher from the web. International Journal of Web Engineering and Technology, [S. l.], v. 12, n. 3, p. 234-252, 2017.pt_BR
dc.description.abstractResearchers usually publish their publication lists on the web. Collecting and extracting them can be of great value to research funding agencies and to applications such as academic network analysis and ranking systems. Because of the wide variety of citation styles and different web page formats, it is not straightforward to develop an automatic system to collect and extract researchers' publication lists. In this paper, we describe the method used by our framework to collect and extract publication lists. It is composed of two tools, named Raposa - Citation Extractor, and Tucano - Publication Lists Collector. Raposa uses a method that identifies regions in the web page containing citations and the delimiters separating them. Tucano collects publication lists by submitting queries to a web search engine. Experimental results show that our framework obtains 93.5% of F1 measure for collecting publication lists, which is a better value when compared to Google Scholar.pt_BR
dc.sourceInternational Journal of Web Engineering and Technologypt_BR
dc.subjectCitation extractorpt_BR
dc.subjectPublication lists collectorpt_BR
dc.subjectWeb search enginept_BR
dc.subjectExtrator de citaçãopt_BR
dc.subjectColeta de listas de publicaçãopt_BR
dc.subjectMotor de buscapt_BR
dc.titleA framework to collect and extract publication lists of a given researcher from the webpt_BR
Appears in Collections:DCC - Artigos publicados em periódicos

Files in This Item:
There are no files associated with this item.

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.