Artigo
Enriching an authority file of scientific conferences with information extracted from the web
Carregando...
Notas
Data
Orientadores
Editores
Coorientadores
Membros de banca
Título da Revista
ISSN da Revista
Título de Volume
Editor
Science Publications
Faculdade, Instituto ou Escola
Departamento
Programa de Pós-Graduação
Agência de fomento
Tipo de impacto
Áreas Temáticas da Extenção
Objetivos de Desenvolvimento Sustentável
Dados abertos
Resumo
Abstract
Authority files maintain variant forms to refer to the same entity and they are very useful in digital libraries. However, collect data and keep an updated authority file is not a trivial task. This paper proposes an approach for the enrichment of a publication venue authority file by extracting information on conferences from their web pages. Collecting additional data is important to improve the effectiveness of data disambiguation tools and information retrieval, such as those that measure the quality of a scientific publication based on bibliometrics (e.g., Journal Impact Factor). Most applications use only basic citation metadata, such as author's names, work and publication venue titles. However, data external to the publication, contained in the publication venue web page, can be very useful in the disambiguation task. Our approach includes the steps for querying a web search engine, classifying documents obtained in the result sets and extracting information from the relevant pages. We evaluated two methods for classifying documents, one based on genre and content and one based on content only. The experiments show good results to trace a history of conference editions, with data such as URL, year of each edition and dates of changing in their names.
Descrição
Área de concentração
Agência de desenvolvimento
Palavra chave
Marca
Objetivo
Procedência
Submitted by André Calsavara (andre.calsavara@biblioteca.ufla.br) on 2018-07-17T12:50:53Z
No. of bitstreams: 0
Approved for entry into archive by André Calsavara (andre.calsavara@biblioteca.ufla.br) on 2018-07-27T11:50:24Z (GMT) No. of bitstreams: 0
Made available in DSpace on 2018-07-27T11:50:24Z (GMT). No. of bitstreams: 0 Previous issue date: 2017
Approved for entry into archive by André Calsavara (andre.calsavara@biblioteca.ufla.br) on 2018-07-27T11:50:24Z (GMT) No. of bitstreams: 0
Made available in DSpace on 2018-07-27T11:50:24Z (GMT). No. of bitstreams: 0 Previous issue date: 2017
Impacto da pesquisa
Resumen
ISBN
DOI
Citação
JESUS, H. A. de; PEREIRA, D. A. Enriching an authority file of scientific conferences with information extracted from the web. Journal of Computer Science, [S. l.], v. 13, n. 4, p. 68-77, 2017.
