Please use this identifier to cite or link to this item:
http://repositorio.unicamp.br/jspui/handle/REPOSIP/341518
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.CRUESP | UNIVERSIDADE ESTADUAL DE CAMPINAS | pt_BR |
dc.identifier.isbn | 978-3-030-15719-7 | pt_BR |
dc.contributor.authorunicamp | Rocha, Anderson de Rezende | - |
dc.type | Artigo | pt_BR |
dc.title | Open-set web genre identification using distributional features and nearest neighbors distance ratio | pt_BR |
dc.contributor.author | Pritsos, Dimitrios | - |
dc.contributor.author | Rocha, Anderson | - |
dc.contributor.author | Stamatatos, Efstathios | - |
dc.subject | Gênero | pt_BR |
dc.subject | Algoritmos | pt_BR |
dc.subject.otherlanguage | Gender | pt_BR |
dc.subject.otherlanguage | Algorithms | pt_BR |
dc.description.abstract | Web genre identification can boost information retrieval systems by providing rich descriptions of documents and enabling more specialized queries. The open-set scenario is more realistic for this task as web genres evolve over time and it is not feasible to define a universally agreed genre palette. In this work, we bring to bear a novel approach to web genre identification underpinned by distributional features acquired by doc2vec and a recently-proposed open-set classification algorithm—the nearest neighbors distance ratio classifier. We present experimental results using a benchmark corpus and a strong baseline and demonstrate that the proposed approach is highly competitive, especially when emphasis is given on precision | pt_BR |
dc.relation.ispartof | Lecture notes in computer science | pt_BR |
dc.relation.ispartofabbreviation | Lect. notes comput. sci. | pt_BR |
dc.publisher.city | Berlim | pt_BR |
dc.publisher.country | Alemanha | pt_BR |
dc.publisher | Springer | pt_BR |
dc.date.issued | 2019 | - |
dc.date.monthofcirculation | Apr. | pt_BR |
dc.language.iso | eng | pt_BR |
dc.description.volume | 11438 | pt_BR |
dc.description.firstpage | 3 | pt_BR |
dc.description.lastpage | 11 | pt_BR |
dc.rights | Fechado | pt_BR |
dc.source | SCOPUS | pt_BR |
dc.identifier.issn | 0302-9743 | pt_BR |
dc.identifier.eissn | 1611-3349 | pt_BR |
dc.identifier.doi | 10.1007/978-3-030-15719-7_1 | pt_BR |
dc.identifier.url | https://link.springer.com/chapter/10.1007/978-3-030-15719-7_1 | pt_BR |
dc.date.available | 2020-05-15T15:56:57Z | - |
dc.date.accessioned | 2020-05-15T15:56:57Z | - |
dc.description.provenance | Submitted by Susilene Barbosa da Silva (susilene@unicamp.br) on 2020-05-15T15:56:57Z No. of bitstreams: 0. Added 1 bitstream(s) on 2020-08-27T19:17:56Z : No. of bitstreams: 1 2-s2.0-85064856679.pdf: 369979 bytes, checksum: 67e99302f016e6c77f2de163916b1044 (MD5) | en |
dc.description.provenance | Made available in DSpace on 2020-05-15T15:56:57Z (GMT). No. of bitstreams: 0 Previous issue date: 2019 | en |
dc.identifier.uri | http://repositorio.unicamp.br/jspui/handle/REPOSIP/341518 | - |
dc.description.conferencenome | SAMBA : SIPAIM – Miccai biomedical workshop, biomedical information processing and analysis - a Latin American perspective | pt_BR |
dc.contributor.department | Departamento de Sistemas de Informação | pt_BR |
dc.contributor.unidade | Instituto de Computação | pt_BR |
dc.subject.keyword | Open-set classification | pt_BR |
dc.subject.keyword | Distributional features | pt_BR |
dc.identifier.source | 2-s2.0-85064856679 | pt_BR |
dc.creator.orcid | 0000-0002-4236-8212 | pt_BR |
dc.type.form | Artigo de evento | pt_BR |
Appears in Collections: | IC - Artigos e Outros Documentos |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
2-s2.0-85064856679.pdf | 361.31 kB | Adobe PDF | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.