Skip to Main content Skip to Navigation
Conference papers

IRISA System for Entity Detection and Linking at CLEF HIPE 2020

Abstract : This note describes IRISA's system for the task of named entity processing on historical newspapers in French. Following a standard entity detection and linking pipeline, our system implements three steps to solve the named entity linking task. Named Entity Recognition (NER) is first performed to identify the entity mentions in a document based on a Conditional Random Fields classifier. Candidate entities from Wikidata are then generated for each mention found, using simple search. Finally, every mention is linked to one of its candidate entities in a so-called linking step leveraging various string metrics and the semantic structure of Wikidata to improve on the linking decisions.
Document type :
Conference papers
Complete list of metadatas

Cited literature [6 references]  Display  Hide  Download

https://hal.inria.fr/hal-02943717
Contributor : Cheikh Brahim El Vaigh <>
Submitted on : Tuesday, September 22, 2020 - 11:15:33 AM
Last modification on : Thursday, September 24, 2020 - 3:13:47 AM

File

paper_185_proc-1.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-02943717, version 2

Citation

Cheikh Brahim El Vaigh, Guillaume Le Noé-Bienvenu, Guillaume Gravier, Pascale Sébillot. IRISA System for Entity Detection and Linking at CLEF HIPE 2020. CEUR Workshop Proceedings, Sep 2020, Thessaloniki, Greece. ⟨hal-02943717v2⟩

Share

Metrics

Record views

62

Files downloads

74