IRISA System for Entity Detection and Linking at CLEF HIPE 2020 - INRIA - Institut National de Recherche en Informatique et en Automatique Accéder directement au contenu
Communication Dans Un Congrès Année : 2020

IRISA System for Entity Detection and Linking at CLEF HIPE 2020

Résumé

This note describes IRISA's system for the task of named entity processing on historical newspapers in French. Following a standard entity detection and linking pipeline, our system implements three steps to solve the named entity linking task. Named Entity Recognition (NER) is first performed to identify the entity mentions in a document based on a Conditional Random Fields classifier. Candidate entities from Wikidata are then generated for each mention found, using simple search. Finally, every mention is linked to one of its candidate entities in a so-called linking step leveraging various string metrics and the semantic structure of Wikidata to improve on the linking decisions.
Fichier principal
Vignette du fichier
paper_185_proc-1.pdf (230.56 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-02943717 , version 1 (21-09-2020)
hal-02943717 , version 2 (22-09-2020)

Identifiants

  • HAL Id : hal-02943717 , version 2

Citer

Cheikh Brahim El Vaigh, Guillaume Le Noé-Bienvenu, Guillaume Gravier, Pascale Sébillot. IRISA System for Entity Detection and Linking at CLEF HIPE 2020. CEUR Workshop Proceedings, Sep 2020, Thessaloniki, Greece. ⟨hal-02943717v2⟩
192 Consultations
113 Téléchargements

Partager

Gmail Facebook X LinkedIn More