Avaliação de recursos computacionais para o português

Translated title of the contribution: Evaluating computational resources for Portuguese

Matilde Goncalves, Luisa Coheur, Jorge Baptista, Ana Mineiro

Research output: Contribution to journalArticlepeer-review

2 Downloads

Abstract

There are several tools for the Portuguese language. However, and due to different choices at the basis of these tools' behaviour (different preprocessing, different labels, etc.), it becomes difficult to have an idea of each one's comparative performance. In this work, we propose an evaluation of tools, publicly available and free, that perform the tasks of Part-of-Speech Tagging and Named Entity Recognition, for the Portuguese language. We evaluate twelve different models for the first task and eight for the second. All the resources used in this evaluation (mapping tables between labels, testing corpora, etc.) will be made available, allowing to replicate/fine-tune the results here presented. We also present a qualitative analysis of two dependency parsers. To the best of our knowledge, no recent work that considers the recent available tools, was carried out for the Portuguese language.
Translated title of the contributionEvaluating computational resources for Portuguese
Original languagePortuguese
Pages (from-to)51-68
Number of pages18
JournalLinguamatica
Volume12
Issue number2
DOIs
Publication statusPublished - Dec 2020

Keywords

  • Dependency parsing
  • Evaluation of resources
  • Named entity recognition
  • Natural language processing
  • Part-of-speech tagging
  • Portuguese language

Fingerprint

Dive into the research topics of 'Evaluating computational resources for Portuguese'. Together they form a unique fingerprint.

Cite this