• JoomlaWorks Simple Image Rotator
  • JoomlaWorks Simple Image Rotator
  • JoomlaWorks Simple Image Rotator
  • JoomlaWorks Simple Image Rotator
  • JoomlaWorks Simple Image Rotator
  • JoomlaWorks Simple Image Rotator
  • JoomlaWorks Simple Image Rotator
  • JoomlaWorks Simple Image Rotator
  • JoomlaWorks Simple Image Rotator
  • JoomlaWorks Simple Image Rotator
 
  Bookmark and Share
 
 
Master's Dissertation
DOI
https://doi.org/10.11606/D.55.2020.tde-19022020-151305
Document
Author
Full name
Marcelo Caetano Martins Muniz
Institute/School/College
Knowledge Area
Date of Defense
Published
São Carlos, 2004
Supervisor
Committee
Nunes, Maria das Graças Volpe (President)
Oliveira Junior, Osvaldo Novais de
Vale, Oto Araujo
Title in Portuguese
A construção de recursos lingüístico-computacionais para o português do Brasil: o projeto Unitex-PB
Keywords in Portuguese
Não disponível
Abstract in Portuguese
A escassez de recursos linguístico-computacionais é um dos maiores entraves para o avanço das pesquisas, e consequente desenvolvimento de sistemas, na área de Processamento de Língua Natural (PLN) no Brasil. Este trabalho documenta a construção de uma série recursos lingúístico-computacionais para português brasileiro seguindo os formalismos utilizados pela ferramenta de processamento de corpus Unitex. Foram construídos léxicos computacionais, regras de resolução de ambiguidades e bibliotecas para acesso a léxicos compactados, assim como algumas ferramentas para validar esses recursos. Os desafios encontrados durante todo o processo são discutidos nessa dissertação.
Title in English
Not available
Keywords in English
Not available
Abstract in English
The lack of computational linguistic resources represents one of the major challenges to the development and research activities related to Natural Language Processing. This work documents the project and development of various computational linguistic resources that support the Brazilian Portuguese language according to the formal methodology used by the corpus processing system called Unitex. The delivered resources include computational lexicons, rules to solve ambiguity, libraries to access compressed lexicons, and additional tools to validate those resources. Some aspects about the main challenges encountered during the course of this project are also addressed.
 
WARNING - Viewing this document is conditioned on your acceptance of the following terms of use:
This document is only for private use for research and teaching activities. Reproduction for commercial use is forbidden. This rights cover the whole data about this document as well as its contents. Any uses or copies of this document in whole or in part must include the author's name.
Publishing Date
2020-02-19
 
WARNING: Learn what derived works are clicking here.
All rights of the thesis/dissertation are from the authors
CeTI-SC/STI
Digital Library of Theses and Dissertations of USP. Copyright © 2001-2024. All rights reserved.