skip to main content

Analyzing natural language inference from a rigorous point of view

Salvatore, Felipe De Souza

Biblioteca Digital de Teses e Dissertações da USP; Universidade de São Paulo; Instituto de Matemática e Estatística 2020-12-14

Acesso online. A biblioteca também possui exemplares impressos.

  • Título:
    Analyzing natural language inference from a rigorous point of view
  • Autor: Salvatore, Felipe De Souza
  • Orientador: Finger, Marcelo; Hirata Junior, Roberto
  • Assuntos: Classificação De Texto; Inferência Em Linguagem Natural; Processamento De Linguagem Natural; Víes Em Modelos De Aprendizado De Máquina; Bias In Deep Learning; Natural Language Inference; Natural Language Processing; Text Classification
  • Notas: Tese (Doutorado)
  • Descrição: Natural language inference (NLI) is the task of determining the entailment relationship between a pair of sentences. We are interested in the problem of verifying whether the deep learning models current used in NLI satisfy some logical properties. In this thesis, we focus on two properties: i) the capacity of solving deduction problems based on some specific logical forms (e.g., Boolean coordination, quantifiers, definite description, and counting operators); and ii) the property of having the same conclusion from equivalent premises. For each one of these properties we develop a new evaluation procedure. For i) we offer a new synthetic dataset that can be used both for inference perception and inference generation; and for ii) we propose a null hypothesis test constructed to represent the different manners that the inclusion of sentences with the same meaning can affect the training of a machine learning model. Our results show that although deep learning models have an outstanding performance on the majority of NLI datasets, they still lack some important inference skills such as dealing with counting operators, predicting which word can form an entailment given an specific context, and presenting the same deductions for two different text inputs with the same meaning. This indicates that despite the high prediction power of these new models, they do present some inference biases that cannot be easily removed. Future investigations are needed in order to understand the scope of this bias. It is possible that by increasing the training sample size in the fine-tuning phase, this bias can be reduced.
  • DOI: 10.11606/T.45.2020.tde-05012021-151600
  • Editor: Biblioteca Digital de Teses e Dissertações da USP; Universidade de São Paulo; Instituto de Matemática e Estatística
  • Data de criação/publicação: 2020-12-14
  • Formato: Adobe PDF
  • Idioma: Inglês

Buscando em bases de dados remotas. Favor aguardar.