Idioma:

Analyzing natural language inference from a rigorous point of view

Salvatore, Felipe De Souza

Biblioteca Digital de Teses e Dissertações da USP; Universidade de São Paulo; Instituto de Matemática e Estatística 2020-12-14

Acesso online. A biblioteca também possui exemplares impressos.

Enviar para

Título:
Analyzing natural language inference from a rigorous point of view
Autor: Salvatore, Felipe De Souza
Orientador: Finger, Marcelo; Hirata Junior, Roberto
Assuntos: Classificação De Texto; Inferência Em Linguagem Natural; Processamento De Linguagem Natural; Víes Em Modelos De Aprendizado De Máquina; Bias In Deep Learning; Natural Language Inference; Natural Language Processing; Text Classification
Notas: Tese (Doutorado)
Descrição: Natural language inference (NLI) is the task of determining the entailment relationship between a pair of sentences. We are interested in the problem of verifying whether the deep learning models current used in NLI satisfy some logical properties. In this thesis, we focus on two properties: i) the capacity of solving deduction problems based on some specific logical forms (e.g., Boolean coordination, quantifiers, definite description, and counting operators); and ii) the property of having the same conclusion from equivalent premises. For each one of these properties we develop a new evaluation procedure. For i) we offer a new synthetic dataset that can be used both for inference perception and inference generation; and for ii) we propose a null hypothesis test constructed to represent the different manners that the inclusion of sentences with the same meaning can affect the training of a machine learning model. Our results show that although deep learning models have an outstanding performance on the majority of NLI datasets, they still lack some important inference skills such as dealing with counting operators, predicting which word can form an entailment given an specific context, and presenting the same deductions for two different text inputs with the same meaning. This indicates that despite the high prediction power of these new models, they do present some inference biases that cannot be easily removed. Future investigations are needed in order to understand the scope of this bias. It is possible that by increasing the training sample size in the fine-tuning phase, this bias can be reduced.
DOI: 10.11606/T.45.2020.tde-05012021-151600
Editor: Biblioteca Digital de Teses e Dissertações da USP; Universidade de São Paulo; Instituto de Matemática e Estatística
Data de criação/publicação: 2020-12-14
Formato: Adobe PDF
Idioma: Inglês

Links

Voltar para lista de resultados

Anterior Resultado 10 Avançar Ir para próxima página

Realização: Logos de Redes Sociais:

Analyzing natural language inference from a rigorous point of view

Salvatore, Felipe De Souza

Biblioteca Digital de Teses e Dissertações da USP; Universidade de São Paulo; Instituto de Matemática e Estatística 2020-12-14

Buscando em bases de dados remotas. Favor aguardar.