skip to main content

Deep neural semantic parsing: translating from natural language into SPARQL

Luz, Fabiano Ferreira

Biblioteca Digital de Teses e Dissertações da USP; Universidade de São Paulo; Instituto de Matemática e Estatística 2019-02-07

Acesso online. A biblioteca também possui exemplares impressos.

  • Título:
    Deep neural semantic parsing: translating from natural language into SPARQL
  • Autor: Luz, Fabiano Ferreira
  • Orientador: Finger, Marcelo
  • Assuntos: Análise Semântica; Palavras Associadas; Rdf; Ontologias; Lstm; Rnn; Gramáticas; Glc; Codificação Decodificação; Sparql; Pln; Semantic Parsing; Ontology; Nlp; Grammars; Encoder Decoder; Cfg; Word Embeddings
  • Notas: Tese (Doutorado)
  • Descrição: Semantic parsing is the process of mapping a natural-language sentence into a machine-readable, formal representation of its meaning. The LSTM Encoder-Decoder is a neural architecture with the ability to map a source language into a target one. We are interested in the problem of mapping natural language into SPARQL queries, and we seek to contribute with strategies that do not rely on handcrafted rules, high-quality lexicons, manually-built templates or other handmade complex structures. In this context, we present two contributions to the problem of semantic parsing departing from the LSTM encoder-decoder. While natural language has well defined vector representation methods that use a very large volume of texts, formal languages, like SPARQL queries, suffer from lack of suitable methods for vector representation. In the first contribution we improve the representation of SPARQL vectors. We start by obtaining an alignment matrix between the two vocabularies, natural language and SPARQL terms, which allows us to refine a vectorial representation of SPARQL items. With this refinement we obtained better results in the posterior training for the semantic parsing model. In the second contribution we propose a neural architecture, that we call Encoder CFG-Decoder, whose output conforms to a given context-free grammar. Unlike the traditional LSTM encoder-decoder, our model provides a grammatical guarantee for the mapping process, which is particularly important for practical cases where grammatical errors can cause critical failures. Results confirm that any output generated by our model obeys the given CFG, and we observe a translation accuracy improvement when compared with other results from the literature.
  • DOI: 10.11606/T.45.2019.tde-01042019-101602
  • Editor: Biblioteca Digital de Teses e Dissertações da USP; Universidade de São Paulo; Instituto de Matemática e Estatística
  • Data de criação/publicação: 2019-02-07
  • Formato: Adobe PDF
  • Idioma: Inglês

Buscando em bases de dados remotas. Favor aguardar.