skip to main content
Visitante
Meu Espaço
Minha Conta
Sair
Identificação
This feature requires javascript
Tags
Revistas Eletrônicas (eJournals)
Livros Eletrônicos (eBooks)
Bases de Dados
Bibliotecas USP
Ajuda
Ajuda
Idioma:
Inglês
Espanhol
Português
This feature required javascript
Search in:
Selecione a lista para navegar
Search For:
Clear Search Box
Search in:
Por título
Por assunto
Por autor
Browse vid input
Busca Simples
This feature requires javascript
Research data supporting "Towards an open-domain chatbot for language practice"
Tyen, Wen Hoi Gladys ; Brenchley, Mark ; Caines, Andrew ; Buttery, Paula
Department of Computer Science and Technology
Texto completo disponível
Citações
Citado por
Exibir Online
Detalhes
Resenhas & Tags
Mais Opções
Nº de Citações
This feature requires javascript
Enviar para
Adicionar ao Meu Espaço
Remover do Meu Espaço
E-mail (máximo 30 registros por vez)
Imprimir
Link permanente
Referência
EasyBib
EndNote
RefWorks
del.icio.us
Exportar RIS
Exportar BibTeX
This feature requires javascript
Título:
Research data supporting "Towards an open-domain chatbot for language practice"
Autor:
Tyen, Wen Hoi Gladys
;
Brenchley, Mark
;
Caines, Andrew
;
Buttery, Paula
Assuntos:
chatbot
;
dialogue system
;
language learning
;
neural text generation
;
text complexity
Notas:
https://doi.org/10.18653/v1/2022.bea-1.28
Descrição:
This dataset is a set of dialogues generated by an artificial dialogue system, along with difficulty and quality annotations. The dialogue system used is a modified version of BlenderBot 1.0 (Roller et al., 2020). In each dialogue, the system is adjusted to generate messages at a particular difficulty level (as denoted by CEFR levels). The system always responds to the previously generated message as if in a 2-person conversation. These dialogues are then shown to 10 English language examiners, who are asked to annotate the dialogues according to the difficulty and quality of the messages. They are asked to give an overall CEFR for the dialogue, as well as binary labels to each individual message denoting whether the message is grammatical, sensible, and specific to the conversation. The .json file contains a list of dictionaries, with the following keys: - "intended_cefr" refers to the CEFR level which was used when generating the dialogue. - "generation_method" refers to one of 5 methods used in Tyen et al. (2022). - "dialogue_turns" is the list of generated dialogue messages. - "cefr_annotations" contains a dictionary of CEFR levels as determined by annotators. - "grammaticality_annotations" is a list of dictionaries containing binary labels, referring to whether an annotator considered a dialogue message to be grammatical. The order of the dictionaries corresponds to the order of the dialogue messages in "dialogue_turns". - "sensibleness_annotations" is structured in the same way as "grammaticality_annotations", but instead describes whether an annotator thought the message was sensible. - "specificity_annotations" is structured in the same way as "grammaticality_annotations", but instead describes whether an annotator thought the message was specific to the conversation. For more detailed descriptions of the adjustments used in each method, as well as definitions of grammaticality, sensibleness, and specificity, please see Tyen et al. (2022) or Adiwardana et al. (2020). Tyen, G., Brenchley, M., Caines, A., & Buttery, P. (2022). Towards an open-domain chatbot for language practice. 17th Workshop on Innovative Use of NLP for Building Educational Applications. Adiwardana, D., Luong, M.-T., So, D. R., Hall, J., Thoppilan, R., Yang, Z., Kulshreshtha, A., Nemade, G., Lu, Y., & Le, Q. V. (2020). Towards a human-like open-domain chatbot. arXiv:2001.09977 Roller, S., Dinan, E., Goyal, N., Ju, D., Williamson, M., Liu, Y., ... & Weston, J. (2021, April). Recipes for Building an Open-Domain Chatbot. In Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume (pp. 300-325). This research was supported by Cambridge University Press & Assessment. This work was performed using resources provided by the Cambridge Service for Data Driven Discovery (CSD3) operated by the University of Cambridge Research Computing Service (www.csd3.cam.ac.uk), provided by Dell EMC and Intel using Tier-2 funding from the Engineering and Physical Sciences Research Council (capital grant EP/P020259/1), and DiRAC funding from the Science and Technology Facilities Council (www.dirac.ac.uk).
Editor:
Department of Computer Science and Technology
Idioma:
Inglês
Links
View record in Cambridge University Library$$FView record in $$GCambridge University Library
This feature requires javascript
This feature requires javascript
Voltar para lista de resultados
Anterior
Resultado
2
Avançar
This feature requires javascript
This feature requires javascript
Buscando em bases de dados remotas. Favor aguardar.
Buscando por
em
scope:(USP_VIDEOS),scope:("PRIMO"),scope:(USP_FISICO),scope:(USP_EREVISTAS),scope:(USP),scope:(USP_EBOOKS),scope:(USP_PRODUCAO),primo_central_multiple_fe
Mostrar o que foi encontrado até o momento
This feature requires javascript
This feature requires javascript