skip to main content
Resource type Show Results with: Show Results with: Index

Cross-project defect prediction with meta-Learning

Porto, Faimison Rodrigues

Biblioteca Digital de Teses e Dissertações da USP; Universidade de São Paulo; Instituto de Ciências Matemáticas e de Computação 2017-09-29

Online access. The library also has physical copies.

  • Title:
    Cross-project defect prediction with meta-Learning
  • Author: Porto, Faimison Rodrigues
  • Supervisor: Simão, Adenilso da Silva
  • Subjects: Predição De Defeitos Cruzada Entre Projetos; Engenharia De Software Experimental; Meta-Aprendizado; Qualidade De Software; Predição De Defeitos Em Software; Software Defect Prediction; Cross-Project Defect Prediction; Meta-Learning; Experimental Software Engineering; Software Quality Assurance
  • Notes: Tese (Doutorado)
  • Description: Defect prediction models assist tester practitioners on prioritizing the most defect-prone parts of the software. The approach called Cross-Project Defect Prediction (CPDP) refers to the use of known external projects to compose the training set. This approach is useful when the amount of historical defect data of a company to compose the training set is inappropriate or insufficient. Although the principle is attractive, the predictive performance is a limiting factor. In recent years, several methods were proposed aiming at improving the predictive performance of CPDP models. However, to the best of our knowledge, there is no evidence of which CPDP methods typically perform best. Moreover, there is no evidence on which CPDP methods perform better for a specific application domain. In fact, there is no machine learning algorithm suitable for all domains. The decision task of selecting an appropriate algorithm for a given application domain is investigated in the meta-learning literature. A meta-learning model is characterized by its capacity of learning from previous experiences and adapting its inductive bias dynamically according to the target domain. In this work, we investigate the feasibility of using meta-learning for the recommendation of CPDP methods. In this thesis, three main goals were pursued. First, we provide an experimental analysis to investigate the feasibility of using Feature Selection (FS) methods as an internal procedure to improve the performance of two specific CPDP methods. Second, we investigate which CPDP methods present typically best performances. We also investigate whether the typically best methods perform best for the same project datasets. The results reveal that the most suitable CPDP method for a project can vary according to the project characteristics, which leads to the third investigation of this work. We investigate the several particularities inherent to the CPDP context and propose a meta-learning solution able to learn from previous experiences and recommend a suitable CDPD method according to the characteristics of the project being predicted. We evaluate the learning capacity of the proposed solution and its performance in relation to the typically best CPDP methods.
  • DOI: 10.11606/T.55.2018.tde-21032018-163840
  • Publisher: Biblioteca Digital de Teses e Dissertações da USP; Universidade de São Paulo; Instituto de Ciências Matemáticas e de Computação
  • Creation Date: 2017-09-29
  • Format: Adobe PDF
  • Language: English

Searching Remote Databases, Please Wait