skip to main content
Tipo de recurso Ver resultados con: Ver resultados con: Sumario

Accelerating the alignment phase of Minimap2 genome assembly algorithm Using GACT-X in a commercial Cloud FPGA machine.

Teng, Carolina

Biblioteca Digital de Teses e Dissertações da USP; Universidade de São Paulo; Escola Politécnica 2022-07-27

Acceso en línea

  • Título:
    Accelerating the alignment phase of Minimap2 genome assembly algorithm Using GACT-X in a commercial Cloud FPGA machine.
  • Autor: Teng, Carolina
  • Orientador: Fonseca, Fernando Josepetti
  • Materias: Computação Em Nuvem; Algorítmos; Bioinformática; Circuitos Fpga; Genômica; Minimap2; Genomics; Field Programmable Gate Arrays; Acceleration; Co-Processors; Cloud Computing; Smith- Waterman-Gotoh
  • Notas: Dissertação (Mestrado)
  • Notas locales: Programa Engenharia Elétrica
  • Descripción: Genetic sequencing can provide crucial information in medicine and in biology studies. The technologies developed in the field are advancing rapidly and the current third-generation of genome sequencers have significant improvements over the secondgeneration. In parallel to that, sequencing throughput has been increasing at an exponential rate, which, coupled with price reduction, has resulted in a leap of generation of genomic data to be processed. Transistor technology is reaching its fundamental limits, and Moores Law is becoming obsolete, so other alternatives are required to efficiently process such an amount of data. Long-reads from the third generation of sequencers are shown to be an emerging type of genetic data, with average lengths of thousands of nucleotides each. State-of-the-Art algorithm Minimap2 is able to assemble these reads into the genome that was sampled, but it is a computationally-intensive process: for the human genome size with sufficient coverage, running times can reach up to dozens of CPU hours. Hardware acceleration has been proposed as an effort to make Minimap2 more efficient, but up to the present moment, only one of its main bottlenecks, the chaining step, has been successfully accelerated on FPGA. No efficient solution has been proposed for the aligning step, implemented as the ksw function. GACT-X is a Cloud FPGA design that performs a banded SWG alignment with fixed memory, suitable for any size of input. GACT-X with tiles of size 4,000 can be 2x faster than ksw when aligning long sequences. Replacing the alignment function ksw in Minimap2 with GACT-X on a Cloud hybrid system can provide up to 1.41x acceleration on the entire execution to the software counterpart, with comparable accuracy for data that have high similarity to the reference genome. This dissertation presents all the relevant background information, the development stages and methods, the results achieved on three different datasets, and the proposed future work on this acceleration project.
  • DOI: 10.11606/D.3.2022.tde-05092022-084236
  • Editor: Biblioteca Digital de Teses e Dissertações da USP; Universidade de São Paulo; Escola Politécnica
  • Fecha de creación: 2022-07-27
  • Formato: Adobe PDF
  • Idioma: Inglés

Buscando en bases de datos remotas, por favor espere