e-QA NGS: a user-friendly tool to preprocessing data from next generation sequencing

Autores

  • Antonio Sérgio Cruz Gaia Instituto Federal de Educação, Ciência e Tecnologia do Pará
  • Mônica Silva de Oliveira Universidade Federal do Pará
  • Gislenne da Silva Moia Universidade Federal do Pará
  • Victória Cardoso dos Santos Universidade Federal do Pará
  • Jorianne Thyeska Castro Alves Universidade Federal Rural da Amazônia
  • Pablo Henrique Caracciolo Gomes de Sá Universidade Federal Rural da Amazônia
  • Adonney Allan de Oliveira Veras Universidade Federal do Pará

DOI:

https://doi.org/10.53660/201.prw203

Palavras-chave:

Web tool, Pre-processing, Assessment, NGS Data

Resumo

In the last decade, sequencing platforms have provided advances in the research developed in various omics sciences, such as genomics, transcriptomics, and metagenomics, among others. On the other hand, these platforms still face challenges, in particular the low quality of the bases at the end of the reads, which can hinder future analyses, such as the assembly process. Making it necessary to pre-process these reads where various strategies are used, such as removing low-quality bases and applying a quality filter. There are numerous software programs to help with these tasks. However, most of them execute from long and complex command lines, running on the Linux operating system, which makes it difficult for users without deep computer knowledge. This study presents e-QA NGS (http://biod.ufpa.br/eqa/), a user-friendly web tool that contributes directly to reducing the complexity of pre-processing raw reads from the sequencing process, eliminating the need for the user to install dependencies, running complex command lines, or even have a deep knowledge in the computing area.

Downloads

Não há dados estatísticos.

Referências

BANKEVICH, Anton et al. SPAdes: A New Genome Assembly Algorithm and Its Applications to Single-Cell Sequencing. Journal of Computational Biology, v. 19, n. 5, p. 455-477, maio 2012. Disponível em: https://doi.org/10.1089/cmb.2012.0021.

BOLGER, Anthony M.; LOHSE, Marc; USADEL, Bjoern. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics, v. 30, n. 15, p. 2114-2120, 1 abr. 2014. Disponível em: https://doi.org/10.1093/bioinformatics/btu170.

DEL FABBRO, C., SCALABRIN, S., MORGANTE, M., GIORGI, F.,. An Extensive Evaluation of Read Trimming Effects on Illumina NGS Data Analysis. PLoS ONE, v. 8, n. 12, p. e85024, 23 dez. 2013b. Disponível em: https://doi.org/10.1371/journal.pone.0085024.

EDWARDS, David; BATLEY, Jacqueline. Plant genome sequencing: applications for crop improvement. Plant Biotechnology Journal, v. 8, n. 1, p. 2-9, jan. 2010. Disponível em: https://doi.org/10.1111/j.1467-7652.2009.00459.x.

EL-METWALLY, Sara et al. Next-Generation Sequence Assembly: Four Stages of Data Processing and Computational Challenges. PLoS Computational Biology, v. 9, n. 12, p. e1003345, 12 dez. 2013. Disponível em: https://doi.org/10.1371/journal.pcbi.1003345.

EWELS, Philip et al. MultiQC: summarize analysis results for multiple tools and samples in a single report. Bioinformatics, v. 32, n. 19, p. 3047-3048, 16 jun. 2016. Disponível em: https://doi.org/10.1093/bioinformatics/btw354.

FALGUERAS, Juan et al. SeqTrim: a high-throughput pipeline for preprocessing any type of sequence reads. BMC Bioinformatics, v. 11, n. 1, p. 38, 2010. Disponível em: https://doi.org/10.1186/1471-2105-11-38.

FAN, Bin et al. Cuckoo Filter. In: CONEXT '14: CONFERENCE ON EMERGING NETWORKING EXPERIMENTS AND TECHNOLOGIES, Sydney Australia. CoNEXT '14: Conference on emerging Networking Experiments and Technologies. New York, NY, USA: ACM, 2014. ISBN 9781450332798. Disponível em: https://doi.org/10.1145/2674005.2674994.

GOECKS, Jeremy et al. Galaxy: a comprehensive approach for supporting accessible, reproducible, and transparent computational research in the life sciences. Genome Biology, v. 11, n. 8, p. R86, 2010. Disponível em: https://doi.org/10.1186/gb-2010-11-8-r86.

GOODWIN, Sara; MCPHERSON, John D.; MCCOMBIE, W. Richard. Coming of age: ten years of next-generation sequencing technologies. Nature Reviews Genetics, v. 17, n. 6, p. 333-351, 17 maio 2016. Disponível em: https://doi.org/10.1038/nrg.2016.49.

GUREVICH, Alexey; SAVELIEV, Vladislav; VYAHHI, Nikolay; TESLER, Glenn. QUAST: quality assessment tool for genome assemblies. Bioinformatics, v. 29, n. 8, p. 1072-1075, 19 fev. 2013.

Disponível em: https://doi.org/10.1093/bioinformatics/btt086.

HENSON, Joseph; TISCHLER, German; NING, Zemin. Next-generation sequencing and large genome assemblies. Pharmacogenomics, v. 13, n. 8, p. 901-915, jun. 2012. Disponível em: https://doi.org/10.2217/pgs.12.72.

LI, Bingshan et al. QPLOT: A Quality Assessment Tool for Next Generation Sequencing Data. BioMed Research International, v. 2013, p. 1-4, 2013. Disponível em: https://doi.org/10.1155/2013/865181.

RAMÍREZ, Fidel et al. deepTools2: a next generation web server for deep-sequencing data analysis. Nucleic Acids Research, v. 44, W1, p. W160—W165, 13 abr. 2016. Disponível em: https://doi.org/10.1093/nar/gkw257.

REID, Jeffrey G. et al. Launching genomics into the cloud: deployment of Mercury, a next generation sequence analysis pipeline. BMC Bioinformatics, v. 15, n. 1, 29 jan. 2014. Disponível em: https://doi.org/10.1186/1471-2105-15-30.

REUTER, Jason A.; SPACEK, Damek V.; SNYDER, Michael P. High-Throughput Sequencing Technologies. Molecular Cell, v. 58, n. 4, p. 586-597, maio 2015. Disponível em: https://doi.org/10.1016/j.molcel.2015.05.004.

VAN DIJK, Erwin L. et al. Ten years of next-generation sequencing technology. Trends in Genetics, v. 30, n. 9, p. 418-426, set. 2014. Disponível em: https://doi.org/10.1016/j.tig.2014.07.001.

Downloads

Publicado

2023-03-13

Como Citar

Gaia, A. S. C., de Oliveira, M. S., Moia, G. da S., dos Santos, V. C., Alves, J. T. C., de Sá, P. H. C. G., & Veras , A. A. de O. (2023). e-QA NGS: a user-friendly tool to preprocessing data from next generation sequencing. Peer Review, 5(3), 91–105. https://doi.org/10.53660/201.prw203

Edição

Seção

Artigos