Title: Impact of different mapping tools on detection of small RNAs in bacterial outer membrane vesicles

Bojana Banović Đeri1*, Sofija Nešić1, Ana Pantelić1, Jelena Samardžić1, Dragana Nikolić1

1University of Belgrade, Institute of Molecular Genetics and Genetic Engineering, Laboratory for Plant Molecular Biology, Vojvode Stepe 444a, Belgrade, Serbia

bojanabanovic [at] imgge.bg.ac.rs

Abstract

Bacterial small RNAs (sRNAs) represent a highly diverse RNA class ranging from 8 to 200 nucleotides in length, originating from the bacterial chromosome, plasmids or phages. After syntheses sRNAs can remain inside the bacterial cell, be secreted or packed into outer membrane vesicles (OMV), enabling various intra- and inter-kingdom interactions. Different sRNAs biotypes display differences in structure, mechanism of action and level of regulation (i.e. transcription, translation, mRNA stability, etc.), but could be broadly grouped in: trans-acting sRNAs (bind to target mRNAs) and cis-encoded sRNAs (or antisense RNA that may interact not only with mRNAs, but also with proteins and DNA).

Even though the advancement of high-throughput sequencing technology led to a burst of knowledge on small RNAs complexity and diversity, there are still specific challenges related to sRNA-seq data analysis that need to be resolved. Two main challenges, associated to short length of many bacterial sRNA biotypes, are: (i) to discriminate between functional sRNAs synthesized by bacterial cell and degradation fragments produced by sample preparation and (ii) to detect functional sRNAs displaying sequence variation. While loss of very small sized sRNAs could easily be overcome by cutting-off only the specific adapter sequences that were used in sRNA library preparation, providing a proper mapping still remains a strenuous task.

The aim of this study was to test five different mapping tools that are widely used in NGS data analysis (bbmap, bowtie2, bwa, minimap2 and segemehl) for their performances in mapping of bacterial OMV sRNA-seq data to bacterial reference genome. For this test publicly available NCBI sRNA-seq dataset from OMVs of Aliivibrio fischeri (PRJNA629425) was used, as it contained sRNAs of different length and biotype and because A.fischeri reference genome and annotation were available (PRJNA12986). We evaluated five mappers using alignment and assignment rates as well as computational time. Alignment rate was calculated as the ratio of aligned and input reads, while the assignment rate was calculated as the ratio of assigned and aligned reads. Finally, totals of detected sRNAs biotypes were compared between different mappers. The statistical analysis was performed in R (version 4.3.0) and performance metrics are discussed.

Keywords: small RNAs, outer membrane vesicles, mapping

Acknowledgements: This work was funded by the project “ExplOMV”, ID 7744906, Program IDEAS, Science Fund of the Republic of Serbia

Comments are closed.