{"id":2890,"date":"2023-06-14T11:47:53","date_gmt":"2023-06-14T10:47:53","guid":{"rendered":"https:\/\/archive.belbi.bg.ac.rs\/2023\/?post_type=abstract&#038;p=2890"},"modified":"2023-06-14T18:37:23","modified_gmt":"2023-06-14T17:37:23","slug":"seeking-an-optimal-variant-calling-pipeline-for-medical-genetics","status":"publish","type":"abstract","link":"https:\/\/archive.belbi.bg.ac.rs\/2023\/abstract\/seeking-an-optimal-variant-calling-pipeline-for-medical-genetics\/","title":{"rendered":"Seeking an optimal variant calling pipeline for medical genetics"},"content":{"rendered":"\n<p>Yury A. Barbitoff<sup>1*<\/sup>, Alexandra Panteleeva<sup>1<\/sup>, Alexander V. Predeus<sup>1<\/sup><\/p>\n\n\n\n<p class=\"affiliation-para\"><sup>1<\/sup>Institute of Bioinformatics Research and Education, Belgrade, Serbia<\/p>\n\n\n\n<p>barbitoff [at] bioinf.institute<\/p>\n\n\n\n<p><strong>Abstract<\/strong><\/p>\n\n\n\n<p class=\"abstract-para\">Accurate and comprehensive variant discovery is extremely important for rare disease diagnostics using next-generation sequencing (NGS) methods. Over the recent years, a plethora of methods have been developed for short variant calling from NGS data, and the most recent tools extensively use machine learning algorithms for both variant discovery and filtering. In our study, we took an effort to systematically evaluate the performance of different pipelines for short variant calling in the human genome.<\/p>\n\n\n\n<p class=\"abstract-para\">To perform such a systematic comparison, we collected a large dataset of both \u201cgold standard\u201d (provided by the Genome In A Bottle (GIAB) consortium) and in-house whole-exome sequencing (WES) and whole-genome sequencing (WGS) datasets. (a total of 20 different datasets was used). We tested all combinations of 4 popular short read aligners (BWA, Bowtie2, Isaac, and Novoalign) and 9 novel and well-established variant calling and filtering methods (Freebayes, Clair3, DeepVariant, Genome Analysis ToolKit (GATK), Octopus, Strelka2). We also used several different tools for preprocessing of short reads.<\/p>\n\n\n\n<p class=\"abstract-para\">Our analysis showed negligible effects of adapter trimming on the accuracy of short variant calling. Among read aligners, Bowtie2 performed significantly worse than other tools, suggesting it should not be used for medical variant calling. For pipelines based on BWA, Isaac, and Novoalign, the accuracy of variant discovery mostly depended on the variant caller and not the read aligner. DeepVariant consistently showed the best performance and the greatest robustness compared to all other tested variant callers. We have also compared the consistency of variant calls in GIAB and non-GIAB samples. With few important caveats, best-performing tools have shown little evidence of overfitting.&nbsp;<\/p>\n\n\n\n<p class=\"abstract-para\">Taken together, our study showed that modern strategies for NGS data analysis allow for high accuracy of genetic variant discovery within coding regions of the human genome. However, there is still a need for development of new library preparation and variant calling methods to enhance variant discovery in the challenging regions of the human genome.<\/p>\n\n\n\n<p class=\"abstract-para\"><strong>Keywords:<\/strong> pipeline, variant calling, human genetics, medical genetics <\/p>\n\n\n\n<p class=\"abstract-para\"><strong>Acknowledgement:<\/strong> We thank JetBrains Ltd. for providing financial support and computing resources for the project.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Yury A. Barbitoff<sup>1*<\/sup>, Alexandra Panteleeva<sup>1<\/sup>, Alexander V. Predeus<sup>1<\/sup><\/p>\n<p class=\"affiliation-para\"><sup>1<\/sup>Institute of Bioinformatics Research and Education, Belgrade, Serbia<\/p>\n<p> <a class=\"continue-reading-link\" href=\"https:\/\/archive.belbi.bg.ac.rs\/2023\/abstract\/seeking-an-optimal-variant-calling-pipeline-for-medical-genetics\/\"><span>Continue reading<\/span><i class=\"crycon-right-dir\"><\/i><\/a> <\/p>\n","protected":false},"author":162,"featured_media":0,"template":"","categories":[8],"tags":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v23.0 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Seeking an optimal variant calling pipeline for medical genetics - BelBi 2023<\/title>\n<meta name=\"robots\" content=\"noindex, follow\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Seeking an optimal variant calling pipeline for medical genetics - BelBi 2023\" \/>\n<meta property=\"og:description\" content=\"Yury A. Barbitoff1*, Alexandra Panteleeva1, Alexander V. Predeus11Institute of Bioinformatics Research and Education, Belgrade, Serbia Continue reading\" \/>\n<meta property=\"og:url\" content=\"https:\/\/archive.belbi.bg.ac.rs\/2023\/abstract\/seeking-an-optimal-variant-calling-pipeline-for-medical-genetics\/\" \/>\n<meta property=\"og:site_name\" content=\"BelBi 2023\" \/>\n<meta property=\"article:modified_time\" content=\"2023-06-14T17:37:23+00:00\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/archive.belbi.bg.ac.rs\/2023\/abstract\/seeking-an-optimal-variant-calling-pipeline-for-medical-genetics\/\",\"url\":\"https:\/\/archive.belbi.bg.ac.rs\/2023\/abstract\/seeking-an-optimal-variant-calling-pipeline-for-medical-genetics\/\",\"name\":\"Seeking an optimal variant calling pipeline for medical genetics - BelBi 2023\",\"isPartOf\":{\"@id\":\"https:\/\/archive.belbi.bg.ac.rs\/2023\/#website\"},\"datePublished\":\"2023-06-14T10:47:53+00:00\",\"dateModified\":\"2023-06-14T17:37:23+00:00\",\"breadcrumb\":{\"@id\":\"https:\/\/archive.belbi.bg.ac.rs\/2023\/abstract\/seeking-an-optimal-variant-calling-pipeline-for-medical-genetics\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/archive.belbi.bg.ac.rs\/2023\/abstract\/seeking-an-optimal-variant-calling-pipeline-for-medical-genetics\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/archive.belbi.bg.ac.rs\/2023\/abstract\/seeking-an-optimal-variant-calling-pipeline-for-medical-genetics\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/archive.belbi.bg.ac.rs\/2023\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Seeking an optimal variant calling pipeline for medical genetics\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/archive.belbi.bg.ac.rs\/2023\/#website\",\"url\":\"https:\/\/archive.belbi.bg.ac.rs\/2023\/\",\"name\":\"BelBi 2023\",\"description\":\"\",\"publisher\":{\"@id\":\"https:\/\/archive.belbi.bg.ac.rs\/2023\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/archive.belbi.bg.ac.rs\/2023\/?s={search_term_string}\"},\"query-input\":\"required name=search_term_string\"}],\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/archive.belbi.bg.ac.rs\/2023\/#organization\",\"name\":\"Belgrade Bioinformatics Conference\",\"url\":\"https:\/\/archive.belbi.bg.ac.rs\/2023\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/archive.belbi.bg.ac.rs\/2023\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/archive.belbi.bg.ac.rs\/2023\/wp-content\/uploads\/2023\/02\/145_97_171.png\",\"contentUrl\":\"https:\/\/archive.belbi.bg.ac.rs\/2023\/wp-content\/uploads\/2023\/02\/145_97_171.png\",\"width\":278,\"height\":500,\"caption\":\"Belgrade Bioinformatics Conference\"},\"image\":{\"@id\":\"https:\/\/archive.belbi.bg.ac.rs\/2023\/#\/schema\/logo\/image\/\"}}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Seeking an optimal variant calling pipeline for medical genetics - BelBi 2023","robots":{"index":"noindex","follow":"follow"},"og_locale":"en_US","og_type":"article","og_title":"Seeking an optimal variant calling pipeline for medical genetics - BelBi 2023","og_description":"Yury A. Barbitoff1*, Alexandra Panteleeva1, Alexander V. Predeus11Institute of Bioinformatics Research and Education, Belgrade, Serbia Continue reading","og_url":"https:\/\/archive.belbi.bg.ac.rs\/2023\/abstract\/seeking-an-optimal-variant-calling-pipeline-for-medical-genetics\/","og_site_name":"BelBi 2023","article_modified_time":"2023-06-14T17:37:23+00:00","twitter_card":"summary_large_image","schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/archive.belbi.bg.ac.rs\/2023\/abstract\/seeking-an-optimal-variant-calling-pipeline-for-medical-genetics\/","url":"https:\/\/archive.belbi.bg.ac.rs\/2023\/abstract\/seeking-an-optimal-variant-calling-pipeline-for-medical-genetics\/","name":"Seeking an optimal variant calling pipeline for medical genetics - BelBi 2023","isPartOf":{"@id":"https:\/\/archive.belbi.bg.ac.rs\/2023\/#website"},"datePublished":"2023-06-14T10:47:53+00:00","dateModified":"2023-06-14T17:37:23+00:00","breadcrumb":{"@id":"https:\/\/archive.belbi.bg.ac.rs\/2023\/abstract\/seeking-an-optimal-variant-calling-pipeline-for-medical-genetics\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/archive.belbi.bg.ac.rs\/2023\/abstract\/seeking-an-optimal-variant-calling-pipeline-for-medical-genetics\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/archive.belbi.bg.ac.rs\/2023\/abstract\/seeking-an-optimal-variant-calling-pipeline-for-medical-genetics\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/archive.belbi.bg.ac.rs\/2023\/"},{"@type":"ListItem","position":2,"name":"Seeking an optimal variant calling pipeline for medical genetics"}]},{"@type":"WebSite","@id":"https:\/\/archive.belbi.bg.ac.rs\/2023\/#website","url":"https:\/\/archive.belbi.bg.ac.rs\/2023\/","name":"BelBi 2023","description":"","publisher":{"@id":"https:\/\/archive.belbi.bg.ac.rs\/2023\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/archive.belbi.bg.ac.rs\/2023\/?s={search_term_string}"},"query-input":"required name=search_term_string"}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/archive.belbi.bg.ac.rs\/2023\/#organization","name":"Belgrade Bioinformatics Conference","url":"https:\/\/archive.belbi.bg.ac.rs\/2023\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/archive.belbi.bg.ac.rs\/2023\/#\/schema\/logo\/image\/","url":"https:\/\/archive.belbi.bg.ac.rs\/2023\/wp-content\/uploads\/2023\/02\/145_97_171.png","contentUrl":"https:\/\/archive.belbi.bg.ac.rs\/2023\/wp-content\/uploads\/2023\/02\/145_97_171.png","width":278,"height":500,"caption":"Belgrade Bioinformatics Conference"},"image":{"@id":"https:\/\/archive.belbi.bg.ac.rs\/2023\/#\/schema\/logo\/image\/"}}]}},"_links":{"self":[{"href":"https:\/\/archive.belbi.bg.ac.rs\/2023\/wp-json\/wp\/v2\/abstract\/2890"}],"collection":[{"href":"https:\/\/archive.belbi.bg.ac.rs\/2023\/wp-json\/wp\/v2\/abstract"}],"about":[{"href":"https:\/\/archive.belbi.bg.ac.rs\/2023\/wp-json\/wp\/v2\/types\/abstract"}],"author":[{"embeddable":true,"href":"https:\/\/archive.belbi.bg.ac.rs\/2023\/wp-json\/wp\/v2\/users\/162"}],"wp:attachment":[{"href":"https:\/\/archive.belbi.bg.ac.rs\/2023\/wp-json\/wp\/v2\/media?parent=2890"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/archive.belbi.bg.ac.rs\/2023\/wp-json\/wp\/v2\/categories?post=2890"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/archive.belbi.bg.ac.rs\/2023\/wp-json\/wp\/v2\/tags?post=2890"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}