Find in Library
Search millions of books, articles, and more
Indexed Open Access Databases
Long-Read Sequencing Improves the Detection of Structural Variations Impacting Complex Non-Coding Elements of the Genome
oleh: Ghausia Begum, Ammar Albanna, Asma Bankapur, Nasna Nassir, Richa Tambi, Bakhrom K. Berdiev, Hosneara Akter, Noushad Karuvantevida, Barbara Kellam, Deena Alhashmi, Wilson W. L. Sung, Bhooma Thiruvahindrapuram, Alawi Alsheikh-Ali, Stephen W. Scherer, Mohammed Uddin
Format: | Article |
---|---|
Diterbitkan: | MDPI AG 2021-02-01 |
Deskripsi
The advent of long-read sequencing offers a new assessment method of detecting genomic structural variation (SV) in numerous rare genetic diseases. For autism spectrum disorders (ASD) cases where pathogenic variants fail to be found in the protein-coding genic regions along chromosomes, we proposed a scalable workflow to characterize the risk factor of SVs impacting non-coding elements of the genome. We applied whole-genome sequencing on an Emirati family having three children with ASD using long and short-read sequencing technology. A series of analytical pipelines were established to identify a set of SVs with high sensitivity and specificity. At 15-fold coverage, we observed that long-read sequencing technology (987 variants) detected a significantly higher number of SVs when compared to variants detected using short-read technology (509 variants) (<i>p</i>-value < 1.1020 × 10<sup>−57</sup>). Further comparison showed 97.9% of long-read sequencing variants were spanning within the 1–100 kb size range (<i>p</i>-value < 9.080 × 10<sup>−67</sup>) and impacting over 5000 genes. Moreover, long-read variants detected 604 non-coding RNAs (<i>p</i>-value < 9.02 × 10<sup>−9</sup>), comprising 58% microRNA, 31.9% lncRNA, and 9.1% snoRNA. Even at low coverage, long-read sequencing has shown to be a reliable technology in detecting SVs impacting complex elements of the genome.