ABSTRACT
Describing, prioritizing, and selecting alleles from large sequencing experiments remains technically challenging. SeqAnt (https://seqant.emory.edu) is the first online, cloud-based application that makes these tasks accessible for non-programmers, even for terabyte-sized experiments containing thousands of whole-genome samples. It rapidly describes the alleles found within submitted VCF files, and then indexes the results in a natural-language search engine, which enables users to locate alleles of interest in milliseconds using normal English phrases. Our results show that SeqAnt decreases processing time by orders of magnitude and that its search engine can be used to precisely identify alleles by phenotype, genomic structure, and population genetics characteristics.
- Yang, H. & Wang, K 2015. Genomic variant annotation and prioritization with ANNOVAR and wANNOVAR. Nat Proc 10, 1556--1556 (2015)Google ScholarCross Ref
- McLaren, W. et al. 2016. The Ensembl Variant Effect Predictor. BMC Bioinformatics 17, 102 (2016).Google Scholar
Index Terms
- SeqAnt: Cloud-Based Whole-Genome Annotation and Search
Recommendations
Homology prediction refinement and reconstruction of gene content and order of ancestral bacterial genomes
BCB '10: Proceedings of the First ACM International Conference on Bioinformatics and Computational BiologyWe present a systematical methodology to refine orthologs identification generated by 3rd party de novo prediction programs and reconstruction of ancestral bacteria genome with this information by a neighboring gene pairs (NGPs) based method. The ...
eQTL networks unveil enriched mRNA master integrators downstream of complex disease-associated SNPs
Display Omitted We study the downstream transcriptome systems properties of 6432 SNPs of 574 complex diseases.eQTL network analyses unveil that downstream mRNAs are targeted by an excess number of SNPs.Thus multiple complex diseases associated SNPs tend ...
Enabling large-scale next-generation sequence assembly with Blacklight
A variety of extremely challenging biological sequence analyses were conducted on the XSEDE large shared memory resource Blacklight, using current bioinformatics tools and encompassing a wide range of scientific applications. These include genomic ...
Comments