skip to main content
10.1145/2795218.2795221acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
short-paper

Data Like This: Ranked Search of Genomic Data Vision Paper

Authors Info & Claims
Published:31 May 2015Publication History

ABSTRACT

High-throughput genetic sequencing produces the ultimate "big data": a human genome sequence contains more than 3B base pairs, and more and more characteristics, or annotations, are being recorded at the base-pair level. Locating areas of interest within the genome is a challenge for researchers, limiting their investigations. We describe our vision of adapting "big data" ranked search to the problem of searching the genome. Our goal is to make searching for data as easy for scientists as searching the Internet.

References

  1. Agrawal, R. and Srikant, R. 2003. Searching with numbers. IEEE TKDE. 15, 4 (Aug. 2003), 855--870. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Ahrens, J.P. et al. 2011. Data-intensive science in the US DOE. CISE. 13, 6 (Dec. 2011), 14--24. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Altschul, S.F. et al. 1997. Gapped BLAST and PSI-BLAST. Nucleic acids res. 25, 17 (1997), 3389--3402.Google ScholarGoogle Scholar
  4. Cafarella, M.J. et al. 2008. Webtables: exploring the power of tables on the web. VLDB. 1, 1 (2008), 538--549. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. CURSOR: http://cursor.businesscatalyst.com/index.html. Accessed: 2015-02-23.Google ScholarGoogle Scholar
  6. Krzywinski, M. et al. 2009. Circos: An information aesthetic for comparative genomics. Genome Research. 19, 9 (Sep. 2009), 1639--1645.Google ScholarGoogle ScholarCross RefCross Ref
  7. Maier, D. et al. 2012. Navigating oceans of data. Scientific and Statistical Database Management (2012), 1--19. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Martin Sanchez, F. et al. 2013. Exposome informatics. J. of Am. Medical Informatics Ass. 21, 3 (Nov. 2013), 386--390.Google ScholarGoogle Scholar
  9. Megler, V.M. 2014. Ranked Similarity Search of Scientific Datasets (PhD Dissertation). Portland State University.Google ScholarGoogle Scholar
  10. Megler, V.M. and Maier, D. 2015. Are Datasets Like Documents?. IEEE TKDE. 27, 1 (Jan. 2015), 32--45.Google ScholarGoogle Scholar
  11. Robinson, J.T. et al. 2011. Integrative Genomics Viewer. Nature Biotechnology. 29, (2011), 24--26.Google ScholarGoogle Scholar
  12. UCSC Genome Browser: http://genome.ucsc.edu/.Google ScholarGoogle Scholar
  13. Venetis, P. et al. 2011. Recovering semantics of tables on the web. Proceedings of VLDB. 4, 9 (2011), 528--538. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Weidman, S. and Arrison, T. 2009. Steps toward large-scale data integration in the sciences. NRC/NAGoogle ScholarGoogle Scholar

Index Terms

  1. Data Like This: Ranked Search of Genomic Data Vision Paper

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          ExploreDB '15: Proceedings of the Second International Workshop on Exploratory Search in Databases and the Web
          May 2015
          37 pages
          ISBN:9781450337403
          DOI:10.1145/2795218

          Copyright © 2015 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 31 May 2015

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • short-paper
          • Research
          • Refereed limited

          Acceptance Rates

          ExploreDB '15 Paper Acceptance Rate6of10submissions,60%Overall Acceptance Rate11of21submissions,52%

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader