Skip to main content

Management of Genotyping-Related Documents by Integrated Use of Semantic Tagging

  • Chapter

Part of the book series: Lecture Notes in Computer Science ((TLDKS,volume 6990))

Abstract

A widespread need is present in molecular biology laboratories for software systems to support the internal management of data and documents. A typical case is represented by genotyping procedures, which produce a large amount of documents whose content may represent a potentially important knowledge base. The exploitation of such information requires a proper classification of the elements in the knowledge base, and this can be effectively achieved using concepts and tools from research on the Semantic Web. In particular, genotyping-related documents can be handled through a DMS (Document Management System) that is also able to deal with semantic metadata, e.g. in the form of tags. The use of semantic tagging at this operating level is currently hampered by the lack of proper tools. In this paper, based on experience from a practical case, we present an integrated approach to manage relevant genotyping documents and to deal with their semantic tagging. A preliminary study on the test procedures workflow is crucial to understand the document production processes. The employed semantic annotation makes use of terms taken from domain ontologies in the biomedical field. The annotation tool must be seamlessly integrated in the supporting DMS; the tool flexibility and usability guarantee a low overhead for the annotation process, paving the way for a widespread adoption of semantic tagging for genotyping-related documents.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Alfresco website, http://www.alfresco.com

  2. MeSH - Medical Subject Headings, http://www.nlm.nih.gov/mesh/

  3. Altman, R.B., Bada, M., Chai, X.J., Carrillo, M.W., Chen, R.O., Abernethy, N.F.: RiboWeb: An ontology-based system for collaborative molecular biology. IEEE Intelligent Systems 14(5), 68–76 (1999)

    Article  Google Scholar 

  4. Aranguren, M.E., Bechhofer, S., Lord, P., Sattler, U., Stevens, R.: Understanding and using the meaning of statements in a bio-ontology: recasting the Gene Ontology in OWL. BMC Bioinformatics 8, 57 (2007)

    Article  Google Scholar 

  5. Ashburner, M., Ball, C., Blake, J., Botstein, D., Butler, H., Cherry, M., Davis, A., Dolinski, K., Dwight, S., Eppig, J.: Gene Ontology: Tool for the unification of biology. Nature Genetics 25, 25–29 (2000)

    Article  Google Scholar 

  6. Bechhofer, S., van Harmele, F., Hedler, J., et al.: OWL Web Ontology Language reference (2002)

    Google Scholar 

  7. Bechini, A., Tomasi, A., Viotto, J.: Collaborative e-business and document management: Integration of legacy DMSs with the ebXML environment. In: Interdisciplinary Aspects of Information Systems Studies, pp. 287–293. Physica-Verlag HD, Heidelberg (2008)

    Chapter  Google Scholar 

  8. Bechini, A., Tomasi, A., Viotto, J.: Enabling ontology-based document classification and management in ebXML registries. In: Proceedings of ACM SAC, pp. 1145–1150. ACM, New York (2008)

    Google Scholar 

  9. Bechini, A., Viotto, J., Giannini, R.: Smooth introduction of semantic tagging in genotyping procedures. In: Khuri, S., Lhotská, L., Pisanti, N. (eds.) ITBAM 2010. LNCS, vol. 6266, pp. 201–214. Springer, Heidelberg (2010)

    Chapter  Google Scholar 

  10. Berners-Lee, T., Hendler, J., Lassila, O.: The semantic web. Scientific American 284(5), 34–43 (2001)

    Article  Google Scholar 

  11. Bleke, J.: Bio-ontologies - fast and furious. Nature Biotechnologies 6(22), 773–774 (2004)

    Article  Google Scholar 

  12. Bojars, U., Breslin, J.G., Peristeras, V., Tummarello, G., Decker, S.: Interlinking the social web with semantics. IEEE Intelligent Systems 23(3), 29–40 (2008)

    Article  Google Scholar 

  13. Choy, D., Brown, A., McVeigh, R., Müller, F.: OASIS Content Management Interoperability Services (CMIS) Version 1.0 (2010)

    Google Scholar 

  14. Deus, H.F., Stanislaus, R., Veiga, D.F., Behrens, C., Wistuba, I.I., Minna, J.D., Garner, H.R., Swisher, S.G., Roth, J.A., Correa, A.M., Broom, B., Coombes, K., Chang, A., Vogel, L.H., Almeida, J.S.: A semantic web management model for integrative biomedical informatics. PLoS ONE 3(8), e2946 (2008)

    Article  Google Scholar 

  15. Ding, L., Finin, T.W., Joshi, A., Peng, Y., Pan, R., Reddivari, P.: Search on the semantic web. IEEE Computer 38(10), 62–69 (2005)

    Article  Google Scholar 

  16. Dong, H., Hussain, F.K., Chang, E.: A survey in semantic search technologies. In: Proc. of DEST 2008, 2nd IEEE Int’l Conf. on Digital Ecosystems and Technologies, pp. 403–408 (2008)

    Google Scholar 

  17. Donofrio, N., Rajagopalon, R., Brown, D.E., Diener, S.E., Windham, D., Nolin, S., Floyd, A., Mitchell, T.K., Galadima, N., Tucker, S., Orbach, M.J., Patel, G., Farman, M.L., Pampanwar, V., Soderlund, C., Lee, Y.-H., Dean, R.A.: ’paclims’: A component LIM system for high-throughput functional genomic analysis. BMC Bioinformatics 6, 94 (2005)

    Article  Google Scholar 

  18. Fong, C., Ko, D.C., Wasnick, M., Radey, M., Miller, S.I., Brittnacher, M.J.: Gwas analyzer: integrating genotype, phenotype and public annotation data for genome-wide association study analysis. Bioinformatics 26(4), 560–564 (2010)

    Article  Google Scholar 

  19. Hadzic, M., Chang, E.: Medical ontologies to support human disease research and control. International Journal of Web and Grid Services 1(2), 139–150 (2005)

    Article  Google Scholar 

  20. Huang, Y.W., Arkin, A.P., Chandonia, J.-M.: WIST: toolkit for rapid, customized LIMS development. Bioinformatics 27(3), 437–438 (2011)

    Article  Google Scholar 

  21. Jayashree, B., Reddy, P.T., Leeladevi, Y., Crouch, J.H., Mahalakshmi, V., Buhariwalla, H.K., Eshwar, K.E., Mace, E., Folksterma, R., Senthilvel, S., Varshney, R.K., Seetha, K., Rajalakshmi, R., Prasanth, V.P., Chandra, S., Swarupa, L., SriKalyani, P., Hoisington, D.A.: Laboratory information management software for genotyping workflows: applications in high throughput crop genotyping. BMC Bioinformatics 7, 383 (2006)

    Article  Google Scholar 

  22. Jensen, L.J., Bork, P.: Ontologies in quantitative biology: A basis for comparison, integration, and discovery. PLoS Biology 8(5), e1000374 (2010)

    Article  Google Scholar 

  23. Kohl, K., Gremmels, J.: Documentation system for plant transformation service and research. Plant Methods 6(1), 4 (2010)

    Article  Google Scholar 

  24. Kothari, C.R., Wilkinson, M.: Structured representation of biomedical experiments: A bottom-up approach. In: Proceedings of Int’l Conf. on Information and Knowledge Engineering (IKE), pp. 199–204. CSREA Press (2008)

    Google Scholar 

  25. Kumar, A., Smith, B.: Oncology ontology in the NCI thesaurus. In: Miksch, S., Hunter, J., Keravnou, E.T. (eds.) AIME 2005. LNCS (LNAI), vol. 3581, pp. 213–220. Springer, Heidelberg (2005)

    Chapter  Google Scholar 

  26. Le Hellard, S., Ballereau, S.J., Visscher, P.M., Torrance, H.S., Pinson, J., Morris, S.W., Thomson, M.L., Semple, C.A.M., Muir, W.J., Blackwood, D.H.R., Porteous, D.J., Evans, K.L.: SNP genotyping on pooled DNAs: comparison of genotyping technologies and a semi automated method for data storage and analysis. Nucleic Acids Research 30(15), e74 (2002)

    Article  Google Scholar 

  27. Li, J.-L., Deng, H., Lai, D.-B., Xu, F., Chen, J., Gao, G., Recker, R.R., Deng, H.-W.: Toward high-throughput genotyping: Dynamic and automatic software for manipulating large-scale genotype data using fluorescently labeled dinucleotide markers. Genome Res. 11(7), 1304–1314 (2001)

    Article  Google Scholar 

  28. Monnier, S., Cox, D.G., Albion, T., Canzian, F.: T.I.M.S: TaqMan Information Management System, tools to organize data flow in a genotyping laboratory. BMC Bioinformatics 6, 246 (2005)

    Article  Google Scholar 

  29. Olivier, M., Petitejan, A., Teague, J., Forbes, S., Dunnick, J., der Dunnen, J., Langerod, A., Wilkinson, J., Vihinen, M., Cotton, R., Hainaut, P.: Somatic mutation databases as tools for molecular epidemiology and molecular pathology of cancer: Proposed guidelines for improving data collection, distribution, and integration. Human Mutation 30(3), 275–282 (2009)

    Article  Google Scholar 

  30. OMG. BPMN 2.0 specifications (2009)

    Google Scholar 

  31. Price, S.L., Nielsen, M.L., Delcambre, L.M., Vedsted, P., Steinhauer, J.: Using semantic components to search for domain-specific documents: An evaluation from the system perspective and the user perspective. Information Systems 34(8), 724–752 (2009)

    Article  Google Scholar 

  32. Rubin, D.L., Shah, N.H., Noy, N.F.: Biomedical ontologies: a functional perspective. Briefings in Bioinformatics 9(1), 75–90 (2008)

    Article  Google Scholar 

  33. Shah, N., Jonquet, C., Chiang, A., Butte, A., Chen, R., Musen, M.: Ontology-driven indexing of public datasets for translational bioinformatics. BMC Bioinformatics 10(suppl.2), S1 (2009)

    Article  Google Scholar 

  34. Sioutos, N., de Coronado, S., Haber, M.W., Hartel, F.W., Shaiu, W.L., Wright, L.W.: NCI Thesaurus: a semantic model integrating cancer-related clinical and molecular information. Journal of Biomedical Informatics 40(1), 30–43 (2007)

    Article  Google Scholar 

  35. Smith, B., Ashburner, M., Rosse, C., Bard, J., Bug, W., Ceusters, W., Goldberg, L.J., Eilbeck, K., Ireland, A., Mungall, C.J., Consortium, T.O., Leontis, N., Rocca-Serra, P., Ruttenberg, A., Sansone, S.-A., Scheuermann, R.H., Shah, N., Whetzel, P.L., Lewis, S.: The OBO foundry: coordinated evolution of ontologies to support biomedical data integration. Nature Biotechnology 25, 1251–1255 (2007)

    Article  Google Scholar 

  36. Specia, L., Motta, E.: Integrating Folksonomies with the Semantic Web. In: Franconi, E., Kifer, M., May, W. (eds.) ESWC 2007. LNCS, vol. 4519, pp. 624–639. Springer, Heidelberg (2007)

    Chapter  Google Scholar 

  37. Strömbäck, L., Hall, D., Lambrix, P.: A review of standards for data exchange within systems biology. Proteomics 7(6), 857–867 (2007)

    Article  Google Scholar 

  38. Tanabe, L.K., Wilbur, W.J.: Tagging gene and protein names in biomedical text. Bioinformatics 18(8), 1124–1132 (2002)

    Article  Google Scholar 

  39. Uren, V., Cimiano, P., Iria, J., Handschuh, S., Vargas-Vera, M., Motta, E., Ciravegna, F.: Semantic annotation for knowledge management: Requirements and a survey of the state of the art. Journal of Web Semantics 4(1), 14–28 (2006)

    Article  Google Scholar 

  40. Wohed, P., van der Aalst, W.M.P., Dumas, M., ter Hofstede, A.H.M., Russell, N.: On the suitability of BPMN for business process modelling. In: Dustdar, S., Fiadeiro, J.L., Sheth, A.P. (eds.) BPM 2006. LNCS, vol. 4102, pp. 161–176. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Bechini, A., Giannini, R. (2011). Management of Genotyping-Related Documents by Integrated Use of Semantic Tagging. In: Hameurlain, A., Küng, J., Wagner, R., Böhm, C., Eder, J., Plant, C. (eds) Transactions on Large-Scale Data- and Knowledge-Centered Systems IV. Lecture Notes in Computer Science, vol 6990. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-23740-9_2

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-23740-9_2

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-23739-3

  • Online ISBN: 978-3-642-23740-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics