Abstract
Enormous volume of DNA sequences of organisms are continuously being discovered by genome sequencing projects around the world. The task of identifying biological function prediction for the DNA sequences is a key activity in genome projects. This task is done in the annotation phase, which is divided into automatic and manual. The automatic annotation has the objective of finding, for each DNA sequence identified in the project, similar sequences among millions, stored in public databases, by using approximated pattern matching algorithms. The manual annotation is done by the biologists, that use the results produced by the automatic annotation, and their knowledge and experience, to decide the function prediction to each DNA sequence. In this way, the biologists guarantee accuracy and correctness to each sequence function prediction. This work presents a new version of BioAgents, a multiagent system (MAS) for supporting manual annotation. The system simulates the biologists’ knowledge and experience for annotating DNA sequences in genome sequencing projects. The MAS cooperative approach, allows to create different specialized intelligent agents that, working together, suggest proper manual annotation. BioAgents was defined with a three-layer architecture using the JADE framework with a ruler-based engine (JESS). We have done experiments with real data from three different genome sequencing projects: Paracoccidioides brasilienses fungus, Paullinia cupana (guaraná) plant and Anaplasma marginale rickettsia. The produced results were encouraging, which prove the usefulness of BioAgents.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Anaplasma marginalis St. Maries, http://www.ncbi.nlm.nih.gov/sites/entrez?Db=genome&Cmd=ShowDetailView&T
Clusters of Orthologous Groups of proteins (COG), http://www.ncbi.nlm.nih.gov/COG/
Eclipse SDK, http://www.eclipse.org
kog database, http://www.ncbi.nlm.nih.gov/COG/grace/shokog.cgi
nr database, http://www.ncbi.nlm.nih.gov/blast/blast_databases.shtml
Framework BioJava, http://biojava.org/wiki/Main_Page
GeneOntology (GO), http://www.geneontology.org/
Genome Project Anaplasma, https://www.biomol.unb.br/anaplasma/servlet/IndexServlet
Genome Project Guaraná, https://dna.biomol.unb.br/GR/
Genome Project Jararaca, https://helix.biomol.unb.br/jararaca/servlet/IndexServlet
Genome Project Pb, https://dna.biomol.unb.br/Pb-eng/
The IGS Annotation Engine, http://ae.igs.umaryland.edu/
Java Agent DEvelopment Framework - JADE, http://jade.tilab.com
Java Expert System Shell - JESS, http://www.jessrules.com/jess/index.shtml
Java Language, http://java.sun.com
nr-genbank, http://www.ncbi.nlm.nih.gov/Genbank/
Altschul, S.F., Gish, W., Miller, W., Myers, E.W., Lipman, D.J.: Basic local alignment search tool. J. Mol. Biol. 215(3), 403–410 (1990)
Bellifemine, F., Caire, G., Poggi, A., Rimassa, G.: JADE - a white paper. White Paper 3, TILAB - Telecom Italia Lab (September 2003)
Bellifemine, F., Caire, G., Trucco, T., Rimassa, G.: Jade Programmer’s Guide (June 2007), http://jade.tilab.com/doc/programmersguide.pdf
Benson, D.A., Karsch-Mizrachi, I., Lipman, D.J., Ostell, J., Wheeler, D.L.: Genbank. Nucleic Acids Res. 36 (Database issue) (January 2008)
Decker, K., Zheng, X., Schmidt, C.: A multi-agent system for automated genomic annotation. In: AGENTS 2001: Proceedings of the 5th international conference on Autonomous agents, pp. 433–440. ACM, New York (2001)
Ding, L., Sabo, A., Berkowicz, N., Meyer, R.R., Shotland, Y., Johnson, M.R., Pepin, K.H., Wilson, R.K., Spieth, J.: EAnnot: A genome annotation tool using experimental evidence. Genome Research 14(12), 2503–2509 (2004)
do Nascimento, L.V., Bazzan, A.L.C.: An agent-based system for re-annotation of genomes. In: III Brazilian Workshop on Bioinformatics (WOB), pp. 41–48 (2004)
dos Santos, C.T., Bazzan, A.L.C.: Using the A3C system for annotation of keywords - a case study. In: III Brazilian Workshop on Bioinformatics (WOB), pp. 175–178 (2004)
Hill, E.F.: Jess in Action: Java Rule-Based Systems. Manning Publications Co., Greenwich (2003)
Lima, R.S.: Sistema multiagente para anotação manual em projetos de sequenciamento de genomas. Master’s thesis, Department of Computer Science, University of Brasília (2007), http://monografias.cic.unb.br/dspace/handle/123456789/28/browse-title
Lima, R.S., Ralha, C.G., Walter, M.E.M.T., Brígido, M.M.: A multiagent system to help manual annotation on genome sequencing projects. In: IWGD 2005: Proceedings of the International Workshop on Genomic Databases (2005), http://www.biowebdb.org/iwgd05/proceedings/multiagent-system.pdf
Lima, R.S., Ralha, C.G., Walter, M.E.M.T., Schneider, H.W., Pereira, A.G.F., Brígido, M.M.: BioAgents: A multiagent system for manual annotation on genome sequencing projects. In: IWGD 2007: Proceedings of the International Workshop on Genomic Databases (2007), http://bsb2007.inf.puc-rio.br/index.php?pg=home
Lima, R.S., Ralha, C.G., Walter, M.E.M.T., Schneider, H.W., Pereira, A.G.F., Brígido, M.M.: BioAgents: Um sistema multiagente para anotação manual em projetos de seqüenciamento de genomas. In: ENIA 2007: 6th Brazilian Meeting on Artificial Intelligence, Brazil, pp. 1302–1310 (2007), http://www.sbc.de9.ime.eb.br/
Liolios, K., Tavernarakis, N., Hugenholtz, P., Kyrpides, N.: The Genomes On Line Database (GOLD) v.2: a monitor of genome projects worldwide. Nucleic Acids Research 34, 332–334 (2006) (Database-Issue)
Pearson, W.R., Lipman, D.J.: Improved tools for biological sequence comparison. Proceedings of the National Academy of Sciences of the USA 85, 2444–2448 (1988)
Weiss, G.: Multiagent Systems: A Modern Approach to Distributed Artificial Intelligence. The MIT Press, Cambridge (July 2000)
Wooldridge, M.: Introduction to MultiAgent Systems. John Wiley & Sons, Chichester (June 2002)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Ralha, C.G., Schneider, H.W., da Fonseca, L.O., Walter, M.E.M.T., Brígido, M.M. (2008). Using BioAgents for Supporting Manual Annotation on Genome Sequencing Projects. In: Bazzan, A.L.C., Craven, M., Martins, N.F. (eds) Advances in Bioinformatics and Computational Biology. BSB 2008. Lecture Notes in Computer Science(), vol 5167. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-85557-6_12
Download citation
DOI: https://doi.org/10.1007/978-3-540-85557-6_12
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-85556-9
Online ISBN: 978-3-540-85557-6
eBook Packages: Computer ScienceComputer Science (R0)