Abstract
The paper presents a Grid Service allowing to detect and extract the longest common sub-spectrum among a set of mass spectrometry spectra data. The system uses a novel pattern extraction algorithm named LCSS (Longest Common Spectra SubString) that adapts a very popular string matching technique based on Suffix Trees to spectra data. The basic LCSS algorithm made available as a Grid Service is used to implement a pattern extraction workflow on mass spectrometry dataset. First experimental results are presented.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Aebersold, R., Mann, M.: Mass spectrometry-based proteomics. Nature 422, 198–207 (2003)
Cannataro, M., Guzzi, P., Mazza, T., Tradigo, G., Veltri, P.: Preprocessing of mass spectrometry proteomics data on the grid. In: CBMS 2005, pp. 549–554. IEEE Press, Los Alamitos (2005)
Cannataro, M., Guzzi, P., Mazza, T., Tradigo, G., Veltri, P.: Using ontologies for preprocessing and mining spectra data on the grid. Future Generation Comp. Syst. 23(1), 55–60 (2007)
Alliance Globus. The globus project, http://www.globus.org/
Gopalakrishnan, V., William, E., Ranganathan, S., Bowser, R., Cudkowic, M.E., Novelli, M., Lattazi, W., Gambotto, A., Day, B.W.: Proteomic data mining challenges in identification of disease-specific biomarkers from variable resolution mass spectra. In: Proceedings of SIAM Bioinformatics Workshop 2004, Buena Vista, FL, April 2004, pp. 1–10 (2004)
Gusfield, D.: Algorithms on Strings, Trees, and Sequences: Computer Science and Computational Biology. Cambridge University Press, Cambridge (1997)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Cannataro, M., Veltri, P. (2006). A Grid Service Based on Suffix Trees for Pattern Extraction from Mass Spectrometry Proteomics Data. In: Min, G., Di Martino, B., Yang, L.T., Guo, M., Rünger, G. (eds) Frontiers of High Performance Computing and Networking – ISPA 2006 Workshops. ISPA 2006. Lecture Notes in Computer Science, vol 4331. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11942634_68
Download citation
DOI: https://doi.org/10.1007/11942634_68
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-49860-5
Online ISBN: 978-3-540-49862-9
eBook Packages: Computer ScienceComputer Science (R0)