skip to main content
10.1145/2464576.2482725acmconferencesArticle/Chapter ViewAbstractPublication PagesgeccoConference Proceedingsconference-collections
tutorial

Correlation of microarray probes give evidence for mycoplasma contamination in human studies

Published: 06 July 2013 Publication History

Abstract

At least 473 Affymetrix HG-U133 +2 Homosapiens probes match one or more species of mycoplasma. Analysis of published data from thousands of human GeneChips finds correlations in homo sapiens studies between different microbiology laboratories in different countries which suggests contamination with mycoplasma is the common factor. This high lights the problem of experts in evolutionary computation needing to apply due diligence before relying on public medical datasets. Caveat emptor even if the data are free!

References

[1]
Carlos El Hader, Sandra Tremblay, Nicolas Solban, Denis Gingras, Richard Beliveau, Sergei N. Orlov, Pavel Hamet, and Johanne Tremblay, "HCaRG increases renal cell migration by a TGF-alpha autocrine loop mechanism," Am J Physiol Renal Physiol, vol. 289, no. 6, pp. F1273--F1280, Dec 2005.
[2]
Stefan Schmidt, Johannes Rainer, Stefan Riml, Christian Ploner, Simone Jesacher, Clemens Achmueller, Elisabeth Presul, Sergej Skvortsov, Roman Crazzolara, Michael Fiegl, Taneli Raivio, Olli A. Jaenne, Stephan Geley, Bernhard Meister, and Reinhard Kofler, "Identification of glucocorticoid-response genes in children with acute lymphoblastic leukemia," Blood, vol. 107, no. 5, pp. 2061--2069, March 1 2006.
[3]
Anatoly L. Mayburd, Alfredo Martlinez, Daniel Sackett, Huaitian Liu, Joanna Shih, Jordy Tauler, Ingalill Avis, and James L. Mulshine, "Ingenuity network-assisted transcription profiling: Identification of a new pharmacologic mechanism for MK886," Clin Cancer Res, vol. 12, no. 6, pp. 1820--1827, Mar 15 2006.
[4]
Graham D. Jack, M. Carla Cabrera, Michael L. Manning, Stephen M. Slaughter, Malcolm Potts, and Richard F. Helm, "Activated stress response pathways within multicellular aggregates utilize an autocrine component," Cellular Signalling, vol. 19, no. 4, pp. 772--781, 2007.
[5]
David Cappellen, Thomas Schlange, Matthieu Bauer, Francisca Maurer, and Nancy E. Hynes, "Novel c-MYC target genes mediate differential effects on cell proliferation and migration," EMBO Rep, vol. 8, no. 1, pp. 70--76, Jan 2007, European Molecular Biology Organization.
[6]
Herbert J. Harwick, George M. Kalmanson, and Lucien B. Guze, "Human diseases associated with mycoplasmas," California Medicine, vol. 116, no. 5, pp. 1--7, May 1972.
[7]
David Taylor-Robinson and Christiane Bebear, "Antibiotic susceptibilities of mycoplasmas and treatment of mycoplasmal infections.," Journal of Antimicrobial Chemotherapy, vol. 40, no. 5, pp. 622--630, 1997.
[8]
Daniel G. Gibson, Gwynedd A. Benders, Cynthia Andrews-Pfannkoch, Evgeniya A. Denisova, Holly Baden-Tillson, Jayshree Zaveri, Timothy B. Stockwell, Anushka Brownley, David W. Thomas, Mikkel A. Algire, Chuck Merryman, Lei Young, Vladimir N. Noskov, John I. Glass, J. Craig Venter, Clyde A. Hutchison, and Hamilton O. Smith, "Complete chemical synthesis, assembly, and cloning of a mycoplasma genitalium genome," Science, vol. 319, no. 5867, pp. 1215--1220, 2008.
[9]
Hans G. Drexler and Cord C. Uphoff, "Mycoplasma contamination of cell cultures: Incidence, sources, effects, detection, elimination, prevention," Cytotechnology, vol. 39, no. 2, pp. 75--90, 2002.
[10]
Estibaliz Aldecoa-Otalora, William B. Langdon, Phil Cunningham, and Matthew J. Arno, "Unexpected presence of mycoplasma probes on human microarrays," BioTechniques, vol. 47, no. 6, pp. 1013--1016, December 2009.
[11]
W. B. Langdon and M.J. Arno, "In Silico infection of the human genome," in 10th European Conference on Evolutionary Computation, Machine Learning and Data Mining in Bioinformatics, EvoBIO 2012, Mario Giacobini, Leonardo Vanneschi, and William S. Bush, Eds., Malaga, Spain, 11-13 April 2012, vol. 7246 of LNCS, pp. 245--249, Springer Verlag.
[12]
Crispin J. Miller, Heba S. Kassem, Stuart D. Pepper, Yvonne Hey, Timothy H. Ward, and Geoffrey P. Margison, "Mycoplasma infection significantly alters microarray gene expression profiles," BioTechniques, vol. 35, no. 4, pp. 812--814, October 2003.
[13]
David Cappellen, personal communication, 30 Nov 2012.
[14]
Reinhard Kofler, personal communication, 15 May 2013.
[15]
W. B. Langdon, "Correlation of microarray probes give evidence for mycoplasma contamination in human studies," Tech. Rep. RN/12/11, Department of Computer Science, University College London, London WC1E 6BT, UK, 2 November 2012.
[16]
Mohammad Wahab Khan and Mansaf Alam, "A survey of application: Genomics and genetic programming, a new frontier," Genomics, vol. 100, no. 2, pp. 65--71, Aug. 2012.
[17]
Michael A. Lones, Stephen L. Smith, Andrew T. Harris, Alec S. High, Sheila E. Fisher, D. Alastair Smith, and Jennifer Kirkham, "Discriminating normal and cancerous thyroid cell lines using implicit context representation cartesian genetic programming," in 2010 IEEE World Congress on Computational Intelligence, Pilar Sobrevilla, Ed., Barcelona, 18-23 July 2010, pp. 1945--1950, IEEE.
[18]
Leonardo Vanneschi, Luca Mussi, and Stefano Cagnoni, "Hot topics in evolutionary computation," Intelligenza Artificiale, vol. 5, no. 1, pp. 5--17, 2011.
[19]
Leonardo Vanneschi, Matteo Mondini, Martino Bertoni, Alberto Ronchi, and Mattia Stefano, "GeNet: A graph-based genetic programming framework for the reverse engineering of gene regulatory networks," in 10th European Conference on Evolutionary Computation, Machine Learning and Data Mining in Bioinformatics, EvoBIO 2012, Mario Giacobini, Leonardo Vanneschi, and William S. Bush, Eds., Malaga, Spain, 11-13 Apr. 2012, vol. 7246 of LNCS, pp. 97--109, Springer Verlag.
[20]
Daniel Howard and Karl Benson, "Evolutionary computation method for promoter site prediction in DNA," in Genetic and Evolutionary Computation -- GECCO-2003, E. Cantú-Paz, J. A. Foster, K. Deb, D. Davis, R. Roy, U.-M. O'Reilly, H.-G. Beyer, R. Standish, G. Kendall, S. Wilson, M. Harman, J. Wegener, D. Dasgupta, M. A. Potter, A. C. Schultz, K. Dowsland, N. Jonoska, and J. Miller, Eds., Chicago, 12-16 July 2003, vol. 2724 of LNCS, pp. 1690--1701, Springer-Verlag.
[21]
Stephan M. Winkler, Michael Affenzeller, and Stefan Wagner, "Using enhanced genetic programming techniques for evolving classifiers in the context of medical diagnosis," Genetic Programming and Evolvable Machines, vol. 10, no. 2, pp. 111--140, June 2009.
[22]
W. B. Langdon and A. P. Harrison, "GP on SPMD parallel graphics hardware for mega bioinformatics data mining," Soft Computing, vol. 12, no. 12, pp. 1169--1183, Oct. 2008, Special Issue on Distributed Bioinspired Algorithms.
[23]
Jason H. Moore and Bill C. White, "Exploiting expert knowledge in genetic programming for genome-wide genetic analysis," in Parallel Problem Solving from Nature - PPSN IX, Thomas Philip Runarsson, Hans-Georg Beyer, Edmund Burke, Juan J. Merelo-Guervos, L. Darrell Whitley, and Xin Yao, Eds., Reykjavik, Iceland, 9-13 Sept. 2006, vol. 4193 of LNCS, pp. 969--977, Springer-Verlag.
[24]
Jason H. Moore, Nate Barney, Chia-Ti Tsai, Fu-Tien Chiang, Jiang Gui, and Bill C. White, "Symbolic modeling of epistasis," Human Heredity, vol. 63, no. 2, pp. 120--133, Feb. 2007.
[25]
Clare Bates Congdon and Kevin J. Septor, "Phylogenetic trees using evolutionary search: Initial progress in extending gaphyl to work with genetic data," in Proceedings of the 2003 Congress on Evolutionary Computation CEC2003, Ruhul Sarker, Robert Reynolds, Hussein Abbass, Kay Chen Tan, Bob McKay, Daryl Essam, and Tom Gedeon, Eds., Canberra, 8-12 Dec. 2003, pp. 320--326, IEEE Press.
[26]
Carlos Cotta and Pablo Moscato, "Inferring phylogenetic trees using evolutionary algorithms," in Parallel Problem Solving from Nature - PPSN VII, Juan J. Merelo-Guervos, Panagiotis Adamidis, Hans-Georg Beyer, Jose-Luis Fernandez-Villacanas, and Hans-Paul Schwefel, Eds., Granada, Spain, 7-11 Sept. 2002, number 2439 in Lecture Notes in Computer Science, LNCS, pp. 720--729, Springer-Verlag.
[27]
Rudi Cilibrasi and Paul Vitanyi, "A new quartet tree heuristic for hierarchical clustering," in Principled methods of trading exploration and exploitation Workshop, London, 6-7 July 2005.
[28]
W. B. Langdon, Olivia Sanchez Graillet, and A. P. Harrison, "RNAnet a map of human gene expression," arXiv:1001.4263, 24 Jan 2010.
[29]
Andrew P. Harrison, Joanna Rowsell, Renata da Silva Camargo, William B. Langdon, Maria Stalteri, Graham J.G. Upton, and Jose M. Arteaga-Salas, "The use of Affymetrix GeneChips as a tool for studying alternative forms of RNA," Biochemical Society Transactions, vol. 36, pp. 511--513, 2008.
[30]
Jose M. Arteaga-Salas, Harry Zuzan, William B. Langdon, Graham J. G. Upton, and Andrew P. Harrison, "An overview of image-processing methods for Affymetrix GeneChips," Briefings in Bioinformatics, vol. 9, no. 1, pp. 25--33, 2008.
[31]
W. B. Langdon, G. J. G. Upton, R. da Silva Camargo, and A. P. Harrison, "A survey of spatial defects in Homo Sapiens Affymetrix GeneChips," IEEE/ACM Transactions on Computational Biology and Bioinformatics, vol. 7, no. 4, pp. 647--653, oct.-dec 2009.
[32]
Ben Langmead, Cole Trapnell, Mihai Pop, and Steven Salzberg, "Ultrafast and memory-efficient alignment of short DNA sequences to the human genome," Genome Biology, vol. 10, no. 3, pp. R25, 2009.
[33]
Graham J. G. Upton, Olivia Sanchez-Graillet, Joanna Rowsell, Jose M. Arteaga-Salas, Neil S. Graham, Maria A. Stalteri, Farhat N. Memon, Sean T. May, and Andrew P. Harrison, "On the causes of outliers in affymetrix genechip data," Briefings in Functional Genomics & Proteomics, vol. 8, no. 3, pp. 199--212, 2009.
[34]
Wei Liu, Liurong Fang, Sha Li, Qiang Li, Zhemin Zhou, Zhixin Feng, Rui Luo, Guoqing Shao, Lei Wang, Huanchun Chen, and Shaobo Xiao, "Complete genome sequence of mycoplasma hyorhinis strain HUB-1," Journal of Bacteriology, vol. 192, no. 21, pp. 5844--5845, Nov 2010.
[35]
William B. Langdon, Graham J. G. Upton, and Andrew P. Harrison, "Probes containing runs of guanine provide insights into the biophysics and bioinformatics of Affymetrix GeneChips," Briefings in Bioinformatics, vol. 10, no. 3, pp. 259--277, 2009.
[36]
Olivia Sanchez-Graillet, Joanna Rowsell, William B. Langdon, Maria A. Stalteri, Jose M. Arteaga Salas, Graham J.G. Upton, and Andrew P. Harrison, "Widespread existence of uncorrelated probe intensities from within the same probeset on Affymetrix GeneChips," Journal of Integrative Bioinformatics, vol. 5, no. 2, pp. 98, 2008.
[37]
Joanna Rowsell, Renata da Silva Camargo, William B. Langdon, Maria A. Stalteri, and Andrew P. Harrison, "Uncovering the expression patterns of chimeric transcripts using surveys of Affymetrix GeneChips," Journal of Integrative Bioinformatics, vol. 7, no. 3, pp. 137, 2010.
[38]
Olivia Sanchez-Graillet, Maria A. Stalteri, Joanna Rowsell, Graham J.G. Upton, and Andrew P. Harrison, "Using surveys of affymetrix GeneChips to study antisense expression," Journal of Integrative Bioinformatics, vol. 7, no. 2, pp. 114, 2010.
[39]
W. B. Langdon and M. J. Arno, "More mouldy data: Virtual infection of the human genome," Tech. Rep. RN/11/14, Department of Computer Science, University College London, London WC1E 6BT, UK, 14 June 2011.
[40]
Mark S. Longo, Michael J. O'Neill, and Rachel J. O'Neill, "Abundant human DNA contamination identified in non-primate genome databases," PLoS ONE, vol. 6, no. 2, pp. e16410, 02 2011.

Cited By

View all
  • (2014)Analysis of discordant Affymetrix probesets casts serious doubt on idea of microarray data reutilizationBMC Genomics10.1186/1471-2164-15-S12-S815:S12Online publication date: 19-Dec-2014

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
GECCO '13 Companion: Proceedings of the 15th annual conference companion on Genetic and evolutionary computation
July 2013
1798 pages
ISBN:9781450319645
DOI:10.1145/2464576
  • Editor:
  • Christian Blum,
  • General Chair:
  • Enrique Alba
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 06 July 2013

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. dna gene expression
  2. evolutionary algorithm
  3. homo sapiens genome reference consortium grch37.p5 h_sapiens_37.5_asm

Qualifiers

  • Tutorial

Conference

GECCO '13
Sponsor:
GECCO '13: Genetic and Evolutionary Computation Conference
July 6 - 10, 2013
Amsterdam, The Netherlands

Acceptance Rates

Overall Acceptance Rate 1,669 of 4,410 submissions, 38%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 20 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2014)Analysis of discordant Affymetrix probesets casts serious doubt on idea of microarray data reutilizationBMC Genomics10.1186/1471-2164-15-S12-S815:S12Online publication date: 19-Dec-2014

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media