Skip to main content

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 8401))

  • 3578 Accesses

Abstract

The past decade has seen explosive growth in digitized medical data. This trend offers medical practitioners an unparalleled opportunity to identify effectiveness of treatments for patients using summary statistics and to offer patients more personalized medical treatments based on predictive analytics. To exploit this opportunity, statisticians and computer scientists need to work and communicate effectively with medical practitioners to ensure proper measurement data, collection of sufficient volumes of heterogeneous data to ensure patient privacy, and understanding of probabilities and sources of errors associated with data sampling. Interdisciplinary collaborations between scientists are likely to lead to the development of more effective methods for explaining probabilities, possible errors, and risks associated with treatment options to patients. This chapter introduces some online resources to help medical practitioners with little or no background in summary and predictive statistics learn basic statistical concepts and implement data analysis on their personal computers using R, a high-level computer language that requires relatively little training. Readers who are only interested in understanding basic statistical concepts may want to skip the subsection on R.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Report on BridgeHead’s 2010 Data Management Healthcheck Survey Results, http://www.bridgeheadsoftware.com/resources/category/reports/ (accessed February 19, 2014)

  2. The BridgeHead Software 2011 International Healthcheck Data Management Survey, http://www.bridgeheadsoftware.com/resources/category/reports/ (accessed February 19, 2014)

  3. IBM Big Data Website, http://www-01.ibm.com/software/data/bigdata/industry-healthcare.html#2 (accessed February 19, 2014)

  4. Creswell, J.: A digital shift on health data swells profits in an industry. NY Times on-line, http://ww.nytimes.com/2013/02/20/business/a-digital-shift-on-health-data-swells-profits.html?pagewanted=all&_r=0 (February 19, 2013)

  5. Kerr, W., Lau, E., Owens, G., Trefler, A.: The future of medical diagnostics: large digitized databases. Yale Journal of Biology and Medicine 85(3), 363–377 (2012), http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3447200/

    Google Scholar 

  6. Holzinger, A., Dehmer, M., Jurisica, I.: Knowledge discovery and interactive data mining in bioinformatics – state-of-the-art, future challenges and research directions. BMC Bioinformatics 15(suppl. 6) (2014)

    Google Scholar 

  7. Hood, L., Rowen, L., Galas, D., Aitchison, J.: Systems biology at the Institute of Systems Biology. Briefings in Functional Genomics and Proteomics 7(4), 239–248 (2008)

    Article  Google Scholar 

  8. Gibbs, W.: Medicine gets up close and personal. Nature, http://www.nature.com/news/medicine-gets-up-close-and-personal-1.14702 (accessed February 11, 2014)

  9. Holzinger, A.: Human-computer interaction and knowledge discovery (HCI-KDD): What is the benefit of bringing those two fields to work together? In: Cuzzocrea, A., Kittl, C., Simos, D.E., Weippl, E., Xu, L. (eds.) CD-ARES 2013. LNCS, vol. 8127, pp. 319–328. Springer, Heidelberg (2013)

    Chapter  Google Scholar 

  10. The R project for statistical computing, http://www.r-project.org/ (accessed February 11, 2014)

  11. Holzinger, A.: Biomedical Informatics: Computational Sciences meets Life Sciences. BoD, Norderstedt (2012)

    Google Scholar 

  12. Wikipedia, http://en.wikipedia.org/wiki/Cloud_computing (February 11, 2014)

  13. Wikipedia, http://en.wikipedia.org/wiki/Massive_Open_Online_Course (February 11, 2014)

  14. Merriam-Webster on-line, http://www.merriam-webster.com/dictionary/statistics

  15. Wikipedia, http://en.wikipedia.org/wiki/Summary_statistics (February 11, 2014)

  16. Wikipedia, http://en.wikipedia.org/wiki/Statistical_inference (February 11, 2014)

  17. Merriam-Webster, http://www.merriam-webster.com/dictionary/correlation (February 11, 2014)

  18. Wolfram MathWorldTM, http://mathworld.wolfram.com/Probability.html

  19. Evfimievski, A., Grandison, T.: Privacy-preserving data mining. In: Ferraggine, V., Doorn, J., Rivero, L. (eds.) Handbook of Research on Innovations in Database Technologies and Applications, pp. 527–536. IGI Global, Hershey (2009)

    Chapter  Google Scholar 

  20. Aggarwal, C., Yu, P.: A general survey of privacy-preserving data mining models and algorithms. In: Aggarwal, C. Yu, P. (eds.), pp. 11–52. Springer, New York (2008)

    Google Scholar 

  21. Pasierb, K., Kajdanowicz, T., Kazienko, P.: Privacy-preserving data mining, sharing and publishing. Journal of Medical Informatics and Technologies 18, 69–76 (2011)

    Google Scholar 

  22. Vaidya, J., Clifton, C., Zhu, M.: Privacy and data mining. In: Vaidya, J., Clifton, C., Zhu, M. (eds.) Privacy Preserving Data Mining, ch. 1. Springer, NY (2006)

    Google Scholar 

  23. Coursera website, http://www.coursera.org

  24. MIT OpenCourseware website, ocw.mit.edu/index.htm

  25. CMU’s Open Learning Initiative, oli.cmu.edu

  26. Udacity website, http://www.udacity.com

  27. Chase, W., Bown, F.: General Statistics, 3rd edn. John Wiley and Sons, NY (1997)

    Google Scholar 

  28. Kerns, G.: Introduction to Probability and Statistics Using R. G. Jay Kerns publisher (2010), http://www.amazon.com , incomplete, rough draft available on-line, cran.r-project.org/web/packages/IPSUR/vignettes/IPSUR.pdf (accessed February 19, 2014)

  29. Der, G., Everitt, J.: Applied Medical Statistics using SAS. CRC Press, Boca Raton (2013)

    Google Scholar 

  30. IBM SPSS, http://www.ibm.com/software/analytics/spss/products/modeler/

  31. StatSoft STATISTICA, http://www.statsoft.com/Solutions/Healthcare

  32. Athena Health Electronic Medical Record (EMR), http://www.athenahealth.com/Clinicals

  33. Wikipedia entry for S, http://en.wikipedia.org/wiki/S_%28programming_language%29

  34. Becker, R.: A brief history of S, kabah.lcg.unam.mx/~lcollado/R/resources/history_of_S.pdf (accessed February 20, 2014)

  35. Try R, http://tryr.codeschool.com/

  36. Code School, http://www.codeschool.com/

  37. O’Reilly, http://oreilly.com/

  38. Computing for Data Analysis, Coursera, http://www.coursera.org/course/compdata

  39. Data Analysis, Coursera, http://www.coursera.org/course/dataanalysis

  40. Core Concepts in Data Analysis, Coursera, http://www.coursera.org/course/datan

  41. Apple, http://www.apple.com

  42. Prinzel, Y.: How Steve Jobs’ health problems have impacted Applestock over the years. Covestor.com (January 11, 2011), http://investing.covestor.com/2011/01/how-steve-jobs-health-problems-have-impacted-apple-stock-over-the-years-aapl

  43. Ovide, S.: Amazon CEO Jeff Bezos evacuated for kidney stone attack. Wall Street Jour. (January 11, 2011), blogs.wsj.com/digits/2014/01/04/amazon-ceo-gives-kidney-stones-zero-stars/

  44. Herper, M.: From fitbits to clinical studies: how big data could change medicine. Forbes on-line (December 16, 2013), http://www.forbes.com/sites/matthewherper/2013/12/16/from-fitbits-to-clinical-studies-how-big-data-could-change-medicine/

  45. Athena Health website, http://www.athenahealth.com/our-company/about-us/medical-practice-management.php

  46. Netlib, http://www.netlib.org/

  47. Dongarra, J., Grosse, E.: Distribution of mathematical software via electronic mail. Comm. of the ACM 30(5), 403–407 (1987)

    Article  Google Scholar 

  48. Sang-Hun, C.: Disgraced cloning expert convicted in South Korea. NY Times (October 26, 2009), http://www.nytimes.com/2009/10/27/world/asia/27clone.html?ref=hwangwoosuk&_r=0

  49. Agence France-Presse, Tokyo: Japan looks into claim ‘falsified’ data was used in Alzheimer’s drug study (January 10, 2014), http://www.scmp.com/news/asia/article/1402542/japan-looks-claim-falsified-data-was-used-alzheimers-drug-study

  50. Takenaka, K., Kelland, K.: Japanese scientist urges withdrawal of own ‘breakthrough’ stem cell research. InterAksyon.com (March 11, 2014), http://www.interaksyon.com/article/82452/japanese-scientist-urges-withdrawal-of-own-breakthrough-stem-cell-research

  51. J-ADNI homepage, http://www.j-adni.org/etop.html

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Kobayashi, M. (2014). Resources for Studying Statistical Analysis of Biomedical Data and R. In: Holzinger, A., Jurisica, I. (eds) Interactive Knowledge Discovery and Data Mining in Biomedical Informatics. Lecture Notes in Computer Science, vol 8401. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-43968-5_10

Download citation

  • DOI: https://doi.org/10.1007/978-3-662-43968-5_10

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-662-43967-8

  • Online ISBN: 978-3-662-43968-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics