Why Does Synthesized Data Improve Multi-sequence Classification?

van Tulder, Gijs; de Bruijne, Marleen

doi:10.1007/978-3-319-24553-9_65

Gijs van Tulder¹⁷ &
Marleen de Bruijne^17,18

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 9349))

Included in the following conference series:

International Conference on Medical Image Computing and Computer-Assisted Intervention

10k Accesses
30 Citations

Abstract

The classification and registration of incomplete multi-modal medical images, such as multi-sequence MRI with missing sequences, can sometimes be improved by replacing the missing modalities with synthetic data. This may seem counter-intuitive: synthetic data is derived from data that is already available, so it does not add new information. Why can it still improve performance? In this paper we discuss possible explanations. If the synthesis model is more flexible than the classifier, the synthesis model can provide features that the classifier could not have extracted from the original data. In addition, using synthetic information to complete incomplete samples increases the size of the training set.

We present experiments with two classifiers, linear support vector machines (SVMs) and random forests, together with two synthesis methods that can replace missing data in an image classification problem: neural networks and restricted Boltzmann machines (RBMs). We used data from the BRATS 2013 brain tumor segmentation challenge, which includes multi-modal MRI scans with T1, T1 post-contrast, T2 and FLAIR sequences. The linear SVMs appear to benefit from the complex transformations offered by the synthesis models, whereas the random forests mostly benefit from having more training data. Training on the hidden representation from the RBM brought the accuracy of the linear SVMs close to that of random forests.

Download to read the full chapter text

Chapter PDF

HeMIS: Hetero-Modal Image Segmentation

Medical Imaging Based Diagnosis Through Machine Learning and Data Analysis

XmoNet: A Fully Convolutional Network for Cross-Modality MR Image Inference

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Little, R.J.A., Rubin, D.B.: Statistical analysis with missing data, 2nd edn. Wiley, New York (2002)
MATH Google Scholar
Fischl, B., Salat, D.H., van der Kouwe, A.J.W., Makris, N., Ségonne, F., Quinn, B.T., Dale, A.M.: Sequence-independent segmentation of magnetic resonance images. NeuroImage 23, S69–S84 (2004)
Google Scholar
Johansson, A., Karlsson, M., Nyholm, T.: CT substitute derived from MRI sequences with ultrashort echo time. Medical Physics 38(5) (2011)
Google Scholar
Johansson, A., Garpebring, A., Asklund, T., Nyholm, T.: CT substituties derived from MR images reconstructed with parallel imaging. Medical Physics 41 (2014)
Google Scholar
Eilertsen, K., Vestad, L.N.T.A., Geier, O., Skretting, A.: A simulation of MRI based dose calculations on the basis of radiotherapy planning CT images. Acta Oncologica 47(7), 1294–1302 (2008)
Article Google Scholar
Kapanen, M., Tenhunen, M.: T1/T2*-weighted MRI provides clinically relevant pseudo-CT density data for the pelvic bones in MRI-only based radiotherapy treatment planning. Acta Oncologica (Stockholm, Sweden) 52(3), 612–618 (2013)
Article Google Scholar
Larsson, A., Johansson, A., Axelsson, J., Nyholm, T., Asklund, T., Riklund, K., Karlsson, M.: Evaluation of an attenuation correction method for PET/MR imaging of the head based on substitute CT images. Magnetic Resonance Materials in Physics, Biology and Medicine 26(1), 127–136 (2013)
Article Google Scholar
Hofmann, M., Steinke, F., Scheel, V., Charpiat, G., Farquhar, J., Aschoff, P., Brady, M., Schölkopf, B., Pichler, B.J.: MRI-based attenuation correction for PET/MRI: a novel approach combining pattern recognition and atlas registration. Journal of Nuclear Medicine 49(11), 1875–1883 (2008)
Article Google Scholar
Hofmann, M., Pichler, B., Schölkopf, B., Beyer, T.: Towards quantitative PET/MRI: a review of MR-based attenuation correction techniques. European Journal of Nuclear Medicine and Molecular Imaging 36(suppl. 1), March 2009
Google Scholar
Iglesias, J.E., Konukoglu, E., Zikic, D., Glocker, B., Van Leemput, K., Fischl, B.: Is synthesizing MRI contrast useful for inter-modality analysis? In: Mori, K., Sakuma, I., Sato, Y., Barillot, C., Navab, N. (eds.) MICCAI 2013, Part I. LNCS, vol. 8149, pp. 631–638. Springer, Heidelberg (2013)
Chapter Google Scholar
Roy, S., Carass, A., Prince, J.: A compressed sensing approach for MR tissue contrast synthesis. In: Székely, G., Hahn, H.K. (eds.) IPMI 2011. LNCS, vol. 6801, pp. 371–383. Springer, Heidelberg (2011)
Chapter Google Scholar
Li, R., Zhang, W., Suk, H.-I., Wang, L., Li, J., Shen, D., Ji, S.: Deep learning based imaging data completion for improved brain disease diagnosis. In: Golland, P., Hata, N., Barillot, C., Hornegger, J., Howe, R. (eds.) MICCAI 2014, Part III. LNCS, vol. 8675, pp. 305–312. Springer, Heidelberg (2014)
Google Scholar
Menze, B.H., Jakab, A., Bauer, S., et al.: The Multimodal Brain Tumor Image Segmentation Benchmark (BRATS). IEEE Transactions on Medical Imaging (2014)
Google Scholar
Bengio, Y., Courville, A., Vincent, P.: Representation Learning: A Review and New Perspectives. Technical report, Université de Montréal (2012)
Google Scholar
Hinton, G.E.: A Practical Guide to Training Restricted Boltzmann Machines. Technical report, University of Toronto (2010)
Google Scholar
Tieleman, T.: Training restricted Boltzmann machines using approximations to the likelihood gradient. In: ICML (2008)
Google Scholar
Bergstra, J., et al.: Theano: A CPU and GPU Math Compiler in Python. In: Proceedings of the Python for Scientific Computing Conference, SciPy (2010)
Google Scholar
Pedregosa, F., et al.: Scikit-learn: Machine Learning in Python. Journal of Machine Learning Research 12, 2825–2830 (2011)
MathSciNet MATH Google Scholar

Download references

Author information

Authors and Affiliations

Biomedical Imaging Group Rotterdam, Erasmus MC University Medical Center, Rotterdam, The Netherlands
Gijs van Tulder & Marleen de Bruijne
Department of Computer Science, University of Copenhagen, Copenhagen, Denmark
Marleen de Bruijne

Authors

Gijs van Tulder
View author publications
You can also search for this author in PubMed Google Scholar
Marleen de Bruijne
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

TU München, Garching, Germany
Nassir Navab
Lehrstuhl Informatik 5, University of Erlangen-Nuremberg, Erlangen, Germany
Joachim Hornegger
Brigham and Women's Hospital, Boston, Massachusetts, USA
William M. Wells
University of Sheffield, Sheffield, Suffolk, United Kingdom
Alejandro Frangi

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

van Tulder, G., de Bruijne, M. (2015). Why Does Synthesized Data Improve Multi-sequence Classification?. In: Navab, N., Hornegger, J., Wells, W., Frangi, A. (eds) Medical Image Computing and Computer-Assisted Intervention -- MICCAI 2015. MICCAI 2015. Lecture Notes in Computer Science(), vol 9349. Springer, Cham. https://doi.org/10.1007/978-3-319-24553-9_65

Download citation

DOI: https://doi.org/10.1007/978-3-319-24553-9_65
Published: 18 November 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-24552-2
Online ISBN: 978-3-319-24553-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics