Abstract
Microwave-based breast cancer detection has been proposed as a complementary approach to compensate for some drawbacks of existing breast cancer detection techniques. Among the existing microwave breast cancer detection methods, machine learning-type algorithms have recently become more popular. These focus on detecting the existence of breast tumours rather than performing imaging to identify the exact tumour position. A key component of the machine learning approaches is feature extraction. One of the most widely used feature extraction method is principle component analysis (PCA). However, it can be sensitive to signal misalignment. This paper proposes feature extraction methods based on time-frequency representations of microwave data, including the wavelet transform and the empirical mode decomposition. Time-invariant statistics can be generated to provide features more robust to data misalignment. We validate results using clinical data sets combined with numerically simulated tumour responses. Experimental results show that features extracted from decomposition results of the wavelet transform and EMD improve the detection performance when combined with an ensemble selection-based classifier.
Similar content being viewed by others
References
Abdelnour AF, Selesnick IW (2001) Nearly symmetric orthogonal wavelet bases. In: International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp 1–6. IEEE, Salt Lake City, USA
Ahmed N, Natarajan T, Rao KR (1974) Discrete cosine transform. IEEE Trans Comput 100(1):90–93
AlShehri SA, Khatun S, Jantan AB, Raja Abdullah RSA, Mahmud R, Awang Z (2011) Experimental breast tumor detection using NN-based UWB imaging. Prog Electromagn Res 111:447–465
Bajaj V, Pachori RB (2012) Classification of seizure and nonseizure EEG signals using empirical mode decomposition. IEEE Trans Inf Technol Biomed 16(6):1135–1142
Bourqui J, Sill JM, Fear EC (2012) A prototype system for measuring microwave frequency reflections from the breast. Int J Biomed Imaging 2012:9
Byrne D, Craddock I (2015) Time-domain wideband adaptive beamforming for radar breast imaging. IEEE Trans Antennas Propag 63(4):1725–1735
Byrne D, O’Halloran M, Glavin M, Jones E (2011) Breast cancer detection based on differential ultrawideband microwave radar. Prog Electromagn Res M (PIER-M) 20:231–242
Caorsi S, Gragnani GL, Pastorino M (1993) Reconstruction of dielectric permittivity distributions in arbitrary 2-D inhomogeneous biological bodies by a multiview microwave numerical method. IEEE Trans Med Imaging 12 (2):232–239
Chew HG, Bogner RE, Lim CC (2001) Dual ν-support vector machine with error rate and training size biasing. In: International conference on acoustics, speech, and signal processing (ICASSP). Salt Lake City, USA, pp 1269–1272
Conceicao RC, Medeiros H, O’Halloran M, Rodriguez-Herrera D, Flores-Tapia D, Pistorius S (2013) Initial classification of breast tumour phantoms using a UWB radar prototype. In: Proceedings of International Conference Electromagnetics in Advance Application (ICEAA), pp. 720–723. Singapore
Conceição RC, Medeiros H, O’Halloran M, Rodriguez-Herrera D, Flores-Tapia D, Pistorius S (2014) SVM-based classification of breast tumour phantoms using a UWB radar prototype system. In: Proceeding URSI general assembly science symposium (GASS), pp 1–4. Beijing, China
Daubechies I (1992) Ten lectures on wavelets. SIAM
Davis SK, Tandradinata H, Hagness S, Van Veen BD (2005) Ultrawideband microwave breast cancer detection: a detection-theoretic approach using the generalized likelihood ratio test. IEEE Trans Biomed Eng 52 (7):1237–1250
Davis SK, Van Veen BD, Hagness S, Kelcz F (2008) Breast tumor characterization based on ultrawideband microwave backscatter. IEEE Trans Biomed Eng 55(1):237–246
Debye PJW (1929) Polar molecules. The Chemical Catalog CO., New York
Haar A (1910) Zur theorie der orthogonalen funktionensysteme. Math Ann 69(3):331–371
Hagness S, Leininger K, Booske J, Okoniewski M (2000) Dielectric characterization of human breast tissue at microwave frequencies. In: Proceedings 2nd international world congress microwave and radio frequency processing. Orlando, FL
Huang NE, Shen Z, Long SR, Wu MC, Shih HH, Zheng Q, Yen NC, Tung CC, Liu HH (1998) The empirical mode decomposition and the hilbert spectrum for nonlinear and non-stationary time series analysis. Proc Roy Soc London A Math, Phys Eng Sci 454:903–995
Kingsbury N (2001) Complex wavelets for shift invariant analysis and filtering of signals. Appl Comput Harmon Anal 10(3):234–253
Klemm M, Craddock I, Leendertz J, Preece A, Gibbins D, Shere M, Benjamin R (2010) Clinical trials of a UWB imaging radar for breast cancer. In: European conference antennas propagation (EuCAP), pp 1–4, Barcelona, Spain
Kosmas P, Rappaport CM (2006) FDTD-based time reversal for microwave breast cancer detection-localization in three dimensions. IEEE Trans Microw Theory 54(4):1921–1927
Li S, Zhou W, Yuan Q, Geng S, Cai D (2013) Feature extraction and recognition of ictal EEG using EMD and SVM. Comput Biol Med 43(7):807–816
Li X, Hagness S (2001) A confocal microwave imaging algorithm for breast cancer detection. IEEE Microw Wirel Compon 11(3):130–132
Li Y, Porter E, Santorelli A, Popović M, Coates M (2017) Microwave breast cancer detection via cost-sensitive ensemble classifiers: phantom and patient investigation. Biomed Signal Process Control 31:366–376
Li Y, Santorelli A, Laforest O, Coates M (2015) Cost-sensitive ensemble classifiers for microwave breast cancer detection. In: International conference on acoustics, speech, and signal processing (ICASSP). Brisbane, Australia
Lim HB, Nhung NTT, Li EP, Thang ND (2008) Confocal microwave imaging for breast cancer detection: delay-multiply-and-sum image reconstruction algorithm. IEEE Trans Biomed Eng 55(6):1697–1704
Mallat S (2009) A wavelet tour of signal processing (Third Edition), 3rd edn. Academic Press, Boston
Mallat SG (1989) A theory for multiresolution signal decomposition: the wavelet representation. IEEE Trans Pattern Anal Machi Intell 11(7):674–693
Meaney PM, Fanning MW, Raynolds T, Fox CJ, Fang Q, Kogel CA, Poplack SP, Paulsen KD (2007) Initial clinical experience with microwave breast imaging in women with normal mammography. Acad Radiol 14(2):207–218
O’Halloran M, Jones E, Glavin M (2010) Quasi-multistatic MIST beamforming for the early detection of breast cancer. IEEE Trans Biomed Eng 57(4):830–840
Porter E, Coates M, Popović M. (2016) An early clinical study of time-domain microwave radar for breast health monitoring. IEEE Trans Biomed Eng 63(3):530–539
Porter E, Kirshin E, Santorelli A, Coates M, Popovic M (2013) Time-domain multistatic radar system for microwave breast screening. IEEE Antennas Wirel Propag 12:229–232
Porter E, Kirshin E, Santorelli A, Popović M (2016) Microwave breast screening in the time domain: identification and compensation of measurement-induced uncertainties. Prog Electromagn Res B (PIER-B) 55:115–130
Canadian Cancer Society’s Advisory Committee on Cancer Statistics (2015) Canadian cancer statistics 2015. Canadian Cancer Society, Toronto
Santorelli A, Kirshin E, Porter E, Popović M, Schwartz J (2013) Improved calibration for an experimental time-domain microwave imaging system. In: European conference on antennas and propagation (EuCAP). Gothenburg, Sweden
Scott C (2007) Performance measures for Neyman-Pearson classification. IEEE Trans Inf Theory 53:2852–2863
Selesnick IW, Baraniuk RG, Kingsbury NC (2005) The dual-tree complex wavelet transform. IEEE Signal Process Mag 22(6):123–151
Shea JD, Kosmas P, Hagness S, Van Veen BD (2010) Three-dimensional microwave imaging of realistic numerical breast phantoms via a multiple-frequency inverse scattering technique. Med Phys 37(8):4210–4226
Sohn VY, Arthurs ZM, Sebesta JA, Brown TA (2008) Primary tumor location impacts breast cancer survival. The Am J Surg 195(5):641–644
Subasi A (2007) EEG signal classification using wavelet feature extraction and a mixture of expert model. Expert Syst Appl 32(4):1084–1093
Sugitani T, Kubota SI, Kuroki SI, Sogo K, Arihiro K, Okada M, Kadoya T, Hide M, Oda M, Kikkawa T (2014) Complex permittivities of breast tumor tissues obtained from cancer surgeries. Appl Phys Lett 104(25):253,702
Thomas M, Das MK, Ari S (2015) Automatic ECG arrhythmia classification using dual tree complex wavelet based features. AEU-Int J Electron Commun 69(4):715–721
Tzanetakis G, Essl G, Cook P (2001) Audio analysis using the discrete wavelet transform. In: Proceedings conference acoustics music theory application skiathos, Greece
Acknowledgements
Hongchao Song was sponsored by the China Scholarship Council Project of the National Nature Science Foundation of China No. 61671077 and No. 61671264, and the Postgraduate Innovation Fund of SICE, BUPT, 2015. Yunpeng Li was supported by the Natural Sciences and Engineering Research Council of Canada (NSERC 260250). The authors gratefully acknowledge the McGill Microwave Breast Cancer Detection group, led by Prof. Milica Popović and Prof. Mark Coates, for providing experimental data and advice.
Author information
Authors and Affiliations
Corresponding author
Appendix:
Appendix:
1.1 A.1 Tumour response simulation
We adopted the strategy outlined in [24] to simulate tumour responses for each volunteer, based on the transmitted pulses from the antennas and the dielectric properties of breast tissue. For one antenna pair and a tumour position p 0, the frequency domain representation of the tumour response R t(p 0,ω) was modelled as:
where R(ω) is the frequency domain representation of received signal, \(d^d_{im}\) and \(d^d_{br}\) are the lengths of the direct path for this antenna pair through the immersion medium (ultrasound gel) and breast tissue, respectively. \(d^t_{im}\) and \(d^t_{br}\) are the lengths of the shortest path between the antenna pair via the tumour position p 0, in the immersion medium and the breast tissue, respectively. Γ is a constant that can be used to introduce additional attenuation in the tumour response. In this paper, we concentrate on the case Γ = 1. k i m and k b r are the wavenumbers for the immersion medium and breast tissue, respectively, and these have the following expressions:
Here, 𝜖 i m is the relative permittivity of the immersion medium, c is the speed of light, and 𝜖 b r is the average breast tissue relative permittivity. The latter is specified by the Debye model [15]:
where 𝜖 0 = 8.854 × 10−12 F/m is the permittivity of free space, and 𝜖 ∞ is the dielectric constant of the material at infinite frequency. Δ𝜖 = 𝜖 s − 𝜖 ∞ , and 𝜖 s is the static dielectric constant. The pole relaxation constant is τ and the static conductivity is σ s . The model parameters are chosen to approximate the dielectric properties of breast tissue.
We refer readers to [24] for a more complete description and discussion of the tumour response simulation procedure. We notice that when Γ = 1 and the tumour is close to the direct path between the transmitting antenna and the receiving antenna, the amplitude of simulated tumour response may be too large. We introduce a hard threshold to constrain the ratio of the amplitude between the tumour response and the received signal.
We do stress that this is not a model that assumes homogeneous breast tissue. By including the received signal R(w), rather than the transmitted signal, the model incorporates distortions and delays caused by inhomogeneous tissue. In Fig. 11, we present one example of a tumour response-injected signal.
1.2 A.2 Parameter values
As discussed in [24], signals collected by the antenna pairs located on the opposite sides of the breast can be highly distorted and vary significantly among different volunteer visits. Thus, we discarded signals from any antenna pair whose median peak amplitude of all the training data is less than a threshold of 20 mV. The length of original signal is 1024 and the sample rate is 40GHz [31]. We first windowed data from 60 samples before the peak amplitude of measurement from antenna pair A1A2 measurement. Since antenna A1 and A2 are one of the closest antenna pairs, we do not expect any tumour responses to occur before the peak amplitude of the measurement from A1A2. Scans from the same antenna pair were then aligned based on maximal correlations with a reference scan and were windowed between the 61st sample to 604th sample, corresponding to an actual time period of 13.6 ns. A window of 544 samples allows the discrete wavelet transform of 5 levels and covers all regions of significant tumour responses as shown in [24]. The maximum ratio of the amplitudes between the simulated tumour response and the received signal was set to 0.3.
The candidate values of the 2ν-SVM hyper-parameters used for cross validation are listed in Table 4. We tested the detection performance with the γ value chosen from the candidate set γ = {2−15, 2−13,..., 25} using a small subset of the data, and observed that the ensemble classifier almost always chose the γ values of {2−1, 21}. We further tested a range of fixed γ values between 2−2 and 22 and observed that they lead to similar performance. To reduce the computational cost during training, we set γ = 1 for the experiments detailed here.
Thus, there are 1 × 18 × 18 = 324 different 2ν-SVM hyper-parameter combinations. These were used to produce a model library consisting of M × 324 base models, where M is the number of retained antenna pairs in each data set. The ensemble classifier selected 100 base models, choosing those with the smallest Neyman-Pearson measure when applied to the training data, to perform classification on the test data.
The mother wavelet used in the DWT was the Daubechies 5 (db5) [12]. The filter used in the first step of the DTCWT was the Farras filter [1], and Kingsbury Q-shift filters[19] with six taps were used for subsequent stages. 5-level decomposition was performed for DWT and DTCWT as we observed that the magnitudes of the decomposition results at higher levels were too small to contain useful information. This leads to 24 features for each antenna pair as we compute statistics for outputs from both high-pass filter and low-pass filter in the last decomposition level. We performed 5-level EMD decomposition for the same reason, and this leads to 20 EMD-based features for one antenna pair.
The magnitudes of the features derived from decomposition results from higher levels were often significantly smaller in magnitude. For the measurements which were collected by the same antenna pair, we rescaled each feature of the training data into the range [0, 1]. The testing data was scaled by the same ratio. The process was repeated for each antenna pair.
Rights and permissions
About this article
Cite this article
Song, H., Li, Y. & Men, A. Microwave breast cancer detection using time-frequency representations. Med Biol Eng Comput 56, 571–582 (2018). https://doi.org/10.1007/s11517-017-1712-0
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11517-017-1712-0