Raman spectroscopy and machine learning for the classification of breast cancers

Lihao Zhang, Chengjian Li, Di Peng, Xiaofei Yi, Shuai He, Fengxiang Liu, Xiangtai Zheng, Wei E. Huang, Liang Zhao*, Xia Huang*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

91 Citations (Scopus)

Abstract

Breast cancer is a major health threat for women. The drug responses associated with different breast cancer subtypes have obvious effects on therapeutic outcomes; therefore, the accurate classification of breast cancer subtypes is critical. Breast cancer subtype classification has recently been examined using various methods, and Raman spectroscopy has emerged as an effective technique that can be used for noninvasive breast cancer analysis. However, the accurate and rapid classification of breast cancer subtypes currently requires a great deal of effort and experience with the processing and analysis of Raman spectra data. Here, we adopted Raman spectroscopy and machine learning techniques to simplify and accelerate the process used to distinguish normal from breast cancer cells and classify breast cancer subtypes. Raman spectra were obtained from cultured breast cancer cell lines, and the data were analyzed by two machine learning algorithms: principal component analysis (PCA)–discriminant function analysis (DFA) and PCA–support vector machine (SVM). The accuracies with which these two algorithms were able to distinguish normal breast cells from breast cancer cells were both greater than 97%, and the accuracies of breast cancer subtype classification for both algorithms were both greater than 92%. Moreover, our results showed evidence to support the use of characteristic Raman spectral features as cancer cell biomarkers, such as the intensity of intrinsic Raman bands, which increased in cancer cells. Raman spectroscopy combined with machine learning techniques provides a rapid method for breast cancer analysis able to reveal differences in intracellular compositions and molecular structures among subtypes.

Original languageEnglish
Article number120300
JournalSpectrochimica Acta - Part A: Molecular and Biomolecular Spectroscopy
Volume264
DOIs
Publication statusPublished - 21 Aug 2021

Keywords

  • Breast cancer
  • Cancer diagnosis
  • Cancer subtype classification
  • Machine learning
  • Raman spectroscopy

Cite this