Non-invasive bladder cancer detection: identification of a urinary volatile biomarker panel using GC-MS metabolomics and machine learning

Â. Carapito*, V. S. Fernandes Ferreira, A. C. Silva Ferreira, A. Teixeira-Marques, R. Henrique, C. Jerónimo, A. C. A. Roque, F. Carvalho, J. Pinto, P. Guedes de Pinho*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

5 Downloads

Abstract

Early detection of bladder cancer (BC) remains a major clinical challenge due to the limitations of current diagnostic methods, which are often invasive, expensive, or insufficiently sensitive, particularly for early-stage disease. Metabolomics approaches, when integrated with machine learning (ML) techniques, offer a powerful platform for identifying novel, non-invasive biomarkers. In this study, urinary volatile organic compounds (VOCs) were analysed from 87 BC patients and 90 age- and sex-matched cancer-free controls using headspace solid-phase microextraction coupled with gas chromatography–mass spectrometry (HS-SPME/GC-MS). Of the 90 VOCs identified, 27 were selected and used to train five ML algorithms—random forest (RF), support vector machine (SVM), partial least squares-discriminant analysis (PLS-DA), extreme gradient boosting (XGBoost), and k-nearest neighbors (k-NN). Model performance was evaluated using cross-validation and an independent validation set, with metrics including area under the curve (AUC), sensitivity, specificity, and accuracy. RF achieved the highest performance using all 27 features (AUC = 0.913; sensitivity, specificity, and accuracy = 85 %). After feature selection, an eight-VOC panel improved performance on the validation set (AUC = 0.872; sensitivity = 89 %, specificity = 92 %, accuracy = 91 %). The panel included ketones, aldehydes, a short fatty alcohol, and a phenol compound—seven elevated in BC, and one (acetone) decreased. This panel outperformed FDA-approved urinary assays and closely matched the specificity of urine cytology. These findings underscore the promise of VOC-based urinary biomarkers, in combination with ML, for the non-invasive detection of BC. Further large-scale validation studies are essential to confirm diagnostic utility and enable clinical translation.
Original languageEnglish
Article number128749
Number of pages10
JournalTalanta
Volume297
DOIs
Publication statusPublished - 1 Jan 2026

Keywords

  • Bladder cancer
  • Early detection
  • Gas chromatography-mass spectrometry
  • Machine learning
  • Metabolomics
  • Urinary biomarkers
  • Volatile organic compounds

Fingerprint

Dive into the research topics of 'Non-invasive bladder cancer detection: identification of a urinary volatile biomarker panel using GC-MS metabolomics and machine learning'. Together they form a unique fingerprint.

Cite this