Classification of the urinary metabolome using machine learning and potential applications to diagnosing interstitial cystitis

Main Article Content

Feng Tong
Muhammad Shahid
Peng Jin
Sungyong Jung
Won Hwa Kim
Jayoung Kim

Keywords

interstitial cystitis, biomarker, urine, metabolomics, machine learning, artificial algorithm

Abstract

With the advent of artificial intelligence (AI) in biostatistical analysis and modeling, machine learning can potentially be applied into developing diagnostic models for interstitial cystitis (IC). In the current clinical setting, urologists are dependent on cystoscopy and questionnaire-based decisions to diagnose IC. This is a result of a lack of objective diagnostic molecular biomarkers. The purpose of this study was to develop a machine learning-based method for diagnosing IC and assess its performance using metabolomics profiles obtained from a prior study. To develop the machine learning algorithm, two classification methods, support vector machine (SVM) and logistic regression (LR), set at various parameters, were applied to 43 IC patients and 16 healthy controls. There were 3 measures used in this study, accuracy, precision (positive predictive value), and recall (sensitivity). Individual precision and recall (PR) curves were drafted. Since the sample size was relatively small, complicated deep learning could not be done. We achieved a 76%–86% accuracy with leave-one-out cross validation depending on the method and parameters set. The highest accuracy achieved was 86.4% using SVM with a polynomial kernel degree set to 5, but a larger area under the curve (AUC) from the PR curve was achieved using LR with a l1-norm regularizer. The AUC was greater than 0.9 in its ability to discriminate IC patients from controls, suggesting that the algorithm works well in identifying IC, even when there is a class distribution imbalance between the IC and control samples. This finding provides further insight into utilizing previously identified urinary metabolic biomarkers in developing machine learning algorithms that can be applied in the clinical setting.

Metrics

Metrics Loading ...
Abstract 145 | HTML Downloads 38 PDF Downloads 86

References

1. Hanno P, Keay S, Moldwin R, Van Ophoven A. International Consultation on IC - Rome, September 2004/Forging an International Consensus: progress in painful bladder syndrome/interstitial cystitis. Report and abstracts. Int Urogynecol J Pelvic Floor Dysfunct. 2005;16 Suppl 1:S2-S34. PMID: 15883858
2. Nordling J, Anjum FH, Bade JJ, Bouchelouche K, Bouchelouche P, Cervigni M, et al. Primary evaluation of patients suspected of having interstitial cystitis (IC). Eur Urol. 2004;45(5):662-9. PMID: 15082211
3. Hanno PM, Burks DA, Clemens JQ, Dmochowski RR, Erickson D, Fitzgerald MP, et al. AUA guideline for the diagnosis and treatment of interstitial cystitis/bladder pain syndrome. J Urol. 2011;185(6):2162-70. PMID: 21497847
4. Urinology Think Tank Writing G. Urine: Waste product or biologically active tissue? Neurourol Urodyn. 2018;37(3):1162-8. PMID: 29464759
5. Wen H, Lee T, You S, Park SH, Song H, Eilber KS, et al. Urinary metabolite profiling combined with computational analysis predicts interstitial cystitis-associated candidate biomarkers. J Proteome Res. 2015;14(1):541-8. PMID: 25353990
6. Kind T, Cho E, Park TD, Deng N, Liu Z, Lee T, et al. Interstitial Cystitis-Associated Urinary Metabolites Identified by Mass-Spectrometry Based Metabolomics Analysis. Sci Rep. 2016;6:39227. PMID: 27976711
7. Shahid M, Lee MY, Yeon A, Cho E, Sairam V, Valdiviez L, et al. Menthol, a unique urinary volatile compound, is associated with chronic inflammation in interstitial cystitis. Sci Rep. 2018;8(1):10859. PMID: 30022124
8. Cahan EM, Hernandez-Boussard T, Thadaney-Israni S, Rubin DL. Putting the data before the algorithm in big data addressing personalized healthcare. NPJ Digit Med. 2019;2:78. PMID: 31453373
9. Tolles J, Meurer WJ. Logistic regression: relating patient characteristics to outcomes. Jama. 2016;316(5):533-4. PMID: 27483067
10. Cortes C, Vapnik V. Support-vector networks. Machine learning. 1995;20(3):273-97.
11. Platt J. Sequential minimal optimization: A fast algorithm for training support vector machines. 1998.
12. Lin Y, Lee Y, Wahba G. Support vector machines for classification in nonstandard situations. Machine learning. 2002;46(1-3):191-202.
13. Tibshirani R. Regression shrinkage and selection via the lasso. Journal of the Royal Statistical Society: Series B (Methodological). 1996;58(1):267-88.
14. Ng AY, editor Feature selection, L 1 vs. L 2 regularization, and rotational invariance. Proceedings of the twenty-first international conference on Machine learning; 2004.
15. Vehtari A, Gelman A, Gabry J. Practical Bayesian model evaluation using leave-one-out cross-validation and WAIC. Statistics and computing. 2017;27(5):1413-32.
16. Esteva A, Robicquet A, Ramsundar B, Kuleshov V, DePristo M, Chou K, et al. A guide to deep learning in healthcare. Nat Med. 2019;25(1):24-9. PMID: 30617335
17. Miotto R, Wang F, Wang S, Jiang X, Dudley JT. Deep learning for healthcare: review, opportunities and challenges. Brief Bioinform. 2018;19(6):1236-46. PMID: 28481991
18. Zampieri G, Vijayakumar S, Yaneske E, Angione C. Machine and deep learning meet genome-scale metabolic modeling. PLoS Comput Biol. 2019;15(7):e1007084. PMID: 31295267
19. Bordbar A, Monk JM, King ZA, Palsson BO. Constraint-based models predict metabolic and associated cellular functions. Nat Rev Genet. 2014;15(2):107-20. PMID: 24430943
20. Cuperlovic-Culf M. Machine Learning Methods for Analysis of Metabolic Data and Metabolic Pathway Modeling. Metabolites. 2018;8(1). PMID: 29324649
21. Angermueller C, Parnamaa T, Parts L, Stegle O. Deep learning for computational biology. Mol Syst Biol. 2016;12(7):878. PMID: 27474269
22. Min S, Lee B, Yoon S. Deep learning in bioinformatics. Brief Bioinform. 2017;18(5):851-69. PMID: 27473064
23. Vamathevan J, Clark D, Czodrowski P, Dunham I, Ferran E, Lee G, et al. Applications of machine learning in drug discovery and development. Nat Rev Drug Discov. 2019;18(6):463-77. PMID: 30976107
24. Jing Y, Bian Y, Hu Z, Wang L, Xie XQ. Deep Learning for Drug Design: an Artificial Intelligence Paradigm for Drug Discovery in the Big Data Era. AAPS J. 2018;20(3):58. PMID: 29603063
25. Klauschen F, Muller KR, Binder A, Bockmayr M, Hagele M, Seegerer P, et al. Scoring of tumor-infiltrating lymphocytes: From visual estimation to machine learning. Semin Cancer Biol. 2018;52(Pt 2):151-7. PMID: 29990622
26. Baptista D, Ferreira PG, Rocha M. Deep learning for drug response prediction in cancer. Brief Bioinform. 2020. PMID: 31950132
27. Tolios A, De Las Rivas J, Hovig E, Trouillas P, Scorilas A, Mohr T. Computational approaches in cancer multidrug resistance research: Identification of potential biomarkers, drug targets and drug-target interactions. Drug Resist Updat. 2020;48:100662. PMID: 31927437
28. Wong NC, Lam C, Patterson L, Shayegan B. Use of machine learning to predict early biochemical recurrence after robot-assisted prostatectomy. BJU Int. 2019;123(1):51-7. PMID: 29969172
29. Madhukar NS, Khade PK, Huang L, Gayvert K, Galletti G, Stogniew M, et al. A Bayesian machine learning approach for drug target identification using diverse data types. Nat Commun. 2019;10(1):5221. PMID: 31745082
30. Saeed K, Rahkama V, Eldfors S, Bychkov D, Mpindi JP, Yadav B, et al. Comprehensive Drug Testing of Patient-derived Conditionally Reprogrammed Cells from Castration-resistant Prostate Cancer. Eur Urol. 2017;71(3):319-27. PMID: 27160946

Most read articles by the same author(s)