TY - JOUR
T1 - Identification of ZMYND19 as a novel biomarker of colorectal cancer: RNA-sequencing and machine learning analysis
AU - Khalili-Tanha, Ghazaleh
AU - Mohit, Reza
AU - Asadnia, Alireza
AU - Khazaei, Majid
AU - Dashtiahangar, Mohammad
AU - Maftooh, Mina
AU - Nassiri, Mohammadreza
AU - Hassanian, Seyed Mahdi
AU - Ghayour-Mobarhan, Majid
AU - Kiani, Mohammad Ali
AU - Ferns, Gordon A.
AU - Batra, Jyotsna
AU - Nazari, Elham
AU - Avan, Amir
PY - 2023/12/1
Y1 - 2023/12/1
N2 - Colorectal cancer (CRC) is the third most common cause of cancer-related deaths. The five-year relative survival rate for CRC is estimated to be approximately 90% for patients diagnosed with early stages and 14% for those diagnosed at an advanced stages of disease, respectively. Hence, the development of accurate prognostic markers is required. Bioinformatics enables the identification of dysregulated pathways and novel biomarkers. RNA expression profiling was performed in CRC patients from the TCGA database using a Machine Learning approach to identify differential expression genes (DEGs). Survival curves were assessed using Kaplan–Meier analysis to identify prognostic biomarkers. Furthermore, the molecular pathways, protein–protein interaction, the co-expression of DEGs, and the correlation between DEGs and clinical data have been evaluated. The diagnostic markers were then determined based on machine learning analysis. The results indicated that key upregulated genes are associated with the RNA processing and heterocycle metabolic process, including C10orf2, NOP2, DKC1, BYSL, RRP12, PUS7, MTHFD1L, and PPAT. Furthermore, the survival analysis identified NOP58, OSBPL3, DNAJC2, and ZMYND19 as prognostic markers. The combineROC curve analysis indicated that the combination of C10orf2 -PPAT- ZMYND19 can be considered as diagnostic markers with sensitivity, specificity, and AUC values of 0.98, 1.00, and 0.99, respectively. Eventually, ZMYND19 gene was validated in CRC patients. In conclusion, novel biomarkers of CRC have been identified that may be a promising strategy for early diagnosis, potential treatment, and better prognosis.
AB - Colorectal cancer (CRC) is the third most common cause of cancer-related deaths. The five-year relative survival rate for CRC is estimated to be approximately 90% for patients diagnosed with early stages and 14% for those diagnosed at an advanced stages of disease, respectively. Hence, the development of accurate prognostic markers is required. Bioinformatics enables the identification of dysregulated pathways and novel biomarkers. RNA expression profiling was performed in CRC patients from the TCGA database using a Machine Learning approach to identify differential expression genes (DEGs). Survival curves were assessed using Kaplan–Meier analysis to identify prognostic biomarkers. Furthermore, the molecular pathways, protein–protein interaction, the co-expression of DEGs, and the correlation between DEGs and clinical data have been evaluated. The diagnostic markers were then determined based on machine learning analysis. The results indicated that key upregulated genes are associated with the RNA processing and heterocycle metabolic process, including C10orf2, NOP2, DKC1, BYSL, RRP12, PUS7, MTHFD1L, and PPAT. Furthermore, the survival analysis identified NOP58, OSBPL3, DNAJC2, and ZMYND19 as prognostic markers. The combineROC curve analysis indicated that the combination of C10orf2 -PPAT- ZMYND19 can be considered as diagnostic markers with sensitivity, specificity, and AUC values of 0.98, 1.00, and 0.99, respectively. Eventually, ZMYND19 gene was validated in CRC patients. In conclusion, novel biomarkers of CRC have been identified that may be a promising strategy for early diagnosis, potential treatment, and better prognosis.
UR - http://www.scopus.com/inward/record.url?scp=85164479297&partnerID=8YFLogxK
U2 - 10.1007/s12079-023-00779-2
DO - 10.1007/s12079-023-00779-2
M3 - Article
AN - SCOPUS:85164479297
SN - 1873-9601
VL - 17
SP - 1469
EP - 1485
JO - Journal of Cell Communication and Signaling
JF - Journal of Cell Communication and Signaling
IS - 4
ER -