Abstract
SARIFA (Stroma AReactive Invasion Front Areas) has recently emerged as a promising histopathological biomarker for colon and gastric cancer. To elucidate the underlying tumor biology, we assessed SARIFA-status in tissue specimens from The-Cancer-Genome-Atlas (TCGA) cohorts COAD (colonic adenocarcinoma) and READ (rectal adenocarcinoma). For the final analysis, 207 CRC patients could be included, consisting of 69 SARIFA-positive and 138 SARIFA-negative cases. In this external validation cohort, H&E-based SARIFA-positivity was strongly correlated with unfavorable overall, disease-specific, and progression-free survival, partly outperforming conventional prognostic factors. SARIFA-positivity was not associated with known high-risk genetic profiles, such as BRAF V600E mutations or microsatellite-stable status. Transcriptionally, SARIFA-positive CRCs exhibited an overlap with CRC consensus molecular subtypes CMS1 and CMS4, along with distinct differential gene expression patterns, linked to lipid metabolism and increased stromal cell infiltration scores (SIIS). Gene-expression-based drug sensitivity prediction revealed a differential treatment response in SARIFA-positive CRCs. In conclusion, SARIFA represents the H&E-based counterpart of an aggressive tumor biology, demonstrating a partial overlap with CMS1/4 and also adding a further biological layer related to lipid metabolism. Our findings underscore SARIFA-status as an ideal biomarker for refined patient stratification and novel drug developments, particularly given its cost-effective assessment based on routinely available H&E slides.
Similar content being viewed by others
Background
With more than 1.8 million new cases every year, colorectal cancer (CRC) is the third most common cancer, contributing extensively to the global burden of disease [1]. Even though the 5-year overall survival (OS) rate is approximately 90% for early stages, this rate rapidly decreases for later stages [2].
In clinical practice, prognosis evaluation and treatment guidance in CRC patients is based on the American Joint Committee on Cancer/Union for International Cancer Control/Tumor Node Metastasis (AJCC/UICC/TNM) classification. Besides that, additional prognostic factors such as grade, tumor budding, KRAS and BRAF mutational status, and mismatch repair (MMR) or microsatellite status have been established. However, TNM staging as well as other routinely used biomarkers are still not fully sufficient to predict the survival of stage II and III patients as these are associated with extremely divergent patient outcomes. For example, stage IIIA patients do have a better prognosis than stage IIB/IIC patients, which is described as the âsurvival paradoxâ [3]. Considering that adjuvant therapy with fluoropyrimidineâ+âoxaliplatin-based chemotherapy is recommended following surgical treatment for a really heterogeneous group of stage III as well as high-risk stage II patients [4], it is evident that there is still a lack of biomarkers sufficiently stratifying CRC patients, leading to under- or overtreatment of some patients.
Recently, gene-expression-based approaches such as consensus molecular subtyping (CMS) [5] or CINSARC signatures [6], that have impressively been shown to correlate with prognosis and partly even outperform conventional TNM staging, have been established. However, these RNA-sequencing-based methods are not easily applicable and time- as well as cost-intensive, and therefore have not found their way into the daily clinical routine yet.
To meet this urgent need for new robust and easy-to-implement biomarkers in CRC, we recently introduced Stroma AReactive Invasion Front Areas (SARIFA) as hematoxylin and eosin (H&E) based negative predictor in colon [7] as well as gastric cancer [8]. SARIFA, which is defined as the direct contact between tumor cells and adipocytes at the invasion front, shows a low interobserver variability and can be assessed fast and easily on routine H&E slides. Hence, there is no need for further testing (via e.g., immunohistochemistry or sequencing), and hence no delay in turnaround time and no additional costs (except for the pathologistâs effort).
Recently, our group provided the first evidence that SARIFA is associated with tumor-promoting tumor-adipocyte interaction [8] as well as deleterious immunologic alterations [9]. Upregulation of proteins associated with fatty acid metabolism such as FABP4 and CD36 in tumor cells at the invasive margin seems to be a key feature of SARIFAs [8], which is already known to contribute to the invasive and metastatic potential of colon cancer cells [10, 11], and could be a potential innovative therapeutic target in SARIFA-positive CRCs [12,13,14].
Besides numerous experimental studies highlighting the major role of adipocytes and lipids in cancer progression [15, 16], two independent deep-learning models have just recently identified tumor cell/adipocyte co-localization, as important. This is similar to what we define as SARIFA and has so far been an underappreciated morphological feature associated with a worse prognosis in CRC [17, 18].
Hence, it is reasonable that our H&E-based SARIFA classification represents an aggressive tumor biology, which is characterized by a distinct tumor-adipocyte interaction, potentially caused by immunologic dysregulation.
Therefore, this study aims to validate the prognostic relevance of SARIFA in CRC on The-Cancer-Genome-Atlas (TCGA) colonic (COAD) and rectal adenocarcinoma (READ) cohorts and to provide the first in-depth molecular characterization of SARIFA-positive CRCs in a well-characterized, publicly available external cohort [19].
Methods and materials
Ethics statement
The experiments in this study are in compliance with the Declaration of Helsinki and the International Ethical Guidelines for Biomedical Research Involving Human Subjects by the Council for International Organizations of Medical Sciences (CIOMS). The study has been carried out according to the âTransparent reporting of a multivariable prediction model for individual prognosis or diagnosisâ (TRIPOD) statement [20]. The overall analysis in this study has been approved by the Ethics Board at the Medical Faculty of Technical University Dresden (BO-EK-444102022). The patient sample collection in each cohort was separately approved by the respective institutional ethics boards.
Data acquisition
Whole slide images (WSI) were obtained from https://portal.gdc.cancer.gov/ for nâ=â627 colorectal (CRC) patients from TCGA cohorts COAD (colonic adenocarcinoma) and READ [19]. Patients were then histopathologically screened regarding their SARIFA-status by a pathologist with plenty of experience in assessing SARIFA-status (first author, NGR). In total, nâ=â215 could be classified into SARIFA-positive and SARIFA-negative. Molecular data are available at https://www.cbioportal.org/ for the TCGA PanCancerAtlas [21, 22]. Additional data on the datasets were partly retrieved and are available from Liu et al. [23], from Thorsson et al. [24] as well as from Malta et al. [25].
Assessment of SARIFA-status
As the digital slides at https://portal.gdc.cancer.gov and http://www.cbioportal.org/ are consistently scalable and therefore also have been used for other morphologic characterization studies [26], SARIFA-status could be reliably assessed when diagnostic WSI with the complete intestinal wall and not only superficial tumor parts were available (nâ=â215). SARIFA-positivity was defined according to our previous studies in CRCs [7, 9] as the presence of an area within the tumor invasion front where at least a single tumor gland or group of â¥5 tumor cells are directly adjacent to adipocytes (SARIFA-positive, Fig. 1) without intervening stromal reaction or inflammatory infiltrate (SARIFA-negative, Fig. 1). If a single SARIFA was present, the case was classified as SARIFA-positive. All the cases were classified by NGR. Clinical and molecular data (TCGA PanCancerAtlas) of nâ=â207 classified cases were partly analyzed and could be retrieved from http://www.cbioportal.org/ [19, 21, 22].
Statistical analysis and experimental design
Chi-squared tests were used for hypothesis testing of differences between relative frequencies. Continuous variables were compared using the Wilcoxon rank-sum test. Estimates of KaplanâMeier survival rates were compared using log-rank tests. The median follow-up was calculated using the reverse Kaplan-Meier method [27]. Relative risks were estimated by hazard ratios (HRs), obtained by Cox proportional hazard models. For genomic alterations, mRNA and protein expression, methylation data, and microbiome signatures, q-values are reported to incorporate multiple testing corrections (using a false discovery rate detection approach). Cramérâs V was also reported for association between nominal variables. p-Valuesâ<â0.05 were considered statistically significant and are reported as follows: *pâ<â0.05, **pâ<â0.01, ***pâ<â0.001, ****pâ<â0.0001. Statistical analyses were performed within http://www.cbioportal.org/ and by using R (v4.2.2; R Foundation for Statistical Computing, Vienna, Austria) or SPSS for Windows, version 24 (IBM, Armonk, NY, USA). The study design is visualized in detail in Fig. 1.
Gene and protein expression analysis
For gene expression analysis, the batch-normalized RNA-seq data generated with RSEM [28] from Illumina HiSeq_RNASeq_V2 (ârna_seq_v2_mrnaâ) data was accessed from TCGA over cBioPortal. Gene expression data were available for 207/215 (96%) TCGA samples with known SARIFA-status. Missing values were replaced with zero counts. Among the samples, 11 samples could not be considered because of negative counts due to the batch correction. Differential expression analysis was performed with DESeq2 v1.36.0 [29] with counts rounded to an integer and Wald test without covariates or with sex included as a covariate in the linear model. CMS subtyping of samples was performed with CMScaller v2.0.1 [30] with the raw counts and with Entrez gene identifiers, 5000 permutations (random seed 31415), and an FDR (false discovery rate) threshold of 0.05. In both groups, 12â14% of the samples could not be confidently assigned a CMS subtype. Functional gene set analysis was performed with GO (Panther v16.0 with GO db as of 12/2021) [31], ShinyGo v0.77 [32], and GSEA (gene set enrichment analysis) v4.3.2 [33]. Gene and protein expression data were visualized with ggplot2 v3.4.2 [34] and networkD3 v0.4. Protein expression was accessed as processed RPPA (reverse phase protein assay) values from cBioPortal and visualized with ggplot2. Drug response predictions based on gene expression were established by deploying oncoPredict [35] with training data from the GDSC2 database [36]. The prediction model was run primarily with default settings (https://cran.r-project.org/web/packages/oncoPredict/oncoPredict.pdf).
Availability of molecular and image data
The dataset(s) supporting the conclusions of this article are included within the article (and its additional files). Molecular and image data are publicly available at https://portal.gdc.cancer.gov/ and https://www.cbioportal.org/.
SARIFA-status of the cases can also be found in detail in Additional File 1.
Results
SARIFA is associated with poor outcomes in TCGA-CRC
In our classified TCGA-CRC cases (nâ=â207), overall 69 patients (33.3%) presented with SARIFA-positive CRC. While age at diagnosis, patient weight, and length of follow-up did not differ based on SARIFA-status, SARIFA-positivity in primary CRC tumor tissue was significantly associated with adverse features such as higher pT stage, lymph node and distant metastasis, higher AJCC stage, occurrences of deaths and new neoplasms post initial therapy (each p-value at least <0.01, chi-squared test). Regarding MSI (microsatellite instability), BRAF status, and TCGA subtypes based on driver mutations, no SARIFA-dependent differences were evident. Clinicopathological features of the cohort with regard to SARIFA-status are displayed in detail in Table 1.
Next, we studied survival endpoints using KaplanâMeier analysis (Fig. 2AâF). Here, SARIFA-positive CRC patients were characterized by a significantly decreased OS, progression-free survival (PFS), and disease-specific survival (DSS) throughout all T-stages and also when only considering locally advanced T3/T4 CRCs (each p-value at least **<0.01, log-rank test). In particular, KaplanâMeier curves regarding PFS separated impressively (median months PFS: SARIFA-positive: 22.62 [19.04âNA], SARIFA-negative: not reached).
To assess the value of SARIFA-status in comparison to other prognostically relevant factors, we performed Cox regression analysis (Additional File 2) including parameters such as age (>65 years), gender, pT, pN, and M status, AJCC stage and MSI scores (Mantis & Sensor). Upon univariate analysis regarding OS, higher age, presence of distant metastasis, higher AJCC stage, and SARIFA-positivity were significantly associated with worse outcomes, with distant metastasis and SARIFA-positivity having the highest HRs of 3.12 and 2.43, respectively (HR: distant metastasis 95% CI: 1.64â5.97, ***<0.001; SARIFA 95% CI: 1.411â4.212). Regarding DSS and PFS, SARIFA-positivity was also significantly associated with adverse outcomes (DSS, HR: 4.58 95% CI: 1.82â11.50, pâ=â0.001; PFS, HR: 3.66 95% CI: 2.13â6.30, ***pâ<â0.001). Consecutively, we performed multivariate regression analysis, including only the parameters that were statistically significant in univariate analysis (Fig. 2GâI). Here, SARIFA-positivity remained highly prognostic with regards to all three endpoints (OS, HR: 2.5 95% CI: 1.22â4.14, **pâ=â0.009; DSS, HR 3.87 95% CI: 1.22â12.31 *pâ=â0.022; PFS 3.56 95% CI: 1.89â6.69, ***pâ<â0.001). With SARIFA-status showing consistently higher HRs (however, with partly overlapping CIs) than conventional biomarkers, we confirm SARIFA as possibly superior, solely H&E-based biomarker that potentially outperforms prognostic biomarkers that are currently used to guide treatment decisions in CRC patients.
SARIFA is not associated with distinct genetic alterations
As survival outcomes differ dramatically between SARIFA-positive and SARIFA-negative CRCs, we aimed to explore if genetic alterations drive these differences and took the in-depth characterization published by Liu et al. in their landmark study on molecular characteristics of gastrointestinal adenocarcinomas into account [23]. In line with our previous findings based on immunohistochemistry-based TCGA molecular subtyping in GC [8] or on small next-generation-sequencing (NGS) panel approaches in CRC [9], we could not find any significant SARIFA-dependent differences on the genomic level (no significant sample-level enrichments, no significant differences in tumor mutational burden, fraction genome altered or aneuploidy score; no significant differences in indel mutation density, SNV (single nucleotide variant) mutation density and total mutation density as well as the fraction of genome with subclonal SCNAs (somatic copy number alterations) and duplicated alleles, all pâ>â0.05). Genomic alterations regarding the most relevant genes in CRC are paradigmatically visualized as Oncoprint in Fig. 3. In particular, SARIFA-positivity was not associated with known high-risk features such as BRAF V600E mutations (SARIFA-positive 10.4% vs. 8.0% SARIFA-negative, pâ=â0.27) or MSS (MSI Sensor score as well as MSI Mantis score, pâ>â0.05; MSI, microsatellite instable; MSS microsatellite stable). Regarding molecular subtypes (CIN, GS, MSI, and POLE) and hypermethylation category (CIMP-H, CIMP-L, and non-CIMP; CIMP: CpG island methylator phenotype), also no significant differences between SARIFA-positive and SARIFA-negative CRCs could be observed (molecular subtypes: pâ=â0.650, hypermethylation category: pâ=â0.441).
Analyzing the data within cBioPortal [21, 22], we could not observe any differences regarding DNA methylation data, and only one bacteria species (Sutterella ssp.) was observed, which seems upregulated in SARIFA-positive cases (****pâ=â6.0009eâ6, qâ=â8.448eâ3).
SARIFA displays a characteristic gene expression signature and differential protein expression
In contrast to genetic alterations, SARIFA-status in CRCs was associated with distinct changes in gene expression on the mRNA and protein levels. Differential gene expression analysis with the transcriptome profiles of the 196 CRC samples with available transcriptomic data and SARIFA-status revealed a broad dysregulation of gene expression (1896 genes/~9.6% with qâ<â0.05 and no LFC threshold; 731 genes/~3% with qâ<â0.01 and no LFC threshold; LFC: log fold change), with the major proportion of differentially expressed genes (approximately two-thirds) up-regulated in SARIFA-positive cases (Fig. 3C). Differential gene expression was similar when adjusting for sex (803 genes, ~4.1% with qâ<â0.05 and no LFC threshold, sex information available in 189 cases), while we did not find significant differential gene expression in a random sample permutation control with balanced SARIFA-negative and SARIFA-positive cases (33 genes, ~0.1% with qâ<â0.05 and no LFC threshold, mean of the results of three random permutations with equal SARIFA proportions and similar sex proportion as in the SARIFA analysis). Detailed results of differential gene expression analysis can be found in the supplementary (Additional File 3). Gene ontology enrichment analysis revealed enrichment of extracellular matrix, proteoglycans, and signaling pathways (Additional File 4). 27 genes were significantly up or downregulated more than 1.5-fold (22 up/5 down, Waldâs test against a null hypothesis of 0.585 LFC), among which were FABP4 and CD36, which we previously identified as differentially expressed at SARIFAs in gastric cancer [8]. These 27 genes showed significantly enriched molecular interaction (PPI enrichment p-value: 3.23eâ11, STRING db), expression in adipocytes and adipose tissue (TISSUES db), association with the extracellular space (Gene ontology and COMPARTMENTS db) and with the PPAR/AMPK signaling and adipocyte signaling pathways (GO, KEGG and WikiPathways db) (Fig. 4C), suggesting dysregulation of a functional network in SARIFA-positive CRCs. Interestingly, differential protein abundance analysis with reverse phase protein assay (RPPA) data demonstrated several differentially abundant proteins, which are also associated with the extracellular matrix (Fig. 3D).
Next, we investigated different gene-expression-based signatures and found several specific characteristics of SARIFA-positive CRCs. SARIFA-positive cases were associated with lower stemness (*pâ=â0.04); although there was no difference for DNA-methylation-based stemness (pâ=â0.71). Furthermore, SARIFA-positive CRCs were associated with higher stromal cell infiltration intensity (SIIS) scores [37] (Fig. 4B, ***pâ<â0.01, MannâWhitney U-test). We also investigated if SARIFA-status is associated with distinct consensus molecular subtypes (CMS) [5] based on their RNA-expression profile. CMS profiles could be confidently assigned to 180/207 samples (87%). The distribution of the CMS profiles in SARIFA-positive cases differed significantly from SARIFA-negative cases (**pâ<â0.01, hypothesis test of equal population proportions, Cramérâs V Ïcâ=â0.29), with the relative proportion of CMS4 and CMS1 increased in SARIFA-positive cases (CMS4: 53% vs. 31%; CMS1: 24% vs. 15%) and CMS2 and CMS3 decreased in SARIFA-positive cases (CMS2: 12% vs. 31%; CMS3: 12% vs. 22%) (Fig. 4A). Thus, SARIFA-positivity was significantly associated with CMS1 and CMS4 molecular subtypes, but did not strongly overlap with the CMS subtyping of colorectal tumor samples.
Besides showing enrichment of CMS1 (immune), SARIFA-positive CRCs also displayed a significantly increased expression of CD274 (PD-L1; LFC 0.91, ****pâ<â0.0001, q-valueâ=â0.0032). Both findings could be relevant for immunotherapeutic approaches, and strengthen our previous finding of an altered immune response in SARIFA-positive CRCs [9].
Furthermore, as it is well-established that hypoxic tumors such as SARIFA-positive tumors are associated with high-risk features and poor outcomes [38], we investigated the relationship between three different hypoxia scores and did not find any SARIFA-dependent changes (Buffa hypoxia score: pâ=â0.799, Ragnum hypoxia score: pâ=â0.800, Winter hypoxia score: pâ=â0.267, Wilcoxon rank sum test).
SARIFA-based gene expression pattern predicts differential therapy response
Based on the observed gene expression profiles, we further analyzed if differential gene expression leads to differences in predicted treatment responses. Therefore, we deployed oncoPredict, which is a computational tool to derive drug responses based on cell line screening data [35]. Here, indeed SARIFA-positive CRCs displayed a differential drug sensitivity (Fig. 4D as well as additional file 5). Among the 198 analyzed compounds, we could identify four drugs, which are currently used in the treatment of CRC patients, in the primary and/or in the metastatic setting (Oxaliplatin_1089, 5-Fluorouracil_1073 [5-FU], Irinotecan_1088, Lapatinib_1558). Interestingly, SARIFA-positive CRCs are predicted to be more resistant to Oxaliplatin with higher predicted IC50 values (fold change 1.045, pâ=â0.0029, qâ=â0.078). Whereas for 5-FU (fold change 0.44, pâ=â0.48, qâ=â0.14), Irinotecan (fold change 0.22, pâ=â0.29, qâ=â0.60), and Lapatinib (fold change 0.20, pâ=â0.83, qâ=â0.91) no significant differences could be observed. Moreover, SARIFA-positive CRCs seem, in line with their partial overlap with CMS4, more sensitive to Dasatinib, which is an FDA (U.S. Food and Drug Administration) approved tyrosine kinase inhibitor against CMS4-related kinases [39] and already in clinical use for chronic myeloid leukemia (Dasatinib_1079, fold change â0.31, pâ=â0.027, qâ=â0.22). At least a similar trend could be observed for further compounds, namely JQ1_2172 (fold change â0.19, pâ=â0.067, qâ=â0.34) and XAV939 (fold change â0.14, pâ=â0.092, qâ=â0.36). JQ1 has been described as an active drug in CRC cell lines and patient-derived xenografts [40], whereas XAV939 is supposed to function via inhibition of Wnt/beta-catenin signaling, which plays a central role in CRC [41].
Discussion
Adequate patient stratification in CRC in routine diagnostic pathology, especially in TNM stages II/III [3], still remains challenging. To come up with a solution for this pressing clinical need, we established SARIFA-status as a solely H&E-based biomarker [7, 8], that could be fast and easily implemented in routine pathologic workflow straight away. Compared to tumor budding [42, 43], which is a histopathologic biomarker already in clinical use, SARIFA-status is characterized by a low interobserver variability without a need for further immunohistochemical stains or assays [7, 8]. To further characterize the prognostic relevance as well as the molecular background of SARIFA, we comprehensively investigated SARIFA as a biomarker in the openly available TCGA cohorts COAD and READ. Besides further insights into tumor biology with regards to SARIFA-status, this approach has the advantage of making our SARIFA assessment publicly available and thereby not only providing a training resource for pathologists but also serving as a starting point for further research efforts.
By deploying TCGA-CRC as the first publicly available, external validation cohort, we could again prove the association of SARIFA-positivity with known conventional high-risk features such as higher pT categories and positive lymph nodes. Furthermore, SARIFA-positivity was strongly associated with poor outcomes with regard to different endpoints, namely OS, PFS, and DSS, even within locally advanced (pT3/pT4) CRCs. SARIFA-status remained one of the strongest independent predictors with regard to all investigated endpoints upon multivariate analysis. In line with our findings, other groups just recently provided further evidence that adipocytes close to tumor cells are a morphological feature that is associated with a poor prognosis in CRC [18, 44]. Even though novel approaches with comparable performance to better stratify CRC patients based on gene-expression profiling or deep learning algorithms have recently been published [6, 18], SARIFA assessment does not rely on challenging assays or computing power but solely on H&E histopathology.
Based on extensive molecular profiling, that has been done within TCGA and further related studies, we could now prove that SARIFA and its associated poor prognosis is likely not driven by genetic changes as SARIFA-positivity was not associated with any harmful molecular changes such as deleterious BRAF V600E mutations [45] or MSS status [46], which are known to convey a poor prognosis. As our previous understanding of genetic alterations with regards to SARIFA-status in CRC was based on very limited sample numbers and only panel-based NGS sequencing [9], our current study confirms that SARIFA-positivity is not a reflection of harmful genetic alterations.
Furthermore, this is the first study that could prove that SARIFA-positive CRCs have a similar upregulation of fatty acid metabolism, just as observed in SARIFA-positive GCs. Strikingly, whereas we initially identified an upregulation of FABP4 and CD36, both closely related to lipid metabolism, specifically at SARIFAs in GC [8], we now provide the first evidence that these genes are also upregulated in RNA bulk data. Even though it seems like a limitation, that only bulk data is available for TCGA-CRC, and bulk data is unlikely to reflect SARIFA as a spatially restricted process at the invasion front, our results show that bulk RNA-seq reflects robust gene expression changes associated with SARIFA not only at the invasion front but the entire tumor as well as the tumor microenvironment.
By linking our gene expression profiles of SARIFA-negative and SARIFA-positive CRCs to the established CMS subtypes [5] for the very first time, we identified an enrichment of CMS1 (immune) and especially CMS4 (mesenchymal) CRCs within SARIFA-positive cases. Moreover, there was a pronounced upregulation of genes associated with extracellular matrix organization such as Proteoglycan 4, and higher SIIS scores in SARIFA-positive CRCs, underlining their more mesenchymal phenotype. Previous studies, partly based on deep learning algorithms, could already detect a distinct genotype-phenotype correlation between histomorphologic features and CMS subtypes, such as the absence of mucin in CMS2 or desmoplastic reaction and high-grade budding in CMS4 CRCs [47,48,49]. Therefore, SARIFA-status based on H&E histopathology could serve as an indicator for CMS subtyping without the need for further cost-intensive RNA-based assays. As the enrichment of SARIFA-positive CRCs within CMS4 cases indicates, SARIFA-positive CRCs display a more mesenchymal, stroma-associated gene expression profile. Consequently, SARIFA-positive CRCs show significantly higher SIIS than SARIFA-negative CRCs. Higher stroma cell infiltration (higher SIIS) has already been proven as a high-risk feature in CRC and conveys an intrinsic drug resistance and therefore is associated with reduced efficacy of adjuvant chemotherapy [37]. Interestingly, SARIFA-negative CRCs showed an upregulation of IGF2 (insulin-like growth factor 2), which has been recently described by Isella et al. as characteristic of CRC intrinsic subtype (CRIS) D [50]. In line with our findings in terms of prognosis, CMS4 CRCs, which have an overlap with SARIFA-positive CRCs, show the poorest prognosis [5], whereas CRIS-D CRCs, which show similarities to SARIFA-negative CRCs, seem to have the best outcomes [50].
Beyond this, SARIFA-positive CRCs were, as mentioned, characterized by an upregulation of genes associated with lipid metabolism, namely FABP4 and CD36, which are known to play an important role in CRC as [10, 14, 51] well as general cancer progression [12, 52, 53], and hence could serve as novel therapeutic targets in SARIFA-positive CRCs [54, 55].
On the protein level, SARIFA-positive CRCs exhibited pronounced upregulation of Fibronectin and Annexin A, hinting on the one hand again at the key role of extracellular matrix organization with regards to SARIFA-status [56], and on the other hand on immunomodulatory changes within SARIFA-positive CRCs [57], which supports our previous findings of an altered immune response in SARIFA-positive CRC patients [9].
Finally, we investigated the predicted differential treatment response based on gene expression signatures of SARIFA-positive CRCs. Here, we observed a differential drug sensitivity. SARIFA-positive CRCs are predicted to be more sensitive to tyrosine kinase inhibitor (TKI) Dasatinib, which is in line with the findings that the use of TKIs in mesenchymal CMS4, which partly overlaps with SARIFA-positivity, can be beneficial [39]. Gene-expression-based drug sensitivity testing also suggested that SARIFA-positive CRCs are more resistant to Oxaliplatin treatment, which is part of most CRC chemotherapy regimens [4, 58], and therefore is of high clinical relevance. This finding is also in line with the higher SIIS observed in SARIFA-positive CRCs as higher SIIS indicates less benefit from adjuvant chemotherapy, as published previously [37]. Consistent with higher SIIS and the overlap with CMS4, a very recent study by Hu et al. could show that SARIFA-positivity is associated with non-mature desmoplastic reaction (with histologically visible keloid-like collagen [intermediate/middle desmoplastic reaction] or myxoid stroma [immature desmoplastic reaction]) [59]. Interestingly, the authors could also show that non-mature desmoplastic reaction as an H&E-based biomarker can potentially guide treatment decisions [59]. Beyond validating our findings that SARIFA-positivity is closely linked to changes in the extracellular matrix organization, these results highlight the important role of histologic biomarkers reflecting changes in the tumor stroma to predict treatment response.
To conclude, SARIFA-status is an independent and adverse prognostic histopathologic biomarker that does not only show some overlap with CMS1/CMS4 subtypes and high SIIS scores but also seems to possess a strong association with lipid metabolism. Therefore, we firmly believe H&E-based SARIFA-status is the equivalent of underlying aggressive tumor biology with its own transcriptional identity, which does not rely on genomic changes. We provide here the first external validation of SARIFA-status as a novel biomarker in CRC, which is based on an openly available data set and can therefore be used as a training resource for pathologists and researchers globally. SARIFA-status could be implemented easily and without further costs in routine diagnostic pathology and should be further validated in prospective trials as our current study also provides evidence that SARIFA-positive CRCs are characterized by a differential drug sensitivity.
Data availability
The dataset(s) supporting the conclusions of this article are included within the article (and its additional files). Moreover, molecular and image data are publicly available at https://portal.gdc.cancer.gov/ and https://www.cbioportal.org/. The Cancer-Genome-Atlas (TCGA) cohorts COAD (colonic adenocarcinoma) and READ (rectal adenocarcinoma) together with the corresponding WSI to each case are publicly available [19]. SARIFA-status of the cases can also be found in detail in Additional File 1. Molecular data is available at https://www.cbioportal.org/ for the TCGA PanCancerAtlas [21, 22]. Additional data on the datasets are available from Liu et al. [23], from Thorsson et al. [24] as well as from Malta et al. [25].
References
Sung H, Ferlay J, Siegel RL, Laversanne M, Soerjomataram I, Jemal A, et al. Global Cancer Statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin. 2021;71:209â49.
Sawicki T, Ruszkowska M, Danielewicz A, Niedźwiedzka E, ArÅukowicz T, PrzybyÅowicz KE. A review of colorectal cancer in terms of epidemiology, risk factors, development, symptoms and diagnosis. Cancers (Basel). 2021;13:2025.
Kim HS, Kim KM, Lee SB, Kim GR, Han YD, Cho MS, et al. Clinicopathological and biomolecular characteristics of stage IIB/IIC and stage IIIA colon cancer: Insight into the survival paradox. J Surg Oncol. 2019;120:423â30.
André T, Boni C, Navarro M, Tabernero J, Hickish T, Topham C, et al. Improved overall survival with oxaliplatin, fluorouracil, and leucovorin as adjuvant treatment in stage II or III colon cancer in the MOSAIC trial. J Clin Oncol. 2009;27:3109â16.
Guinney J, Dienstmann R, Wang X, de Reyniès A, Schlicker A, Soneson C, et al. The consensus molecular subtypes of colorectal cancer. Nat Med. 2015;21:1350â6.
Brunac A-C, Fourquet J, Perot G, Jaffrelot M, Meilleroux J, Danjoux M, et al. CINSARC signature outperforms gold-standard TNM staging and consensus molecular subtypes for clinical outcome in stage II-III colorectal carcinoma. Mod Pathol. 2022;35:2002â10.
Martin B, Grosser B, Kempkens L, Miller S, Bauer S, Dhillon C, et al. Stroma AReactive Invasion Front Areas (SARIFA)-a new easily to determine biomarker in colon cancer-results of a retrospective study. Cancers (Basel). 2021;13:4880.
Grosser B, Glückstein M-I, Dhillon C, Schiele S, Dintner S, VanSchoiack A, et al. Stroma AReactive Invasion Front Areas (SARIFA) - a new prognostic biomarker in gastric cancer related to tumor-promoting adipocytes. J Pathol. 2022;256:71â82.
Reitsam NG, Märkl B, Dintner S, Sipos E, Grochowski P, Grosser B, et al. Alterations in natural killer cells in colorectal cancer patients with Stroma AReactive Invasion Front Areas (SARIFA). Cancers (Basel). 2023;15:994.
Tian W, Zhang W, Zhang Y, Zhu T, Hua Y, Li H, et al. FABP4 promotes invasion and metastasis of colon cancer by regulating fatty acid transport. Cancer Cell Int. 2020;20:512.
Di Franco S, Bianca P, Sardina DS, Turdo A, Gaggianesi M, Veschi V, et al. Adipose stem cell niche reprograms the colorectal cancer stem cell metastatic machinery. Nat Commun. 2021;12:1â16.
Gyamfi J, Yeo JH, Kwon D, Min BS, Cha YJ, Koo JS, et al. Interaction between CD36 and FABP4 modulates adipocyte-induced fatty acid import and metabolism in breast cancer. npj Breast Cancer. 2021;7:1â18.
Wang Y-T, Liu C-H, Zhu H-L. Fatty acid binding protein (FABP) inhibitors: a patent review (2012-2015). Expert Opin Ther Pat. 2016;26:767â76.
Drury J, Rychahou PG, He D, Jafari N, Wang C, Lee EY, et al. Inhibition of fatty acid synthase upregulates expression of CD36 to sustain proliferation of colorectal cancer cells. Front Oncol. 2020;10:1185.
Mukherjee A, Bilecz AJ, Lengyel E. The adipocyte microenvironment and cancer. Cancer Metastasis Rev. 2022;41:575â87.
Vasseur S, Guillaumond F. Lipids in cancer: a global view of the contribution of lipid pathways to metastatic formation and treatment resistance. Oncogenesis 2022;11:46.
Wulczyn E, Steiner DF, Moran M, Plass M, Reihs R, Tan F, et al. Interpretable survival prediction for colorectal cancer using deep learning. NPJ Digit Med. 2021;4:71.
Foersch S, Glasner C, Woerl A-C, Eckstein M, Wagner D-C, Schulz S, et al. Multistain deep learning for prediction of prognosis and therapy response in colorectal cancer. Nat Med. 2023;29:430â9.
Cancer Genome Atlas Network. Comprehensive molecular characterization of human colon and rectal cancer. Nature 2012;487:330â7.
Collins GS, Reitsma JB, Altman DG, Moons KGM. Transparent Reporting of a multivariable prediction model for Individual Prognosis or Diagnosis (TRIPOD): the TRIPOD statement. Ann Intern Med. 2015;162:55â63.
Gao J, Aksoy BA, Dogrusoz U, Dresdner G, Gross B, Sumer SO, et al. Integrative analysis of complex cancer genomics and clinical profiles using the cBioPortal. Sci Signal. 2013;6:l1.
Cerami E, Gao J, Dogrusoz U, Gross BE, Sumer SO, Aksoy BA, et al. The cBio cancer genomics portal: an open platform for exploring multidimensional cancer genomics data. Cancer Discov. 2012;2:401â4.
Liu Y, Sethi NS, Hinoue T, Schneider BG, Cherniack AD, Sanchez-Vega F, et al. Comparative molecular analysis of gastrointestinal adenocarcinomas. Cancer Cell. 2018;33:721â35.e8.
Thorsson V, Gibbs DL, Brown SD, Wolf D, Bortone DS, Ou Yang T-H, et al. The immune landscape of cancer. Immunity 2018;48:812â30.e14.
Malta TM, Sokolov A, Gentles AJ, Burzykowski T, Poisson L, Weinstein JN, et al. Machine learning identifies stemness features associated with oncogenic dedifferentiation. Cell 2018;173:338â54.e15.
Shia J, Schultz N, Kuk D, Vakiani E, Middha S, Segal NH, et al. Morphological characterization of colorectal cancers in The Cancer Genome Atlas reveals distinct morphology-molecular associations: clinical and biological implications. Mod Pathol. 2017;30:599â609.
Schemper M, Smith TL. A note on quantifying follow-up in studies of failure time. Control Clin Trials. 1996;17:343â6.
Li B, Dewey CN. RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome. BMC Bioinforma. 2011;12:323.
Love MI, Huber W, Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 2014;15:550.
Eide PW, Bruun J, Lothe RA, Sveen A. CMScaller: an R package for consensus molecular subtyping of colorectal cancer pre-clinical models. Sci Rep. 2017;7:1â8.
Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, et al. Gene Ontology: tool for the unification of biology. Nat Genet. 2000;25:25â9.
Ge SX, Jung D, Yao R. ShinyGO: a graphical gene-set enrichment tool for animals and plants. Bioinformatics 2020;36:2628â9.
Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, Gillette MA, et al. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci USA. 2005;102:15545â50.
Wickham H ggplot2. Springer International Publishing;
Maeser D, Gruener RF, Huang RS. oncoPredict: an R package for predicting in vivo or cancer patient drug response and biomarkers from cell line screening data. Brief Bioinform. 2021;22:bbab260.
Iorio F, Knijnenburg TA, Vis DJ, Bignell GR, Menden MP, Schubert M, et al. A landscape of pharmacogenomic interactions in cancer. Cell. 2016;166:740â54.
Zhou R, Wen Z, Liao Y, Wu J, Xi S, Zeng D, et al. Evaluation of stromal cell infiltration in the tumor microenvironment enable prediction of treatment sensitivity and prognosis in colon cancer. Comput Struct Biotechnol J 2022;20:2153â68.
Bhandari V, Hoey C, Liu LY, Lalonde E, Ray J, Livingstone J, et al. Molecular landmarks of tumor hypoxia across cancer types. Nat Genet. 2019;51:308â18.
Peters NA, Constantinides A, Ubink I, van Kuik J, Bloemendal HJ, van Dodewaard JM, et al. Consensus molecular subtype 4 (CMS4)-targeted therapy in primary colon cancer: a proof-of-concept study. Front Oncol. 2022;12:969855.
Wu T, Wang G, Chen W, Zhu Z, Liu Y, Huang Z, et al. Co-inhibition of BET proteins and NF-κB as a potential therapy for colorectal cancer through synergistic inhibiting MYC and FOXM1 expressions. Cell Death Dis. 2018;9:315.
Tian X-H, Hou W-J, Fang Y, Fan J, Tong H, Bai S-L, et al. XAV939, a tankyrase 1 inhibitior, promotes cell apoptosis in neuroblastoma cell lines by inhibiting Wnt/β-catenin signaling pathway. J Exp Clin Cancer Res. 2013;32:100.
Martin B, Schäfer E, Jakubowicz E, Mayr P, Ihringer R, Anthuber M, et al. Interobserver variability in the H&E-based assessment of tumor budding in pT3/4 colon cancer: does it affect the prognostic relevance? Virch Arch. 2018;473:189â97.
Koelzer VH, Zlobec I, Berger MD, Cathomas G, Dawson H, Dirschmid K, et al. Tumor budding in colorectal cancer revisited: results of a multicenter interobserver study. Virch Arch. 2015;466:485â93.
LâImperio V, Wulczyn E, Plass M, Müller H, Tamini N, Gianotti L, et al. Pathologist validation of a machine learningâderived feature for colon cancer risk stratification. JAMA Netw Open. 2023;6:e2254891.
Fariña-Sarasqueta A, van Lijnschoten G, Moerland E, Creemers G-J, Lemmens VEPP, Rutten HJT, et al. The BRAF V600E mutation is an independent prognostic factor for survival in stage II and stage III colon cancer patients. Ann Oncol. 2010;21:2396â402.
Popat S, Hubner R, Houlston RS. Systematic review of microsatellite instability and colorectal cancer prognosis. J Clin Orthod. 2005;23:609â18.
Trinh A, Lädrach C, Dawson HE, Ten Hoorn S, Kuppen PJK, Reimers MS, et al. Tumour budding is associated with the mesenchymal colon cancer subtype and RAS/RAF mutations: a study of 1320 colorectal cancers with Consensus Molecular Subgroup (CMS) data. Br J Cancer. 2018;119:1244â51.
Sirinukunwattana K, Domingo E, Richman SD, Redmond KL, Blake A, Verrill C, et al. Image-based consensus molecular subtype (imCMS) classification of colorectal cancer using deep learning. Gut 2021;70:544â54.
Nguyen H-G, Lundström O, Blank A, Dawson H, Lugli A, Anisimova M, et al. Image-based assessment of extracellular mucin-to-tumor area predicts consensus molecular subtypes (CMS) in colorectal cancer. Mod Pathol. 2021;35:240â8.
Isella C, Brundu F, Bellomo SE, Galimi F, Zanella E, Porporato R, et al. Selective analysis of cancer-cell intrinsic transcriptional traits defines novel clinically relevant subtypes of colorectal cancer. Nat Commun. 2017;8:1â16.
Pan B, Yue Y, Ding W, Sun L, Xu M, Wang S. A novel prognostic signatures based on metastasis- and immune-related gene pairs for colorectal cancer. Front Immunol. 2023;14:1161382.
Gharpure KM, Pradeep S, Sans M, Rupaimoole R, Ivan C, Wu SY, et al. FABP4 as a key determinant of metastatic potential of ovarian cancer. Nat Commun. 2018;9:1â14.
Mukherjee A, Chiang C-Y, Daifotis HA, Nieman KM, Fahrmann JF, Lastra RR, et al. Adipocyte-induced FABP4 expression in ovarian cancer cells promotes metastasis and mediates carboplatin resistance. Cancer Res. 2020;80:1748â61.
Ruan C, Meng Y, Song H. CD36: an emerging therapeutic target for cancer and its molecular mechanisms. J Cancer Res Clin Oncol. 2022;148:1551â8.
Sun N, Zhao X. Therapeutic implications of FABP4 in cancer: an emerging target to tackle cancer. Front Pharm. 2022;13:948610.
Ye Y, Zhang R, Feng H. Fibronectin promotes tumor cells growth and drugs resistance through a CDC42-YAP-dependent signaling pathway in colorectal cancer. Cell Biol Int. 2020;44:1840â9.
Araújo TG, Mota STS, Ferreira HSV, Ribeiro MA, Goulart LR, Vecchi L. Annexin A1 as a regulator of immune response in cancer. Cells. 2021;10. https://doi.org/10.3390/cells10092245.
André T, Boni C, Mounedji-Boudiaf L, Navarro M, Tabernero J, Hickish T, et al. Oxaliplatin, fluorouracil, and leucovorin as adjuvant treatment for colon cancer. N. Engl J Med. 2004;350:2343â51.
Hu Q, Wang Y, Yao S, Mao Y, Liu L, Li Z, et al. Desmoplastic reaction associates with prognosis and adjuvant chemotherapy response in colorectal cancer: a multicenter retrospective study. Cancer Res Commun. 2023;3:1057â66.
Acknowledgements
We thank all patients who participated in the analyzed studies. We also thank all researchers who contributed to the publicly available datasets used in this study.
Funding
JNK is supported by the German Federal Ministry of Health (DEEP LIVER, ZMVI1-2520DAT111) and the Max-Eder-Program of the German Cancer Aid (grant #70113864), the German Federal Ministry of Education and Research (PEARL, 01KD2104C), and the German Academic Exchange Service (SECAI, 57616814). This research was funded/supported by the National Institute for Health and Care Research (NIHR, NIHR213331), Leeds Biomedical Research Center. The views expressed are those of the author(s) and not necessarily those of the NHS, the NIHR, or the Department of Health and Social Care. NGR is supported by the Manfred-Stolte Foundation (gastrointestinal pathology research). Open Access funding enabled and organized by Projekt DEAL.
Author information
Authors and Affiliations
Contributions
NGR, JNK and BM conceptualized the study. NGR did the histopathological review. NGR, VG, CMLL, HSM and BG contributed to the data acquisition process, analyzed and interpreted the data. NGR and VG performed bioinformatics analyses. NGR, VG, CMLL and BM wrote the paper. Finally, all authors critically revised the paper and approved the final version for submission.
Corresponding author
Ethics declarations
Competing interests
JNK reports consulting services for Owkin, France, Panakeia, UK, and DoMore Diagnostics, Norway, and has received honoraria for lectures by MSD, Eisai, and Fresenius; not related to this study. BM has received compensation for travel expenses and fees for advisory board activities by AstraZeneca, Boehringer Ingelheim, Merck, MSD, BMS, Bayer, and Novartis, not related to this study. The other authors have no conflict of interest to declare.
Ethics approval and consent to participate
The experiments in this study are in compliance with the Declaration of Helsinki and the International Ethical Guidelines for Biomedical Research Involving Human Subjects by the Council for International Organizations of Medical Sciences (CIOMS). Moreover, the study has been carried out according to the âTransparent reporting of a multivariable prediction model for individual prognosis or diagnosisâ (TRIPOD) statement [20]. Additionally, the overall analysis in this study has been approved by the Ethics Board at the Medical Faculty of Technical University Dresden (BO-EK-444102022). The patient sample collection in each cohort was separately approved by the respective institutional ethics boards.
Additional information
Publisherâs note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the articleâs Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the articleâs Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Reitsam, N.G., Grozdanov, V., Löffler, C.M.L. et al. Novel biomarker SARIFA in colorectal cancer: highly prognostic, not genetically driven and histologic indicator of a distinct tumor biology. Cancer Gene Ther 31, 207â216 (2024). https://doi.org/10.1038/s41417-023-00695-y
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1038/s41417-023-00695-y
This article is cited by
-
The relationship between Stroma AReactive Invasion Front Areas (SARIFA), Warburg-subtype and survival: results from a large prospective series of colorectal cancer patients
Cancer & Metabolism (2024)
-
SARIFA as a new histopathological biomarker is associated with adverse clinicopathological characteristics, tumor-promoting fatty-acid metabolism, and might predict a metastatic pattern in pT3a prostate cancer
BMC Cancer (2024)
-
The Concept of Stroma AReactive Invasion Front Areas (SARIFA) as a new prognostic biomarker for lipid-driven cancers holds true in pancreatic ductal adenocarcinoma
BMC Cancer (2024)
-
Converging deep learning and human-observed tumor-adipocyte interaction as a biomarker in colorectal cancer
Communications Medicine (2024)
-
The SARIFA biomarker in the context of basic research of lipid-driven cancers
npj Precision Oncology (2024)