DriverDB is an integrative cancer omics database that combines somatic mutation, RNA expression, miRNA expression, protein expression, methylation, copy number variation, and clinical data with curated annotations and published bioinformatics algorithms for driver gene and driver event identification. Featured in the 2014, 2016, 2020, and 2024 Nucleic Acids Research Database Issues, DriverDB applies state-of-the-art computational methods to characterize cancer drivers across molecular layers.
DriverDB provides three major analytical modules:
1.1 Cancer Module Overview
The Cancer module summarizes driver gene and driver event predictions for a user-selected cancer type by integrating multi-omics data — including somatic mutations, RNA expression, miRNA expression, protein expression, copy number variation (CNV), methylation, and clinical information — through published bioinformatics algorithms and curated annotation sources. This module provides a cancer-centric overview of dysregulated molecular features and highlights candidate driver genes, their regulatory mechanisms, and their functional significance across molecular layers.
For mutation, CNV, and methylation, a Survival Relevance tab evaluates whether identified driver genes are associated with patient survival using multiple analysis methods, including Cox regression, cure model, and machine learning-based approaches. For multi-omics analysis, additional machine learning results are provided, including prognostic signature identification, Kaplan–Meier survival plots, predictive performance plots, and a gene-level summary of survival associations across omics types, endpoints, and algorithms.
1.2 Dataset Selection: Browse by Cancer Type
DriverDBv4 provides analysis across 70 cancer datasets, including 33 TCGA cancer types and additional datasets from resources such as CPTAC and ICGC. Use the selection panel to choose the dataset you want to explore.
A. Tissue Type (Optional)
Filter available datasets by tissue origin to quickly locate cancers related to a specific anatomical site.B. Related Dataset
Select the specific cancer dataset you wish to analyze. Each dataset label includes its data source (e.g., TCGA-US, ICGC-KR), allowing users to choose cohorts most relevant to their research.
C. Submit
After making your selections, click Submit to load driver gene summaries and molecular features for the chosen cancer type. All downstream tabs, including Mutation, CNV, Methylation, Survival, miRNA, and Multi-Omics, will display results based on the selected dataset.
1.3 Overview of Result Tabs
The Cancer module contains several results tabs, each summarizing driver evidence derived from a different omics layer:1.4 Cancer Summary
1.4.1 Overview
The Cancer Summary tab provides an integrated overview of potential driver genes and miRNA drivers for the selected cancer type. It aggregates multi-omics driver evidence—including mutation, CNV, methylation, expression, miRNA regulation, and survival relevance—and connects them through known biological networks such as protein–protein interactions (PPIs), gene–miRNA interactions, and synergistic survival associations.
This section contains two main components:Together, these views help users quickly identify influential driver genes, their regulatory relationships, and cross-omics support.
1.4.2 Summary Network
Purpose
The Summary Network visualizes relationships between driver genes and miRNA drivers in the selected cancer type, providing an integrated view of multi-omics driver events and their functional or regulatory connections.
The network integrates the following data sources:Nodes
Driver gene nodes are displayed as circular nodes divided into four quadrants, each corresponding to an omics feature:Each quadrant is colored according to its omics status when that feature is altered in the selected cancer type; refer to the Node Legend for color definitions. When a quadrant's omics feature shows no significant alteration, that quadrant is white. The overall appearance of the node therefore reflects the combination of omics alterations present for that gene — a fully colored node indicates alterations across all four omics layers, while a predominantly white node indicates few or no detected alterations. A red star within a node indicates genes identified by multi-omics integration tools.
miRNA driver nodes are displayed as yellow nodes and represent miRNAs identified as regulatory or dysregulated in the selected cancer type.
Unconnected nodes are omitted from the network for clarity.

Edges
Each edge indicates a known or predicted biological relationship between two nodes.
Unconnected nodes are removed to reduce visual clutter and highlight biologically relevant clusters.
Interaction Guide
The Summary Network is fully interactive:
Selecting and Highlighting
1.4.3 Driver Summary Table
The Driver Summary Table provides an integrated overview of potential cancer driver genes identified across the cancer projects associated with the selected cancer type. Each row represents a gene and summarizes the evidence supporting its potential driver role across multiple molecular data types and established cancer-gene databases.
The table indicates whether each gene is included in the Cancer Gene Census (CGC) or the Network of Cancer Genes 6.0 (NCG6.0), and reports driver evidence derived from mutation, copy number variation, DNA methylation, RNA expression, and miRNA regulation analyses. The RNA column shows whether the gene is upregulated or downregulated, while the miRNA column lists the miRNAs associated with regulation of the gene. The multiomics column reports the number of omics data types that identify the gene as a potential driver.
Together, these features allow users to compare candidate driver genes, assess the breadth of molecular evidence supporting each gene, and identify genes supported by multiple omics layers or curated cancer-gene resources.
Column Description:1.5 Cancer Mutation
1.5.1 Overview
The Cancer Mutation section identifies and visualizes mutation-based driver genes and their survival relevance in the selected cancer type. Results are organized into two tabs: Driver Genes and Survival Relevance.
The Driver Genes tab focuses on identifying mutation driver genes using multiple published computational tools. Driver genes are genes whose mutations are believed to confer a selective growth advantage in cancer. The degree of consensus across tools provides a measure of confidence in each gene's driver role. This tab contains two components:Together, the Driver Genes and Survival Relevance tabs help users identify which genes are supported as mutation drivers by computational tools, which are associated with patient survival, and which show synergistic survival effects in combination with other molecular features.
1.5.2 Driver genes
Mutation Driver Summary by ToolsPurpose
This panel summarizes how many genes are identified by varying numbers of mutation driver–detection tools.
Stronger consensus across tools indicates stronger evidence supporting a gene’s driver role.
Components
Distribution of Mutation Driver Genes by Tool Support (Left Plot)Mutation Summary Table (Right Table)

Purpose
This section visualizes mutation patterns for the top 30 mutation driver genes, helping users examine:It contains two interactive components.
Components
Mutation Impact Distribution of Top 30 Driver Genes (Left Plot)Tool Support for Top 30 Driver Genes (Right Plot)
The plot displays a bar chart where each bar represents a gene, with the height of the bar indicating the number of mutation tools that identified that gene as a mutation driver. Genes that are supported by a greater number of tools suggest higher-confidence driver roles, as consensus across multiple computational methods provides stronger evidence for their functional importance in cancer development.
1.5.3 Survival relevance
Overall Summary
The bar charts and Venn diagrams summarize the number and overlap of survival-related genes identified from the selected omics data across four survival endpoints and four survival analysis methods.
The four survival endpoints include overall survival (OS), progression-free interval (PFI), disease-specific survival (DSS), and disease-free interval (DFI). The four survival analysis methods include Cox univariate regression, Cox multivariate regression adjusted for clinical covariates, cure model analysis, and machine learning (ML)-based analysis. The ML-based analysis includes Lasso, Random Forest, and I-Boost; a gene is counted as survival-related by ML if it is identified by at least one of the three algorithms. For more information about these methods, please refer to FAQ4.
Survival gene summary table
The Survival Gene Summary table lists survival-related genes identified in the selected cancer type based on the selected omics data type: RNA, mutation, copy number variation (CNV), or methylation. Results are organized into four tabs corresponding to the four survival endpoints: overall survival (OS), progression-free interval (PFI), disease-specific survival (DSS), and disease-free interval (DFI).
Each endpoint-specific table includes the following columns: Gene Symbol, Cox Uni, Cox Multi (Clinical), Cure Model, Machine Learning, and Number of Algorithms.
For Cox Uni, Cox Multi (Clinical), and Cure Model, values represent log2 hazard ratios, where log2 transformation is applied to center the scale symmetrically around zero for easier comparison of risk directions. Positive values are shown in red, indicating higher risk, while negative values are shown in blue, indicating lower risk. Stronger color intensity represents a larger absolute log2 hazard ratio. Blank cells indicate that the gene did not reach statistical significance or was not tested under that method.
For Machine Learning, genes identified by at least one of the three machine learning algorithms — Lasso, Random Forest, or I-Boost — are marked with '+'. A blank cell indicates the gene was not identified as survival-related by any of the three methods.
The Number of Algorithms column indicates how many of the four analysis methods — Cox Univariate, Cox Multivariate (Clinical), Cure Model, and Machine Learning — identified the gene as survival-related, on a scale of 1 to 4. Higher values suggest more consistent evidence of survival relevance across methods. Users can reorder the table by clicking on any column name. See FAQ4 for algorithm descriptions and references.
Synergistic survival analysis
The Synergistic Survival Analysis section evaluates whether pairs of genes or molecular features show combined survival effects within the user-selected cancer type. The analysis supports cross-omics interactions among RNA expression, mutation, copy number variation (CNV), and methylation, depending on the selected omics type and available data. Currently, synergistic survival analysis is available for overall survival (OS) only.
Users can filter the results by selecting a gene set, including All, CGC, or NCG, and by selecting the hazard ratio direction, including All, HR > 1, or HR < 1. The gene set resources include the Cancer Gene Census (CGC) and the Network of Cancer Genes (NCG 6.0).
The result table lists detailed information for each synergistic survival interaction, including cancer type, interaction type, gene symbols, omics levels, hazard ratio, and adjusted p-value. The adjusted p-value is used to determine significant synergistic interactions, whereas the Kaplan–Meier plots display the corresponding unadjusted log-rank p-values for visualization. Users can reorder the table by clicking on any column name. Selecting or toggling a gene–omic pair in the table generates the corresponding Kaplan–Meier survival plots below.
The Kaplan–Meier plots display survival differences among patient groups defined by the selected cross-omics interaction. The left plot shows the unadjusted Kaplan–Meier survival curves, while the right plot shows adjusted survival curves generated using ggadjustedcurves() from the survminer package, when available. The gene symbols, omics layers, and survival analysis values are shown above each plot. The x-axis represents survival time from the initial cancer diagnosis, with survival curves displayed for the first 5 years of follow-up, and the y-axis represents survival probability. Users can hover over the curves to view detailed survival information and click the legend to show or hide individual curves.
Synergistic interactions are identified based on the combined survival effect of two survival-related molecular features from different omics layers. A synergistic pair is reported when the combined hazard ratio is greater than 1.5-fold compared with each individual omics feature and the log-rank p-value is less than 0.05.
Patient group stratification
Patient groups are defined according to the combined omics states of the two paired features. The grouping method depends on the omics type:
Abbreviation definitions
The group labels in the Kaplan–Meier plots represent the combined omics states of two genes or molecular features. The order of gene 1 and gene 2 follows the interaction type shown in the table.
1.6 Cancer CNV
1.6.1 Overview
The Cancer CNV section visualizes genes exhibiting significant copy number variation (CNV) gain or loss in the selected cancer type, and evaluates whether CNV status of genes is associated with patient survival. Results are organized into two tabs: CNV Drivers and Survival Relevance.
At the top of the tab, users may choose between two CNV driver–detection modes:
Switching between modes allows users to compare tool-specific versus multi-tool consensus CNV drivers. The selected mode applies across both tabs.
The CNV Drivers tab summarizes CNV driver evidence across patient samples, chromosomes, and pathway enrichments, helping users explore CNV-expression relationships and CNV-driven biological mechanisms. This tab contains three components:Together, the CNV Drivers and Survival Relevance tabs help users identify which genes show significant copy number alterations supported by computational tools, which are associated with patient survival, and which show synergistic survival effects in combination with other molecular features.
1.6.2 Driver genes
Visualization of Top 30 CNV Driver GenesThis panel presents CNV gain, loss, and neutral patterns for the top 30 CNV driver genes in the selected cancer type.
CNV Gain and Loss Distribution of Top 30 Genes (Top Chart)
The plot displays a bar chart summarizing the percentage of samples exhibiting copy number variation (CNV) changes across the top 30 CNV driver genes, with each bar color-coded to show CNV gain (pink), CNV loss (green), and no CNV change (blue). Users can hover over any bar segment to view the exact percentages of gain, loss, and neutral CNV states for each gene. Genes with high gain percentages may represent potential oncogenes, while those with high loss percentages may be tumor suppressor candidates, whereas genes with balanced or low CNV changes may indicate lower CNV-driven relevance in cancer development.
CNV Patterns of Top 30 Genes Across Cancer Samples (Bottom Heatmap)
The heatmap displays copy number variation (CNV) data with rows representing the top 30 CNV driver genes and columns representing individual patient samples, where each cell is color-coded to indicate CNV gain (pink), CNV loss (green), or no CNV event (blue). Additional summary panels provide complementary information: the left panel (A) shows total CNV gain/loss percentages per gene, the top bar chart (B) displays total CNV events per sample, and the right bar chart (C) presents total CNV events per gene. Rows dominated by green or pink indicate consistent CNV-driven alterations in specific genes, while samples with tall bars in the top chart may represent CNV-heavy tumor genomes, and genes showing both high CNV frequency and strong tool support from the summary table emerge as strong CNV driver candidates.
This section explores chromosomal distribution and functional enrichment of CNV-associated genes.
Chromosomal Locus Enrichment of CNV-Associated Genes (Left Plot)
The plot displays each gene as a red dot positioned according to its chromosomal coordinates across the genome, with hovering over any dot revealing detailed information including the chromosome, position, gene symbol, and correlation value between CNV and expression. Dense clusters of dots indicate chromosomal regions enriched for CNV events, while genes showing high CNV–expression correlation may reflect dosage-sensitive drivers where copy number changes directly influence gene expression levels and potentially contribute to cancer development.
Locus Enrichment Summary Table (Right Table)
The table displays pathways or functional categories that are enriched among CNV-affected genes, helping users identify biological processes potentially disrupted by copy number variation events. Enrichment of pathways such as cell cycle regulation, DNA repair, or receptor tyrosine kinase (RTK) signaling may highlight key CNV-driven mechanisms underlying cancer development and progression.
This table provides gene-level CNV statistics, including significance metrics, sample proportions, CNV amplitude, and CNV–expression associations, offering a comprehensive overview of copy number variation patterns across genes. Genes with significant gain or loss (low p-value or FDR) and high sample proportions represent strong CNV candidates, while positive CNV–expression correlations indicate copy-number–driven expression changes where genomic alterations directly influence gene expression levels. Combining this table with the heatmap helps confirm consistent CNV patterns across patients and strengthens the evidence for identifying clinically relevant CNV-driven genes.
1.6.3 Survival relevance
Overall Summary
The bar charts and Venn diagrams summarize the number and overlap of survival-related genes identified from the selected omics data across four survival endpoints and four survival analysis methods.
The four survival endpoints include overall survival (OS), progression-free interval (PFI), disease-specific survival (DSS), and disease-free interval (DFI). The four survival analysis methods include Cox univariate regression, Cox multivariate regression adjusted for clinical covariates, cure model analysis, and machine learning (ML)-based analysis. The ML-based analysis includes Lasso, Random Forest, and I-Boost; a gene is counted as survival-related by ML if it is identified by at least one of the three algorithms. For more information about these methods, please refer to FAQ4.

Survival gene summary table
The Survival Gene Summary table lists survival-related genes identified in the selected cancer type based on the selected omics data type: RNA, mutation, copy number variation (CNV), or methylation. Results are organized into four tabs corresponding to the four survival endpoints: overall survival (OS), progression-free interval (PFI), disease-specific survival (DSS), and disease-free interval (DFI).
Each endpoint-specific table includes the following columns: Gene Symbol, Cox Uni, Cox Multi (Clinical), Cure Model, Machine Learning, and Number of Algorithms.
For Cox Uni, Cox Multi (Clinical), and Cure Model, values represent log2 hazard ratios, where log2 transformation is applied to center the scale symmetrically around zero for easier comparison of risk directions. Positive values are shown in red, indicating higher risk, while negative values are shown in blue, indicating lower risk. Stronger color intensity represents a larger absolute log2 hazard ratio. Blank cells indicate that the gene did not reach statistical significance or was not tested under that method.
For Machine Learning, genes identified by at least one of the three machine learning algorithms — Lasso, Random Forest, or I-Boost — are marked with +. A blank cell indicates the gene was not identified as survival-related by any of the three methods.
The Number of Algorithms column indicates how many of the four analysis methods — Cox Univariate, Cox Multivariate (Clinical), Cure Model, and Machine Learning — identified the gene as survival-related, on a scale of 1 to 4. Higher values suggest more consistent evidence of survival relevance across methods. Users can reorder the table by clicking on any column name. See FAQ4 for algorithm descriptions and references.
Synergistic survival analysis
The Synergistic Survival Analysis section evaluates whether pairs of genes or molecular features show combined survival effects within the user-selected cancer type. The analysis supports cross-omics interactions among RNA expression, mutation, copy number variation (CNV), and methylation, depending on the selected omics type and available data. Currently, synergistic survival analysis is available for overall survival (OS) only.
Users can filter the results by selecting a gene set, including All, CGC, or NCG, and by selecting the hazard ratio direction, including All, HR > 1, or HR < 1. The gene set resources include the Cancer Gene Census (CGC) and the Network of Cancer Genes (NCG 6.0).
The result table lists detailed information for each synergistic survival interaction, including cancer type, interaction type, gene symbols, omics levels, hazard ratio, and adjusted p-value. The adjusted p-value is used to determine significant synergistic interactions, whereas the Kaplan–Meier plots display the corresponding unadjusted log-rank p-values for visualization. Users can reorder the table by clicking on any column name. Selecting or toggling a gene–omic pair in the table generates the corresponding Kaplan–Meier survival plots below.
The Kaplan–Meier plots display survival differences among patient groups defined by the selected cross-omics interaction. The left plot shows the unadjusted Kaplan–Meier survival curves, while the right plot shows adjusted survival curves generated using ggadjustedcurves() from the survminer package, when available. The gene symbols, omics layers, and survival analysis values are shown above each plot. The x-axis represents survival time from the initial cancer diagnosis, with survival curves displayed for the first 5 years of follow-up, and the y-axis represents survival probability. Users can hover over the curves to view detailed survival information and click the legend to show or hide individual curves.
Synergistic interactions are identified based on the combined survival effect of two survival-related molecular features from different omics layers. A synergistic pair is reported when the combined hazard ratio is greater than 1.5-fold compared with each individual omics feature and the log-rank p-value is less than 0.05.
Patient group stratification
Patient groups are defined according to the combined omics states of the two paired features. The grouping method depends on the omics type:
Abbreviation definitions
The group labels in the Kaplan–Meier plots represent the combined omics states of two genes or molecular features. The order of gene 1 and gene 2 follows the interaction type shown in the table.
1.7 Cancer Methylation
1.7.1 Overview
The Cancer Methylation section visualizes genes exhibiting significant hypermethylation or hypomethylation in the selected cancer type, and evaluates whether methylation status of genes is associated with patient survival. Results are organized into two tabs: Methylation Drivers and Survival Relevance.
At the top of the section, users may choose between two methylation driver-detection modes:
Switching between modes allows users to compare single-tool versus consensus methylation drivers. The selected mode applies across both tabs.
The Methylation Drivers tab provides an overview of methylation driver distributions across patient samples and chromosomal locations, helping users explore methylation-expression relationships and identify epigenetically driven gene dysregulation. This tab contains three components:Together, the Methylation Drivers and Survival Relevance tabs help users identify which genes show significant epigenetic alterations supported by computational tools, which are associated with patient survival, and which show synergistic survival effects in combination with other molecular features.
1.7.2 Driver genes
Visualization of Top 30 Methylation Driver GenesThis panel summarizes the methylation status of the top 30 methylation driver genes and shows how hypermethylation/hypomethylation patterns appear across samples.
Methylation Status of Top 30 Genes (Top Bar Chart)
The plot displays a bar chart summarizing the proportion of samples showing hypermethylation (pink), hypomethylation (green), and no methylation change (blue) across the top 30 methylation driver genes, with each bar representing a single gene. Users can hover over any bar to view the exact percentages of hyper-, hypo-, and unmethylated samples for that gene. Genes with high hypermethylation may involve promoter silencing or epigenetic downregulation that reduces gene expression, while genes with high hypomethylation may indicate enhancer activation or derepression leading to increased expression, whereas balanced patterns may suggest context-specific or mixed methylation states that vary across different tumor samples or subtypes.
Methylation Patterns Across Cancer Samples (Bottom Heatmap)
The heatmap displays methylation data with rows representing the top 30 methylation driver genes and columns representing individual patient samples, where cell colors indicate hypermethylation (pink) or hypomethylation (green). Additional summary bars provide complementary information: the left panel (A) shows total methylation percentage per gene, the top bar chart (B) displays total methylation events per sample, and the right bar chart (C) presents total methylation events per gene. Genes with predominantly pink rows are consistently hypermethylated across patients, while those with predominantly green rows show consistent hypomethylation, and tall bars in the top chart indicate samples with high methylation burden. Comparison with expression data through correlation analysis helps identify methylation-driven expression changes, revealing epigenetic mechanisms that influence gene activity in cancer.
This section maps methylation-associated genes to their chromosomal positions and evaluates pathway enrichment.
Chromosomal Locus Enrichment of Methylation-Associated Genes (Left Plot)
The plot displays each methylation-associated gene as a red dot positioned according to its chromosomal coordinates across the genome, with hovering over any dot revealing the chromosome, genomic position, gene symbol, and correlation value between methylation and expression. Clusters of dots may indicate epigenetically altered chromosomal regions where multiple genes experience coordinated methylation changes, while positive correlation values suggest that methylation changes strongly influence gene expression, such as hypermethylation leading to downregulation or hypomethylation resulting in upregulation. Genes with high correlation values may represent functional methylation drivers where epigenetic modifications play a critical role in regulating gene activity and contributing to cancer phenotypes.
Locus Enrichment Summary Table (Right Table)
The table displays pathways enriched among methylation-affected genes, helping users identify biological processes impacted by epigenomic dysregulation in cancer. Enrichment in pathways such as DNA repair, immune regulation, or cell differentiation may indicate core mechanisms that are altered via methylation changes, revealing how epigenetic modifications contribute to tumor development, progression, and immune evasion.
This table summarizes methylation statistics for each gene identified as a methylation driver in the selected cancer type, integrating results from MethylMix and ELMER outputs and including methylation proportions, probe-level data, and correlations with gene expression. Strong hyper- or hypomethylation percentages combined with significant adjusted p-values indicate robust methylation drivers, while a strong negative correlation typically suggests promoter hypermethylation reducing expression and a positive correlation may indicate intragenic methylation effects that enhance gene activity. Genes showing consistent patterns across both tools (MethylMix ∩ ELMER) represent high-confidence methylation drivers, as convergent evidence from multiple computational methods strengthens the reliability of epigenetic alterations as key regulatory mechanisms in cancer.
1.7.3 Survival relevance
Overall Summary
The bar charts and Venn diagrams summarize the number and overlap of survival-related genes identified from the selected omics data across four survival endpoints and four survival analysis methods.
The four survival endpoints include overall survival (OS), progression-free interval (PFI), disease-specific survival (DSS), and disease-free interval (DFI). The four survival analysis methods include Cox univariate regression, Cox multivariate regression adjusted for clinical covariates, cure model analysis, and machine learning (ML)-based analysis. The ML-based analysis includes Lasso, Random Forest, and I-Boost; a gene is counted as survival-related by ML if it is identified by at least one of the three algorithms. For more information about these methods, please refer to FAQ4.

Survival gene summary table
The Survival Gene Summary table lists survival-related genes identified in the selected cancer type based on the selected omics data type: RNA, mutation, copy number variation (CNV), or methylation. Results are organized into four tabs corresponding to the four survival endpoints: overall survival (OS), progression-free interval (PFI), disease-specific survival (DSS), and disease-free interval (DFI).
Each endpoint-specific table includes the following columns: Gene Symbol, Cox Uni, Cox Multi (Clinical), Cure Model, Machine Learning, and Number of Algorithms.
For Cox Uni, Cox Multi (Clinical), and Cure Model, values represent log2 hazard ratios, where log2 transformation is applied to center the scale symmetrically around zero for easier comparison of risk directions. Positive values are shown in red, indicating higher risk, while negative values are shown in blue, indicating lower risk. Stronger color intensity represents a larger absolute log2 hazard ratio. Blank cells indicate that the gene did not reach statistical significance or was not tested under that method.
For Machine Learning, genes identified by at least one of the three machine learning algorithms — Lasso, Random Forest, or I-Boost — are marked with +. A blank cell indicates the gene was not identified as survival-related by any of the three methods.
The Number of Algorithms column indicates how many of the four analysis methods — Cox Univariate, Cox Multivariate (Clinical), Cure Model, and Machine Learning — identified the gene as survival-related, on a scale of 1 to 4. Higher values suggest more consistent evidence of survival relevance across methods. Users can reorder the table by clicking on any column name. See FAQ4 for algorithm descriptions and references.
Synergistic survival analysis
The Synergistic Survival Analysis section evaluates whether pairs of genes or molecular features show combined survival effects within the user-selected cancer type. The analysis supports cross-omics interactions among RNA expression, mutation, copy number variation (CNV), and methylation, depending on the selected omics type and available data. Currently, synergistic survival analysis is available for overall survival (OS) only.
Users can filter the results by selecting a gene set, including All, CGC, or NCG, and by selecting the hazard ratio direction, including All, HR > 1, or HR < 1. The gene set resources include the Cancer Gene Census (CGC) and the Network of Cancer Genes (NCG 6.0).
The result table lists detailed information for each synergistic survival interaction, including cancer type, interaction type, gene symbols, omics levels, hazard ratio, and adjusted p-value. The adjusted p-value is used to determine significant synergistic interactions, whereas the Kaplan–Meier plots display the corresponding unadjusted log-rank p-values for visualization. Users can reorder the table by clicking on any column name. Selecting or toggling a gene–omic pair in the table generates the corresponding Kaplan–Meier survival plots below.
The Kaplan–Meier plots display survival differences among patient groups defined by the selected cross-omics interaction. The left plot shows the unadjusted Kaplan–Meier survival curves, while the right plot shows adjusted survival curves generated using ggadjustedcurves() from the survminer package, when available. The gene symbols, omics layers, and survival analysis values are shown above each plot. The x-axis represents survival time from the initial cancer diagnosis, with survival curves displayed for the first 5 years of follow-up, and the y-axis represents survival probability. Users can hover over the curves to view detailed survival information and click the legend to show or hide individual curves.
Synergistic interactions are identified based on the combined survival effect of two survival-related molecular features from different omics layers. A synergistic pair is reported when the combined hazard ratio is greater than 1.5-fold compared with each individual omics feature and the log-rank p-value is less than 0.05.
Patient group stratification
Patient groups are defined according to the combined omics states of the two paired features. The grouping method depends on the omics type:
Abbreviation definitions
The group labels in the Kaplan–Meier plots represent the combined omics states of two genes or molecular features. The order of gene 1 and gene 2 follows the interaction type shown in the table.
1.8 Cancer miRNA
1.8.1 Overview
The Cancer miRNA section visualizes regulatory relationships between differentially expressed (DE) genes and miRNAs in the selected cancer type. It integrates both experimentally validated interactions and computationally predicted miRNA–target relationships to help users identify miRNA regulators, target genes, and expression patterns associated with carcinogenesis. Note: When Validated is selected, interactions are shown if they are either experimentally validated (solid lines) or meet the selected minimum prediction support (dotted lines).
This section contains three components:1.8.2 miRNA-Gene Interaction Network
Purpose
This interactive network displays validated and predicted interactions between genes and miRNAs, enabling users to explore regulatory mechanisms that may contribute to cancer development or progression.
Nodes
Edges
Two types of miRNA–gene interactions are shown:Note: All interactions appear as dotted lines in the visualization. The distinction between validated and predicted interactions is determined by whether the 'Validated' checkbox is enabled, not by line style. Edges grow denser as evidence increases (validated + multi-tool predictions).
Interaction Guide Selecting Nodes

1.8.3 Visualization of Differentially Expressed Genes and miRNAs
Purpose
This heatmap displays the expression profiles of differentially expressed (DE) genes and DE miRNAs across tumor and normal samples, allowing users to compare regulatory patterns at the expression level and examine relationships between miRNA regulators and their target genes.
Heatmap Display
The heatmap presents rows representing DE miRNAs and/or DE genes and columns representing individual patient samples, with a color scale where red indicates higher expression and blue indicates lower expression. Sample labels distinguish between TP (dark blue, tumor samples) and NT (light blue, normal samples), while clustering dendrograms show similarities among samples (top) and similarities among genes/miRNAs (right), revealing co-expression patterns and sample groupings.
Visualization Modes
Users can switch between three visualization modes: DE miRNA (displaying only DE miRNAs), DE gene (displaying only DE genes), or DE miRNA + DE gene (combined view). These modes help users examine upregulated miRNAs versus their target genes, identify opposing expression trends such as miRNA upregulation with corresponding gene downregulation, and detect co-expression clusters among miRNAs or genes that suggest coordinated regulatory mechanisms.
Interpretation Tips
Inverse expression patterns, where miRNA expression is high and target gene expression is low, may indicate miRNA-mediated repression as a functional regulatory mechanism. Co-clustering of genes or miRNAs suggests shared regulatory pathways or common biological functions, while DE miRNAs that align with network interactions identified in other analyses highlight strong regulatory candidates with potential functional significance in cancer development or progression.
1.8.4. Gene–miRNA Correlation Summary Table
This table provides quantitative measures of gene–miRNA regulatory relationships using correlation analysis, validation data, and prediction support to assess the strength and reliability of regulatory interactions. Negative correlations often indicate miRNA-mediated repression where miRNA upregulation corresponds with target gene downregulation, while positive correlations may suggest co-regulation or indirect regulatory mechanisms involving intermediate factors. High prediction-tool support combined with validated status and strong correlation values indicate high-confidence interactions that are likely functionally relevant, and users can cross-check this table with network edges and heatmap expression patterns to confirm consistent regulatory relationships across multiple analytical approaches.
1.9 Cancer Multi-omics
1.9.1 Overview
The Cancer Multi-Omics section visualizes driver genes identified through multi-omics integration tools and evaluates their survival relevance in the selected cancer type. By combining evidence across mutations, CNV, methylation, mRNA expression, and miRNA regulation, this section highlights genes supported by multiple molecular layers and explores their biological functions, tool support, distribution across omics categories, and prognostic significance.
Users may filter results by gene set:
1.9.2 Multi-Layer Relationship Diagram of Multi-Omics Drivers and Biological Functions
This section presents a diagram illustrating hierarchical relationships from the cancer type → omics layers → multi-omics driver genes → Gene Ontology (GO) functions, showing how integrative driver events connect molecular alterations to biological processes. A summary table below the diagram lists detailed gene-specific and GO-specific results.
1.9.3 Distribution of Multi-Omics Drivers Across Omics Layers
Purpose
This section summarizes how many tools identify each gene as a multi-omics driver and how those drivers are distributed across omics categories.
It consists of two complementary plots:Tool Support Across Omics Layers (Left Heatmap)
This heatmap displays tool support across omics layers, with rows representing multi-omics driver genes, columns representing different omics layers, and cells showing the number of tools that identified each gene within that specific omic layer, where hovering over any cell reveals the exact number of supporting tools. Darker cells indicate stronger multi-tool evidence for that particular omic layer, suggesting robust detection across computational methods, while genes with support across multiple omics layers may represent high-confidence integrative drivers that are dysregulated through multiple molecular mechanisms. Missing or light-colored cells suggest omics-specific drivers where the gene shows alterations predominantly in one molecular layer rather than across multiple platforms.
Top Genes by Support Count (Right Bar Chart)
This bar chart displays an ordered list of genes ranked by the total number of supporting tools, with the x-axis showing tool counts and the y-axis displaying gene symbols, where bar colors differentiate the omics categories contributing to each gene's overall support. Users can hover over any bar to view the number of tools per omic layer contributing to each gene's score, providing detailed breakdowns of evidence sources. Genes with the longest bars are most consistently supported across computational tools and represent the strongest driver candidates, while multicolored bars indicate multi-layered evidence across different omics platforms suggesting integrative dysregulation, and single-color bars represent omics-specific drivers that show alterations predominantly within one molecular layer.

1.9.4 Cross-Tool Comparison of Multi-Omics Driver Detection
Purpose
This section compares the coverage and consistency of different multi-omics driver-identification tools across omics layers.
It contains:Proportion of Genes Identified by Each Tool (Left Heatmap)
This heatmap displays multi-omics identification tools on the y-axis and omics layers on the x-axis, with each cell showing the proportion of genes identified by a specific tool for a given omic layer, where hovering reveals exact proportion values. High-proportion cells reveal tool specialization or sensitivity toward certain omics categories, indicating that some tools are particularly effective at detecting drivers within specific molecular layers, while tools with balanced proportions across multiple omics may provide more integrative coverage and capture dysregulation across diverse biological mechanisms.
Gene Counts by Tool Support Level (Right Bar Chart)
This bar chart displays the distribution of tool support across genes, with the x-axis showing the number of tools supporting a gene and the y-axis showing the number of genes at each support level, where hovering reveals the exact count of genes supported by each tool count. A right-skewed distribution, where more genes are supported by many tools, indicates strong cross-tool consensus and robust identification of driver genes across computational methods, while a left-skewed distribution suggests tool divergence where few genes are consistently detected across platforms. Genes supported by more tools typically represent high-confidence multi-omics drivers, as convergent evidence from multiple analytical approaches strengthens their credibility as functionally relevant cancer-associated genes.

1.9.5 Machine Learning Results
The machine learning result table summarizes significant prognostic signatures identified by machine learning algorithms for the selected cancer type. Each row represents a significant result for a specific survival endpoint and algorithm combination. Results are organized across four omics data types — RNA expression, mutation, CNV, and methylation — each available in a separate tab.
The table includes the following columns:Users can reorder the table by clicking on any column name. Selecting a row displays the corresponding Kaplan–Meier survival plot and predictive performance plot in the Signature Results panel below.
1.9.6 Signature Results
The Signature Results panel displays the prognostic signature identified by the selected machine learning algorithm and survival endpoint for the selected cancer type. Users can select a machine learning algorithm — Lasso, Random Forest, or I-Boost — and a survival endpoint from the left menu. The corresponding signature gene table, Kaplan–Meier survival plot, and predictive performance plot are displayed on the right. For detailed algorithm descriptions and reference links, please refer to FAQ4.
For Lasso and Random Forest, ROC curves evaluate the predictive performance of the signature at different survival time points. The x-axis represents the false-positive rate and the y-axis represents the true-positive rate. Users can hover over the curves to view the false-positive rate, true-positive rate, and cutoff value at each point. ROC curves for different survival times can be shown or hidden by clicking the corresponding legend labels.
For I-Boost, a cumulative hazard plot is displayed instead of ROC curves. This plot shows the cumulative hazard over time for the high- and low-risk groups. Higher cumulative hazard values indicate a greater accumulated risk of the survival event occurring up to that time point. The x-axis represents survival time from initial cancer diagnosis and the y-axis represents cumulative hazard. Users can hover over the curves to view detailed information, and curves can be shown or hidden by clicking the corresponding legend labels.
1.9.7 Multi-Omics Survival Gene Summary
The Multi-Omics Survival Gene Summary panel provides an overview of survival-related genes identified by Lasso, Random Forest, and I-Boost across omics data types and survival endpoints for the selected cancer type.

Bar charts
The three bar charts at the top summarize the distribution of significant survival-related genes from different perspectives. Hover over each bar to view the exact gene count.
(A) Significant genes by omics type: Shows the number of significant survival-related genes identified from each omics data type — RNA expression, mutation (MUT), CNV, and methylation (MET). This chart helps users assess which omics layer contributes the most survival-related features in the selected cancer type.
(B) Significant genes by survival endpoint: Shows the number of significant survival-related genes associated with each survival endpoint — OS, PFI, DSS, and DFI. This chart helps users compare the breadth of survival associations across endpoints.
(C) Significant genes by algorithm: Shows the number of significant survival-related genes identified by each machine learning algorithm — Lasso, Random Forest, and I-Boost. This chart helps users assess whether results are consistent across algorithms or driven predominantly by one method.
Survival gene table (D)
The table below the bar charts lists all survival-related genes identified across omics types, endpoints, and algorithms. Each row represents a unique omics-gene combination, so a gene identified across multiple omics types appears as separate rows.
The table includes the following columns:
Molecular: the omics data type from which the feature is derived — RNA expression, mutation (MUT), CNV, or methylation (MET).
Gene: the gene symbol of the molecular feature.
Count: the total number of + marks in that row, reflecting how many endpoint-algorithm combinations identified the gene as survival-related. Higher counts indicate more consistent survival relevance across endpoints and algorithms.
Endpoint columns: each survival endpoint — OS, PFI, DSS, and DFI — is represented as a column group with three sub-columns corresponding to Lasso, Random Forest, and I-Boost. A + indicates that the gene was identified as survival-related by that algorithm under that endpoint. A blank cell indicates the gene was not identified under that combination.
2.1 Gene Module Overview
The Gene module provides a comprehensive, multi-omics overview of a user-selected gene across multiple cancer types. By integrating expression, mutation, CNV, methylation, survival analysis, miRNA regulation, protein expression, and multi-omics driver evidence, this module helps users understand how a gene behaves across the cancer landscape and how its molecular alterations may relate to patient outcomes.
2.2 Input Selection
To begin, choose how you want to search for the gene:
After entering the gene, click Submit to generate all downstream results.
2.3 Overview of Result Tabs
The Gene module contains several results tabs, each summarizing multi-omics evidence and functional insights for a selected gene across different cancer types:2.4 Gene Summary
Gene Overview
The Gene Overview page provides a high-level overview of multi-omics evidence for the selected gene across projects, cohorts, and tissues. These visualizations help users quickly identify where the gene shows notable molecular changes, how consistently those changes appear across cohorts, and whether tissue-specific patterns are present.
The summary bar plots and boxplots provide global cross-cohort summaries for the selected gene, while the heatmap shows project-level results for the selected tissue or organ. Bar plots summarize the proportion of projects or cohorts with significant categorical results, such as RNA, CNV, and methylation. Boxplots summarize count-based results, such as mutation driver tool counts and miRNA counts.
Detailed information on the computational algorithms and tools used in DriverDBv5 can be found in FAQ4.
Color definitions, statistical cutoffs, and asterisk criteria are described in the next Visualization Color & Asterisk Reference section.
Select a tissue or organ from the body diagram to update the heatmap and view tissue-specific results across RNA, mutation, CNV, methylation, and miRNA.
Visualization Color & Asterisk Reference
2.5 Gene RNA
2.5.1 Overview
The RNA panel visualizes expression patterns and survival associations for the selected gene across multiple cancer types.
Users can explore RNA expression results using three grouping options: sample type, mutation class, and tumor stage. Each grouping option includes both a Pan-Cancer View and a Cancer-Specific View, allowing users to compare expression patterns across cancer types or examine expression details within a selected cancer type.
When results are grouped by sample type, users can click on tissues or organs in the body diagram to display relevant expression results. This view summarizes RNA expression z-scores — standardized expression values indicating how far a sample's expression deviates from the mean across samples — across matching cancer projects. For mutation class and tumor stage views, results are displayed according to the selected grouping category and are not affected by the body diagram selection.
In the Cancer-Specific View, results are shown as boxplots embedded within violin plots, where the boxplot summarizes the median and interquartile range and the violin shape illustrates the full distribution of expression values across samples.
Each of the three grouping options includes a dedicated Survival Map and Survival Analysis section. These evaluate whether RNA expression of the selected gene is associated with patient survival using the same patient grouping — high versus low expression — regardless of which grouping option is selected. As a result, the survival results are consistent across the three grouping options and reflect the overall expression level of the gene rather than the specific grouping metric displayed above. Survival results are available for TCGA cohorts only.
Each grouping option section is organized as follows:2.5.2 Expression by Sample Type
This tab displays expression across all available sample types (e.g., NT, TP, TM, TRBM, TBM).
Organ-specific Project Expression by Sample Type
This panel displays z-score expression distributions of the selected gene across all projects/cohorts of a selected organ/tissue (x-axis), grouped and colored by sample type.
Cancer-Specific View: Expression by Sample Type
Shows gene expression distributions of the selected gene within a specific cancer project, grouped by sample type.
2.5.3 Expression by Mutation Class
This tab evaluates expression levels across mutation impact categories, including:Pan-Cancer View: Expression by Mutation Class
Boxplots display TPM expression across cancer types, grouped by mutation class.
User can:
Cancer-Specific View: Expression by Mutation Class
Shows log₁₀(TPM) expression distributions of the selected gene within a specific cancer project, grouped by mutation class.
Users first select an organ or tissue from the left panel (A), which filters the available cancer projects to those associated with the selected organ or tissue. Users then select a specific cancer project from the filtered list (B). Once a project is selected, the corresponding project description (C) is displayed at the top, followed by violin plots with embedded boxplots (D) showing the expression distribution of the selected gene across available mutation classes within that project. A summary table (E) is displayed below the plots, reporting pairwise comparisons between mutation classes including p-values that indicate whether gene expression differs significantly between the compared groups.
The log₁₀(TPM) transformation is applied to expression values to reduce the influence of outliers and improve visualization across samples with widely varying expression levels.
Use the sample type controls to show or hide specific groups. Hover over individual dots to view sample-level details, or hover over the violin or boxplot areas to view distribution statistics, including the maximum, upper fence, Q3, median, Q1, lower fence, and minimum.

2.5.4 Expression by Tumor Stage
This tab examines expression variation across clinical tumor stages (I–IV).
Pan-Cancer View: Expression by Tumor Stage
Boxplots show TPM expression across cancer types, grouped by tumor stage.
Cancer-Specific View: Expression by Tumor Stage
Shows log₁₀(TPM) expression distributions of the selected gene within a specific cancer project, grouped by tumor stage.
Users first select an organ or tissue from the left panel (A), which filters the available cancer projects to those associated with the selected organ or tissue. Users then select a specific cancer project from the filtered list (B). Once a project is selected, the corresponding project description (C) is displayed at the top, followed by violin plots with embedded boxplots w the plots, reporting pairwise comparisons between tumor stages including p-values that indicate whether gene expression differs significantly between the compared groups.
The log₁₀(TPM) transformation is applied to expression values to reduce the influence of outliers and improve visualization across samples with widely varying expression levels.
Use the sample type controls to show or hide specific groups. Hover over individual dots to view sample-level details, or hover over the violin or boxplot areas to view distribution statistics, including the maximum, upper fence, Q3, median, Q1, lower fence, and minimum.

2.5.5 Survival Map & Survival Analysis
Each omics module includes dedicated Survival Map and Survival Analysis sections that evaluate the prognostic relevance of the selected gene using the molecular data available in that module. Depending on the module, the analysis may be based on RNA expression, mutation, copy number variation (CNV), or DNA methylation.
Detailed information on interpreting the survival results for all omics modules, including the Survival Map and Survival Analysis panels, is provided in Gene-Survival.
2.6 Gene Mutation
2.6.1 Overview
The Mutation interface visualizes mutation patterns and statistics of the selected gene across multiple cancer types, with mutations mapped along the protein sequence and aligned with functional protein domains.
Users can explore three mutation-level summaries: Mutation Rate, Mutation Percent, and Exon Distribution. Mutation Rate reflects the frequency of mutations per sample, while Mutation Percent reflects the proportion of samples carrying at least one mutation in the selected gene. Exon Distribution summarizes how mutations are distributed across the exonic regions of the gene. Each summary includes both a Pan-Cancer View and a Cancer-Specific View, allowing users to compare mutation patterns across cancer types or examine mutation details within a selected cancer type. Mutation hotspots — positions where mutations cluster more frequently than expected — can be identified by examining the distribution of mutations along the protein coordinates.
Each of the three mutation summaries includes a dedicated Survival Map and Survival Analysis section. These evaluate whether mutation status of the selected gene is associated with patient survival using the same patient grouping — mutated versus wild-type — regardless of which mutation summary is selected. As a result, the survival results are consistent across the three summaries and reflect the overall mutation status of the gene rather than the specific mutation-level metric displayed above. Survival results are available for TCGA cohorts only.
Each mutation summary section is organized as follows:This structure is consistent across all three mutation summaries: Mutation Rate, Mutation Percent, and Exon Distribution.
2.6.2 Mutation Rate
Pan-Cancer View: Mutation Rate Heatmap
This view integrates multiple coordinated panels to show where mutations occur along the protein and how frequently they appear across cancer types, using mutation rate as the metric.
ComponentsThis heatmap displays projects as rows and protein positions as columns, with cell color indicating the mutation rate at each specific position. Users can hover over cells to view the cancer type, protein position, and mutation rate, allowing them to identify protein regions with recurrent mutation hotspots across multiple cancer types.
B. Protein Region Impact Bar PlotThis plot aggregates mutation rates per protein region, with bars stacked by impact level (High, Moderate, or Low) to show the relative contribution of different mutation severities. Hovering over bar segments reveals the region name, impact category, and mutation rate, demonstrating which functional regions of the protein accumulate the highest mutation load.
C. Dataset-Level Mutation Burden Bar PlotThis bar chart displays the overall mutation rate per dataset or cancer type, with bars stacked by mutation impact level to show the distribution of mutation severities. Users can hover to view the dataset, tissue type, impact level, and mutation rate, providing a quick comparison of which cancers have the heaviest mutation burden for the selected gene.
D. Dataset & Tissue LegendThis companion panel lists the tissue type, project ID, and cancer type for each dataset included in the analysis, with each color corresponding to a specific tissue type to help users interpret the color-coding used throughout the visualization.
E. Protein Domain Annotation TrackThis track shows annotated protein domains from Pfam or InterPro databases, displaying the domain name, protein coordinate range, and functional description (accessible via hover). This annotation aligns functional domains with mutation hotspots, helping users understand whether mutations cluster in functionally important regions of the protein.
F. Exon Annotation TrackThis track displays exon boundaries aligned to protein coordinates, with each exon shown as a distinct colored block to illustrate the genomic structure underlying the protein sequence and how mutations map to specific exons.
G&H. LegendsTwo legends accompany the visualization: a mutation rate legend providing a continuous color scale for heatmap intensity, and an impact legend showing categorical colors for High, Moderate, and Low mutation impacts to help users interpret the color-coding throughout all components.

Cancer-Specific View: Mutation Rate Bar Chart
Displays the mutation rate of the selected gene across protein positions within a selected cancer project, with mutations categorized by predicted impact level.
Users first select an organ or tissue from the left panel (A), which filters the available cancer projects to those associated with the selected organ or tissue. Users then select a specific cancer project from the filtered list (B). Once a project is selected, the corresponding project description (C) is displayed at the top, followed by the bar chart (D), where each bar represents a protein position and the height reflects the proportion of samples carrying a mutation at that position, expressed as a rate.
Mutation impacts are stacked within each bar into three categories — High, Moderate, and Low — to illustrate the impact composition at each protein position. This allows users to identify not only mutation-enriched regions along the protein sequence but also whether mutations at those positions are predominantly high-impact or low-impact.
Use the legend (E) to show or hide specific impact categories for focused comparisons. Hover over individual bars to view the protein position, mutation rate, impact category, and cancer project.

2.6.3 Mutation Percent
Pan-Cancer View: Mutation Percent Heatmap
This visualization is structurally identical to the Mutation Rate view but uses mutation percentage—the proportion of mutated samples in each dataset—rather than mutation rate.
ComponentsA. Pan-Cancer Mutation Hotspot Heatmap
This heatmap displays projects as rows and protein positions as columns, with cell color indicating the mutation percentage at each specific position. Users can hover over cells to view the cancer type, protein position, and mutation percentage, revealing which protein positions are most frequently mutated across patient samples in different cancer types.
B. Protein Region Impact Bar Plot
This plot shows mutation percentage per protein region, with bars stacked by mutation impact level to display the relative contribution of High, Moderate, and Low impact mutations. Hovering over bar segments reveals the region name, impact level, and mutation percentage, highlighting protein regions with high prevalence of mutations among patients and indicating which functional domains are most commonly affected.
C. Dataset-Level Mutation Burden Bar Plot
This bar chart displays mutation percentage per dataset or cancer type, with bars stacked by impact level to show the distribution of mutation severities. Users can hover to view the dataset, tissue type, impact level, and mutation percentage, identifying cancer types where mutations in the gene are widespread across patient populations.
D. Dataset & Tissue Legend
Color-coded tissue and dataset identifiers for interpreting the heatmap rows.
E. Protein Domain Annotation Track
This track displays Pfam and InterPro protein domains aligned to protein coordinates, showing the domain name, coordinate range, and functional details accessible through hovering, allowing users to determine whether mutations cluster within functionally important protein domains.
F. Exon Annotation Track
This track displays exon boundaries aligned to protein structure, with each exon shown as a distinct block to illustrate how the genomic organization corresponds to the protein sequence and mutation positions.
G&H. Legends
Two legends accompany the visualization: a mutation percent legend providing a color scale for mutation proportions, and an impact legend showing colors for High, Moderate, and Low mutation impact categories to help users interpret the color-coding throughout all components.

Cancer-Specific View: Mutation Percent Bar Chart
Displays the mutation percentage of the selected gene across protein positions within a selected cancer project, with mutations categorized by predicted impact level.
Users first select an organ or tissue from the left panel (A), which filters the available cancer projects to those associated with the selected organ or tissue. Users then select a specific cancer project from the filtered list (B). Once a project is selected, the corresponding project description (C) is displayed at the top, followed by the bar chart (D), where each bar represents a protein position and the height reflects the proportion of samples carrying a mutation at that position, expressed as a percentage.
Mutation impacts are stacked within each bar into three categories — High, Moderate, and Low — to illustrate the impact composition at each protein position. This allows users to identify not only mutation-enriched regions along the protein sequence but also whether mutations at those positions are predominantly high-impact or low-impact.
Use the legend (E) to show or hide specific impact categories for focused comparisons. Hover over individual bars to view the protein position, mutation percentage, impact category, and cancer project.
2.6.4 Exon Distribution
Pan-Cancer View: Exon Mutation Distribution
ComponentsA. Mutation Count by Exon
Shows the number of mutations per exon across all cancer types. X-axis = exon number; Y-axis = mutation count. Bars are stacked by mutation impact (High, Moderate, Low, Modifier). Hover to view exon number, impact, and mutation count.
B. Mutation Percentage by Exon
Displays the proportion of mutated samples per exon. X-axis = exon number; Y-axis = mutation percentage. Hover to view exon number, impact, and mutation percentage.
C. Protein Domain Panel
Annotated functional domains with Pfam ID, InterPro ID, position, and description. Hover for details on each domain.
D. Exon Annotation Track
Each colored block represents an exon aligned to the protein coordinate axis.
E. Impact Legend

Cancer-Specific View: Exon Mutation Distribution
Displays the exon-level mutation distribution of the selected gene within a selected cancer project, with mutations categorized by predicted impact level.
Users first select the visualization metric — mutation count or mutation percentage — from panel (A) to determine whether the bar chart displays the absolute number of mutations or the proportion of samples carrying a mutation per exon. Users then select an organ or tissue from panel (B), which filters the available cancer projects to those associated with the selected organ or tissue, and select a specific cancer project from the filtered list (C).
Once selections are made, the corresponding project description (D) is displayed at the top, followed by the bar chart (E) showing the exon-level mutation distribution for the selected cancer project and metric. Each bar represents an exon, and mutations are stacked within each bar into three impact categories — High, Moderate, and Low — to illustrate the impact composition at each exon. This allows users to identify which exons harbor the highest mutation load or frequency and whether mutations within those exons are predominantly high-impact or low-impact.
Use the legend (F) to show or hide specific impact categories for focused comparisons. Hover over individual bars to view the exon number, impact category, and mutation count or percentage.

2.6.5 Survival Map & Survival Analysis
Each omics module includes dedicated Survival Map and Survival Analysis sections that evaluate the prognostic relevance of the selected gene using the molecular data available in that module. Depending on the module, the analysis may be based on RNA expression, mutation, copy number variation (CNV), or DNA methylation.
Detailed information on interpreting the survival results for all omics modules, including the Survival Map and Survival Analysis panels, is provided in Gene-Survival.
2.7 Gene CNV
2.7.1 Overview
The Copy Number Variation interface visualizes CNV patterns of the selected gene across multiple cancer types and explores how copy number changes relate to gene expression and patient survival.
This interface integrates results from two complementary CNV analysis tools that operate at different levels of analysis. iGC identifies significant copy number gains and losses for the selected gene across cancer types, providing a gene-level summary of CNV status. DIGGIT identifies genes whose copy number alterations are significantly correlated with downstream gene expression changes, inferring potential CNV driver genes — that is, genes whose copy number changes may confer a functional advantage by altering the expression of downstream targets. Together, these tools allow users to explore both the CNV status of the selected gene and its potential functional consequences.
Users can explore CNV gain or loss significance across cancer types, examine how copy number changes correlate with gene expression levels, and identify cancers where the selected gene may act as a CNV driver.
The Survival section evaluates whether copy number variation of the selected gene is associated with patient survival. Survival results include a Survival Map summarizing associations across cancer types and endpoints, and detailed survival analyses based on multiple analysis methods. Survival results are available for TCGA cohorts only.
This panel includes five sections:2.7.2 Pan-Cancer View: Copy Number Variation Overview
This visualization summarizes CNV gain, loss, and neutral states for the selected gene across available cancer projects, based on results from iGC and DIGGIT. The display format depends on the number of available projects: when five or more projects are available, results are shown as a bar chart; when fewer than five projects are available, results are shown as a pie chart.
For projects displayed as a bar chart, the visualization consists of three coordinated components. The CNV driver panel at the top indicates how many CNV tools support the gene as a CNV driver in each cancer type. Light grey represents identification by iGC only, while dark grey indicates identification by both iGC and DIGGIT, with darker shading suggesting higher cross-tool confidence. The main panel displays sample proportion bars for each cancer type, with bar segments color-coded to show the proportion of samples exhibiting CNV gain, CNV loss, or no CNV change. Bar height reflects the percentage of samples in each CNV state. Users can hover over any bar to view detailed information, including cancer type, CNV status, tool support information, and proportional values.
When fewer than five projects are available, the same CNV information is displayed as a pie chart instead of a bar chart. In this format, each pie segment represents the proportion of samples with CNV gain, CNV loss, or no CNV change for the selected gene. Hovering over each segment provides detailed information such as the CNV status, sample proportion, and available tool support.
Together, these views help users determine which cancer projects show frequent CNV gains or losses, whether the selected gene is supported as a CNV driver by iGC, DIGGIT, or both, and which cancers may exhibit CNV-driven expression changes.
2.7.3 Cancer-Specific View: CNV Distribution and Correlation
This visualization helps users assess how copy number changes relate to gene expression levels within a selected cancer project, displaying the correlation between CNV segment mean and expression alongside group-level comparisons across CNV status categories.
Users first select an organ or tissue from panel (A), which filters the available cancer projects to those associated with the selected organ or tissue. Users then select a specific cancer project from the filtered list (B). Once a project is selected, the corresponding project description (C) is displayed at the top, followed by a 2×2 grid of visualization panels below.
D.CNV–Expression Correlation Scatter Plot
Displays the correlation between CNV segment mean (x-axis) and gene expression (y-axis, TPM on a log₁₀ scale) for individual samples, where each point represents one sample. Points are colored by CNV status — red for gain, blue for loss, green for neutral, and grey for normal. Gene expression is displayed on a log₁₀ scale to accommodate the wide dynamic range of TPM values across samples. Hover over any point to view the segment mean and expression value for that sample.
E.Expression by CNV Status Boxplot
Summarizes gene expression (TPM, log₁₀ scale) across CNV status groups — gain, loss, neutral, and normal — allowing users to compare expression levels between groups. Hover over the boxplot areas to view summary statistics, including the maximum, upper fence, Q3, median, Q1, lower fence, and minimum.
F.Segment Mean by CNV Status Boxplot
Summarizes the segment mean distribution for each CNV status category, where segment mean is calculated as log₂(copy number / 2). Values near 0 (between −0.3 and 0.3) indicate diploid or no copy number change; values above 0.3 indicate copy number gain; and values below −0.3 indicate copy number loss. Hover over the boxplot areas to view the same summary statistics as panel (E).
G.Correlation Summary Panel
Displays the correlation coefficient and p-value describing the strength and statistical significance of the relationship between CNV segment mean and gene expression. A positive correlation indicates that higher segment mean values — reflecting copy number gain — are associated with higher expression levels. A negative correlation indicates that lower segment mean values — reflecting copy number loss — are associated with lower expression levels.
H.Legend
The legend displays the CNV status color coding — red for gain, blue for loss, green for neutral, and grey for normal. Use the legend to show or hide specific CNV status groups across all panels for focused comparison.

2.7.4 CNV Summary Table
This table provides detailed tool-specific CNV statistics for the selected gene across cancer types, including correlation metrics that reveal the relationship between copy number alterations and gene expression. The Spearman correlation coefficient (ρ) indicates the direction and strength of this relationship: positive ρ values suggest that CNV gains are associated with higher expression levels, while negative ρ values indicate that CNV losses correspond to lower expression, and the accompanying p-value indicates the statistical significance of these correlations, helping users identify cancer types where CNV alterations functionally drive expression changes.
2.7.5 Survival Map & Survival Analysis
Each omics module includes dedicated Survival Map and Survival Analysis sections that evaluate the prognostic relevance of the selected gene using the molecular data available in that module. Depending on the module, the analysis may be based on RNA expression, mutation, copy number variation (CNV), or DNA methylation.
Detailed information on interpreting the survival results for all omics modules, including the Survival Map and Survival Analysis panels, is provided in Gene-Survival.
2.8 Gene Methylation
2.8.1 Overview
The Methylation interface visualizes DNA methylation patterns and survival associations for the selected gene across multiple cancer types, and explores how methylation status relates to gene expression.
This interface integrates results from two complementary methylation tools that answer different questions at different levels of analysis. MethylMix classifies genes as hypermethylated or hypomethylated by comparing their methylation distributions to a reference normal tissue, providing a gene-level summary of aberrant methylation across cancer types, providing a gene-level summary of aberrant methylation across cancer types. ELMER identifies specific CpG probes whose methylation level is inversely associated with nearby gene expression changes, providing a probe-level view of functionally relevant methylation events. Together, these tools allow users to explore both the methylation state of the selected gene and its potential regulatory consequences.
The Survival section evaluates whether methylation status of the selected gene is associated with patient survival. Survival results include a Survival Map summarizing associations across cancer types and endpoints, and detailed survival analyses based on multiple analysis methods. Survival results are available for TCGA cohorts only.
This panel includes five sections:2.8.2 Pan-Cancer View: Methylation Status Overview
This visualization summarizes DNA methylation results for the selected gene across available cancer projects, integrating results from MethylMix and ELMER.
The methylation driver panel at the top indicates how many methylation tools detected the selected gene as significant in each cancer project. Lighter gray indicates that one tool detected significance, while darker gray indicates that both tools detected significance, suggesting stronger cross-tool support.
Below the driver panel, methylation states are summarized for each cancer project. The display format depends on the number of available projects: when five or more projects are available, results are shown as a bar chart; when fewer than five projects are available, results are shown as a pie chart. Colors indicate the proportion of samples classified as hypermethylated, hypomethylated, or showing no methylation change.
Users can hover over bars or pie segments to view additional details, including the cancer type, methylation status, tool-specific results from MethylMix and ELMER, and sample proportion. This view helps users identify cancer projects with prominent hypermethylation or hypomethylation patterns and assess whether methylation-driven dysregulation of the selected gene is supported by one or both methylation tools.
2.8.3 Cancer-Specific View: Methylation Distribution and Correlation
This visualization helps users assess how methylation status relates to gene expression levels within a selected cancer project, displaying the correlation between beta value and expression alongside group-level comparisons across methylation status categories.
Users first select an organ or tissue from panel (A), which filters the available cancer projects to those associated with the selected organ or tissue. Users then select a specific cancer project from the filtered list (B). Once a project is selected, the corresponding project description (C) is displayed at the top, followed by a 2×2 grid of visualization panels below.
D.Methylation–Expression Correlation Scatter Plot
Displays the correlation between beta value (x-axis) and gene expression (y-axis, TPM on a log₁₀ scale) for individual samples, where each point represents one sample. The beta value reflects the ratio of methylated to total probe intensity, ranging from 0 (unmethylated) to 1 (fully methylated). Points are colored by methylation status — red for hypermethylated, green for hypomethylated, blue for neutral, and grey for normal. Gene expression is displayed on a log₁₀ scale to accommodate the wide dynamic range of TPM values across samples. Hover over any point to view the beta value, expression value, and methylation status for that sample.
E.Expression by Methylation Status Boxplot
Summarizes gene expression (TPM, log₁₀ scale) across methylation status groups — hypermethylated, hypomethylated, neutral, and normal — allowing users to compare expression levels between groups. Hover over the boxplot areas to view summary statistics, including the maximum, upper fence, Q3, median, Q1, lower fence, and minimum.
F.Beta Value by Methylation Status Boxplot
Summarizes the beta value distribution for each methylation status category, allowing users to examine how beta values differ across hypermethylated, hypomethylated, neutral, and normal groups. Hover over the boxplot areas to view the same summary statistics as panel (E).
G.Correlation Summary Panel
Displays the Spearman correlation coefficient (ρ) and p-value describing the strength and statistical significance of the relationship between beta value and gene expression. A negative correlation indicates that hypermethylation is associated with lower expression, consistent with gene silencing. A positive correlation indicates that hypomethylation is associated with higher expression.
H.Legend
The legend displays the methylation status color coding — red for hypermethylated, green for hypomethylated, blue for neutral, and grey for normal. Use the legend to show or hide specific methylation status groups across all panels for focused comparison.

2.8.4 Methylation Summary Table
This table lists methylation statistics across cancer types, integrating results from MethylMix (gene-level) and ELMER (probe-level).
2.8.5 Survival Map & Survival Analysis
Each omics module includes dedicated Survival Map and Survival Analysis sections that evaluate the prognostic relevance of the selected gene using the molecular data available in that module. Depending on the module, the analysis may be based on RNA expression, mutation, copy number variation (CNV), or DNA methylation.
Detailed information on interpreting the survival results for all omics modules, including the Survival Map and Survival Analysis panels, is provided in Gene-Survival.
2.9 Gene miRNA
2.9.1 Overview
The Gene miRNA module visualizes and quantifies regulatory relationships between the selected gene and its associated miRNAs across multiple cancer types.
Interactions are compiled from 12 miRNA–target prediction tools and experimentally validated interactions recorded in miRTarBase, with flexible filtering to refine predicted vs. validated relationships.
2.9.2 Gene-miRNA Interaction Network
This interactive network displays predicted and experimentally validated regulatory interactions between the selected gene and miRNAs, integrating multiple data sources to provide comprehensive evidence for post-transcriptional regulation.
Data Sources
The network integrates three types of data: prediction tools (12 total) that identify interactions using multiple independent algorithms such as TargetScan, miRanda, RNAhybrid, DIANA-microT, and miRDB; validation data from miRTarBase containing experimentally confirmed interactions; and additional miRNA regulatory information from Chung et al. (Nucleic Acids Research, 2017) and YM500v3, which was used to identify negative correlations between driver genes and miRNAs in earlier DriverDB releases.
Network Representation
The network displays nodes and edges that represent genes, miRNAs, and their regulatory relationships. Nodes are color-coded with the selected gene shown in green and miRNAs predicted or validated to interact with the gene shown in yellow. All interactions are displayed as dotted lines. The type of interaction shown (validated, predicted, or both) depends on which checkboxes are enabled: experimentally validated interactions are sourced from miRTarBase, while predicted interactions are computationally derived with support based on the number of prediction tools that agree on the interaction.
Filtering Options
Users can refine the network using three filtering categories: gene source (CGC from Cancer Gene Census, NCG from Network of Cancer Genes, or all genes), minimum prediction support (≥6 tools, ≥8 tools, or ≥10 tools, where higher thresholds ensure stronger computational agreement), and a validation filter to show only validated interactions.
Interactions
The network provides interactive features including a search bar to enter gene or miRNA names and locate them within the network, node clicking to highlight only the selected node and its connected partners, and clicking empty space to reset to the full view.
Interpretation
This network allows users to identify miRNAs with high prediction support, identify miRNAs with experimental validation, explore potential regulatory repression mechanisms including interactions previously observed across cancers, and distinguish between predicted versus validated miRNA regulation of the selected gene. Together, the network provides a visual map of gene–miRNA interactions supported by both computational and experimental evidence, helping researchers understand post-transcriptional regulatory mechanisms that may influence gene expression in cancer.

2.9.3 Gene-miRNA Correlation Table
This table provides quantitative expression-based evidence supporting each predicted or validated interaction between the selected gene and miRNAs, displaying three correlation metrics that reflect gene–miRNA co-expression patterns across samples: Pearson correlation with p-value, Spearman correlation with p-value, and Kendall correlation with p-value. Negative correlations suggest miRNA-mediated repression of the gene, which is consistent with canonical miRNA targeting mechanisms where increased miRNA expression leads to decreased target gene expression, while the accompanying p-values indicate the statistical significance of the correlation strength and help users assess the reliability of each regulatory relationship.
2.10 Gene Protein
2.10.1 Overview
The Gene Protein module visualizes protein-level variation of the selected gene across cancers and examines how protein abundance relates to mRNA expression and post-translational modifications (PTMs).
Analyses are organized into three tabs:All analyses support interactive exploration, including sample-level tooltips, togglable groups, and mRNA–protein scatter plots.
2.10.2 Clinical Stages
This tab evaluates how protein expression and mRNA–protein associations vary across clinical tumor stages.
Protein Expression by Clinical Stage (Pan-Cancer)
Purpose:Visualizes protein expression levels of the selected gene across all TCGA cancer types, grouped by stage.
Plot Features:
mRNA–Protein Correlation by Clinical Stage (Pan-Cancer)
Purpose:Assesses whether mRNA abundance explains protein expression patterns across cancers within each stage group.
Plot Features:
Cancer-Specific: Stage-Specific Protein Expression
This visualization shows protein expression patterns within a selected cancer type grouped by tumor stage, displaying a violin plot with stage groups (I–IV) on the x-axis and protein expression levels on the y-axis, where users can toggle stages using the legend and hover over dots to view sample-level information or hover over violin areas to see statistical summaries including median, quartiles, and fences. An accompanying statistical table compares pairs of stages with columns showing Group 1, Group 2, p-value, significance level, and sample counts, indicating whether stage-specific differences are statistically significant (p < 0.05) and helping users determine if protein expression changes progressively across disease stages or shows distinct patterns at specific stages of cancer development.
2.10.3 Mutation Classes
This tab evaluates how protein expression varies across mutation impact categories and how mutation classes influence mRNA–protein correlations.
Mutation impact groups:Protein Expression by Mutation Class (Pan-Cancer)
Purpose:Visualizes protein expression across cancers grouped by mutation impact level.
Plot Features:Allows users to assess whether specific impact classes (e.g., high-impact mutations) correspond to altered protein levels.

mRNA–Protein Correlation by Mutation Class (Pan-Cancer)
Purpose:Examines mRNA–protein concordance across mutation-defined sample groups.
Features:
Cancer-Specific: Mutation-Impact–Specific Protein Expression
This visualization shows protein expression differences within a selected cancer type grouped by mutation class, displaying a violin plot with mutation impact class on the x-axis and protein expression levels on the y-axis, where users can hover for statistical summaries and sample information and toggle impact classes using the legend. An accompanying statistical table provides pairwise comparisons between impact classes with columns showing Group 1, Group 2, p-value, significance level, and sample counts. This analysis helps identify whether high-impact mutation carriers show altered protein levels relative to other mutation groups, revealing whether mutations influence not only gene expression at the transcript level but also at the protein level, which may have more direct functional consequences for cancer phenotypes.
2.10.4 PTM Sites
This tab evaluates how post-translational modifications (PTMs)—specifically phosphorylation sites—modify the relationship between mRNA and protein expression.
mRNA-Protein Correlation by PTM Site (Pan-Cancer)
Purpose:Assesses how PTMs (e.g., pY1068, pY1173) influence mRNA–protein coupling across cancers.
Plot Features:
2.10.5 Survival
Purpose
This analysis evaluates whether total protein abundance or site-specific post-translational modification abundance is associated with patient survival across selected cancer cohorts.
When phosphorylation data are available, results are presented separately for:Comparing total-protein and PTM-site results can reveal site-specific prognostic associations that may not be apparent from overall protein abundance.
Analysis Workflow
Users first select one of three survival-analysis methods:For the Cure Model, users currently select only the cancer type. The survival endpoint is overall survival (OS), follow-up includes all available years, and abundance stratification is based on the median abundance of the analyzed protein feature.
Survival Endpoint
The available survival endpoints are:Endpoint availability may vary among cancer cohorts.
Follow-up Time
Users may analyze:For a 5-year analysis, patients who remain event-free beyond 60 months should normally be administratively censored at 60 months rather than excluded. The plot x-axis and risk estimates then represent outcomes during the first five years of follow-up.
Sample Stratification
For each total-protein or PTM-site feature, patients with valid abundance and survival data are divided into abundance groups before plotting the survival curves.
Median StratificationThe median abundance value among the eligible patients is used as the cutpoint.
Median stratification is simple, reproducible, and generally produces groups of similar size. However, it may miss an association when the biologically relevant threshold is not close to the median.
Best-Cutpoint StratificationA set of candidate abundance thresholds is evaluated, and the cutpoint that produces the strongest separation between the survival groups is selected.
The optimal threshold is commonly chosen by maximizing a survival-separation statistic, such as the standardized log-rank statistic, or equivalently by minimizing the corresponding log-rank p-value within an allowed range of cutpoints.
Patients are then classified as:Candidate cutpoints should be restricted so that neither group becomes too small. Because the same dataset is used to select and test the threshold, best-cutpoint results can overestimate effect size and statistical significance. These results should therefore be interpreted as exploratory and ideally validated in an independent cohort. If adjusted p-values are implemented for the cutpoint search, this should be stated explicitly.
Cox Uni
Cox Uni evaluates one molecular feature at a time without adjustment for clinical characteristics. The feature may be total protein abundance or the abundance of an individual PTM site.
The Cox model estimates the relative hazard for the High group compared with the Low group. A hazard ratio greater than 1 indicates a higher event rate in the High group, whereas a hazard ratio below 1 indicates a lower event rate.
Kaplan-Meier PlotThe Kaplan–Meier plot displays the observed survival probability over time for the High and Low abundance groups.
Greater separation between the curves indicates a larger difference in survival experience. The log-rank p-value tests whether the survival distributions differ between the groups.
The Kaplan–Meier plot is an unadjusted comparison and does not account for clinical covariates.
Cumulative Hazard PlotThe cumulative hazard plot displays the accumulated event hazard over time for the same High and Low groups.
A curve that rises more rapidly indicates faster accumulation of risk. Separation between the curves suggests different event rates between the abundance groups.
The cumulative hazard plot complements the Kaplan–Meier plot but should not be interpreted as the instantaneous hazard at a particular time point.
InterpretationA significant result suggests that the selected protein or PTM-site abundance is associated with the selected survival endpoint in an unadjusted analysis. It does not establish that the feature is independent of tumor stage, age, or other clinical factors.
Cox Multi (clinical)
Cox Multi fits a multivariable Cox proportional hazards model to evaluate the association between a protein or PTM feature and survival after accounting for available clinical covariates.
Depending on the cohort and data availability, covariates may include variables such as age, sex, tumor stage, grade, or other relevant clinical characteristics. The exact variables included should be listed with the results.
Kaplan-Meier PlotThe Kaplan–Meier plot displays the observed, unadjusted survival experience of the High and Low abundance groups.
Although it is shown alongside the multivariable analysis, the Kaplan–Meier curve itself does not adjust for clinical covariates. Clinical adjustment is provided by the Cox model and summarized in the forest plot.
If the application instead generates model-predicted curves from the multivariable Cox model, the figure should be labelled adjusted survival curve, rather than Kaplan–Meier plot.
Forest Plot The forest plot summarizes the hazard ratio and 95% confidence interval for:The dashed vertical reference line at HR = 1 represents no association. A confidence interval that crosses 1 indicates that the effect is not statistically distinguishable from no association at the corresponding confidence level.
InterpretationIf the protein or PTM abundance group remains statistically significant after clinical adjustment, the result supports an association with survival that is not explained by the clinical covariates included in the model.
This should be described as an independent association within the fitted model, rather than proof that the feature is biologically or causally independent. Residual confounding may remain, and results depend on the quality and availability of the clinical variables.
Cure Model
The cure model is intended for survival settings in which a proportion of patients may experience sustained long-term survival and no longer show the same event risk as the susceptible patient population.
Unlike a standard Cox model, a cure model can separately characterize:The term “cure” is statistical and does not necessarily indicate confirmed clinical eradication of disease.
Survival PlotsCure-model results are displayed for total protein and, when available, separately for each PTM site.
The plots may show:Separation between the High and Low curves suggests group-specific survival behavior. A plateau in the late portion of a survival curve may be consistent with a long-term-surviving fraction, although a plateau can also result from limited follow-up or few patients remaining at risk.
InterpretationDifferences in the long-term component suggest that abundance may be associated with the estimated long-term-surviving fraction. Differences in the short-term component suggest an association with event timing among susceptible patients.
Comparisons between total protein and individual PTM sites can identify site-specific long-term or short-term survival patterns that are not reflected by total protein abundance
2.11 Gene Multi-omics
2.11.1 Overview
The Gene Multi-Omics interface summarizes how multiple molecular layers—including gene expression, mutation, CNV, methylation, protein, and miRNA—converge to support the user-selected gene as a potential driver across cancers.
All results reflect outputs from multi-omics integration tools, providing a cross-layer view of biological evidence.
Each section offers complementary insights into how strongly the selected gene is supported as a multi-omics driver.
2.11.2 Integrated Multi-Omics Overview
This section aggregates driver evidence from multi-omics prediction tools and displays where (cancer types) and how (omic layers) the selected gene is supported, with visualization presented through three coordinated components that together provide a comprehensive view of multi-layered evidence.
Top-Right Bar Chart — Tool Support Across Cancers
This bar chart shows how many integration tools identified the selected gene as a driver in each cancer type, with users able to hover to view the cancer type and tool support count. Taller bars indicate stronger cross-tool agreement within that particular cancer, suggesting the gene is consistently recognized as a driver across multiple computational methods in those cancer types.
Bottom-Left Bar Chart — Tool Support Across Omics Layers
This bar chart summarizes the number of supporting tools for each omic type, including mutation, CNV, methylation, expression, and miRNA, with hover functionality displaying the omics type and tool count. This component highlights whether the gene is supported across multiple molecular mechanisms, revealing the diversity of dysregulation patterns affecting the gene.
Bottom-Right Combination Matrix — Cross-Omics Evidence
This matrix displays cancer types as rows and omics layers as columns, where each solid dot indicates that the gene is identified as significant in the corresponding omic type for that cancer dataset, with filled dots representing support by at least one integrative tool. Users can hover over dots to view cancer type, omics type, and evidence details. Dense dot patterns indicate broad multi-omics support across many cancer types, wide spread across omics columns suggests convergent dysregulation of the gene through multiple molecular mechanisms, and dense rows highlight specific cancers where the gene plays a multi-omics functional role with alterations spanning multiple biological layers.

2.11.3 Omics Connectivity Network
The Omics Connectivity Network illustrates relationships between the selected gene, omics layers, and cancer projects through an interactive network view.
Nodes represent different entities, including the selected gene, omics types, and cancer projects. Edges denote associations identified by integration tools, indicating links between the gene and specific molecular alterations or cancer contexts. Cancer projects belonging to the same cancer type are displayed using the same node color, helping users recognize related datasets across projects.
Users can interact with the network by clicking or hovering over nodes to explore their connected omics layers, cancer projects, and related associations. Nodes with more connections indicate a higher degree of multi-omics influence or cross-layer integration, suggesting that the selected gene may play a broader role across multiple molecular mechanisms or cancer contexts.

2.11.4 Multi-Omics Driver Event Table
This detailed table lists all driver events supported by multi-omics tools for the selected gene, providing comprehensive information about the molecular context and reproducibility of each driver event. High nTool values indicate strong cross-tool reproducibility where multiple computational methods independently identify the same driver event, while CGC/NCG = Yes indicates independent biological support of cancer relevance through inclusion in curated cancer gene databases. The presence of multiple omics layers for a gene demonstrates multi-layer dysfunction characteristic of driver genes, where alterations span genomic, epigenomic, and transcriptomic levels. This table allows users to quickly identify the specific omics drivers and cancer contexts in which the gene is strongly supported, enabling prioritization of the most robust and biologically relevant driver events for further investigation or therapeutic targeting.
Survival Map
The Survival Map displays the survival impact of the selected gene across multiple cancer types and four survival endpoints: overall survival (OS), progression-free interval (PFI), disease-free interval (DFI), and disease-specific survival (DSS). The map supports multiple omics data types, including RNA expression, mutation, copy number variation (CNV), and methylation.
Each heatmap cell represents a combination of cancer type and survival analysis method. For Cox univariate, Cox multivariate, and cure model analyses, a colored cell indicates that the selected molecular feature is significantly associated with survival based on the hazard ratio and p-value. The selected molecular feature may represent RNA expression level, mutation status, CNV status, or methylation level, depending on the selected omics type.
The color gradient represents the hazard ratio: red indicates a hazard ratio greater than 1, suggesting higher risk, while blue indicates a hazard ratio less than 1, suggesting lower risk. The direction of risk is interpreted relative to the omics-specific reference group used in the analysis — for example, high versus low RNA expression, mutated versus wild-type status, or copy number gain or loss relative to neutral status. For methylation data, the reference group depends on the grouping method: high versus low methylation level when using beta-value median stratification, or hypermethylated versus hypomethylated status when using MethylMix-based classification.
For machine learning–based results, a colored cell indicates that the selected gene was identified as survival-related by at least one of the machine learning algorithms — Lasso, Random Forest, or I-Boost. To determine which specific algorithm identified the gene, users can refer to the detailed results in the Survival Analysis section.
Hover over a colored cell to view detailed survival information, including cancer type, survival endpoint, analysis method, omics type, grouping or stratification method, hazard ratio, log-rank p-value, and cutoff or grouping value when available.
Grouping methods depend on the selected omics type. RNA expression is stratified using the best cutoff, which identifies the threshold that maximizes survival difference between groups, or the median cutoff. Mutation data are grouped by mutated versus wild-type status. CNV data are grouped using iGC or GISTIC-based copy number calls into gain, loss, or neutral categories. For methylation data, patients are grouped using beta-value median stratification, beta-value best cutoff stratification, or MethylMix-based classification.
Note that p-values for Cox univariate, Cox univariate 5-year, and machine learning analyses are calculated using the log-rank test, while p-values for Cox multivariate and Cox multivariate 5-year analyses are calculated using the Cox proportional hazards model.
The map includes multiple survival analysis approaches: Cox univariate regression, Cox multivariate regression adjusted for clinical covariates, cure model analysis, and machine learning–based survival analysis using Lasso, Random Forest, and I-Boost.
Available survival analysis methods vary by survival endpoint. For the OS endpoint, available results include Cox univariate regression, Cox univariate regression 5-year, Cox multivariate regression adjusted for clinical covariates, Cox multivariate regression 5-year adjusted for clinical covariates, cure model short-term effect, cure model long-term effect, and machine learning–based results. For the PFI, DFI, and DSS endpoints, available results include Cox univariate regression, Cox univariate regression 5-year, Cox multivariate regression adjusted for clinical covariates, Cox multivariate regression 5-year adjusted for clinical covariates, and machine learning–based results. Cure model results are available only for the OS endpoint.
References of all survival analysis methods, including the machine learning–based approaches, are available in FAQ4. Click a colored cell to open the corresponding Kaplan–Meier plot. If the selected cell represents a machine learning result, the detailed output opens in a new tab. For figure and table manipulation, please refer to FAQ3.
Survival Analysis
The Survival Analysis panel evaluates whether the selected gene's molecular features are associated with patient prognosis. Users first select a survival analysis type:References of all survival analysis methods, including the machine learning–based approaches, are available in FAQ4.
After an analysis type is selected, the available filters and result views update accordingly.
Depending on the selected analysis framework, users can choose relevant options such as cancer type, survival endpoint, survival time, stratification method, or machine learning algorithm. The resulting plots and tables help compare survival patterns, estimate risk differences, and identify molecular features or interactions associated with patient outcomes.
Cox Univariate
The Cox Univariate results section evaluates whether the selected molecular feature of the user-selected gene is associated with patient survival in a selected cancer type. The analysis is performed using univariate Cox proportional hazards regression and Kaplan–Meier survival analysis, without adjusting for any clinical covariates.
Users can select a cancer type, survival endpoint, survival time, and grouping method from the dropdown menus. The available survival endpoints include overall survival (OS), progression-free interval (PFI), disease-free interval (DFI), and disease-specific survival (DSS). The survival time options include all-time and 5-year analyses.
Patient grouping depends on the selected omics type. For RNA expression data, patients are stratified into high- and low-expression groups using either the median cutoff or the best cutoff, which identifies the expression threshold that maximizes the log-rank test statistic across all possible cutpoints to produce the most statistically significant survival separation between groups. For mutation data, patients are grouped by mutation status — mutated or wild-type. For CNV data, patients are grouped into copy number gain, loss, or neutral categories based on copy number status defined by either iGC or GISTIC. For methylation data, patients are grouped using beta-value median stratification, beta-value best cutoff stratification, or MethylMix-based classification. For beta-value stratification methods, patients are divided into high and low methylation level groups; for MethylMix-based classification, patients are divided into hypermethylated and hypomethylated groups.
Full cancer type names are available in the Dataset subsection of the Help page. Please note that survival analysis is performed only on TCGA datasets. In the visualizations, the -TCGA suffix is omitted from cancer type labels; therefore, when looking up full cancer type names in the Help section, please add the -TCGA suffix to the displayed cancer type abbreviation.
After the selections are made, two Kaplan–Meier survival plots are displayed. The left plot shows the unadjusted survival curves based solely on the selected omics-specific patient grouping, reflecting the univariate association between the molecular feature and survival. The right plot shows covariate-adjusted survival curves generated from a separate Cox model that accounts for available clinical covariates such as age, gender, stage, or other cohort-specific variables; this plot is displayed only when sufficient clinical covariate data are available for the selected cancer type and endpoint.
Each plot displays survival analysis results above the figure, such as hazard ratio and p-value. The x-axis represents survival time starting from the initial cancer diagnosis, and the y-axis represents survival probability.
Patient groups are defined according to the selected omics type:Users can hover over the curves to view detailed survival information at specific time points. Curves can also be shown or hidden by clicking the corresponding labels in the legend. For figure and table manipulation, please refer to FAQ3. For algorithm descriptions and references, please refer to FAQ4.
Cox Multivariate (clinical)
The Cox Multivariate results section evaluates whether the selected molecular feature of the user-selected gene is independently associated with patient survival after adjusting for available clinical covariates. The analysis is performed using multivariate Cox proportional hazards regression across multiple cancer types.
Users can select a cancer type, survival endpoint, survival time, and grouping method from the dropdown menus. The available survival endpoints include overall survival (OS), progression-free interval (PFI), disease-free interval (DFI), and disease-specific survival (DSS). The survival time options include all-time and 5-year analyses.
Patient grouping depends on the selected omics type. For RNA expression data, patients are stratified into high- and low-expression groups using either the median cutoff or the best cutoff, which identifies the expression threshold that maximizes the log-rank test statistic across all possible cutpoints to produce the most statistically significant survival separation between groups. For mutation data, patients are grouped by mutation status — mutated or wild-type. For CNV data, patients are grouped into copy number gain, loss, or neutral categories based on copy number status defined by either iGC or GISTIC. For methylation data, patients are grouped using beta-value median stratification, beta-value best cutoff stratification, or MethylMix-based classification. For beta-value stratification methods, patients are divided into high and low methylation level groups; for MethylMix-based classification, patients are divided into hypermethylated and hypomethylated groups.
Full cancer type names are available in the Dataset subsection of the Help page. Please note that survival analysis is performed only on TCGA datasets. In the visualizations, the -TCGA suffix is omitted from cancer type labels; therefore, when looking up full cancer type names in the Help section, please add the -TCGA suffix to the displayed cancer type abbreviation.
After the selections are made, the results display covariate-adjusted survival curves and a forest plot. The adjusted survival curves show model-estimated survival differences among patient groups defined by the selected omics-specific grouping method, after accounting for available clinical covariates such as age, gender, stage, or other cohort-specific variables. The x-axis represents survival time from initial cancer diagnosis, and the y-axis represents survival probability.
Patient groups are defined according to the selected omics type:Users can hover over the survival curves to view detailed survival information at specific time points. Curves can also be shown or hidden by clicking the corresponding labels in the legend.
The forest plot summarizes the hazard ratios and 95% confidence intervals for the selected molecular feature and all clinical covariates included in the multivariate Cox model. Each row represents one variable, with the point estimate indicating the hazard ratio and the horizontal line indicating the confidence interval. Values greater than 1 indicate higher risk and values less than 1 indicate lower risk relative to the reference group. The forest plot helps users compare the relative association of each variable with survival after mutual adjustment. Clicking on the forest plot opens a full-sized version for closer inspection.
For figure and table manipulation, please refer to FAQ3. For algorithm descriptions and references, please refer to FAQ4.
Cure Model
The Cure Model results section evaluates whether the selected molecular feature of the user-selected gene. The cure model estimates two types of survival effects: short-term and long-term effects. The short-term effect reflects the association between the selected molecular feature and survival time among patients who remain at risk of the event. The long-term effect reflects the association between the selected molecular feature and the probability of long-term survival, or the estimated cured fraction. Short-term and long-term p-values are reported to indicate whether the selected molecular feature is significantly associated with each component of the cure model. The short-term and long-term p-values are displayed above the plot alongside the other survival statistics.
Cure model results are available only for overall survival (OS) using all-time survival data. Patient grouping depends on the selected omics type. For RNA expression data, patients are separated into high- and low-expression groups using the median cutoff. For mutation data, patients are separated into mutated and wild-type groups. For CNV data, patients are grouped into copy number gain, loss, or neutral categories based on copy number status defined by either iGC or GISTIC. For methylation data, patients are grouped using either beta-value median stratification or MethylMix-based methylation states.
Users can select a cancer type from the dropdown menu to view the corresponding cure model results. Full cancer type names are available in the Dataset subsection of the Help page. Please note that survival analysis is performed only on TCGA datasets. In the visualizations, the -TCGA suffix is omitted from cancer type labels; therefore, when looking up full cancer type names in the Help section, please add the -TCGA suffix to the displayed cancer type abbreviation.
After a cancer type is selected, the estimated survival curves are displayed. The values calculated by the survival analysis are shown above the plot. The x-axis represents survival time starting from the initial cancer diagnosis, and the y-axis represents survival probability.
Patient groups are defined according to the selected omics type:Users can hover over the curves to view detailed survival information at specific time points. Curves can also be shown or hidden by clicking the corresponding labels in the legend. For figure and table manipulation, please refer to FAQ3. For algorithm descriptions and references, please refer to FAQ4.
Machine Learning
Machine Learning–based survival analysis identifies molecular features associated with patient survival using three algorithms: Lasso, Random Forest, and I-Boost. Each method builds a multi-feature prognostic signature, and patients are stratified into high- and low-risk groups based on their composite signature score. Results are presented as Kaplan–Meier survival curves and ROC curves evaluating the predictive performance of each signature.
Lasso (Least Absolute Shrinkage and Selection Operator)The Lasso results section displays significant survival signatures involving the user-selected gene across 33 cancer types. Lasso is a regression-based method that selects the most relevant survival-related molecular features by shrinking less informative gene coefficients toward zero. Features with non-zero coefficients are retained to build a prognostic signature.
Full cancer type names are available in the Dataset subsection of the Help page. Please note that survival analysis is performed only on TCGA datasets. In the visualizations, the -TCGA suffix is omitted from cancer type labels; therefore, when looking up full cancer type names in the Help section, please add the -TCGA suffix to the displayed cancer type abbreviation.
For figure and table manipulation, please refer to FAQ3. For detailed algorithm descriptions and reference links, please refer to FAQ4.
Random Forest
The Random Forest results section displays significant survival signatures involving the user-selected gene across 33 cancer types. Random Forest is a machine learning method that uses many decision trees to identify molecular features that help distinguish different survival outcomes. Features are ranked based on their contribution to prediction performance.
Full cancer type names are available in the Dataset subsection of the Help page. Please note that survival analysis is performed only on TCGA datasets. In the visualizations, the -TCGA suffix is omitted from cancer type labels; therefore, when looking up full cancer type names in the Help section, please add the -TCGA suffix to the displayed cancer type abbreviation.
For figure and table manipulation, please refer to FAQ3. For detailed algorithm descriptions and reference links, please refer to FAQ4.
I-BoostThe I-Boost results section displays significant survival signatures involving the user-selected gene across 33 cancer types. I-Boost is a boosting-based machine learning method that builds a survival prediction model by combining multiple weak predictors into a stronger signature. Features are selected and assigned coefficients based on their cumulative contribution to survival prediction boosting iterations.
Full cancer type names are available in the Dataset subsection of the Help page. Please note that survival analysis is performed only on TCGA datasets. In the visualizations, the -TCGA suffix is omitted from cancer type labels; therefore, when looking up full cancer type names in the Help section, please add the -TCGA suffix to the displayed cancer type abbreviation.
For figure and table manipulation, please refer to FAQ3. For detailed algorithm descriptions and reference links, please refer to FAQ4.
Synergistic survival analysis
The Synergistic Survival Analysis section evaluates whether the selected gene shows combined survival effects with related genes or molecular features from other omics layers. The analysis supports cross-omics interactions among RNA expression, mutation, copy number variation (CNV), and methylation, depending on the selected omics type and available data. Currently, synergistic survival analysis is available for overall survival (OS) only.
The result table lists detailed information for each synergistic survival interaction, including cancer type, interaction type, gene symbols, omics levels, hazard ratio, and p-value. Users can reorder the table by clicking on any column name. Clicking a gene–omics pair in the table generates the corresponding Kaplan–Meier survival plots below; clicking the same row again hides the plots.
Full cancer type names are available in the Dataset subsection of the Help page. Please note that survival analysis is performed only on TCGA datasets. In the visualizations, the -TCGA suffix is omitted from cancer type labels; therefore, when looking up full cancer type names in the Help section, please add the -TCGA suffix to the displayed cancer type abbreviation.
Abbreviation definitions
The group labels in the Kaplan–Meier plots represent the combined omics states of two genes or molecular features. The order of gene 1 and gene 2 follows the interaction type shown in the table.
For figure and table manipulation, please refer to FAQ3.
3.1 Overview
The Customized Analysis module enables researchers to perform user-defined analyses using clinical subgroups, gene features, and survival outcomes.
Unlike the Cancer and Gene modules—which summarize fixed results—Customized Analysis allows flexible, interactive, and hypothesis-driven exploration.
Below is detailed help for the Subgroup Comparison Analyses section.
3.2 Subgroup Comparison Analyses
Subgroup Comparison Analyses evaluate how gene-level molecular features differ across clinically defined patient subgroups.
Users define subgroups using any combination of clinical parameters (e.g., stage, grade, receptor status), and analyses are performed per selected gene and dataset.
Each analysis helps uncover biology associated with disease progression, risk groups, treatment response, or other clinically important factors.
3.2.1 Expression Subgroup Comparison
This analysis assesses whether gene expression varies across clinically defined patient subgroups within a cancer dataset.
Workflow
Output: Expression Comparison Across Clinical Subgroups
Violin Plot (log₁₀ TPM)Interpretation
Together, the violin plots and comparison table help determine whether gene expression differs meaningfully across clinical categories such as:
3.2.2 Mutation Subgroup Comparison
This analysis evaluates whether mutation frequency of the selected gene differs between two clinically defined patient groups.
Workflow
Output
Mutation Contingency TableThis table summarizes how mutation events are distributed across the two subpopulations.
Fisher’s Exact Test Statistics
3.2.3 CNV Subgroup Comparison
The CNV Subgroup Comparison evaluates whether copy number variation (CNV) patterns differ between two clinically defined patient groups.
Events are categorized as Gain, Loss, or None (neutral).
Workflow
Output
CNV Contingency Tables (Four Comparisons)Interpretation
This analysis reveals whether the selected gene exhibits different CNV profiles across clinical subpopulations—for example:
3.2.4 Methylation Subgroup Comparison
The Methylation Subgroup Comparison evaluates whether DNA methylation levels (β-values) differ across clinically defined patient subgroups.
Workflow
Output
Methylation Violin Plot (β-values)Interpretation
This analysis helps determine whether epigenetic regulation of the gene differs across patient groups—for example:
3.3 Survival Analyses
Survival Analyses evaluate how mutations, gene expression levels, or miRNA expression levels influence patient outcomes within a user-defined subpopulation. Users define the patient cohort via clinical criteria and select stratification methods. This section begins with Mutation-Based Survival Analysis, followed by Expression-Based Survival Analysis and miRNA-Based Survival Analysis.
3.3.1 Mutation-Based Survival Analysis
The Mutation-Based Survival Analysis assesses whether mutations in the selected gene list are associated with survival differences in a clinically defined subpopulation.
Workflow
These two options control both the survival table and the Kaplan–Meier plots below.
Output
A. Mutation Oncoprint
This visualization provides a visual summary of mutation patterns across patients, with rows representing genes from the input list, columns representing individual patients, and cells color-coded by mutation impact where red indicates high impact, blue indicates moderate impact, and additional colors are used as applicable. Side panels provide complementary information: the left panel displays the percentage of mutated samples for each gene, while the top panel shows mutation burden or impact summary, often displayed as a combination impact score (e.g., 0–2). This visualization quickly shows which genes are frequently mutated and how mutation profiles vary across patients within the selected cohort.
B. Survival Control Panel
Located directly above the survival table and Kaplan–Meier plots, this panel includes dropdown menus for selecting the stratification method (Mutation vs. Wild type or By number of mutated genes) and time interval (All follow-up or 5-year survival). Changing these settings immediately updates the KM curves to reflect the selected analysis parameters.
C. Survival Statistics Table
This comprehensive table summarizes survival analysis results for each cancer type, gene or gene set, and survival endpoint combination. Key information includes the cancer type abbreviation, the gene(s) evaluated under the selected stratification, the outcome analyzed (OS for Overall Survival, PFI for Progression-Free Interval, DFI for Disease-Free Interval, DSS for Disease-Specific Survival), the stratification method used, which groups serve as the comparison factor versus reference, log-rank and Cox p-values for both all follow-up time and the 5-year interval, hazard ratios and their log2 transforms for both time periods, and sample counts for mutated and wild-type groups. HR values greater than 1 indicate the mutated group has worse prognosis, HR values less than 1 indicate the mutated group has better prognosis, and p-values less than 0.05 indicate significant survival differences between groups.
D. Kaplan–Meier (KM) Survival Plots
For every analysis, four Kaplan–Meier plots are generated—one for each survival endpoint: OS (Overall Survival), PFI (Progression-Free Interval), DFI (Disease-Free Interval), and DSS (Disease-Specific Survival). The KM curves automatically update based on the selected stratification method (Mutation vs. Wild type or Number of mutated genes) and time interval (All follow-up or 5-year survival). Each KM plot includes color-coded survival curves for the selected groups, survival probability over time in months, log-rank test p-value, and hover interaction to view timepoint-specific survival values. These four KM plots allow users to visually compare survival differences across mutation-defined groups for all major survival outcomes, with the multi-endpoint output being particularly useful for identifying consistent trends or endpoint-specific associations across different measures of patient prognosis.

3.3.2 Expression-based Survival Analysis
The Expression-Based Survival Analysis evaluates whether gene expression levels are associated with survival outcomes (OS, PFI, DSS, DFI) in a user-defined patient subpopulation, allowing users to stratify patients based on gene expression and examine survival differences across clinical groups.
Workflow
Output
A. Stratification Control Panel
B. Survival Table
This table summarizes survival statistics for each gene, cancer type, and survival endpoint. Key metrics include hazard ratios (HR) and p-values for both all follow-up and 5-year intervals, expression cutoff thresholds, stratification methods, and sample counts for high and low-expression groups. HR > 1 indicates high expression is associated with worse prognosis, HR < 1 indicates better prognosis, and p < 0.05 indicates significant survival differences.
C. Kaplan-Meier (KM) Survival Plots
For each survival outcome, KM curves visualize survival differences between expression-defined groups, with one curve per group (High vs Low, All-high vs Others, etc.), log-rank p-value and HR displayed on the plot, and curves automatically updating based on cutoff method, grouping method, and time interval (5-year vs all follow-up). These plots show whether expression differences translate into clinically meaningful survival divergence and help users assess the prognostic value of the selected gene(s) in the filtered patient cohort.
D. Boxplots of Gene Expression
These boxplots summarize the expression distributions of the selected gene(s) within the filtered cohort, displaying TPM or log10(TPM) values with hover functionality to view sample-level values and summary statistics including median, Q1, Q3, fences, minimum, and maximum. This visualization helps confirm that expression-defined patient groups are meaningfully different in terms of gene expression levels before analyzing survival outcomes, ensuring that stratification produces biologically distinct groups for comparison.

3.3.3 miRNA-based Survival Analysis
Workflow
Output
A. Cox Uni Results (Univariate Cox Analysis)This analysis displays two complementary visualizations when Cox Uni is selected and dropdown menus are configured. The Kaplan–Meier (KM) survival plot shows survival probability (y-axis) over time in months (x-axis) for miRNA-defined groups (e.g., High vs Low), allowing users to compare curve separation between groups to assess outcome differences and use the reported log-rank p-value to evaluate whether the group difference is statistically supported. The cumulative hazard plot shows cumulative hazard (accumulated risk of the event) over months for the same stratified groups, where steeper curves indicate risk accumulating more quickly and can be used alongside the KM plot to view group differences in terms of risk accumulation rather than survival probability. Clear separation between High versus Low groups suggests the miRNA is associated with prognosis without clinical adjustment, indicating a univariate association between miRNA expression and patient outcomes.
B. Cox Multi (Multivariate Cox Analysis with Clinical Adjustment)This analysis displays two complementary visualizations when Cox Multi is selected and dropdown menus are configured. The adjusted survival curve shows model-predicted survival probability (y-axis) over months (x-axis) for miRNA-defined groups after adjusting for available clinical covariates in that cohort, allowing users to assess whether group separation persists after clinical adjustment and provides evidence that the miRNA offers prognostic information beyond standard clinical variables within the available covariates. The forest plot displays hazard ratios (HRs) with 95% confidence intervals for the miRNA group term (e.g., High vs Low) and each included clinical covariate that was automatically selected based on cohort availability, where HR > 1 indicates higher risk (worse outcome), HR < 1 indicates lower risk (better outcome), a dashed vertical line at HR = 1 indicates no effect, and wider confidence intervals indicate greater uncertainty. A significant miRNA-group HR after adjustment suggests the miRNA is an independent prognostic factor given the included covariates, providing evidence that miRNA expression contributes prognostic value beyond traditional clinical variables.
C. Cure ModelThis analysis displays a cure-model survival curve when Cure Model is selected and dropdown menus are configured, showing cure-model–estimated survival probability (y-axis) over months (x-axis) for miRNA-defined groups. Users can compare curves to evaluate group-specific outcome differences under a model designed to capture long-term survival patterns and look for late-time plateaus that can reflect sustained survival behavior. Group separation indicates prognostic differences in a framework designed for long-term survival dynamics, which can be particularly informative when standard proportional hazards assumptions may not fully reflect the data and when a subset of patients may experience extended disease-free survival.
3.4 Multi-omics Driver Analysis
Overview
The Multi-omics Driver Analysis identifies driver events that differ between two clinically defined patient groups, integrating:It highlights which genes and pathways are most likely driving group differences (e.g., responders vs non-responders, early vs late stage) and how consistently they are supported across omics types and tools.
Use this analysis when you want to understand which genomic and epigenomic alterations underlie clinical subgroup differences.
Workflow
Output
A. Multi-layer Driver-Function Relationship Diagram & Driver Summary TableThis network-like diagram connects the selected cancer dataset, omics layers (mRNA, Mutation, CNV, Methylation), driver genes, and functional/pathway terms such as GO terms to illustrate the relationships between molecular alterations and biological functions. The structure flows hierarchically: the cancer node connects to each omics node (mRNA, Mutation, CNV, Methylation), each omics node connects to driver genes identified in that layer, and driver genes connect to GO term or function nodes representing enriched pathways or processes. Users can trace paths from clinical groups through omics alterations to driver genes and finally to biological functions, identifying which omics layers contribute most to observed group differences, which driver genes are shared across omics layers, and which biological functions and pathways are most impacted by these alterations.
Driver Summary TableThis table lists all identified drivers with their cancer type/dataset, omics layer (mRNA, Mutation, CNV, Methylation), driver gene symbol, Cancer Gene Census (CGC) status, Network of Cancer Genes (NCG) status, integration method that detected the driver, number of tools supporting this driver event (nTools), and associated Gene Ontology terms indicating pathways or functions. Higher nTools values indicate stronger cross-tool evidence for a driver event, CGC/NCG = Yes provides additional external support as a known cancer gene, and GO_term entries reveal potential biological roles and affected pathways. Together, the diagram and table summarize how multi-omics drivers are identified and what functional roles they may play in cancer biology.

This section helps evaluate how strongly each driver is supported across omics layers and computational methods through two complementary visualizations.
Omics-by-Gene Tool Support Heatmap (Left)This heatmap displays genes as rows and omics types (mRNA, Mutation, CNV, Methylation) as columns, with each cell value representing the number of tools that identified that gene as a driver in that specific omic layer. Users can hover on cells to see the exact tool count for each gene-omic combination, enabling identification of robust multi-omics drivers including genes with high support across multiple omics layers and genes supported by many tools in at least one omic category. This visualization helps prioritize genes based on the breadth and depth of computational evidence supporting their driver status.
Tools per Gene by Omics Bar Plot (Right)This bar plot displays each driver gene with bar height representing the total number of tools that support that gene as a driver, with color coding indicating the contributions from different omics layers (mRNA, Mutation, CNV, Methylation). Users can compare tool support between genes to identify the most robustly detected drivers and use the legend to toggle specific omics types on or off, allowing focused examination of particular molecular layers. Genes with high tool support across several omics types are high-confidence multi-omics drivers, as convergent evidence from multiple computational methods and molecular mechanisms strengthens the reliability of their identification as functionally important cancer genes.

This section focuses on tools rather than genes, evaluating how comprehensively tools cover omics layers and how consistent their driver identifications are across methods.
Omics-by-Tool Coverage Heatmap (Left)This heatmap displays tools as rows and omics types as columns, with each cell showing the proportion of drivers in each omic layer detected by each specific tool. Users can examine which tools have broad coverage across multiple omics types versus those with more selective detection patterns focused on particular molecular layers, and identify tools that contribute most substantially to driver identification within a specific omics type. This visualization reveals the complementary nature of different computational approaches and helps users understand which tools are most effective for detecting drivers in each molecular context.
Tool Overlap Distribution Plot (Right)This bar chart displays the number of tools (x-axis) versus the number of genes detected by that many tools (y-axis), revealing the degree of consensus among computational methods in driver identification. Genes detected by multiple tools are generally more reliable as convergent evidence from independent methods strengthens confidence in their driver status, while a right-shifted distribution with more genes supported by many tools suggests strong cross-tool consistency in the analytical pipeline. This plot helps users assess the overall reproducibility of driver detection and identify which genes have the most robust computational support across the integrated multi-omics framework.

3.5 Prognostic Signature Identification
3.5.1 Overview
The Prognostic Signature Identification analysis constructs a survival-predictive gene signature from a user-provided gene list. Using LASSO (Least Absolute Shrinkage and Selection Operator) and Random Forest models, the analysis identifies survival-associated genes, builds a multigene risk-score model, and evaluates its predictive performance through survival statistics, ROC curves, risk stratification plots, and feature-selection diagnostics.
Use this analysis if you already have a candidate gene list and want to determine:3.5.2 Workflow
1. Input a Gene List
Users may provide a list of candidate genes using either method:These genes will be used to identify survival-related markers and construct the prognostic signature.
2. Select Dataset Settings
After submitting the gene list, users configure all analysis settings on the results page.
Select a TissueFrom the filtered list, select a TCGA (or other) cancer dataset for training the prognostic signature model.
3. Select Data Type(s) for Model Construction
Users may choose one or multiple omics types used for signature construction:Selected data types define which molecular features contribute to the LASSO and Random Forest survival models.
4. Select a Survival Endpoint
Choose the patient outcome to be modeled:The endpoint determines how prognostic performance is evaluated.
5. Define Patient Subpopulation (Clinical Criteria Filter)
Filter the dataset to analyze a specific patient subpopulation by applying one or more clinical criteria.
Each criterion includes multiple groups with sample counts.
Users may combine multiple criteria to define a precise analysis cohort.
3.5.3 Results Overview
3.6 Multivariate Survival Analysis
3.6.1 Overview
In the Multivariate Survival Analysis, over one hundred clinical factors are available for selection to construct a comprehensive prognostic model. If you are interested in comparing specific candidate gene(s) with well-known clinical prognostic biomarkers, this analysis will construct a multivariate model with customized clinical factors in the CoxPH framework. The generated report includes corresponding statistical results, Kaplan–Meier plots, and point-estimated values of all factors displayed in a forest plot.
3.6.2 Workflow
1. Input a Gene List or Signature list
Users choose the input type—either gene name or signature—and provide their list of candidate genes by typing or pasting gene symbols directly into the text box. These genes will be used to identify survival-related markers and construct the prognostic signature for the selected cancer cohort.
2. Select Dataset Settings
After submitting the gene list, users configure all analysis settings on the results page.
Select a TissueChoose a broad tissue category (e.g., Breast, Lung, Colon).
This filters the available cancer datasets.
From the filtered list, select a TCGA (or other) cancer dataset for training the prognostic signature model.
3. Select Confounding Factors
This section determines how clinical variables are handled in the survival model.4. Select Data Type(s) for Model Construction
Users may choose one or multiple omics types used for signature construction:Selected data types define which molecular features contribute to the LASSO and Random Forest survival models.
5. Select a Survival Endpoint
Choose the patient outcome to be modeled:The endpoint determines how prognostic performance is evaluated.
6. Define Patient Subpopulation (Clinical Criteria Filter)
Filter the dataset to analyze a specific patient subpopulation by applying one or more clinical criteria.
Each criterion includes multiple groups with sample counts.
Users may combine multiple criteria to define a precise analysis cohort.
3.6.3 Results Overview
The analysis generates comprehensive results for each selected data type, including a summary table with statistical metrics, Kaplan–Meier plots showing survival curves for risk-stratified groups, and forest plots displaying hazard ratios with confidence intervals for all genes and clinical factors included in the final multivariate model.
Copyright© 2010-2025. All Rights Reserved. ©版權所有. Ver. 1.00.003未經允許請勿任意轉載、複製或做商業用途