Expression

Overview

This facet comprises three tabs, allowing users to explore the expression landscape of 3,432 RNA-Seq fresh frozen tumor samples (1,389 blood tumors, 888 solid tumors, and 1,155 brain tumors) using a t-SNE plot (Figure 1), gene expression violin plots organized by subtype for a gene of interest (Figure 2), gene expression overlayed on the t-SNE, or collectively within a data matrix.

tsne initial screen

tsne sample view

Figure 1: t-SNE for Blood, Brain, and Solid Samples. Mouse over data points to access metadata details for each sample. Visualization powered by D3.

gene violin plots

Figure 2: Gene Expression for MYCN. Gene expression violin plots for each sample, filtered by the gene of interest. Visualization powered by Plotly.

Note
  • All samples use the hg38 reference genome.
  • Full metadata can be accessed through our manifest.

Features for the t-SNE Plot

FeatureDescription
Subtype CategorizationSubtypes are color-coded, and a subset is labeled on the plot. These can be turned off in the 3 dot menu.
Sample SummaryClicking a data point opens a drawer with metadata and sample details.
FiltersFilters are categorized by Tumor Sample, Patient Phenotype, and Sample Preparation.
Sample SearchSearch by individual or bulk (comma-separated) sample IDs. CompBio IDs must be exact.
Lasso ToolSelect a region on the plot to retrieve a list of samples for further investigation.
Pan/ZoomZoom in or pan to examine specific regions of the plot. This will disable subtype labels.

tsne features overview

Warning
Filtering by the sunburst will auto-populate the Root and Subtype filters. These can be manually edited but will not update the sunburst.

Features for Gene Expression

FeatureDescription
Gene SandboxViolin plots for the gene of interest, filtered by root and subtypes.
Plotly FunctionsPan and zoom features on the right side of the gene sandbox do not affect filter components.
Median SortSort the gene expression sandboxes by median expression across or within individual groups.
Outlier ToggleToggle off data points to keep outliers intact for the cohort currently being filtered.

For data normalization details, refer to our Methods and Data page.

violin plots


Gene Expression Overlay on t-SNE

Users can overlay gene expression on the t-SNE plot by selecting genes of interest. Count data is normalized using Median of Ratios (MoR). More details can be found on the Methods and Data page.

gene expression toggle


Features for the Data Matrix

The data matrix displays all filtered data with sortable headers for easier exploration.

data sortable columns


Filters Explained

Tumor Sample

FilterDescription
Sample IDSearch by individual or bulk St. Jude CompBio IDs (comma-separated). Allows multi-select.
Subtype RootCustom-select a root to prompt applicable subtypes. Heme is defaulted upon loading the facet unless the sunburst is employed.
SubtypeCustom-select subtypes to view on the plot. Parent node selection enables or disables child nodes.
Subtype BiomarkerMulti-select subtype biomarkers to apply on the plot. General genes like "CTNNB1" are not accepted; users must select biomarkers from dropdown.
Sample TypeMulti-select dropdown for sample types.

Patient Phenotype

FilterDescription
SexMulti-select dropdown for biological sex.
Age at DiagnosisAdjustable scale or manual input for age in years.
RaceMulti-select dropdown for race.
EthnicityMulti-select dropdown for ethnicity.

Sample Preparation

FilterDescription
Library Selection ProtocolMulti-select dropdown for library protocol types.
PreservativeMulti-select dropdown for sample preservative types.
Warning
Some fields may have a "Not Available" option for samples where the data wasn't recorded (e.g., Race, Ethnicity, Sex).
Tip
For a subset of this data, refer to Figure 4f of McLeod et al.

To see how the data was calculated and normalized, visit our Methods and Data page.