I. Tutorial & Method

Browse

In the page Browse, users can choose one cancer type or normal tissue to view immune cell clusters with certain context. Similarly, users can click the options in table below with information button of data set.

Search

In the page Search, there are 4 sections for users to search scRNA-seq dataset, context-specific reference, marker gene and immune cell type by entering letters within cancer names and obtaining a list of possible names or choose data directly.

Tool

In the page Tool, users can submit their single-cell RNA-seq data to explore the immune cell types by selecting some interested prediction methods. Through the integrated project, we give all the results belonging to every method and the optimal annotations. Following pages would be loaded in order.

- Data upload. Users should submit a single-cell expression profile(.txt or .csv) and provide E-mail address to create task ID. As some integrated method would take a long time, we will send a message when these computations are done. To get an integrated result, users should choose some interested methods, with specifying cancer types or normal tissue and specifying whether to predict unknown, and submit.

- Quality control. After data uploading and method selection, users can define quality control conditions according to data distribution by adjust the right sliders and GO.

- Analysis results. Results of each method would be displayed in order and users can check them by clicking the button above. Only selected methods and the integrated results are available. Note that if different contexts are chosen, there would be two ensemble results.

Results of Annotation

ImmCluster provided four major analytic modules that allow users to interactively explore the annotations of immune cells.

I) Clustering and annotating the immune cell clusters

Firstly, two immune cell clustering plot, which represent clusters and annotated immune cell types of a certain method separately. Then, the annotating results for each cell can be check and downloaded in the following table and the barplot below gives the number of cells belonging to different cell types.

II) Gene expression of markers across cell clusters

Using COSGR, a method to rapidly identify highly expressed genes in single cell data, we provide a list of top 15 genes within each immune cell type ranked by the COSGR scores, which could also be downloaded. And the expression level of each gene can be selected to be shown in a violinplot. All top 15 genes’ expression were shown in a whole heatmap.

III) Functional assignment of the cell clusters

We calculate function scores of each immune clusters for cancer hallmarks, cell states and immune pathways through ssGSEA as shown in the heatmap plot. Asterisk indicates the significance of overlaps between the top 100 marker genes and the signatures of certain function, which would shown in the table.

IV) Cell-cell communications

We infer cell-cell communication based on the expression of ligands and their coupled receptors in scRNA-seq datasets by using three methods including iTALK, celltalker and ICELLNET. A table shows the communication score of each pair of ligand-receptor interactions occurred in two cell types. An overview of cell-cell communication and strong interactions ranked by the score will be displayed in a circle plot and a river plot, respectively.

Reference atlas annotation

Annotations of immune clusters from the page Browse

Method

scRNA-seq data collection

We used a list of keywords ((scRNA seq OR single-cell RNA-seq) AND (cancer OR carcinoma OR adenocarcinoma)) to search datasets from GEO, 10X Genomics, and NGDC platform, and obtained 60 datasets of human single cell RNA-seq transcriptomes. In addition, for the datasets which were already annotated in the original publications, only cells identified as immune cell types were included. If the original publications not described, we only retain cells expressing PTPRC, a general marker of immune cells. Through literature review, the immune cell types in brain cancer are different from those of other solid tumors. After removing the data of glioma and following strict quality control, a total of 986,589 immune cells were remained, including 25 immune cell types from 17 cancer types and 49 immune cell types from 9 normal tissues.

scRNA-seq data filtering

After quality control of each scRNA-seq data, samples with the number of cells that were less than 50 or datasets with the number of cells that were less than 1000 were deleted. Besides, we filtered datasets or samples that seriously affect the effectiveness during data integration and batch effect correction.

Marker atlas developing *

1.Gene is detected in at least 3 cells at last 3 counts across all cells.

2.Gene in statistically significantly higher in expression in this cluster compared to the complement set. To establish significance, we used a two-tailed Mann-Whitney U test with multiple hypothesis correction,FDR < 5%.

3.Gene has maximal average expression in this cluster.

4.Gene satisfies the max-to-second-max ratio is at least 1.1x.

ensemble integration

We develop an integrating project to annotate immune cell types with a label matrix based on voting mechanism.

For each cell, the selected methods would give annotation results which are gathered and labeled. According to the frequency of immune cell types, we record the one which has the highest credibility with a 'real' label. When more than two label a cell hold, we define it as 'unknown'. Similarly, the cells with different immune lineages are also labeled as 'unknown'.


* : Zilionis R, Engblom C, Pfirschke C, Savova V, Zemmour D, Saatcioglu HD,et al. Single-Cell Transcriptomics of Human and Mouse Lung Cancers Reveals Conserved Myeloid Populations Across Individuals and Species. Immunity (2019) 50(5):1317–1334.e10. doi: 10.1016/j.immuni.2019.03.009

Table

Cancer

Short form Full name
BCCBasal cell carcinoma
BRCABreast invasive carcinoma
CHOLCholangiocarcinoma
CRCColorectal adenocarcinoma
CTCLCutaneous T cell lymphomas
HNSCHead and Neck squamous cell carcinoma
KIRCKidney renal clear cell carcinoma
LIHCLiver hepatocellular carcinoma
NSCLCNon-small cell lung cancer
PAADPancreatic ductal adenocarcinomas
SKCMMelanoma
UVMUveal melanoma
MESOMesothelioma
HGSOChigh-grade serous ovarian cancer
PTCpapillary thyroid carcinoma
GCgastric cancer
PCaprostatic adenocarcinoma

Tissue

Short form Full name
LNGLungs
LIVLiver
OMEOmentum
SKMSkeletal muscle
TCLTransverse colon
DUODuodenum
JEJJejunum
CAECaecum
ILEIleum
SCLSigmoid colon
BLDBlood
THYThymus
JEJEPIJejunum-epithelial
JEJLPJejunum-lamina propria
PBMCPeripheral blood mononuclear cells
LLNLung-draining lymph nodes
MLNMesenteric lymph nodes
SPLSpleen
BMABone marrow

Cell type

Short form Full name
B-LymB lymphocyte
Plasmaplasma cell
B-Nainaive B cell
B-Regregulatory B cell
B-Memmemory B cell
B-Folfollicular B cell
GC Bgerminal center B cell
GC B-DZgerminal center B cell in the dark zone
GC B-LZgerminal center B cell in the light zone
NKnatural killer cell
T-LymT lymphocyte
T-Nainaive T cell
T-Proproliferative T cell
T-TRMtissue-resident memory T cell
T-EMeffect memory T cell
NKTnatural killer T cell
T-gdgamma delta T cell
CD4+ TCD4+ T cell
CD4+ T-Nainaive CD4+ T cell
CD4+ T-Memmemory CD4+ T cell
Th1T helper type 1 cell
Th2T helper type 2 cell
Th17T helper type 17 cell
T-FHfollicular T helper cell
T-Regregulatory T cell
CD8+ TCD8+ T cell
CD8+ T-Nainaive CD8+ T cell
T-Cytcytotoxic T cell
CD8+ T-Effeffector CD8+ T cell
CD8+ T-Memmemory CD8+ T cell
T-Exhexhausted T cell
Myeloidmyeloid cell
Monocytemonocyte
Macrophagemacrophage
Mac-M1M1-type macrophage
Mac-M2M2-type macrophage
DCdendritic cell
cDC1conventional type 1 dendritic cell
cDC2conventional type 2 dendritic cell
aDCactive dendritic cell
pDCplasmacytoid dendritic cell
Neutrophilsneutrophil
MASTmast cell
Progenitorprogenitor
Erythroiderythroid
Cyclingcycling cell
Pre-Bprecursor B cell
Pro-Bprogenitor B cell
GC B-IGC B-I
GC B-IIGC B-II
Plasmablastsplasmablast
ABCage-associated B cell
NK-CD16+CD16+ natural killer cell
NK-CD56bright-CD16-CD56bright CD16- natural killer cell
T&NK-CycCycling T&natural killer cell
T-CD4;CD8CD4+ T cell;CD8+ T cell
T-Eff;EM-CD4effector T cell;effect memory CD4+ T cell
T-EM;EMRA-CD8effect memory T cell;EMRA CD8+ T cell
T-Nai;CM-CD4naive T cell;center memory CD4+ T cell
T-Nai;CM-CD4-Actnaive T cell;activated center memory CD4+ T cell
T-Nai;CM-CD8naive T cell;center memory CD8+ T cell
T-RM;EM-CD8tissue-resident memory T cell;effect memory CD8+ T cell
TRM-gut-CD8tissue-resident memory_gut_CD8 T cell
TRM-Tgdtissue-resident memory_gamma delta T cell
TRM-Th1;Th17tissue-resident memory_T helper type 1 cell;T helper type 17 cell
Tgd-CRTAM+CRTAM+ gamma delta T cell
MAITmucosal associated invariant T cell
T;BT lymphocyte;B lymphocyte
MNP;BMNP;B lymphocyte
MNP;TMNP;T lymphocyte
Megakaryocytesmegakaryocyte
Mon-Clsclassical monocyte
Mon-nonClsnon-classical monocyte
Mac-Eryerythrophagocytic macrophage
Mac-Alvalveolar macrophage
Mac-IntMedintermediate macrophage
Mac-Intintestinal macrophage
DC1conventional type 1 dendritic cell
DC2conventional type 2 dendritic cell
migDCmigratory dendritic cell
ILC3type 3 innate lymphoid cell
Statistics

1.Normal tissues from healthy donor

1.1 No. of immune cells

1.2 No. of samples

1.3 No. of marker genes

2.Cancer types

2.1 No. of immune cells

2.2 No. of samples

3.3 No. of marker genes


II. Q & A

How to view tSNE and UMAP plots of background data ?

Users can get a tSNE and UMAP plot for visualizing clustering results of normal cells or cancer cells by clicking on any cancers and tissues in Browse.

Why use ImmCluster to annotate immune cell types ?

As we known,a comprehensive annotation for immune cell types after clustering are often elusive and inaccurate. What calls for special attention is that the canonical marker genes for each immune cell type are limited. Here, we collected multiple sets of cancer and normal single-cell RNA-seq transcriptome from different tissues and organs and identified multiple cell types through clustering and manual inspection. Through integrating a variety of calculation methods based on reference or marker genes for annotating the cell types of the data uploaded by users, we will map them to the most likely cell types.

How can I get the immune cell annotation results ?

After uploading a scRNA-seq expression profile and defining the parameters , users can check and download all the annotation results as long as subsequent analysis in Annotation-Analysis results.

Can I choose the interested immune cell types for the subsequent analysis ?

Users can specify one or more interested calculation methods based on reference or marker genes by choosing them in Annotation-Data upload-Choose method . When the computations have been done, the results of most suitable prediction method for the uploaded data according to the annotation results could be chosen.

What is the difference between cancer context and normal context ?

Considering the heterogeneity of tumor microenvironment and normal tissue, we characterized multiple independent reference profiles and identified specific marker atlas of immune cell types for different context.

Can I upload a bulk RNA-seq expression data ?

ImmCluster is a annotation tools for single-cell RNA-seq data, users can not provide a bulk RNA-seq expression data to ensure the accuracy of the results.

Another question.

If Users have other question, email bioinformatics2021@163.com We will give an answer as soon as we can.