Chapter 2 Introduction
2.1 Introduction of acute leukemia
Acute leukemia (AL) is a malignant disorder of hematopoiesis that arises from the clonal proliferation of leukemic blast cells. The presentation and progression of leukemia can vary greatly among different subgroups of patients. Additionally, within the same patient, leukemia cells with different characteristics can coexist. There are two distinct models for understanding cellular heterogeneity and cell-of-origin of acute leukemia: one is the genomic model, and the other is the stem-cell model. The genomic model of acute leukemia is based on the identification of genomic lesions, such as recurrent somatic mutations or fusions. These mutations/fusions are involved in several categories, including transcriptional regulation, signaling pathways, and epigenetic modifiers. However, although the genomic model can explain much of the heterogeneity in leukemia, even cells with the same mutation may have different functions, so the explanatory power of the genomic model is limited.
Another classic model is the hematopoietic stem cell (HSC) model, which suggests that leukemia is caused by a small population of self-renewing and differentiation-arrested stem cells (also known as leukemia stem cells, LSC). These leukemia stem cells possess characteristics similar to normal hematopoietic stem cells, such as the ability to self-renew and produce different types of other blood cells, while other leukemia cells are their differentiated progeny. According to this model, leukemia is a hierarchical disease in which leukemia stem cells are located at the top of the hierarchy, while more differentiated leukemia cells are at the bottom. The hematopoietic stem cell model provides a better explanation for leukemia heterogeneity, as it can explain why different types of leukemia cells can exist even within the same patient.
The differentiation trajectory of hematopoiesis in a healthy human body is extremely complex and crucial for understanding the mechanism of leukemia. In the past, methods such as bone marrow tissue sectioning and cell culture were commonly used to understand the differentiation process of hematopoietic cells. However, these methods still have limitations, such as the inability to comprehensively and accurately reflect the phenotypic and functional characteristics of hematopoietic cells at different developmental stages. In recent years, with the development of sequencing technologies, especially single-cell sequencing, new insights has been brought into the hematopoietic cell differentiation exploration.
2.2 What can HematoMap do
Here we present HematoMap, a comprehensive R package that can provide a visual representation of the cellular hematopoietic hierarchy of human bone marrow. It is a mapping tool that integrates information from single-cell based gene expression data and generate a unified model of hematopoietic cell differentiation. HematoMap allows researchers to explore the relationships between different hematopoietic cell types (such as HSC/MPP, CML and CLP) and interpret normal hematopoiesis cellular compositions. In this HematoMap project, by re-collecting, re-analyzing, and re-annotating single-cell RNA sequencing (scRNA-seq) data of normal bone marrow mononuclear cells (BMMCs), we constructed a hierarchy-based reference to interpret normal hematopoiesis cellular compositions. A likelihood score was also developed to comprehensively and accurately project leukemic blast cells onto the cellular hierarchy of normal hematopoietic differentiation. The identification of the cell-of-origin and resolution of heterogeneity of acute leukemia is an important long-term goal, as it may provide critical insights into the pathogenesis of this disease and inform the development of novel therapeutic strategies.
Using HematoMap, you can do:
1 Visual representation of the cellular composition of normal human BMMCs.
2 Visual representation of cell blockage of acute leukemia.
3 Lineage aberrancy inference using scRNA-seq of acute leukemia.
4 Lineage aberrancy inference using bulk RNA-seq of acute leukemia.
2.3 Data resources in HematoMap
The human hematopoietic hierarchy in bone marrow consists of a hierarchical system of cells that differentiate from hematopoietic stem cells (HSCs) to mature blood cells. The hierarchy includes many kinds of progenitors such as multipotent progenitor cells (MPPs), common lymphoid progenitor cells (CLPs), common myeloid progenitor cells (CMPs), megakaryocyte-erythroid progenitor cells (MEPs), and granulocyte-macrophage progenitor cells (GMPs). These progenitor cells differentiate further into more specialized cells, such as erythrocytes, megakaryocytes, granulocytes, monocytes, and lymphocytes. In HematoMap, we provided detailed annotation of hematopoietic cell differentiation based on single-cell data at different hierarchy levels.
The annotation of healthy BMMCs could be described as four levels:
Level 1: Lineages. HSPCs, Myeloids, Bcells, T/NKcells.
Level 2: Populations. HSPCs, Monocytes, Dendritic cells, Erythrocytes, pro B cells, mature B cells, CD8+ T cells, CD4+ T cells, NK cells
Level 3: Types. The 42 cell types (including 4 cell cycle related cell types): HSC/MPP, LMPP, CLP, CMP, MDP, GMP, CDP, MEP, GMP-Mono, CD14 Mono, CD16 Mono, pre-DC, pDC, cDC1, mo-DC, MKP, MK, pro-Ery1, pro-Ery2, Ery, pre-pro-B, Early pro-B, 23 Early cycling pro-B, Late pro-B, Late cycling pro-B, pre-B, Immature B, Naive B, Memory B 1, Memory B 2, CD8 Tnaive, CD8 Teff, CD8 Tex, CD8 Tdpe, CD8 Tmpe, CD4 Tnaive, CD4 Tem, CD4 Treg, NK, NK-XCL1, cycling NK, cycling NK/T.
Level 4: Sub-clusters. The 1079 cell sub-clusters, for example, HSC/MPP:0001, HSC/MPP:0002, HSC/MPP:0003.
The relationship of the four levels and the related marker for cell type identification were listed as followed.
Lineage | Population | CellType | FullName | Markers | Reference..PMID. |
---|---|---|---|---|---|
HSPCs | HSPCs | HSC/MPP | Hematopoietic stem cell/Multipotent progenitor | CD34(+), KIT(+), FLT3(-), AVP(+), CD38(-), | 28650480; |
HSPCs | HSPCs | LMPP | Lymphoid-primed multipotent progenitor | CD34(+), KIT(+), FLT3(+), AVP(+), CD38(-), | 28650480; |
HSPCs | HSPCs | CLP | Common lymphoid progenitor | CD34(+), KIT(+), FLT3(+), MME(+/-), CD7(+/-), | 28650480; 19996414; |
HSPCs | HSPCs | CMP | Common myeloid progenitor | CD34(+), KIT(+), FLT3(+), CD38(+), CSF1R(-), FCGR1A(-), | 28650480; |
HSPCs | HSPCs | MDP | Monocyte-DC progenitor | CD34(+), KIT(+), FLT3(+), CD38(+), CSF1R(+), FCGR1A(+/-), | 29166589; 28650480; 28514689; 29313948; |
HSPCs | HSPCs | GMP | Granulocyte-macrophage progenitor | CD34(+), KIT(+), FLT3(+), CD38(+), MPO(+), ELANE(+), FCGR1A(+), | 28650480; 26332381; |
HSPCs | HSPCs | CDP | Common DC progenitor | CD34(+), KIT(+), FLT3(+), CD38(+), CSF1R(+), FCGR1A(-), IL3RA(+), | 28650480; 29313948; |
HSPCs | HSPCs | MEP | Megakaryocyte-erythrocyte progenitor | CD34(+), KIT(+), FLT3(-), CD38(+), | 28650480; |
Myeloids | Monocytes | GMP-Mono | GMP-Monocytes | CD34(-), KIT(-), CD300E(-), LYZ(+), CD36(+), MPO(+), FCGR1A(+), CD4(+) | 26282340; 28514689; |
Myeloids | Monocytes | CD14 Mono | Classical(CD14+CD16−) monocyte | CD14(+), FCGR3A(-), CD300E(+), LYZ(+), ITGAX(+), FCGR1A(+), CD4(+) | 26332381; 28514689; |
Myeloids | Monocytes | CD16 Mono | Non-classical(CD14−CD16+) monocyte | CD14(-), FCGR3A(+), CD300E(+), LYZ(+), ITGAX(+), CD4(+) | 26332381; 28514689; |
Myeloids | Dendritic cells | pre-DC | Dendritic cell precursor | CD34(-), FLT3(+), AXL(+), SIGLEC6(+), ITGAX(+), | 29313948; |
Myeloids | Dendritic cells | pDC | Plasmacytoid dendritic cell | CLEC4C(+), IRF4(+), LILRA4(+), IL3RA(+), IRF8(+), IRF7(+), | 29313948; 20193018; |
Myeloids | Dendritic cells | cDC1 | Conventional dendritic cell 1 | XCR1(+), CD1C(+), ITGAX(+), IRF8(+), THBD(+), | 29313948; |
Myeloids | Dendritic cells | mo-DC | Monocyte-derived dendritic cell | MRC1(+), LYZ(+), ITGAX(+), | 29313948; |
Myeloids | Erythrocytes | MKP | Megakaryocyte progenitor | CD34(+), KIT(+), IFLT3(-), ITGA2B(+), | |
Myeloids | Erythrocytes | MK | Megakaryocyte | PF4(+), ITGA2B(+), CD9(+), | |
Myeloids | Erythrocytes | pro-Ery1 | Proerythroblast 1 | ENG(+), KLF(+), KIT(+), TFRC(+) | |
Myeloids | Erythrocytes | pro-Ery2 | Proerythroblast 2 | ENG(+), KLF(+), KIT(-), TFRC(+) | |
Myeloids | Erythrocytes | Ery | Erythroblast | ENG(-), KLF(+), TFRC(+) | |
B cells | B cells | pre-pro-B | B cell progenitor precursor | CD79A(+), IGLL1(+), RAG1(+), RAG2(+), VPREB1(+), MME(+), IL7R(+), | 35090788; 19996414; |
B cells | B cells | Early pro-B | Early B cell progenitor | CD79A(+), DNTT(+), IGLL1(+), RAG1(+), RAG2(+), VPREB1(+), | 35090788; |
B cells | B cells | Early cycling pro-B | Early cycling B cell progenitor | MKI67(+), PCNA(+), CD79A(+), DNTT(+), IGLL1(+), RAG1(+), RAG2(+), VPREB1(+), IL7R(-), | 35090788; |
B cells | B cells | Late pro-B | Late B cell progenitor | CD79A(+), DNTT(-), IGLL1(+), TCL1A(+), RAG1(+), RAG2(+), VPREB1(+), | 35090788; |
B cells | B cells | Late cycling pro-B | Late cycling B cell progenitor | MKI67(+), PCNA(+), CD79A(+), DNTT(-), IGLL1(+), TCL1A(+), RAG1(+), RAG2(+), VPREB1(+), | 35090788; |
B cells | B cells | pre-B | B cell precursor | IGLL1(+/-), TCL1A(+), RAG1(+), RAG2(+), VPREB1(+), | 35090788; |
B cells | B cells | Immature B | Immature B cell | MS4A1(+), TCL1A(+), RAG1(-), RAG2(-), VPREB1(+/-), | 35090788; |
B cells | B cells | Naive B | Naive B cell | MS4A1(+), CD27(-), IGHD(+), TCL1A(+), | 35090788; |
B cells | B cells | Memory B 1 | Memory B cell 1 | MS4A1(+), CD27(+), IGHD(+), IGHG3(+/-), | 35090788; |
B cells | B cells | Memory B 2 | Memory B cell 2 | MS4A1(-), CD27(+), IGHD(-), IGHG3(++), | |
T/NK cells | CD8+ T cells | CD8 Tnaive | Naive CD8+ T cell | CD3D(+), CD3E(+), CD8A(+), CCR7(+), IL7R(+), SELL(+) | 29630679; |
T/NK cells | CD8+ T cells | CD8 Tdpe | KLRG1+IL7R+ double-positive effector CD8+ T | CD3D(+), CD3E(+), CD8A(+), IL7R(+), CCR7(-), KLRG1(+), CD27(+) | 29625895; |
T/NK cells | CD8+ T cells | CD8 Tmpe | Memory precursor effector CD8+ T | CD3D(+), CD3E(+), CD8A(+), IL7R(+), CCR7(-), KLRG1(dim/-) | 29625895; |
T/NK cells | CD8+ T cells | CD8 Teff | Effector CD8+ T cell | CD3D(+), CD3E(+), CD8A(+), CCR7(-), GNLY(+), NKG7(+) | 34017005; |
T/NK cells | CD8+ T cells | CD8 Tex | Exhausted CD8+ T | CD3D(+), CD3E(+), CD8A(+), CCR7(-), PDCD1(+), TNFRSF9(+), LAG3(+), CD160(+), | 34017005; |
T/NK cells | CD4+ T cells | CD4 Tnaive | Naive CD4+ T cell | CD3D(+), CD3E(+), CD4(+), CCR7(+), IL7R(+), SELL(+), CD40LG(+), | 29630679; |
T/NK cells | CD4+ T cells | CD4 Tem | Memory CD4+ T cell | CD3D(+), CD3E(+), CD4(+), CCR7(-), IL7R(+), CD40LG(+), | 29630679; |
T/NK cells | CD4+ T cells | CD4 Treg | Regulatory T cell | CD3D(+), CD3E(+), CD4(+), IL2RA(+), FOXP3(+), DUSP4(+) | 34017005; |
T/NK cells | NK | NK | Natural killer cell | IL2RB(+), IL7R(-), KLRF1(+), XCL1(-), FCGR3A(+), NCAM1(+), | 30150991; |
T/NK cells | NK | NK-XCL1 | XCL1+ Natural killer cell | IL2RB(+), IL7R(-), KLRF1(+), XCL1(+), NCAM1(+), | 30150991; |
T/NK cells | NK | cycling NK | Cycling natural killer cell | MKI67(+), PCNA(+), IL2RB(+), IL7R(-), KLRF1(+), FCGR3A(+), NCAM1(+), | 30150991; |
T/NK cells | NK | cycling NK/T | Cycling natural killer/T cell | MKI67(+), PCNA(+), CD3D(+), CD3E(+), KLRF(+), IL2RB(+/-), NCAM1(+) | 30150991; |
2.4 Outline of this tutorial
The HematoMap
is an R-based package for lineage aberrancy inference based on single-cell RNA sequencing (scRNA-seq) and bulk RNA sequencing (bulk RNA-seq) data of acute leukemia. In this tutorial, we will present:
- How to install
HematoMap
.
- How to install
- Overview of workflow of
HematoMap
.
- Overview of workflow of
- scRNA-seq analysis using
HematoMap
.
- scRNA-seq analysis using
- Bulk RNA-seq analysis using
HematoMap
.
- Bulk RNA-seq analysis using