canergen commited on
Commit
986ebd6
·
verified ·
1 Parent(s): 1d3203f

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +105 -0
README.md ADDED
@@ -0,0 +1,105 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ library_name: popV
3
+ license: cc-by-4.0
4
+ tags:
5
+ - biology
6
+ - genomics
7
+ - single-cell
8
+ - anndata_version:0.11.1
9
+ - popV
10
+ - 'tissue: Salivary_Gland'
11
+ ---
12
+
13
+ Popular Vote (popV) model for automated cell type annotation of single-cell RNA-seq data. We provide here pretrained models
14
+ for plug-in use in your own analysis.
15
+ Follow our [tutorial](https://github.com/YosefLab/popV/blob/main/tabula_sapiens_tutorial.ipynb) to learn how to use the model for cell type annotation.
16
+
17
+ # Model description
18
+
19
+ Tabula Sapiens is a benchmark, first-draft human cell atlas of over 1.1M cells from 28 organs of 24 normal human subjects. This work is the product of the Tabula Sapiens Consortium. Taking the organs from the same individual controls for genetic background, age, environment, and epigenetic effects, and allows detailed analysis and comparison of cell types that are shared between tissues.
20
+
21
+ **Link to CELLxGENE**:
22
+ Link to the [data](https://cellxgene.cziscience.com/e/f01bdd17-4902-40f5-86e3-240d66dd2587.cxg/) in the CELLxGENE browser for interactive exploration of the data and download of the source data.
23
+
24
+ **Training Code URL**:
25
+ Not provided by uploader.
26
+
27
+ # Metrics
28
+
29
+ We provide here accuracies for each of the experts and the ensemble model. The validation set accuracies are
30
+ computed on a 10% random subset of the data that was not used for training.
31
+
32
+ | Cell Type | N cells | celltypist | knn on bbknn | knn on harmony | knn on scvi | onclass | scanvi | svm | xgboost | Consensus Prediction |
33
+ | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |
34
+ | acinar cell of salivary gland | 1219 | 0.96 | 0.93 | 0.97 | 0.96 | 0.00 | 0.91 | 0.95 | 0.96 | 0.97 |
35
+ | B cell | 515 | 1.00 | 0.99 | 0.98 | 0.99 | 0.00 | 1.00 | 1.00 | 1.00 | 1.00 |
36
+ | fibroblast | 467 | 0.97 | 0.94 | 0.97 | 0.95 | 0.00 | 0.94 | 0.95 | 0.96 | 0.98 |
37
+ | CD8-positive, alpha-beta T cell | 383 | 0.91 | 0.88 | 0.92 | 0.88 | 0.00 | 0.86 | 0.90 | 0.90 | 0.93 |
38
+ | duct epithelial cell | 384 | 0.88 | 0.73 | 0.92 | 0.85 | 0.00 | 0.72 | 0.83 | 0.88 | 0.90 |
39
+ | CD4-positive, alpha-beta T cell | 165 | 0.89 | 0.88 | 0.89 | 0.79 | 0.00 | 0.85 | 0.90 | 0.89 | 0.91 |
40
+ | endothelial cell | 158 | 0.96 | 0.90 | 0.94 | 0.92 | 0.00 | 0.95 | 0.96 | 0.96 | 0.97 |
41
+ | macrophage | 175 | 0.95 | 0.71 | 0.90 | 0.90 | 0.00 | 0.91 | 0.94 | 0.95 | 0.96 |
42
+ | basal cell | 112 | 0.93 | 0.91 | 0.95 | 0.86 | 0.00 | 0.83 | 0.89 | 0.88 | 0.91 |
43
+ | tissue-resident macrophage | 81 | 0.93 | 0.09 | 0.88 | 0.86 | 0.00 | 0.94 | 0.87 | 0.90 | 0.96 |
44
+ | plasma cell | 45 | 0.89 | 0.98 | 0.98 | 0.97 | 0.00 | 0.96 | 0.97 | 0.97 | 0.98 |
45
+ | endothelial cell of lymphatic vessel | 51 | 0.93 | 0.77 | 0.85 | 0.73 | 0.00 | 0.89 | 0.90 | 0.93 | 0.94 |
46
+ | adventitial cell | 57 | 0.88 | 0.32 | 0.77 | 0.63 | 0.00 | 0.71 | 0.74 | 0.75 | 0.88 |
47
+ | vascular associated smooth muscle cell | 44 | 0.99 | 0.99 | 0.97 | 0.99 | 0.00 | 0.98 | 0.99 | 0.98 | 0.99 |
48
+ | neutrophil | 25 | 0.91 | 0.89 | 0.94 | 0.92 | 0.00 | 0.93 | 0.96 | 0.94 | 0.94 |
49
+ | myoepithelial cell | 16 | 0.81 | 0.88 | 0.85 | 0.88 | 0.00 | 0.74 | 0.84 | 0.74 | 0.89 |
50
+ | T cell | 19 | 0.54 | 0.10 | 0.57 | 0.30 | 0.00 | 0.38 | 0.52 | 0.45 | 0.53 |
51
+ | ionocyte | 14 | 0.88 | 0.90 | 0.93 | 0.54 | 0.00 | 0.29 | 0.84 | 0.84 | 0.90 |
52
+ | endothelial cell of artery | 10 | 0.75 | 0.67 | 0.84 | 0.53 | 0.00 | 0.67 | 0.86 | 0.87 | 0.86 |
53
+ | monocyte | 10 | 0.44 | 0.33 | 0.47 | 0.25 | 0.00 | 0.56 | 0.69 | 0.72 | 0.65 |
54
+ | natural killer cell | 9 | 0.42 | 0.33 | 0.59 | 0.18 | 0.00 | 0.36 | 0.59 | 0.60 | 0.67 |
55
+ | mast cell | 5 | 0.00 | 0.67 | 0.73 | 0.29 | 0.00 | 0.62 | 0.67 | 0.44 | 0.60 |
56
+ | classical monocyte | 8 | 0.00 | 0.00 | 0.00 | 0.35 | 0.00 | 0.00 | 0.36 | 0.46 | 0.00 |
57
+ | intermediate monocyte | 6 | 0.00 | 0.00 | 0.00 | 0.20 | 0.00 | 0.17 | 0.44 | 0.36 | 0.00 |
58
+ | mature NK T cell | 3 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.33 | 0.57 | 0.00 |
59
+ | leukocyte | 1 | 0.00 | 0.00 | 1.00 | 1.00 | 0.00 | 0.33 | 1.00 | 0.00 | 1.00 |
60
+ | pericyte | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
61
+ | salivary gland cell | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
62
+ | myeloid dendritic cell | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
63
+ | non-classical monocyte | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
64
+
65
+ The train accuracies are computed on the training data.
66
+
67
+ | Cell Type | N cells | celltypist | knn on bbknn | knn on harmony | knn on scvi | onclass | scanvi | svm | xgboost | Consensus Prediction |
68
+ | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |
69
+ | acinar cell of salivary gland | 11018 | 0.96 | 0.93 | 0.97 | 0.97 | 0.00 | 0.92 | 0.95 | 0.96 | 0.98 |
70
+ | B cell | 4433 | 0.99 | 0.99 | 0.98 | 0.99 | 0.00 | 0.99 | 0.99 | 0.99 | 1.00 |
71
+ | fibroblast | 4000 | 0.97 | 0.93 | 0.98 | 0.97 | 0.00 | 0.95 | 0.96 | 0.97 | 0.99 |
72
+ | CD8-positive, alpha-beta T cell | 3682 | 0.91 | 0.84 | 0.93 | 0.91 | 0.00 | 0.87 | 0.90 | 0.91 | 0.94 |
73
+ | duct epithelial cell | 3505 | 0.87 | 0.75 | 0.94 | 0.90 | 0.00 | 0.78 | 0.85 | 0.88 | 0.92 |
74
+ | CD4-positive, alpha-beta T cell | 1721 | 0.88 | 0.86 | 0.89 | 0.86 | 0.00 | 0.88 | 0.90 | 0.91 | 0.91 |
75
+ | endothelial cell | 1471 | 0.94 | 0.89 | 0.94 | 0.96 | 0.00 | 0.96 | 0.97 | 0.98 | 0.98 |
76
+ | macrophage | 1448 | 0.93 | 0.71 | 0.95 | 0.92 | 0.00 | 0.94 | 0.97 | 0.97 | 0.98 |
77
+ | basal cell | 1070 | 0.92 | 0.91 | 0.96 | 0.91 | 0.00 | 0.89 | 0.93 | 0.94 | 0.96 |
78
+ | tissue-resident macrophage | 573 | 0.88 | 0.01 | 0.92 | 0.92 | 0.00 | 0.93 | 0.86 | 0.89 | 0.95 |
79
+ | plasma cell | 538 | 0.92 | 0.98 | 0.99 | 0.98 | 0.00 | 0.98 | 0.99 | 0.99 | 0.99 |
80
+ | endothelial cell of lymphatic vessel | 495 | 0.91 | 0.66 | 0.88 | 0.92 | 0.00 | 0.94 | 0.95 | 0.96 | 0.97 |
81
+ | adventitial cell | 403 | 0.83 | 0.20 | 0.84 | 0.77 | 0.00 | 0.78 | 0.79 | 0.82 | 0.95 |
82
+ | vascular associated smooth muscle cell | 359 | 0.96 | 0.96 | 0.97 | 0.98 | 0.00 | 0.98 | 0.98 | 0.98 | 0.97 |
83
+ | neutrophil | 218 | 0.93 | 0.92 | 0.97 | 0.93 | 0.00 | 0.91 | 0.98 | 0.98 | 0.97 |
84
+ | myoepithelial cell | 200 | 0.90 | 0.91 | 0.94 | 0.92 | 0.00 | 0.85 | 0.92 | 0.92 | 0.94 |
85
+ | T cell | 174 | 0.67 | 0.19 | 0.72 | 0.68 | 0.00 | 0.55 | 0.80 | 0.81 | 0.84 |
86
+ | ionocyte | 128 | 0.88 | 0.88 | 0.92 | 0.68 | 0.00 | 0.44 | 0.90 | 0.86 | 0.92 |
87
+ | endothelial cell of artery | 106 | 0.76 | 0.71 | 0.86 | 0.81 | 0.00 | 0.73 | 0.94 | 0.91 | 0.91 |
88
+ | monocyte | 89 | 0.48 | 0.36 | 0.62 | 0.55 | 0.00 | 0.67 | 0.87 | 0.95 | 0.88 |
89
+ | natural killer cell | 64 | 0.34 | 0.26 | 0.63 | 0.48 | 0.00 | 0.30 | 0.57 | 0.59 | 0.65 |
90
+ | mast cell | 40 | 0.00 | 0.68 | 0.73 | 0.50 | 0.00 | 0.75 | 1.00 | 1.00 | 0.99 |
91
+ | classical monocyte | 35 | 0.00 | 0.00 | 0.32 | 0.34 | 0.00 | 0.79 | 0.95 | 0.93 | 0.91 |
92
+ | intermediate monocyte | 36 | 0.00 | 0.00 | 0.51 | 0.47 | 0.00 | 0.83 | 0.89 | 0.90 | 0.99 |
93
+ | mature NK T cell | 20 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.46 | 0.83 | 0.68 | 0.40 |
94
+ | leukocyte | 5 | 0.00 | 0.00 | 1.00 | 0.73 | 0.00 | 0.27 | 0.83 | 0.91 | 1.00 |
95
+ | pericyte | 3 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.24 | 1.00 | 1.00 | 0.00 |
96
+ | salivary gland cell | 2 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.04 | 0.67 | 1.00 | 0.80 |
97
+ | myeloid dendritic cell | 1 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.67 | 0.33 | 1.00 | 0.00 |
98
+ | non-classical monocyte | 1 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.67 | 1.00 | 1.00 | 1.00 |
99
+
100
+ </details>
101
+
102
+
103
+ # References
104
+
105
+ Tabula Sapiens reveals transcription factor expression, senescence effects, and sex-specific features in cell types from 28 human organs and tissues, The Tabula Sapiens Consortium; bioRxiv, doi: https://doi.org/10.1101/2024.12.03.626516