Cluster: fefined cell type.
Annotation: fefined cell type.
gene: gene symbol.
p_val: the probability value is the probability for a given statistical model. The smaller the p-value, the higher the significance.
avg_log2FC: log fold-chage of the average expression between the two groups. Positive values indicate that the gene is more highly expressed in the first group.
pct.1: the percentage of cells where the gene is detected in current group.
pct.2: the percentage of cells where the gene is detected in other groups.
We read as many papers as possible to correspond cluster specific genes to known cell types. If a cell type is not described before we call it XXX_high cells based on the most specific marker XXX. However the annotation might not be always accurate. We really appreciate your help on correcting annotations on the KO Landscape.
Because of the huge amount of data, the raw data we uploaded to GEO is the QC filtered and trimmed bam file, the cellcode is tagged with XC and the UMI is tagged with XM. The GEO change the storage format to sra file. The barcodes should be included as a spot-group instead of as a custom tag. We will provide a local server for puting the original BAM files soon.
The ratio might be affected by cell digestion method as well as gene expression profiling method. The cell number ratio identified by Microwell-seq might be different from the ratio identified by other methods such as FACS.