Details
-
Type: Task
-
Status: Closed (View Workflow)
-
Priority: Major
-
Resolution: Done
-
Affects Version/s: None
-
Fix Version/s: None
-
Labels:None
-
Story Points:2
-
Epic Link:
-
Sprint:Summer 7
Description
In IGBF-3838, the PBMC dataset of 2700 single cells was obtained and viewed in IGB. Almost all of the single cell RNA sequences for the genes (mentioned in the ticket) did not line up with the reference gene models. The reads are misaligned from the boundaries of the exons in the gene model. The UMI count matrix data for the PBMC dataset was used in the Seurat Guided Clustering Tutorial . To investigate further, view the the reads of the the marker genes mentioned in the tutorial in IGB and check how many are aligned with the gene model.
I was counting the reads that align within the exon boundaries of the gene model and recording it in a spreadsheet. After talking to Dr. Nowlan Freese, I realized I have been counting only reads with no gaps and did not count the ones with some gaps. The counts needs to be updated with counts of reads that may have some gaps but are still within the exon boundaries.
We also discussed about the CellRanger pipeline and understanding how the UMI counts are counted for each feature (gene) for each cell or rather what reads are counted and what is discarded.