Details
-
Type:
Task
-
Status: Closed (View Workflow)
-
Priority:
Major
-
Resolution: Done
-
Affects Version/s: None
-
Fix Version/s: None
-
Labels:None
-
Story Points:2
-
Epic Link:
-
Sprint:Fall 2, Fall 3, Fall 4, Fall 5, Fall 6
Description
Purpose: To submit the de novo assembled RNA-Seq contigs to NCBI's TSA archive.
Since our raw reads already exist as a SRA, this should be straightforward. I just need to align reads to assemblies and the upload assemblies.
Attachments
Issue Links
- is blocked by
-
IGBF-3944 Python script to remove NCBI's contaminants
-
- Closed
-
Activity
TSA submission: SUB14767088
1st step is the Fasta asembly uploads to FTP. After tthat, they ask for the metadata. Backwards to SRA and GEO.
Noter from Brandon where the final contigs are:
Removed duplicates and wrote them to their own file (Variety_dupes.fa). The new files are located in /projects/tomato_genome/fnb/dataprocessing/TSA-transcriptomeShotgunAssembly/kelsieData/FCS_tools/FCS_GX/tom_runs/fin_results
They follow this pattern (Variety_contaminant3_removed.fa), We should be ready for another submission. Python script is fix_contam3_v2.py and is in same directory