Details
-
Type:
Task
-
Status: Closed (View Workflow)
-
Priority:
Major
-
Resolution: Done
-
Affects Version/s: None
-
Fix Version/s: None
-
Labels:None
-
Story Points:1
-
Epic Link:
-
Sprint:Fall 7
Description
Task: Add the Vanessa cardui genome and annotation to IGB.
Link to genome on NCBI: https://www.ncbi.nlm.nih.gov/datasets/genome/GCF_905220365.1/
Attachments
Issue Links
- relates to
-
IGBF-4038 Improve documentation description quickload files for painted lady genome assembly
-
- Closed
-
Below is an outline of the steps I followed to create this Quickload:
1.Use wget to obtain the .2bit file from UCSC's track hub directory, then rename it
wget https://hgdownload.soe.ucsc.edu/hubs/GCF/905/220/365/GCF_905220365.1/GCF_905220365.1.2bit mv GCF_905220365.1.2bit ilVanCard2.1.2bit2. Create genome.txt, then check that the chromosome's are ordered logically (i.e., numerically)
3. Use Vanessa cardui's taxID (171605) to get the information needed from gene2accession.gz and gene_info.gz to create the BED14 file in a later step
4. Download the RefSeqAll BED file from UCSC's table browser (Link: https://genome.ucsc.edu/cgi-bin/hgTables), then create the BED14 file using the following code:
5. Sort, gzip, and tabix the BED14 file
6. Sanity check the 2bit and BED files - Add the 2bit file as a reference, then drag/drop the BED files into IGB. Confirm that gene models are present, labeled correctly, and that no error messages are present in the Log.
7. Create a new directory in the quickload repo, then create annots.xml
8. Add V_cardui_Feb_2021 to contents.txt and .htaccess
9. Create HEADER.md