Details
-
Type:
Task
-
Status: Closed (View Workflow)
-
Priority:
Major
-
Resolution: Done
-
Affects Version/s: None
-
Fix Version/s: None
-
Labels:None
-
Story Points:2
-
Epic Link:
-
Sprint:Fall 1, Fall 7
Description
Task: Add the Hydra vulgaris genome and annotation to IGB. Current Hydra vulgaris genome version provided by ensembl: Hydra_105_v3 (Feb 2022).
Hydra vulgaris (HydraT2T_AEP)(Apr 2024) - https://www.ncbi.nlm.nih.gov/datasets/genome/GCF_038396675.1/
Attachments
Activity
Below is an outline of the steps I followed to create the Hydra vulgaris Quickload:
1. Convert genome .fna to .2bit
gunzip GCF_038396675.1_HydraT2T_AEP_genomic.fna.gz ./faToTwoBit GCF_038396675.1_HydraT2T_AEP_genomic.fna H_vulgaris_Apr_2024.2bit
2. Create genome.txt
./twoBitInfo H_vulgaris_Apr_2024.2bit genome.txt
3. Get gene models from NCBI (.gff), then convert .gff to .bed
cd ~/Documents/Repos/genomesource ./gff3ToBedDetail.py -g ~/Downloads/genomic.gff -b ~/Downloads/H_vulgaris_Apr_2024_refGene.bed
4. Check if UCSC has any information for this genome using its txid (NCBI:txid6087) and, since if it does, compare gene names/ID's to those present in the .bed file created in the previous step
cd ~/Downloads gunzip -c gene2accession.gz | grep '^6087\t' > 6087.gene2accession.txt
5. Sort, gzip, and tabix the .bed file
sort -k1,1 -k2,2n H_vulgaris_Apr_2024_refGene.bed | bgzip > H_vulgaris_Apr_2024_refGene.bed.gz tabix -0 -s 1 -b 2 -e 3 H_vulgaris_Apr_2024_refGene.bed.gz
6. Sanity check the .bed and .2bit files - Add the .2bit file as a reference, then drag/drop the .bed file into IGB. Confirm that gene models are present, labeled correctly, and the chromosomes listed are in a logical order. Also check that no error messages are present in the Log.
7. Create annots.xml and add _H_vulgaris_ to contents.txt and .htaccess
cd ~/Documents/Repos/quickload svn mkdir H_vulgaris_Apr_2024 svn cp A_gambiae_Oct_2006/annots.xml H_vulgaris_Apr_2024 nano H_vulgaris_Apr_2024/annots.xml nano contents.txt nano .htaccess
Tested following instructions above. Everything looks good.
Closing ticket.