Details
-
Type:
Task
-
Status: Closed (View Workflow)
-
Priority:
Major
-
Resolution: Done
-
Affects Version/s: None
-
Fix Version/s: None
-
Labels:None
-
Story Points:2
-
Epic Link:
-
Sprint:Fall 1, Fall 7
Description
Task: Add the Capitella teleta genome and annotation to IGB.
Capitella teleta (Capca1) - https://www.ncbi.nlm.nih.gov/datasets/genome/GCA_000328365.1/
NCBI:txid283909
Attachments
Activity
| Field | Original Value | New Value |
|---|---|---|
| Epic Link | IGBF-3823 [ 23122 ] |
| Status | To-Do [ 10305 ] | In Progress [ 3 ] |
| Description |
Task: Add the Capitella teleta genome and annotation to IGB.
Capitella teleta (Capca1) - https://www.ncbi.nlm.nih.gov/datasets/genome/GCA_000328365.1/ |
Task: Add the Capitella teleta genome and annotation to IGB.
Capitella teleta (Capca1) - https://www.ncbi.nlm.nih.gov/datasets/genome/GCA_000328365.1/ NCBI:txid283909 |
| Status | In Progress [ 3 ] | Needs 1st Level Review [ 10005 ] |
| Assignee | Paige Kulzer [ pkulzer ] | Nowlan Freese [ nfreese ] |
| Sprint | Fall 1 [ 202 ] |
| Sprint | Fall 1 [ 202 ] | Fall 1, Fall 2 [ 202, 203 ] |
| Rank | Ranked higher |
| Rank | Ranked higher |
| Rank | Ranked lower |
| Sprint | Fall 1, Fall 2 [ 202, 203 ] | Fall 1, Fall 3 [ 202, 204 ] |
| Sprint | Fall 1, Fall 3 [ 202, 204 ] | Fall 1, Fall 4 [ 202, 205 ] |
| Sprint | Fall 1, Fall 4 [ 202, 205 ] | Fall 1, Fall 5 [ 202, 206 ] |
| Sprint | Fall 1, Fall 5 [ 202, 206 ] | Fall 1, Fall 6 [ 202, 207 ] |
| Sprint | Fall 1, Fall 6 [ 202, 207 ] | Fall 1, Fall 7 [ 202, 208 ] |
| Rank | Ranked higher |
| Status | Needs 1st Level Review [ 10005 ] | First Level Review in Progress [ 10301 ] |
| Attachment | species.txt [ 18577 ] | |
| Attachment | synonyms.txt [ 18578 ] |
| Assignee | Nowlan Freese [ nfreese ] | Paige Kulzer [ pkulzer ] |
| Status | First Level Review in Progress [ 10301 ] | To-Do [ 10305 ] |
| Status | To-Do [ 10305 ] | In Progress [ 3 ] |
| Status | In Progress [ 3 ] | Needs 1st Level Review [ 10005 ] |
| Assignee | Paige Kulzer [ pkulzer ] | Nowlan Freese [ nfreese ] |
| Status | Needs 1st Level Review [ 10005 ] | First Level Review in Progress [ 10301 ] |
| Status | First Level Review in Progress [ 10301 ] | Needs 1st Level Review [ 10005 ] |
| Status | Needs 1st Level Review [ 10005 ] | First Level Review in Progress [ 10301 ] |
| Assignee | Nowlan Freese [ nfreese ] | Paige Kulzer [ pkulzer ] |
| Status | First Level Review in Progress [ 10301 ] | Ready for Pull Request [ 10304 ] |
| Status | Ready for Pull Request [ 10304 ] | Pull Request Submitted [ 10101 ] |
| Status | Pull Request Submitted [ 10101 ] | Reviewing Pull Request [ 10303 ] |
| Status | Reviewing Pull Request [ 10303 ] | Merged Needs Testing [ 10002 ] |
| Assignee | Paige Kulzer [ pkulzer ] | Nowlan Freese [ nfreese ] |
| Status | Merged Needs Testing [ 10002 ] | Post-merge Testing In Progress [ 10003 ] |
| Assignee | Nowlan Freese [ nfreese ] | Paige Kulzer [ pkulzer ] |
| Resolution | Done [ 10000 ] | |
| Status | Post-merge Testing In Progress [ 10003 ] | Closed [ 6 ] |
| Assignee | Paige Kulzer [ pkulzer ] | Nowlan Freese [ nfreese ] |
| Assignee | Nowlan Freese [ nfreese ] | Paige Kulzer [ pkulzer ] |
Below is an outline of the steps I followed to create the Capitella teleta Quickload:
1. Convert genome .fasta to .2bit
rsync -aP rsync://hgdownload.soe.ucsc.edu/genome/admin/exe/macOSX.arm64/faToTwoBit ./ ./faToTwoBit GCA_000328365.1_Capca1_genomic.fna C_teleta_Jan_2013.2bit2. Create genome.txt
rsync -aP rsync://hgdownload.soe.ucsc.edu/genome/admin/exe/macOSX.arm64/twoBitInfo ./ ./twoBitInfo C_teleta_Jan_2013.2bit genome.txt3. Get gene models from NCBI (.gff), then convert .gff to .bed
git clone git@bitbucket.org:lorainelab/genomesource.git path+=('~/Documents/Repos/genomesource/') export PYTHONPATH="${PYTHONPATH}:~/Documents/Repos/genomesource/" ./gff3ToBedDetail.py -g ~/Downloads/genomic.gff -b ~/Downloads/C_teleta_Jan_2013_refGene.bed4. Check if NCBI has any information for this genome using its txid (NCBI:txid283909) (Conclusion: it didn't)
brew install tnftp ftp ftp://ftp.ncbi.nlm.nih.gov/gene/DATA/ get gene2accession.gz quit gunzip -c gene2accession.gz | grep '^283909\t' > 283909.gene2accession.txt5. Sort, gzip, and tabix the .bed file made in step 3
6. Sanity check the .bed and .2bit files - Add the .2bit file as a reference, then drag/drop the .bed file into IGB. Confirm that gene models are present, labeled correctly, and the chromosomes listed are in a logical order.
7. Create annots.xml
brew install svn svn checkout --username=guest --password=guest https://svn.bioviz.org/repos/genomes/quickload svn mkdir C_teleta_Jan_2013 svn cp A_gambiae_Oct_2006/annots.xml C_teleta_Jan_2013/7. Add the new genome to contents.txt and .htaccess