Details
-
Type:
Task
-
Status: Closed (View Workflow)
-
Priority:
Major
-
Resolution: Done
-
Affects Version/s: None
-
Fix Version/s: None
-
Labels:None
-
Story Points:1
-
Epic Link:
-
Sprint:Spring 1
Description
Investigate UCSC's GenArk system (https://api.genome.ucsc.edu/list/genarkGenomes) and check whether it can be integrated into IGB.
Attachments
Activity
| Field | Original Value | New Value |
|---|---|---|
| Status | To-Do [ 10305 ] | In Progress [ 3 ] |
| Description | Investigate UCSC's GenArk system and check whether it can be integrated into IGB. | Investigate UCSC's GenArk system (https://api.genome.ucsc.edu/list/genarkGenomes) and check whether it can be integrated into IGB. |
| Epic Link | IGBF-1765 [ 17855 ] |
| Attachment | Trackhub-dataset-folders.png [ 18625 ] |
| Status | In Progress [ 3 ] | Needs 1st Level Review [ 10005 ] |
| Status | Needs 1st Level Review [ 10005 ] | First Level Review in Progress [ 10301 ] |
| Status | First Level Review in Progress [ 10301 ] | Ready for Pull Request [ 10304 ] |
| Status | Ready for Pull Request [ 10304 ] | Pull Request Submitted [ 10101 ] |
| Status | Pull Request Submitted [ 10101 ] | Reviewing Pull Request [ 10303 ] |
| Status | Reviewing Pull Request [ 10303 ] | Merged Needs Testing [ 10002 ] |
| Status | Merged Needs Testing [ 10002 ] | Post-merge Testing In Progress [ 10003 ] |
| Resolution | Done [ 10000 ] | |
| Status | Post-merge Testing In Progress [ 10003 ] | Closed [ 6 ] |
Investigated the genomes and tracks all the APIs work similar to the UCSC genomes and on top of it we can directly get the bigbed file from the hub URLs that are provided in the API response which will decrease a lot of API calls and as far as I checked all genomes in this GenArk seem to have the hubUrl, the hub page for that genome (eg: https://hgdownload.soe.ucsc.edu/hubs/GCA/000/001/905/GCA_000001905.1/) and the bigDataUrl, where the actual annotation file exists (eg: https://hgdownload.soe.ucsc.edu/hubs/GCA/000/001/905/GCA_000001905.1/bbi/GCA_000001905.1_Loxafr3.0.ncbiRefSeq.bb). Initially we can get the genomes and tracks from the API call and we can use this file to load the data into IGB for a particular track, instead of making multiple API calls and we already have a .bb file parser. So, implementation can be done and it should be easier too.
Tracks API for the selected genome: https://api.genome.ucsc.edu/list/tracks?genome=GCA_000001905.1
Things we need to check before implementing:
Nowlan Freese We can discuss these points and decide on how to move forward.