Details
-
Type: Task
-
Status: Closed (View Workflow)
-
Priority: Major
-
Resolution: Done
-
Affects Version/s: None
-
Fix Version/s: 10.1.0
-
Labels:None
-
Story Points:3
-
Epic Link:
-
Sprint:Fall 2
Description
Situation: UCSC REST data is currently organized by file "type", which often separates data into large folders of bed, genePred, etc. The UCSC genome browser website uses groupings based on the data itself, for example, Genes and Gene Predictions, Expression and Regulation, etc.
Task: Organize the UCSC REST data by the "group" value provided through the API.
Attachments
Issue Links
- relates to
-
IGBF-3689 Make folders inside the UCSC Rest folder
- Closed
Mapping the "group" values to what UCSC shows in their browser.
Note: I looked at the T2T, the hg38, and mm39 API responses. The below table should encompass most of the "group" values that are present at UCSC, though I haven't been able to find a webpage or API call that lists all of the mappings.
Unfortunately where this gets complicated is for the data that are in folders on UCSC site. Since we are passing the trackLeavesOnly=1 parameter to the API, we aren't able to see the "parent". I've included an example below where I do not pass the trackLeavesOnly=1 to the API.
The parent, T2T_Encode, has a group value of regulation. It is a folder that contains subfolders (for example T2T_Encode_Coverage) which then contain data (for example T2T_Encode_Coverage_BE2C.Control). I think the best approach would be to emulate the organization that UCSC has, that way a user would see the same organization.