Details
-
Type: Task
-
Status: Closed (View Workflow)
-
Priority: Major
-
Resolution: Done
-
Affects Version/s: None
-
Fix Version/s: None
-
Labels:None
-
Story Points:1
-
Epic Link:
-
Sprint:Summer 5
Description
Situation: I've come back from the Galaxy Community Conference with a lot of ideas!
Task: Discuss the following list of topics/ideas as a team and decide which may be worth pursuing. Then, create tickets for each topic/idea we decide to pursue.
- Genome Resources: Confirm that we've looked into these resources for pulling genomes already - 1) EBI's Biodiversity Portal, 2) Genome Ark, and 3) the Vertebrate Genomes Project BioProject on NCBI.
- Copy Number: I talked to a researcher who was working with calculating copy numbers. My understanding of this is limited but she said that she's been calculating copy number in an excel spreadsheet with a formula that uses variables that we already have in the browser (number of reads in a position, gene name, etc). Is there a way to view this in IGB already (i.e., Is this the same as the number in the Selection Info box in the top right when all of the reads at a locus are highlighted by clicking and dragging over them?) If not, sounds like we could look into this and add some functionality.
- Speaking opportunity: I talked to a member of the ERGA (European Reference Genome Atlas) and he said I could give a talk later this year if we'd like. I would join their annotation committee and present about what IGB is and our quest to add more genomes/make it more accessible to more individuals.
- Embedding IGB into Galaxy: Many people came up to me and said they would like to see IGB embedded into Galaxy such that they don't have to install anything first. That's why they tend to use UCSC and IGV in Galaxy more often! Here's documentation on how IGV went about doing this: https://github.com/igvteam/igv.js/
- Viewing Hi-C Genomes: Someone suggested we should add a way to view Hi-C data in IGB! Click on the following link to see an example of how that might look in the browser: https://pygenometracks.readthedocs.io/en/latest/
- Exporting IGB to Galaxy (and vice versa): Is there a way to drag and drop data from Galaxy into IGB? I also got some interest in implementing a way to easily export IGB to Galaxy (so clicking a button in IGB that would send the genome .fasta and any data tracks/files over to Galaxy for further data analyses).
- Galaxy Training Network (GTN): I've been talking with the admin for Galaxy Australia and he'd think we would really benefit in terms of user engagement if we create a short training material for the GTN. Basically, since IGB isn't a tool in Galaxy, there's no way to search for it, and only certain file types can be used with IGB, Galaxy users are likely not finding IGB very easily. So, by creating training materials, at least when people search for IGB in Galaxy that material will come up with information about what kind of file types they can visualize in IGB and how to go about doing so.
- New App Store idea: The same woman I connected with about adding Hi-C capability to IGB also creates a tool called BREW3R (https://github.com/lldelisle/BREW3R.r). Essentially, it uses publicly available data to identify where gene annotations are too short (e.g., those genes we can see in IGB where the coverage graphs extend just past the gene model) and then create a new version of those reference annotations with longer UTR's or new exons. I think this could be a great app to include in IGB so that researchers can visualize their data, see this phenomenon happening, update their reference genome in IGB with this tool, then view re-analyse their data and visualize it again!
- Training opportunity: There's a global online training event for the Galaxy Training Network/Galaxy Training Academy. They're looking for trainers that want to teach a tutorial (such as the IGB one we might create) or contribute a new track if we don't have it done quite in time. This is 7-11 October 2024.
- Conference opportunity: There's an online conference happening in November focused on all things genome sequencing for all types of biodiversity: https://www.biodiversitygenomicsconference.org/
- Documentation check: Check that our documentation currently explains that a link from NCBI to a fasta file doesn't work in IGB – instead, users should try to use EBI or another resource where a direct link to the genome is provided. Someone approached me today saying they were having issues figuring this out and I've also run into this hitch myself.
Attachments
Activity
Paige Kulzer
created issue -
Paige Kulzer
made changes -
Field | Original Value | New Value |
---|---|---|
Epic Link |
|
Nowlan Freese
made changes -
Status | To-Do [ 10305 ] | Pull Request Submitted [ 10101 ] |
Nowlan Freese
made changes -
Status | Pull Request Submitted [ 10101 ] | Reviewing Pull Request [ 10303 ] |
Nowlan Freese
made changes -
Status | Reviewing Pull Request [ 10303 ] | To-Do [ 10305 ] |
Paige Kulzer
made changes -
Status | To-Do [ 10305 ] | In Progress [ 3 ] |
Paige Kulzer
made changes -
Status | In Progress [ 3 ] | Needs 1st Level Review [ 10005 ] |
Paige Kulzer
made changes -
Status | Needs 1st Level Review [ 10005 ] | First Level Review in Progress [ 10301 ] |
Paige Kulzer
made changes -
Status | First Level Review in Progress [ 10301 ] | Ready for Pull Request [ 10304 ] |
Paige Kulzer
made changes -
Status | Ready for Pull Request [ 10304 ] | Pull Request Submitted [ 10101 ] |
Paige Kulzer
made changes -
Status | Pull Request Submitted [ 10101 ] | Reviewing Pull Request [ 10303 ] |
Paige Kulzer
made changes -
Status | Reviewing Pull Request [ 10303 ] | Merged Needs Testing [ 10002 ] |
Paige Kulzer
made changes -
Status | Merged Needs Testing [ 10002 ] | Post-merge Testing In Progress [ 10003 ] |
Paige Kulzer
made changes -
Resolution | Done [ 10000 ] | |
Status | Post-merge Testing In Progress [ 10003 ] | Closed [ 6 ] |
1) I don't believe any of these have API's, but may be good resources for adding less well-known genomes not provided by UCSC and Ensembl.
2) Unclear on what data to use, but could potentially create a demo from this topic instead of adding new functionality. This would involve finding data with copy number variation, creating a coverage graph from that data, adding it to IGB, demonstrating how thresholding works, then exporting the resulting data as a BED file. (Ticket created: IGBF-3829)
3) Send them an email, here's a link to the website: https://www.biodiversitygenomicsconference.org/
4) IGB has been developed in Java, not JavaScript, so this isn't feasible. BUT, we should reach out to Galaxy and switch IGB from just a small feature that's hard to find to a full-blown tool.
5) Include this link in any future Hi-C tickets for design reference.
6) Create a tutorial showing that demonstrates how to get a genome provided from IGB into Galaxy by downloading it from the Quickload site and then adding it to Galaxy... but what happens when it's a .2bit and not a fasta?