Uploaded image for project: 'IGB'
  1. IGB
  2. IGBF-4112

2025 Update Galaxy dbkey values in IGB synonyms.txt

    Details

    • Type: Task
    • Status: Closed (View Workflow)
    • Priority: Major
    • Resolution: Done
    • Affects Version/s: None
    • Fix Version/s: None
    • Labels:
      None

      Description

      Situation: usegalaxy.org website is used frequently by IGB users. When a user views data from Galaxy in IGB we attempt to load the appropriate genome in IGB. Many of the genomes line up with and use the same nomenclature as Galaxy, but there are now many more genomes in Galaxy.

      Task: Update the IGB synonyms.txt with any missing Galaxy synonyms. See IGBF-1879 for additional details.

      Galaxy genome API: https://usegalaxy.org/api/genomes
      Galaxy Europe genome API: https://usegalaxy.eu/api/genomes
      UCSC genome API (for comparison, this should be up to date in IGB already): https://api.genome.ucsc.edu/list/ucscGenomes

        Attachments

          Issue Links

            Activity

            nfreese Nowlan Freese created issue -
            nfreese Nowlan Freese made changes -
            Field Original Value New Value
            Epic Link IGBF-1880 [ 17970 ]
            nfreese Nowlan Freese made changes -
            Link This issue relates to IGBF-1879 [ IGBF-1879 ]
            nfreese Nowlan Freese made changes -
            Description Situation: usegalaxy.org website is used frequently by IGB users. When a user views data from Galaxy in IGB we attempt to load the appropriate genome in IGB. Many of the genomes line up with and use the same nomenclature as Galaxy, but there are now many more genomes in Galaxy.

            Task: Update the IGB synonyms.txt with any missing Galaxy synonyms.

            Galaxy genome API: https://usegalaxy.org/api/genomes
            Galaxy Europe genome API: https://usegalaxy.eu/api/genomes
            UCSC genome API (for comparison, this should be up to date in IGB already): https://api.genome.ucsc.edu/list/ucscGenomes
            Situation: usegalaxy.org website is used frequently by IGB users. When a user views data from Galaxy in IGB we attempt to load the appropriate genome in IGB. Many of the genomes line up with and use the same nomenclature as Galaxy, but there are now many more genomes in Galaxy.

            Task: Update the IGB synonyms.txt with any missing Galaxy synonyms. See IGBF-1879 for additional details.

            Galaxy genome API: https://usegalaxy.org/api/genomes
            Galaxy Europe genome API: https://usegalaxy.eu/api/genomes
            UCSC genome API (for comparison, this should be up to date in IGB already): https://api.genome.ucsc.edu/list/ucscGenomes
            Hide
            pkulzer Paige Kulzer (Inactive) added a comment - - edited

            First steps:

            1. From the API, find a genome that's not supported by IGB and see what happens when viewing a Galaxy History file with that dbkey in IGB.
            2. Find a genome that's being added to IGB via UCSC's API and see what happens when viewing a Galaxy History file with that dbkey in IGB.
            3. Compare the API's between Galaxy and UCSC and determine which, if any, genomes are being included only by Galaxy. These are the genomes we'll likely need to update IGB's synonyms.txt file with.
            Show
            pkulzer Paige Kulzer (Inactive) added a comment - - edited First steps: From the API, find a genome that's not supported by IGB and see what happens when viewing a Galaxy History file with that dbkey in IGB. Find a genome that's being added to IGB via UCSC's API and see what happens when viewing a Galaxy History file with that dbkey in IGB. Compare the API's between Galaxy and UCSC and determine which, if any, genomes are being included only by Galaxy. These are the genomes we'll likely need to update IGB's synonyms.txt file with.
            pkulzer Paige Kulzer (Inactive) made changes -
            Assignee Paige Kulzer [ pkulzer ]
            pkulzer Paige Kulzer (Inactive) made changes -
            Sprint Spring 3 [ 212 ] Spring 4 [ 213 ]
            ann.loraine Ann Loraine made changes -
            Sprint Spring 4 [ 213 ]
            ann.loraine Ann Loraine made changes -
            Rank Ranked higher
            pkulzer Paige Kulzer (Inactive) made changes -
            Sprint Spring 6 [ 215 ]
            pkulzer Paige Kulzer (Inactive) made changes -
            Rank Ranked lower
            pkulzer Paige Kulzer (Inactive) made changes -
            Status To-Do [ 10305 ] In Progress [ 3 ]
            pkulzer Paige Kulzer (Inactive) made changes -
            Status In Progress [ 3 ] To-Do [ 10305 ]
            pkulzer Paige Kulzer (Inactive) made changes -
            Status To-Do [ 10305 ] In Progress [ 3 ]
            Hide
            pkulzer Paige Kulzer (Inactive) added a comment - - edited
            1. This opened a custom genome in IGB.
            2. This opened the correct genome in IGB.
            3. Below is a table summarizing the genomes that we're including in synonyms.txt without the correct Galaxy key:
            Genome Galaxy Key
            A_thaliana_Jan_2004 araTha1***
            A_thaliana_Apr_2008 arabidopsis_tair8
            A_thaliana_Jun_2009 arabidopsis
            H_exemplaris_Z151_Apr_2017 H_exemplaris_Z151
            Y_lipolytica_CLIB122_Jul_2004 GCF_000002525.2

            ***This key (araTha1) is already listed in synonyms.txt with a different genome. Galaxy correlates araTha1 with TAIR7, we've correlated it with TAIR9/TAIR10, and UCSC correlates it with TAIR10. Which is correct? Update: I believe TAIR10 is correct

            Show
            pkulzer Paige Kulzer (Inactive) added a comment - - edited This opened a custom genome in IGB. This opened the correct genome in IGB. Below is a table summarizing the genomes that we're including in synonyms.txt without the correct Galaxy key: Genome Galaxy Key A_thaliana_Jan_2004 araTha1*** A_thaliana_Apr_2008 arabidopsis_tair8 A_thaliana_Jun_2009 arabidopsis H_exemplaris_Z151_Apr_2017 H_exemplaris_Z151 Y_lipolytica_CLIB122_Jul_2004 GCF_000002525.2 ***This key (araTha1) is already listed in synonyms.txt with a different genome. Galaxy correlates araTha1 with TAIR7, we've correlated it with TAIR9/TAIR10, and UCSC correlates it with TAIR10. Which is correct? Update: I believe TAIR10 is correct
            Hide
            pkulzer Paige Kulzer (Inactive) added a comment - - edited

            Please see attached for a copy of the spreadsheet I compiled listing the genomes hosted by Galaxy/Galaxy EU as well as the genomes hosted by UCSC and IGB.

            Ready for review!

            Show
            pkulzer Paige Kulzer (Inactive) added a comment - - edited Please see attached for a copy of the spreadsheet I compiled listing the genomes hosted by Galaxy/Galaxy EU as well as the genomes hosted by UCSC and IGB. Ready for review!
            pkulzer Paige Kulzer (Inactive) made changes -
            pkulzer Paige Kulzer (Inactive) made changes -
            Status In Progress [ 3 ] Needs 1st Level Review [ 10005 ]
            pkulzer Paige Kulzer (Inactive) made changes -
            Assignee Paige Kulzer [ pkulzer ] Nowlan Freese [ nfreese ]
            pkulzer Paige Kulzer (Inactive) made changes -
            Attachment UniqueGalaxyGenomes.html [ 18668 ]
            Attachment UniqueGalaxyGenomes.R [ 18669 ]
            nfreese Nowlan Freese made changes -
            Sprint Spring 6 [ 215 ] Spring 6, Spring 7 [ 215, 216 ]
            nfreese Nowlan Freese made changes -
            Rank Ranked higher
            Hide
            nfreese Nowlan Freese added a comment - - edited
            Genome Galaxy Key
            A_thaliana_Jan_2004 araTha1***
            • I'm not sure what to think about this one, I would say let's not change it now since that could mess up UCSC and current IGB users
            A_thaliana_Apr_2008 arabidopsis_tair8
            • let's update this in the synonyms.txt on the SVN repository
            A_thaliana_Jun_2009 arabidopsis
            • let's update this in the synonyms.txt on the SVN repository
            H_exemplaris_Z151_Apr_2017 H_exemplaris_Z151
            • I can't find this in either Galaxy API, can you double check it?
            Y_lipolytica_CLIB122_Jul_2004 GCF_000002525.2
            • In the Galaxy API I see the key as (GCF_000002525.2_ASM252v1) can you double check it?

            Paige Kulzer - I marked some above as needing to be double checked, otherwise please go ahead and add the Galaxy key values to the synonyms.txt file in the SVN repository.

            Show
            nfreese Nowlan Freese added a comment - - edited Genome Galaxy Key A_thaliana_Jan_2004 araTha1*** I'm not sure what to think about this one, I would say let's not change it now since that could mess up UCSC and current IGB users A_thaliana_Apr_2008 arabidopsis_tair8 let's update this in the synonyms.txt on the SVN repository A_thaliana_Jun_2009 arabidopsis let's update this in the synonyms.txt on the SVN repository H_exemplaris_Z151_Apr_2017 H_exemplaris_Z151 I can't find this in either Galaxy API, can you double check it? Y_lipolytica_CLIB122_Jul_2004 GCF_000002525.2 In the Galaxy API I see the key as (GCF_000002525.2_ASM252v1) can you double check it ? Paige Kulzer - I marked some above as needing to be double checked, otherwise please go ahead and add the Galaxy key values to the synonyms.txt file in the SVN repository.
            nfreese Nowlan Freese made changes -
            Assignee Nowlan Freese [ nfreese ] Paige Kulzer [ pkulzer ]
            nfreese Nowlan Freese made changes -
            Status Needs 1st Level Review [ 10005 ] First Level Review in Progress [ 10301 ]
            nfreese Nowlan Freese made changes -
            Status First Level Review in Progress [ 10301 ] To-Do [ 10305 ]
            pkulzer Paige Kulzer (Inactive) made changes -
            Status To-Do [ 10305 ] In Progress [ 3 ]
            pkulzer Paige Kulzer (Inactive) made changes -
            Status In Progress [ 3 ] To-Do [ 10305 ]
            pkulzer Paige Kulzer (Inactive) made changes -
            Sprint Spring 6, Spring 7 [ 215, 216 ] Spring 6, Summer 1 [ 215, 218 ]
            nfreese Nowlan Freese made changes -
            Sprint Spring 6, Summer 1 [ 215, 218 ] Spring 6, Summer 3 [ 215, 220 ]
            pkulzer Paige Kulzer (Inactive) made changes -
            Sprint Spring 6, Summer 3 [ 215, 220 ] Spring 6, Summer 4 [ 215, 221 ]
            nfreese Nowlan Freese made changes -
            Link This issue relates to IGBF-4266 [ IGBF-4266 ]
            Hide
            pkulzer Paige Kulzer (Inactive) added a comment -
            Genome Galaxy Key
            A_thaliana_Apr_2008 arabidopsis_tair8
            A_thaliana_Jun_2009 arabidopsis
            Y_lipolytica_CLIB122_Jul_2004 GCF_000002525.2

            This is the final table with the changes that need to be made to IGB's synonyms.txt. Rather than making these changes via SVN, they will be made to IGB's version of synonyms.txt. That task has been captured by a ticket Dr. Freese just made (IGBF-4266) so I will now close this ticket and work on that one.

            Show
            pkulzer Paige Kulzer (Inactive) added a comment - Genome Galaxy Key A_thaliana_Apr_2008 arabidopsis_tair8 A_thaliana_Jun_2009 arabidopsis Y_lipolytica_CLIB122_Jul_2004 GCF_000002525.2 This is the final table with the changes that need to be made to IGB's synonyms.txt. Rather than making these changes via SVN, they will be made to IGB's version of synonyms.txt. That task has been captured by a ticket Dr. Freese just made ( IGBF-4266 ) so I will now close this ticket and work on that one.
            pkulzer Paige Kulzer (Inactive) made changes -
            Status To-Do [ 10305 ] In Progress [ 3 ]
            pkulzer Paige Kulzer (Inactive) made changes -
            Status In Progress [ 3 ] Needs 1st Level Review [ 10005 ]
            pkulzer Paige Kulzer (Inactive) made changes -
            Status Needs 1st Level Review [ 10005 ] First Level Review in Progress [ 10301 ]
            pkulzer Paige Kulzer (Inactive) made changes -
            Status First Level Review in Progress [ 10301 ] Ready for Pull Request [ 10304 ]
            pkulzer Paige Kulzer (Inactive) made changes -
            Status Ready for Pull Request [ 10304 ] Pull Request Submitted [ 10101 ]
            pkulzer Paige Kulzer (Inactive) made changes -
            Status Pull Request Submitted [ 10101 ] Reviewing Pull Request [ 10303 ]
            pkulzer Paige Kulzer (Inactive) made changes -
            Status Reviewing Pull Request [ 10303 ] Merged Needs Testing [ 10002 ]
            pkulzer Paige Kulzer (Inactive) made changes -
            Status Merged Needs Testing [ 10002 ] Post-merge Testing In Progress [ 10003 ]
            pkulzer Paige Kulzer (Inactive) made changes -
            Resolution Done [ 10000 ]
            Status Post-merge Testing In Progress [ 10003 ] Closed [ 6 ]

              People

              • Assignee:
                pkulzer Paige Kulzer (Inactive)
                Reporter:
                nfreese Nowlan Freese
              • Votes:
                0 Vote for this issue
                Watchers:
                2 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: