Details
-
Type: Improvement
-
Status: To-Do (View Workflow)
-
Priority: Major
-
Resolution: Unresolved
-
Affects Version/s: None
-
Fix Version/s: None
-
Labels:None
-
Story Points:2
-
Epic Link:
-
Sprint:Spring 6 2021 May 31 - June 11
Description
Situation: The Synonym Lookup module includes several files that are used to compare the species names found in IGB. This includes comparing the contents.txt, species.txt, and synonyms.txt. Genomes can come from Quickloads such as IGB Quickload and also from other services, such as UCSC DAS.
IGB friendly genome names follow the pattern of G_species_MMM_YYYY. In addition, a variety can be included - G_species_variety_MMM_YYYY.
While updating genomes for IGBF-781 an edge case was discovered. The opossum genome provided by UCSC DAS has an IGB friendly name of M_domestica. This is very similar to the IGB provided genome for apple, which is M_domestica_Borkh. However, IGB appears to ignore the variety and is just comparing the M_domestica, which is incorrect. As a workaround, a variety was added to the opossum genome to keep them separated.
Task: Identify why IGB considers the two M_domestica genomes to be the same. This should not be occurring, with or without the variety included.
Attachments
Issue Links
- relates to
-
IGBF-781 Update synonyms.txt for UCSC
- Closed
Documentation on synonyms.txt in the IGB users guide
https://wiki.transvar.org/display/igbman/Use+synonyms.txt+to+link+genome+version+names+to+each+other
https://wiki.transvar.org/display/igbman/Personal+Synonyms
https://wiki.transvar.org/display/lorainelab/How+to+add+a+new+synonym+to+IGB