Details
-
Type:
New Feature
-
Status: Closed (View Workflow)
-
Priority:
Minor
-
Resolution: Done
-
Affects Version/s: None
-
Fix Version/s: 10.0.0 Major Release
-
Labels:
-
Story Points:3
-
Epic Link:
-
Sprint:Summer 3 2023 June 12, Summer 4 2023 June 26, Summer 5 2023 July 10
Description
Situation: Under the Advanced Search tab the Search can be set to residues. This is extremely useful as a user can search for primer locations or motifs. While the Residues search does allow for wildcards (.[]*) it does not appear to understand nucleotide symbols such as R [G/A] Y [C/T] etc. So a motif found in a paper such as CACRTS does not work correctly under the Advanced Search for Residues in IGB.
Task: Expand the logic for IGB Advanced Search for Residues so that IGB can understand Nucleotide Symbols.
R A or G
Y C or T
S G or C
W A or T
K G or T
M A or C
B C or G or T
D A or G or T
H A or C or T
V A or C or G
N any base
For example, if a user were to currently use the Advanced Search for Residues in IGB to look for the motif RYSNATCG IGB would not be able to find the motif, as IGB does not understand what RYSN refers to. New logic needs to be added to IGB so that when searching, IGB understands that R can match to either A or G, Y matches C or T, etc.
The CACRTS example is from this paper: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8318262/
The paper is examining the Arabidopsis thaliana genome.