Details
-
Type:
Documentation
-
Status: Closed (View Workflow)
-
Priority:
Major
-
Resolution: Done
-
Affects Version/s: None
-
Fix Version/s: None
-
Labels:None
-
Story Points:2
-
Epic Link:
-
Sprint:Winter 1, Spring 1
Description
Situation: We currently have documentation for adding new genomes to IGB Quickload using UCSC as a data source. However, after working hard on the IGB-UCSC integration, future requests for adding new genomes to IGB will have to be completed using other data sources such as NCBI. Since this is an entirely new data source and the process for downloading data and naming files will look different, we need to create some new documentation.
Task:
- Read over current documentation for adding new genomes to IGB Quickload (https://docs.google.com/document/d/1WQO_HWhpfUBntsNSaQ-jdrVoR6jJcQ7ewJ_HFvpYsYA/edit?usp=sharing).
- Using our current documentation as a guide, create a new document in Google Drive, create section outlines, and transfer over as much relevant documentation as possible.
- Include a section that discusses how to name files depending on the assembly type (e.g., "ncbiRefSeq" when dealing with a RefSeq assembly, "genBank" when dealing with a GenBank (GCA) assembly).
- Include a section with an example annots.xml file that discusses how to format the description attribute depending on the assembly type (e.g., "NCBI GenBank [GenBank (GCA) assembly] [Assembly] ([Assembly date in MMM. DD, YYYY format])" when dealing with a GenBank (GCA) assembly), as well as the title attribute.
- Include an example HEADER.md file that has been remade for genomes coming from NCBI rather than UCSC.
- Include a section for creating species.txt and synonyms.txt.
Attachments
Issue Links
- relates to
-
IGBF-4018 Add Dama dama genome to IGB
-
- Closed
-
These are great suggestions!
I've added some more instructions to the Tabix-index gene model file section for manually checking the BED file for any erroneous text. I also removed mention of archiving files from that section. Finally, I specified that ALL modified files should be included in the zipped Quickload folder for a reviewer to test (i.e., species.txt, synonyms.txt, etc) in the Deploy to IGB Quickload section.
Closing this ticket!