Details
-
Type:
Documentation
-
Status: Closed (View Workflow)
-
Priority:
Major
-
Resolution: Done
-
Affects Version/s: None
-
Fix Version/s: None
-
Labels:None
-
Story Points:2
-
Epic Link:
-
Sprint:Winter 1, Spring 1
Description
Situation: We currently have documentation for adding new genomes to IGB Quickload using UCSC as a data source. However, after working hard on the IGB-UCSC integration, future requests for adding new genomes to IGB will have to be completed using other data sources such as NCBI. Since this is an entirely new data source and the process for downloading data and naming files will look different, we need to create some new documentation.
Task:
- Read over current documentation for adding new genomes to IGB Quickload (https://docs.google.com/document/d/1WQO_HWhpfUBntsNSaQ-jdrVoR6jJcQ7ewJ_HFvpYsYA/edit?usp=sharing).
- Using our current documentation as a guide, create a new document in Google Drive, create section outlines, and transfer over as much relevant documentation as possible.
- Include a section that discusses how to name files depending on the assembly type (e.g., "ncbiRefSeq" when dealing with a RefSeq assembly, "genBank" when dealing with a GenBank (GCA) assembly).
- Include a section with an example annots.xml file that discusses how to format the description attribute depending on the assembly type (e.g., "NCBI GenBank [GenBank (GCA) assembly] [Assembly] ([Assembly date in MMM. DD, YYYY format])" when dealing with a GenBank (GCA) assembly), as well as the title attribute.
- Include an example HEADER.md file that has been remade for genomes coming from NCBI rather than UCSC.
- Include a section for creating species.txt and synonyms.txt.
Attachments
Issue Links
- relates to
-
IGBF-4018 Add Dama dama genome to IGB
-
- Closed
-