Uploaded image for project: 'IGB'
  1. IGB
  2. IGBF-2262

Update C elegans annotations and data sets

    Details

    • Type: Task
    • Status: To-Do (View Workflow)
    • Priority: Major
    • Resolution: Unresolved
    • Affects Version/s: None
    • Fix Version/s: None
    • Labels:
      None

      Description

      Follow these directions from:

      • Christian Grove cgrove@caltech.edu help@wormbase.org

      to obtain best annnotations for C elegans:

      snip

      Hi Ann,

      You can find the latest genome assemblies for C. elegans on our C. elegans page (discoverable under the WormBase "Directory" menu):

      https://wormbase.org/species/c_elegans#32--10

      As can be seen on the UCSC genome browser site:

      https://genome.ucsc.edu/cgi-bin/hgGateway

      and in our recent user guide book chapter:

      https://www.ncbi.nlm.nih.gov/pubmed/29761466

      The ce11 genome corresponds to the "WBcel235" genome assembly, which is our current assembly as indicated on our WB C. elegans page. You should then be able to download the GFF3 for that assembly from that same page in the "Downloads" widget. I would then recommend processing the GFF file to just pull out features coming from WormBase (column 2 = "WormBase"). Once you unzip the GFF3 file, you can run the following on the command line:

      awk 'BEGIN

      {FS="\t";OFS="\t"}

      {if($2 == "WormBase") print}

      ' c_elegans.PRJNA13758.WS274.annotations.gff3 > WormBase_elegans_GFF3_lines.txt

      The remaining feature types (column 3) are then:

      CDS
      antisense_RNA
      exon
      five_prime_UTR
      gene
      intron
      lincRNA
      mRNA
      miRNA
      miRNA_primary_transcript
      ncRNA
      nc_primary_transcript
      piRNA
      pre_miRNA
      pseudogenic_rRNA
      pseudogenic_tRNA
      pseudogenic_transcript
      rRNA
      scRNA
      snRNA
      snoRNA
      tRNA
      three_prime_UTR

      You can filter the lines based on what you're looking for, specifically.

      I hope that helps. Let us know if you have any questions.

      Best,

      Chris Grove
      WormBase

      end snip

        Attachments

          Activity

          ann.loraine Ann Loraine created issue -
          ann.loraine Ann Loraine made changes -
          Field Original Value New Value
          Epic Link IGBF-1765 [ 17855 ]
          ann.loraine Ann Loraine made changes -
          Rank Ranked higher
          ann.loraine Ann Loraine made changes -
          Comment [ Describe what is done in "micropublication" format - see:

          * journal: https://www.micropublication.org/
          * how to contribute: https://www.micropublication.org/submit-new-article/
          * author guidelines: https://www.micropublication.org/about/for-authors/preparation-and-submission-of-a-manuscript/

          Above documentation says:

          {quote}
          microPublication Biology is currently accepting submissions from the following communities in collaboration with their respective knowledgebase:

          Caenorhabditis in collaboration with WormBase
          Drosophila in collaboration with FlyBase
          Xenopus in collaboration with Xenbase
          Arabidopsis in collaboration with TAIR
          S. cerevisiae in collaboration with SGD
          S. pombe in collaboration with PomBase
          Zebrafish in collaboration with ZFIN
          Please visit our author guidelines page for instructions. If you have trouble with our online system, you can download an article template and submit it to us at contact@micropublication.org.
          {quote}

          ]

            People

            • Assignee:
              Unassigned
              Reporter:
              ann.loraine Ann Loraine
            • Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

              • Created:
                Updated: