Uploaded image for project: 'IGB'
  1. IGB
  2. IGBF-4103

Read mango paper on gene discovery

    Details

    • Type: Task
    • Status: Closed (View Workflow)
    • Priority: Minor
    • Resolution: Done
    • Affects Version/s: None
    • Fix Version/s: None
    • Labels:
      None

      Description

      Read this paper with the idea of how we can take our Reference free efforts and do something along the same lines.

      De novo transcriptome assembly and annotation for gene discovery in avocado, macadamia and mango

      https://www.nature.com/articles/s41597-019-0350-9

        Attachments

          Activity

          robofjoy Robert Reid created issue -
          robofjoy Robert Reid made changes -
          Field Original Value New Value
          Epic Link IGBF-2993 [ 21429 ]
          bbendick Brandon Bendickson made changes -
          Status To-Do [ 10305 ] In Progress [ 3 ]
          Hide
          bbendick Brandon Bendickson added a comment -

          This comment will serve as my notes for the article:

          Methods
          -Raw RNA-seq reads were pre-processed using trimmomatic with default parameters
          -RNA-Seq read quality was assessed using FastQC and aggregated using MultiQC
          -Trinity was used for de novo transcriptome assembly and validation was done using BUSCO
          -They used HISAT2 to map reads to respective references
          -Used TransDecoder with default setting to predict coding regions, they selected the best open reading frame per transcript longer than 100 peptides
          -Used CD-HIT-EST with default params to reduce redundancy and produce unique genes
          -Used BLAST to assign function annotations to the unique genes

          -BLASTx program was used to annotate genes based on UniProtKB, which is a manually annotated, non-redundant protein sequence db.

          How can we use this on Tomatoes
          -Could go back and try validating our assemblies with BUSCO
          -Try running TransDecoder on our contigs to predict coding regions, perhaps after finding the best long hits from BLAT
          -Use CD-HIT to find our unique genes
          -Use BLASTx to annotate our genes

          Show
          bbendick Brandon Bendickson added a comment - This comment will serve as my notes for the article: Methods -Raw RNA-seq reads were pre-processed using trimmomatic with default parameters -RNA-Seq read quality was assessed using FastQC and aggregated using MultiQC -Trinity was used for de novo transcriptome assembly and validation was done using BUSCO -They used HISAT2 to map reads to respective references -Used TransDecoder with default setting to predict coding regions, they selected the best open reading frame per transcript longer than 100 peptides -Used CD-HIT-EST with default params to reduce redundancy and produce unique genes -Used BLAST to assign function annotations to the unique genes -BLASTx program was used to annotate genes based on UniProtKB, which is a manually annotated, non-redundant protein sequence db. How can we use this on Tomatoes -Could go back and try validating our assemblies with BUSCO -Try running TransDecoder on our contigs to predict coding regions, perhaps after finding the best long hits from BLAT -Use CD-HIT to find our unique genes -Use BLASTx to annotate our genes
          Hide
          robofjoy Robert Reid added a comment -

          I think we can dismiss BUSCO in this instance.
          Since we are only looking at a transcriptome in some pistil tissue and maybe some pollen.
          For BUSCO and completeness, we'd want leaf, and root and flower and all of the tissue sequenced to assess completeness.

          Transdecoder, let's do it!!

          CD-HIT, Heck yeah!!
          These will be new tickets.

          Show
          robofjoy Robert Reid added a comment - I think we can dismiss BUSCO in this instance. Since we are only looking at a transcriptome in some pistil tissue and maybe some pollen. For BUSCO and completeness, we'd want leaf, and root and flower and all of the tissue sequenced to assess completeness. Transdecoder, let's do it!! CD-HIT, Heck yeah!! These will be new tickets.
          bbendick Brandon Bendickson made changes -
          Status In Progress [ 3 ] Needs 1st Level Review [ 10005 ]
          bbendick Brandon Bendickson made changes -
          Status Needs 1st Level Review [ 10005 ] First Level Review in Progress [ 10301 ]
          bbendick Brandon Bendickson made changes -
          Status First Level Review in Progress [ 10301 ] Ready for Pull Request [ 10304 ]
          bbendick Brandon Bendickson made changes -
          Status Ready for Pull Request [ 10304 ] Pull Request Submitted [ 10101 ]
          bbendick Brandon Bendickson made changes -
          Status Pull Request Submitted [ 10101 ] Reviewing Pull Request [ 10303 ]
          bbendick Brandon Bendickson made changes -
          Status Reviewing Pull Request [ 10303 ] Merged Needs Testing [ 10002 ]
          bbendick Brandon Bendickson made changes -
          Status Merged Needs Testing [ 10002 ] Post-merge Testing In Progress [ 10003 ]
          bbendick Brandon Bendickson made changes -
          Resolution Done [ 10000 ]
          Status Post-merge Testing In Progress [ 10003 ] Closed [ 6 ]

            People

            • Assignee:
              bbendick Brandon Bendickson
              Reporter:
              robofjoy Robert Reid
            • Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: