Uploaded image for project: 'IGB'
  1. IGB
  2. IGBF-3143

Run RNA-Seq data processing pipeline on positive splicing control and experimental samples

    Details

    • Type: Task
    • Status: Closed (View Workflow)
    • Priority: Major
    • Resolution: Done
    • Affects Version/s: None
    • Fix Version/s: None
    • Labels:
      None
    • Story Points:
      3
    • Sprint:
      Summer 4 2022 July 4, Summer 5 2022 July 18, Summer 6 2022 Aug 1, Fall 1 2022 Aug 15

      Description

      Data sets to process:

      Positive control: SRP328042 Data are published in this article.
      Experimental: SRP252265

      To-Do:

      • Obtain data in fastq format from Sequence Read Archive using fasterqdump options for paired end data - DONE
      • Please data into directories named for the SRP number, e.g., SRP328042 and SRP252265 within a directory named "alt_splicing" under "nobackup" - DONE
      • Make a note of the particular commands used to perform the data retrieval (see comment below)
      • Create "samples" text file listing the SRR fastq files for running nf-core/rna-seq nextflow
      • Run nf-core/rnaseq using proper maximum intron size parameter using "tomato.config"

      Notes:

      Methods used to create positive control RNA-Seq data from SRP328042, according to the paper:

      2.5.2. Preparation of RNA-Seq Library and Sequencing Total RNA was extracted utilizing Trizol reagent (Invitrogen, Waltham, MA, USA). RNA quantity and quality were determined by NanoDrop 1000 spectrophotometer (Thermo Scientific Inc., Waltham, MA, USA), 1% agarose gel electrophoresis and Agilent 2100 Bioanalyzer (Agilent Technologies, Palo Alto, CA, USA). Following the protocol described by [28], strand-specific RNA-Seq libraries from 3 biological replicates for each group from WW and DS anthers were prepared using 1 ng/µL of total RNA sample and sequenced by Novogene Biotech (Beijing, China) on Illumina HiSeq 4000 system (Illumina, Inc., San Diego, CA, USA) according to the manufacturer’s instructions. The raw sequence reads were deposited into NCBI Sequence Read Archive under accession the number PRJNA746070.

      A PDF copy of the protocol paper (reference 28) for RNA-Seq library synthesis is attached.

        Attachments

          Issue Links

            Activity

            ann.loraine Ann Loraine created issue -
            ann.loraine Ann Loraine made changes -
            Field Original Value New Value
            Epic Link IGBF-2993 [ 21429 ]
            ann.loraine Ann Loraine made changes -
            Link This issue relates to IGBF-3135 [ IGBF-3135 ]
            ann.loraine Ann Loraine made changes -
            Description Data sets to process:

            Positive control: SRP328042
            Experimental:


            Notes:

            * Experimental dataset originally processed using code in https://bitbucket.org/hotpollen/rna-seq/src/master/
            ann.loraine Ann Loraine made changes -
            Description Data sets to process:

            Positive control: SRP328042
            Experimental:


            Notes:

            * Experimental dataset originally processed using code in https://bitbucket.org/hotpollen/rna-seq/src/master/
            Data sets to process:

            Positive control: SRP328042
            Experimental: [SRP252265|https://trace.ncbi.nlm.nih.gov/Traces/index.html?view=study&acc=SRP252265]


            Notes:

            * Experimental dataset originally processed using code in https://bitbucket.org/hotpollen/rna-seq/src/master/
            ann.loraine Ann Loraine made changes -
            Description Data sets to process:

            Positive control: SRP328042
            Experimental: [SRP252265|https://trace.ncbi.nlm.nih.gov/Traces/index.html?view=study&acc=SRP252265]


            Notes:

            * Experimental dataset originally processed using code in https://bitbucket.org/hotpollen/rna-seq/src/master/
            Data sets to process:

            Positive control: [SRP328042|https://trace.ncbi.nlm.nih.gov/Traces/index.html?view=study&acc=SRP328042]
            Experimental: [SRP252265|https://trace.ncbi.nlm.nih.gov/Traces/index.html?view=study&acc=SRP252265]


            Notes:

            * Experimental dataset originally processed using code in https://bitbucket.org/hotpollen/rna-seq/src/master/
            ann.loraine Ann Loraine made changes -
            Description Data sets to process:

            Positive control: [SRP328042|https://trace.ncbi.nlm.nih.gov/Traces/index.html?view=study&acc=SRP328042]
            Experimental: [SRP252265|https://trace.ncbi.nlm.nih.gov/Traces/index.html?view=study&acc=SRP252265]


            Notes:

            * Experimental dataset originally processed using code in https://bitbucket.org/hotpollen/rna-seq/src/master/
            Data sets to process:

            Positive control: [SRP328042|https://trace.ncbi.nlm.nih.gov/Traces/index.html?view=study&acc=SRP328042]
            Experimental: [SRP252265|https://trace.ncbi.nlm.nih.gov/Traces/index.html?view=study&acc=SRP252265]

            To-Do:

            * Obtain data in fastq format from Sequence Read Archive using fasterqdump options for paired end data
            * Please data into directories named for the SRP number, e.g., SRP328042 and SRP252265 within a directory named "splicing" in "nobackup"
            * Make a note of the particular commands used to perform the data retrieval (see comment below)

            Notes:

            * Experimental dataset originally processed using code in https://bitbucket.org/hotpollen/rna-seq/src/master/
            ann.loraine Ann Loraine made changes -
            Description Data sets to process:

            Positive control: [SRP328042|https://trace.ncbi.nlm.nih.gov/Traces/index.html?view=study&acc=SRP328042]
            Experimental: [SRP252265|https://trace.ncbi.nlm.nih.gov/Traces/index.html?view=study&acc=SRP252265]

            To-Do:

            * Obtain data in fastq format from Sequence Read Archive using fasterqdump options for paired end data
            * Please data into directories named for the SRP number, e.g., SRP328042 and SRP252265 within a directory named "splicing" in "nobackup"
            * Make a note of the particular commands used to perform the data retrieval (see comment below)

            Notes:

            * Experimental dataset originally processed using code in https://bitbucket.org/hotpollen/rna-seq/src/master/
            Data sets to process:

            Positive control: [SRP328042|https://trace.ncbi.nlm.nih.gov/Traces/index.html?view=study&acc=SRP328042]
            Experimental: [SRP252265|https://trace.ncbi.nlm.nih.gov/Traces/index.html?view=study&acc=SRP252265]

            To-Do:

            * Obtain data in fastq format from Sequence Read Archive using fasterqdump options for paired end data
            * Please data into directories named for the SRP number, e.g., SRP328042 and SRP252265 within a directory named "alt_splicing" under "nobackup"
            * Make a note of the particular commands used to perform the data retrieval (see comment below)

            Notes:

            * Experimental dataset originally processed using code in https://bitbucket.org/hotpollen/rna-seq/src/master/
            ann.loraine Ann Loraine made changes -
            Description Data sets to process:

            Positive control: [SRP328042|https://trace.ncbi.nlm.nih.gov/Traces/index.html?view=study&acc=SRP328042]
            Experimental: [SRP252265|https://trace.ncbi.nlm.nih.gov/Traces/index.html?view=study&acc=SRP252265]

            To-Do:

            * Obtain data in fastq format from Sequence Read Archive using fasterqdump options for paired end data
            * Please data into directories named for the SRP number, e.g., SRP328042 and SRP252265 within a directory named "alt_splicing" under "nobackup"
            * Make a note of the particular commands used to perform the data retrieval (see comment below)

            Notes:

            * Experimental dataset originally processed using code in https://bitbucket.org/hotpollen/rna-seq/src/master/
            Data sets to process:

            Positive control: [SRP328042|https://trace.ncbi.nlm.nih.gov/Traces/index.html?view=study&acc=SRP328042]
            Experimental: [SRP252265|https://trace.ncbi.nlm.nih.gov/Traces/index.html?view=study&acc=SRP252265]

            To-Do:

            * Obtain data in fastq format from Sequence Read Archive using fasterqdump options for paired end data
            * Please data into directories named for the SRP number, e.g., SRP328042 and SRP252265 within a directory named "alt_splicing" under "nobackup"
            * Make a note of the particular commands used to perform the data retrieval (see comment below)
            * Create "samples" text file listing the SRR fastq files for running nf-core/rna-seq nextflow
            * Run nf-core/rnaseq using proper maximum intron size parameter (probably it will be 15000)


            Notes:

            * Experimental dataset originally processed using code in https://bitbucket.org/hotpollen/rna-seq/src/master/
            ann.loraine Ann Loraine made changes -
            Link This issue blocks IGBF-3144 [ IGBF-3144 ]
            ann.loraine Ann Loraine made changes -
            Sprint Summer 4 2022 July 4 [ 150 ] Summer 4 2022 July 4, Summer 5 2022 July 18 [ 150, 151 ]
            ann.loraine Ann Loraine made changes -
            Rank Ranked higher
            ann.loraine Ann Loraine made changes -
            Link This issue is blocked by IGBF-3127 [ IGBF-3127 ]
            ann.loraine Ann Loraine made changes -
            Link This issue relates to IGBF-3127 [ IGBF-3127 ]
            ann.loraine Ann Loraine made changes -
            Status To-Do [ 10305 ] In Progress [ 3 ]
            ann.loraine Ann Loraine made changes -
            Assignee Ann Loraine [ aloraine ]
            ann.loraine Ann Loraine made changes -
            Link This issue relates to IGBF-2947 [ IGBF-2947 ]
            ann.loraine Ann Loraine made changes -
            Link This issue relates to IGBF-2970 [ IGBF-2970 ]
            ann.loraine Ann Loraine made changes -
            Description Data sets to process:

            Positive control: [SRP328042|https://trace.ncbi.nlm.nih.gov/Traces/index.html?view=study&acc=SRP328042]
            Experimental: [SRP252265|https://trace.ncbi.nlm.nih.gov/Traces/index.html?view=study&acc=SRP252265]

            To-Do:

            * Obtain data in fastq format from Sequence Read Archive using fasterqdump options for paired end data
            * Please data into directories named for the SRP number, e.g., SRP328042 and SRP252265 within a directory named "alt_splicing" under "nobackup"
            * Make a note of the particular commands used to perform the data retrieval (see comment below)
            * Create "samples" text file listing the SRR fastq files for running nf-core/rna-seq nextflow
            * Run nf-core/rnaseq using proper maximum intron size parameter (probably it will be 15000)


            Notes:

            * Experimental dataset originally processed using code in https://bitbucket.org/hotpollen/rna-seq/src/master/
            Data sets to process:

            Positive control: [SRP328042|https://trace.ncbi.nlm.nih.gov/Traces/index.html?view=study&acc=SRP328042]
            Experimental: [SRP252265|https://trace.ncbi.nlm.nih.gov/Traces/index.html?view=study&acc=SRP252265]

            To-Do:

            * Obtain data in fastq format from Sequence Read Archive using fasterqdump options for paired end data - *DONE*
            * Please data into directories named for the SRP number, e.g., SRP328042 and SRP252265 within a directory named "alt_splicing" under "nobackup" - *DONE*
            * Make a note of the particular commands used to perform the data retrieval (see comment below)
            * Create "samples" text file listing the SRR fastq files for running nf-core/rna-seq nextflow
            * Run nf-core/rnaseq using proper maximum intron size parameter using "tomato.config"


            Notes:

            * Experimental datasets originally processed using code in https://bitbucket.org/hotpollen/rna-seq/src/master/ and https://bitbucket.org/hotpollen/flavonoid-rnaseq
            * All Pollen project datasets are now in the SRA under the same project number ! (SRP252265)
            * Ann is using a fork of flavonoid-rnaseq for all new code she's writing, on branch IGBF-3143. To find her fork, go to https://bitbucket.org/hotpollen/flavonoid-rnaseq and select "forks"
            ann.loraine Ann Loraine made changes -
            Assignee Ann Loraine [ aloraine ]
            ann.loraine Ann Loraine made changes -
            Status In Progress [ 3 ] To-Do [ 10305 ]
            ann.loraine Ann Loraine made changes -
            Status To-Do [ 10305 ] In Progress [ 3 ]
            ann.loraine Ann Loraine made changes -
            Assignee Ann Loraine [ aloraine ]
            ann.loraine Ann Loraine made changes -
            Description Data sets to process:

            Positive control: [SRP328042|https://trace.ncbi.nlm.nih.gov/Traces/index.html?view=study&acc=SRP328042]
            Experimental: [SRP252265|https://trace.ncbi.nlm.nih.gov/Traces/index.html?view=study&acc=SRP252265]

            To-Do:

            * Obtain data in fastq format from Sequence Read Archive using fasterqdump options for paired end data - *DONE*
            * Please data into directories named for the SRP number, e.g., SRP328042 and SRP252265 within a directory named "alt_splicing" under "nobackup" - *DONE*
            * Make a note of the particular commands used to perform the data retrieval (see comment below)
            * Create "samples" text file listing the SRR fastq files for running nf-core/rna-seq nextflow
            * Run nf-core/rnaseq using proper maximum intron size parameter using "tomato.config"


            Notes:

            * Experimental datasets originally processed using code in https://bitbucket.org/hotpollen/rna-seq/src/master/ and https://bitbucket.org/hotpollen/flavonoid-rnaseq
            * All Pollen project datasets are now in the SRA under the same project number ! (SRP252265)
            * Ann is using a fork of flavonoid-rnaseq for all new code she's writing, on branch IGBF-3143. To find her fork, go to https://bitbucket.org/hotpollen/flavonoid-rnaseq and select "forks"
            Data sets to process:

            Positive control: [SRP328042|https://trace.ncbi.nlm.nih.gov/Traces/index.html?view=study&acc=SRP328042]
            Experimental: [SRP252265|https://trace.ncbi.nlm.nih.gov/Traces/index.html?view=study&acc=SRP252265]

            To-Do:

            * Obtain data in fastq format from Sequence Read Archive using fasterqdump options for paired end data - *DONE*
            * Please data into directories named for the SRP number, e.g., SRP328042 and SRP252265 within a directory named "alt_splicing" under "nobackup" - *DONE*
            * Make a note of the particular commands used to perform the data retrieval (see comment below)
            * Create "samples" text file listing the SRR fastq files for running nf-core/rna-seq nextflow
            * Run nf-core/rnaseq using proper maximum intron size parameter using "tomato.config"


            Notes:

            * Experimental datasets originally processed using code in https://bitbucket.org/hotpollen/rna-seq/src/master/ and https://bitbucket.org/hotpollen/flavonoid-rnaseq
            * All Pollen project datasets are now in the SRA under the same project number ! (SRP252265)
            * Ann is using a fork of flavonoid-rnaseq for all new code she's writing, on branch IGBF-3143. To find her fork, go to https://bitbucket.org/hotpollen/flavonoid-rnaseq and select "forks"
            * Documentation for the pipeline we are using is here: https://nf-co.re/rnaseq/3.4/usage
            ann.loraine Ann Loraine made changes -
            Description Data sets to process:

            Positive control: [SRP328042|https://trace.ncbi.nlm.nih.gov/Traces/index.html?view=study&acc=SRP328042]
            Experimental: [SRP252265|https://trace.ncbi.nlm.nih.gov/Traces/index.html?view=study&acc=SRP252265]

            To-Do:

            * Obtain data in fastq format from Sequence Read Archive using fasterqdump options for paired end data - *DONE*
            * Please data into directories named for the SRP number, e.g., SRP328042 and SRP252265 within a directory named "alt_splicing" under "nobackup" - *DONE*
            * Make a note of the particular commands used to perform the data retrieval (see comment below)
            * Create "samples" text file listing the SRR fastq files for running nf-core/rna-seq nextflow
            * Run nf-core/rnaseq using proper maximum intron size parameter using "tomato.config"


            Notes:

            * Experimental datasets originally processed using code in https://bitbucket.org/hotpollen/rna-seq/src/master/ and https://bitbucket.org/hotpollen/flavonoid-rnaseq
            * All Pollen project datasets are now in the SRA under the same project number ! (SRP252265)
            * Ann is using a fork of flavonoid-rnaseq for all new code she's writing, on branch IGBF-3143. To find her fork, go to https://bitbucket.org/hotpollen/flavonoid-rnaseq and select "forks"
            * Documentation for the pipeline we are using is here: https://nf-co.re/rnaseq/3.4/usage
            Data sets to process:

            Positive control: [SRP328042|https://trace.ncbi.nlm.nih.gov/Traces/index.html?view=study&acc=SRP328042] Data are published in [this article|https://pubmed.ncbi.nlm.nih.gov/34359978/].
            Experimental: [SRP252265|https://trace.ncbi.nlm.nih.gov/Traces/index.html?view=study&acc=SRP252265]

            To-Do:

            * Obtain data in fastq format from Sequence Read Archive using fasterqdump options for paired end data - *DONE*
            * Please data into directories named for the SRP number, e.g., SRP328042 and SRP252265 within a directory named "alt_splicing" under "nobackup" - *DONE*
            * Make a note of the particular commands used to perform the data retrieval (see comment below)
            * Create "samples" text file listing the SRR fastq files for running nf-core/rna-seq nextflow
            * Run nf-core/rnaseq using proper maximum intron size parameter using "tomato.config"


            Notes:

            * Experimental datasets originally processed using code in https://bitbucket.org/hotpollen/rna-seq/src/master/ and https://bitbucket.org/hotpollen/flavonoid-rnaseq
            * All Pollen project datasets are now in the SRA under the same project number ! (SRP252265)
            * Ann is using a fork of flavonoid-rnaseq for all new code she's writing, on branch IGBF-3143. To find her fork, go to https://bitbucket.org/hotpollen/flavonoid-rnaseq and select "forks"
            * Documentation for the pipeline we are using is here: https://nf-co.re/rnaseq/3.4/usage
            ann.loraine Ann Loraine made changes -
            Description Data sets to process:

            Positive control: [SRP328042|https://trace.ncbi.nlm.nih.gov/Traces/index.html?view=study&acc=SRP328042] Data are published in [this article|https://pubmed.ncbi.nlm.nih.gov/34359978/].
            Experimental: [SRP252265|https://trace.ncbi.nlm.nih.gov/Traces/index.html?view=study&acc=SRP252265]

            To-Do:

            * Obtain data in fastq format from Sequence Read Archive using fasterqdump options for paired end data - *DONE*
            * Please data into directories named for the SRP number, e.g., SRP328042 and SRP252265 within a directory named "alt_splicing" under "nobackup" - *DONE*
            * Make a note of the particular commands used to perform the data retrieval (see comment below)
            * Create "samples" text file listing the SRR fastq files for running nf-core/rna-seq nextflow
            * Run nf-core/rnaseq using proper maximum intron size parameter using "tomato.config"


            Notes:

            * Experimental datasets originally processed using code in https://bitbucket.org/hotpollen/rna-seq/src/master/ and https://bitbucket.org/hotpollen/flavonoid-rnaseq
            * All Pollen project datasets are now in the SRA under the same project number ! (SRP252265)
            * Ann is using a fork of flavonoid-rnaseq for all new code she's writing, on branch IGBF-3143. To find her fork, go to https://bitbucket.org/hotpollen/flavonoid-rnaseq and select "forks"
            * Documentation for the pipeline we are using is here: https://nf-co.re/rnaseq/3.4/usage
            Data sets to process:

            Positive control: [SRP328042|https://trace.ncbi.nlm.nih.gov/Traces/index.html?view=study&acc=SRP328042] Data are published in [this article|https://pubmed.ncbi.nlm.nih.gov/34359978/].
            Experimental: [SRP252265|https://trace.ncbi.nlm.nih.gov/Traces/index.html?view=study&acc=SRP252265]

            To-Do:

            * Obtain data in fastq format from Sequence Read Archive using fasterqdump options for paired end data - *DONE*
            * Please data into directories named for the SRP number, e.g., SRP328042 and SRP252265 within a directory named "alt_splicing" under "nobackup" - *DONE*
            * Make a note of the particular commands used to perform the data retrieval (see comment below)
            * Create "samples" text file listing the SRR fastq files for running nf-core/rna-seq nextflow
            * Run nf-core/rnaseq using proper maximum intron size parameter using "tomato.config"


            Notes:

            * Experimental datasets originally processed using code in https://bitbucket.org/hotpollen/rna-seq/src/master/ and https://bitbucket.org/hotpollen/flavonoid-rnaseq
            * All Pollen project datasets are now in the SRA under the same project number ! (SRP252265)
            * Ann is using a fork of flavonoid-rnaseq for all new code she's writing, on branch IGBF-3143. To find her fork, go to https://bitbucket.org/hotpollen/flavonoid-rnaseq and select "forks"
            * Documentation for the pipeline we are using is here: https://nf-co.re/rnaseq/3.4/usage

            Methods used to create positive control RNA-Seq data:

            {quote}
            2.5.2. Preparation of RNA-Seq Library and Sequencing Total RNA was extracted utilizing Trizol reagent (Invitrogen, Waltham, MA, USA). RNA quantity and quality were determined by NanoDrop 1000 spectrophotometer (Thermo Scientific Inc., Waltham, MA, USA), 1% agarose gel electrophoresis and Agilent 2100 Bioanalyzer (Agilent Technologies, Palo Alto, CA, USA). Following the protocol described by [28], strand-specific RNA-Seq libraries from 3 biological replicates for each group from WW and DS anthers were prepared using 1 ng/µL of total RNA sample and sequenced by Novogene Biotech (Beijing, China) on Illumina HiSeq 4000 system (Illumina, Inc., San Diego, CA, USA) according to the manufacturer’s instructions. The raw sequence reads were deposited into NCBI Sequence Read Archive under accession the number PRJNA746070.
            {quote}
            ann.loraine Ann Loraine made changes -
            Description Data sets to process:

            Positive control: [SRP328042|https://trace.ncbi.nlm.nih.gov/Traces/index.html?view=study&acc=SRP328042] Data are published in [this article|https://pubmed.ncbi.nlm.nih.gov/34359978/].
            Experimental: [SRP252265|https://trace.ncbi.nlm.nih.gov/Traces/index.html?view=study&acc=SRP252265]

            To-Do:

            * Obtain data in fastq format from Sequence Read Archive using fasterqdump options for paired end data - *DONE*
            * Please data into directories named for the SRP number, e.g., SRP328042 and SRP252265 within a directory named "alt_splicing" under "nobackup" - *DONE*
            * Make a note of the particular commands used to perform the data retrieval (see comment below)
            * Create "samples" text file listing the SRR fastq files for running nf-core/rna-seq nextflow
            * Run nf-core/rnaseq using proper maximum intron size parameter using "tomato.config"


            Notes:

            * Experimental datasets originally processed using code in https://bitbucket.org/hotpollen/rna-seq/src/master/ and https://bitbucket.org/hotpollen/flavonoid-rnaseq
            * All Pollen project datasets are now in the SRA under the same project number ! (SRP252265)
            * Ann is using a fork of flavonoid-rnaseq for all new code she's writing, on branch IGBF-3143. To find her fork, go to https://bitbucket.org/hotpollen/flavonoid-rnaseq and select "forks"
            * Documentation for the pipeline we are using is here: https://nf-co.re/rnaseq/3.4/usage

            Methods used to create positive control RNA-Seq data:

            {quote}
            2.5.2. Preparation of RNA-Seq Library and Sequencing Total RNA was extracted utilizing Trizol reagent (Invitrogen, Waltham, MA, USA). RNA quantity and quality were determined by NanoDrop 1000 spectrophotometer (Thermo Scientific Inc., Waltham, MA, USA), 1% agarose gel electrophoresis and Agilent 2100 Bioanalyzer (Agilent Technologies, Palo Alto, CA, USA). Following the protocol described by [28], strand-specific RNA-Seq libraries from 3 biological replicates for each group from WW and DS anthers were prepared using 1 ng/µL of total RNA sample and sequenced by Novogene Biotech (Beijing, China) on Illumina HiSeq 4000 system (Illumina, Inc., San Diego, CA, USA) according to the manufacturer’s instructions. The raw sequence reads were deposited into NCBI Sequence Read Archive under accession the number PRJNA746070.
            {quote}
            Data sets to process:

            Positive control: [SRP328042|https://trace.ncbi.nlm.nih.gov/Traces/index.html?view=study&acc=SRP328042] Data are published in [this article|https://pubmed.ncbi.nlm.nih.gov/34359978/].
            Experimental: [SRP252265|https://trace.ncbi.nlm.nih.gov/Traces/index.html?view=study&acc=SRP252265]

            To-Do:

            * Obtain data in fastq format from Sequence Read Archive using fasterqdump options for paired end data - *DONE*
            * Please data into directories named for the SRP number, e.g., SRP328042 and SRP252265 within a directory named "alt_splicing" under "nobackup" - *DONE*
            * Make a note of the particular commands used to perform the data retrieval (see comment below)
            * Create "samples" text file listing the SRR fastq files for running nf-core/rna-seq nextflow
            * Run nf-core/rnaseq using proper maximum intron size parameter using "tomato.config"


            Notes:

            * Experimental datasets originally processed using code in https://bitbucket.org/hotpollen/rna-seq/src/master/ and https://bitbucket.org/hotpollen/flavonoid-rnaseq
            * All Pollen project datasets are now in the SRA under the same project number ! (SRP252265)
            * Ann is using a fork of flavonoid-rnaseq for all new code she's writing, on branch IGBF-3143. To find her fork, go to https://bitbucket.org/hotpollen/flavonoid-rnaseq and select "forks"
            * Documentation for the pipeline we are using is here: https://nf-co.re/rnaseq/3.4/usage

            Methods used to create *positive control* RNA-Seq data from SRP328042, according to the paper:

            {quote}
            2.5.2. Preparation of RNA-Seq Library and Sequencing Total RNA was extracted utilizing Trizol reagent (Invitrogen, Waltham, MA, USA). RNA quantity and quality were determined by NanoDrop 1000 spectrophotometer (Thermo Scientific Inc., Waltham, MA, USA), 1% agarose gel electrophoresis and Agilent 2100 Bioanalyzer (Agilent Technologies, Palo Alto, CA, USA). Following the protocol described by [28], strand-specific RNA-Seq libraries from 3 biological replicates for each group from WW and DS anthers were prepared using 1 ng/µL of total RNA sample and sequenced by Novogene Biotech (Beijing, China) on Illumina HiSeq 4000 system (Illumina, Inc., San Diego, CA, USA) according to the manufacturer’s instructions. The raw sequence reads were deposited into NCBI Sequence Read Archive under accession the number PRJNA746070.
            {quote}
            ann.loraine Ann Loraine made changes -
            Attachment strand-specific-protocol.png [ 17253 ]
            ann.loraine Ann Loraine made changes -
            Attachment 10.1.1.1052.3871.pdf [ 17254 ]
            ann.loraine Ann Loraine made changes -
            Attachment strand-specific-protocol.png [ 17253 ]
            ann.loraine Ann Loraine made changes -
            Description Data sets to process:

            Positive control: [SRP328042|https://trace.ncbi.nlm.nih.gov/Traces/index.html?view=study&acc=SRP328042] Data are published in [this article|https://pubmed.ncbi.nlm.nih.gov/34359978/].
            Experimental: [SRP252265|https://trace.ncbi.nlm.nih.gov/Traces/index.html?view=study&acc=SRP252265]

            To-Do:

            * Obtain data in fastq format from Sequence Read Archive using fasterqdump options for paired end data - *DONE*
            * Please data into directories named for the SRP number, e.g., SRP328042 and SRP252265 within a directory named "alt_splicing" under "nobackup" - *DONE*
            * Make a note of the particular commands used to perform the data retrieval (see comment below)
            * Create "samples" text file listing the SRR fastq files for running nf-core/rna-seq nextflow
            * Run nf-core/rnaseq using proper maximum intron size parameter using "tomato.config"


            Notes:

            * Experimental datasets originally processed using code in https://bitbucket.org/hotpollen/rna-seq/src/master/ and https://bitbucket.org/hotpollen/flavonoid-rnaseq
            * All Pollen project datasets are now in the SRA under the same project number ! (SRP252265)
            * Ann is using a fork of flavonoid-rnaseq for all new code she's writing, on branch IGBF-3143. To find her fork, go to https://bitbucket.org/hotpollen/flavonoid-rnaseq and select "forks"
            * Documentation for the pipeline we are using is here: https://nf-co.re/rnaseq/3.4/usage

            Methods used to create *positive control* RNA-Seq data from SRP328042, according to the paper:

            {quote}
            2.5.2. Preparation of RNA-Seq Library and Sequencing Total RNA was extracted utilizing Trizol reagent (Invitrogen, Waltham, MA, USA). RNA quantity and quality were determined by NanoDrop 1000 spectrophotometer (Thermo Scientific Inc., Waltham, MA, USA), 1% agarose gel electrophoresis and Agilent 2100 Bioanalyzer (Agilent Technologies, Palo Alto, CA, USA). Following the protocol described by [28], strand-specific RNA-Seq libraries from 3 biological replicates for each group from WW and DS anthers were prepared using 1 ng/µL of total RNA sample and sequenced by Novogene Biotech (Beijing, China) on Illumina HiSeq 4000 system (Illumina, Inc., San Diego, CA, USA) according to the manufacturer’s instructions. The raw sequence reads were deposited into NCBI Sequence Read Archive under accession the number PRJNA746070.
            {quote}
            Data sets to process:

            Positive control: [SRP328042|https://trace.ncbi.nlm.nih.gov/Traces/index.html?view=study&acc=SRP328042] Data are published in [this article|https://pubmed.ncbi.nlm.nih.gov/34359978/].
            Experimental: [SRP252265|https://trace.ncbi.nlm.nih.gov/Traces/index.html?view=study&acc=SRP252265]

            To-Do:

            * Obtain data in fastq format from Sequence Read Archive using fasterqdump options for paired end data - *DONE*
            * Please data into directories named for the SRP number, e.g., SRP328042 and SRP252265 within a directory named "alt_splicing" under "nobackup" - *DONE*
            * Make a note of the particular commands used to perform the data retrieval (see comment below)
            * Create "samples" text file listing the SRR fastq files for running nf-core/rna-seq nextflow
            * Run nf-core/rnaseq using proper maximum intron size parameter using "tomato.config"


            Notes:

            * Experimental datasets originally processed using code in https://bitbucket.org/hotpollen/rna-seq/src/master/ and https://bitbucket.org/hotpollen/flavonoid-rnaseq
            * All Pollen project datasets are now in the SRA under the same project number ! (SRP252265)
            * Ann is using a fork of flavonoid-rnaseq for all new code she's writing, on branch IGBF-3143. To find her fork, go to https://bitbucket.org/hotpollen/flavonoid-rnaseq and select "forks"
            * Documentation for the pipeline we are using is here: https://nf-co.re/rnaseq/3.4/usage

            Methods used to create *positive control* RNA-Seq data from SRP328042, according to the paper:

            {quote}
            2.5.2. Preparation of RNA-Seq Library and Sequencing Total RNA was extracted utilizing Trizol reagent (Invitrogen, Waltham, MA, USA). RNA quantity and quality were determined by NanoDrop 1000 spectrophotometer (Thermo Scientific Inc., Waltham, MA, USA), 1% agarose gel electrophoresis and Agilent 2100 Bioanalyzer (Agilent Technologies, Palo Alto, CA, USA). Following the protocol described by [28], strand-specific RNA-Seq libraries from 3 biological replicates for each group from WW and DS anthers were prepared using 1 ng/µL of total RNA sample and sequenced by Novogene Biotech (Beijing, China) on Illumina HiSeq 4000 system (Illumina, Inc., San Diego, CA, USA) according to the manufacturer’s instructions. The raw sequence reads were deposited into NCBI Sequence Read Archive under accession the number PRJNA746070.
            {quote}

            A PDF copy of the protocol paper (reference 28) for RNA-Seq library synthesis is attached.
            ann.loraine Ann Loraine made changes -
            Comment [ Testing email function of jira: [~aloraine] ]
            ann.loraine Ann Loraine made changes -
            Status In Progress [ 3 ] To-Do [ 10305 ]
            ann.loraine Ann Loraine made changes -
            Assignee Ann Loraine [ aloraine ]
            ann.loraine Ann Loraine made changes -
            Sprint Summer 4 2022 July 4, Summer 5 2022 July 18 [ 150, 151 ] Summer 4 2022 July 4, Summer 5 2022 July 18, Summer 6 2022 Aug 1 [ 150, 151, 152 ]
            ann.loraine Ann Loraine made changes -
            Rank Ranked higher
            ann.loraine Ann Loraine made changes -
            Status To-Do [ 10305 ] In Progress [ 3 ]
            ann.loraine Ann Loraine made changes -
            Assignee Molly Davis [ molly ]
            ann.loraine Ann Loraine made changes -
            Sprint Summer 4 2022 July 4, Summer 5 2022 July 18, Summer 6 2022 Aug 1 [ 150, 151, 152 ] Summer 4 2022 July 4, Summer 5 2022 July 18, Summer 6 2022 Aug 1, Fall 1 2022 Aug 15 [ 150, 151, 152, 153 ]
            ann.loraine Ann Loraine made changes -
            Rank Ranked higher
            ann.loraine Ann Loraine made changes -
            Link This issue relates to IGBF-3162 [ IGBF-3162 ]
            ann.loraine Ann Loraine made changes -
            Status In Progress [ 3 ] Needs 1st Level Review [ 10005 ]
            ann.loraine Ann Loraine made changes -
            Status Needs 1st Level Review [ 10005 ] First Level Review in Progress [ 10301 ]
            ann.loraine Ann Loraine made changes -
            Status First Level Review in Progress [ 10301 ] Ready for Pull Request [ 10304 ]
            ann.loraine Ann Loraine made changes -
            Status Ready for Pull Request [ 10304 ] Pull Request Submitted [ 10101 ]
            ann.loraine Ann Loraine made changes -
            Status Pull Request Submitted [ 10101 ] Reviewing Pull Request [ 10303 ]
            ann.loraine Ann Loraine made changes -
            Status Reviewing Pull Request [ 10303 ] Merged Needs Testing [ 10002 ]
            ann.loraine Ann Loraine made changes -
            Status Merged Needs Testing [ 10002 ] Post-merge Testing In Progress [ 10003 ]
            ann.loraine Ann Loraine made changes -
            Resolution Done [ 10000 ]
            Status Post-merge Testing In Progress [ 10003 ] Closed [ 6 ]
            ann.loraine Ann Loraine made changes -
            Link This issue blocks IGBF-3165 [ IGBF-3165 ]
            ann.loraine Ann Loraine made changes -
            Link This issue relates to IGBF-3228 [ IGBF-3228 ]

              People

              • Assignee:
                Mdavis4290 Molly Davis
                Reporter:
                ann.loraine Ann Loraine
              • Votes:
                0 Vote for this issue
                Watchers:
                2 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: