[IGBF-3143] Run RNA-Seq data processing pipeline on positive splicing control and experimental samples - JIRA UNCC

Ann Loraine created issue - 07/Jul/22 12:16 PM

Ann Loraine made changes - 07/Jul/22 12:16 PM

Field	Original Value	New Value
Epic Link		IGBF-2993 [ 21429 ]

Ann Loraine made changes - 07/Jul/22 12:16 PM

Link

This issue relates to ~~IGBF-3135~~ [ ~~IGBF-3135~~ ]

Ann Loraine made changes - 07/Jul/22 12:22 PM

Description

Data sets to process:

Positive control: SRP328042
Experimental:

Notes:

* Experimental dataset originally processed using code in https://bitbucket.org/hotpollen/rna-seq/src/master/

Ann Loraine made changes - 07/Jul/22 12:26 PM

Description

Data sets to process:

Positive control: SRP328042
Experimental:

Notes:

* Experimental dataset originally processed using code in https://bitbucket.org/hotpollen/rna-seq/src/master/

Data sets to process:

Positive control: SRP328042
Experimental: [SRP252265|https://trace.ncbi.nlm.nih.gov/Traces/index.html?view=study&acc=SRP252265]

Notes:

* Experimental dataset originally processed using code in https://bitbucket.org/hotpollen/rna-seq/src/master/

Ann Loraine made changes - 07/Jul/22 12:26 PM

Description

Data sets to process:

Positive control: SRP328042
Experimental: [SRP252265|https://trace.ncbi.nlm.nih.gov/Traces/index.html?view=study&acc=SRP252265]

Notes:

* Experimental dataset originally processed using code in https://bitbucket.org/hotpollen/rna-seq/src/master/

Data sets to process:

Positive control: [SRP328042|https://trace.ncbi.nlm.nih.gov/Traces/index.html?view=study&acc=SRP328042]
Experimental: [SRP252265|https://trace.ncbi.nlm.nih.gov/Traces/index.html?view=study&acc=SRP252265]

Notes:

* Experimental dataset originally processed using code in https://bitbucket.org/hotpollen/rna-seq/src/master/

Ann Loraine made changes - 07/Jul/22 12:31 PM

Description

Data sets to process:

Positive control: [SRP328042|https://trace.ncbi.nlm.nih.gov/Traces/index.html?view=study&acc=SRP328042]
Experimental: [SRP252265|https://trace.ncbi.nlm.nih.gov/Traces/index.html?view=study&acc=SRP252265]

Notes:

* Experimental dataset originally processed using code in https://bitbucket.org/hotpollen/rna-seq/src/master/

Data sets to process:

Positive control: [SRP328042|https://trace.ncbi.nlm.nih.gov/Traces/index.html?view=study&acc=SRP328042]
Experimental: [SRP252265|https://trace.ncbi.nlm.nih.gov/Traces/index.html?view=study&acc=SRP252265]

To-Do:

* Obtain data in fastq format from Sequence Read Archive using fasterqdump options for paired end data
* Please data into directories named for the SRP number, e.g., SRP328042 and SRP252265 within a directory named "splicing" in "nobackup"
* Make a note of the particular commands used to perform the data retrieval (see comment below)

Notes:

* Experimental dataset originally processed using code in https://bitbucket.org/hotpollen/rna-seq/src/master/

Ann Loraine made changes - 07/Jul/22 12:33 PM

Description

Data sets to process:

Positive control: [SRP328042|https://trace.ncbi.nlm.nih.gov/Traces/index.html?view=study&acc=SRP328042]
Experimental: [SRP252265|https://trace.ncbi.nlm.nih.gov/Traces/index.html?view=study&acc=SRP252265]

To-Do:

* Obtain data in fastq format from Sequence Read Archive using fasterqdump options for paired end data
* Please data into directories named for the SRP number, e.g., SRP328042 and SRP252265 within a directory named "splicing" in "nobackup"
* Make a note of the particular commands used to perform the data retrieval (see comment below)

Notes:

* Experimental dataset originally processed using code in https://bitbucket.org/hotpollen/rna-seq/src/master/

Data sets to process:

Positive control: [SRP328042|https://trace.ncbi.nlm.nih.gov/Traces/index.html?view=study&acc=SRP328042]
Experimental: [SRP252265|https://trace.ncbi.nlm.nih.gov/Traces/index.html?view=study&acc=SRP252265]

To-Do:

* Obtain data in fastq format from Sequence Read Archive using fasterqdump options for paired end data
* Please data into directories named for the SRP number, e.g., SRP328042 and SRP252265 within a directory named "alt_splicing" under "nobackup"
* Make a note of the particular commands used to perform the data retrieval (see comment below)

Notes:

* Experimental dataset originally processed using code in https://bitbucket.org/hotpollen/rna-seq/src/master/

Ann Loraine made changes - 07/Jul/22 12:38 PM

Description

Data sets to process:

Positive control: [SRP328042|https://trace.ncbi.nlm.nih.gov/Traces/index.html?view=study&acc=SRP328042]
Experimental: [SRP252265|https://trace.ncbi.nlm.nih.gov/Traces/index.html?view=study&acc=SRP252265]

To-Do:

* Obtain data in fastq format from Sequence Read Archive using fasterqdump options for paired end data
* Please data into directories named for the SRP number, e.g., SRP328042 and SRP252265 within a directory named "alt_splicing" under "nobackup"
* Make a note of the particular commands used to perform the data retrieval (see comment below)

Notes:

* Experimental dataset originally processed using code in https://bitbucket.org/hotpollen/rna-seq/src/master/

Data sets to process:

Positive control: [SRP328042|https://trace.ncbi.nlm.nih.gov/Traces/index.html?view=study&acc=SRP328042]
Experimental: [SRP252265|https://trace.ncbi.nlm.nih.gov/Traces/index.html?view=study&acc=SRP252265]

To-Do:

* Obtain data in fastq format from Sequence Read Archive using fasterqdump options for paired end data
* Please data into directories named for the SRP number, e.g., SRP328042 and SRP252265 within a directory named "alt_splicing" under "nobackup"
* Make a note of the particular commands used to perform the data retrieval (see comment below)
* Create "samples" text file listing the SRR fastq files for running nf-core/rna-seq nextflow
* Run nf-core/rnaseq using proper maximum intron size parameter (probably it will be 15000)

Notes:

* Experimental dataset originally processed using code in https://bitbucket.org/hotpollen/rna-seq/src/master/

Ann Loraine made changes - 07/Jul/22 12:39 PM

Link

This issue blocks ~~IGBF-3144~~ [ ~~IGBF-3144~~ ]

Ann Loraine made changes - 18/Jul/22 8:25 AM

Sprint

Summer 4 2022 July 4 [ 150 ]

Summer 4 2022 July 4, Summer 5 2022 July 18 [ 150, 151 ]

Ann Loraine made changes - 18/Jul/22 8:25 AM

Rank

Ranked higher

Ann Loraine made changes - 20/Jul/22 11:23 AM

Link

This issue is blocked by ~~IGBF-3127~~ [ ~~IGBF-3127~~ ]

Ann Loraine made changes - 20/Jul/22 11:28 AM

Link

This issue relates to ~~IGBF-3127~~ [ ~~IGBF-3127~~ ]

Ann Loraine made changes - 20/Jul/22 8:11 PM

Status

To-Do [ 10305 ]

In Progress [ 3 ]

Ann Loraine made changes - 20/Jul/22 8:12 PM

Assignee

Ann Loraine [ aloraine ]

Ann Loraine made changes - 20/Jul/22 8:25 PM

Link

This issue relates to ~~IGBF-2947~~ [ ~~IGBF-2947~~ ]

Ann Loraine made changes - 20/Jul/22 8:29 PM

Link

This issue relates to ~~IGBF-2970~~ [ ~~IGBF-2970~~ ]

Ann Loraine made changes - 21/Jul/22 7:37 AM

Description

Data sets to process:

Positive control: [SRP328042|https://trace.ncbi.nlm.nih.gov/Traces/index.html?view=study&acc=SRP328042]
Experimental: [SRP252265|https://trace.ncbi.nlm.nih.gov/Traces/index.html?view=study&acc=SRP252265]

To-Do:

* Obtain data in fastq format from Sequence Read Archive using fasterqdump options for paired end data
* Please data into directories named for the SRP number, e.g., SRP328042 and SRP252265 within a directory named "alt_splicing" under "nobackup"
* Make a note of the particular commands used to perform the data retrieval (see comment below)
* Create "samples" text file listing the SRR fastq files for running nf-core/rna-seq nextflow
* Run nf-core/rnaseq using proper maximum intron size parameter (probably it will be 15000)

Notes:

* Experimental dataset originally processed using code in https://bitbucket.org/hotpollen/rna-seq/src/master/

Data sets to process:

Positive control: [SRP328042|https://trace.ncbi.nlm.nih.gov/Traces/index.html?view=study&acc=SRP328042]
Experimental: [SRP252265|https://trace.ncbi.nlm.nih.gov/Traces/index.html?view=study&acc=SRP252265]

To-Do:

* Obtain data in fastq format from Sequence Read Archive using fasterqdump options for paired end data - *DONE*
* Please data into directories named for the SRP number, e.g., SRP328042 and SRP252265 within a directory named "alt_splicing" under "nobackup" - *DONE*
* Make a note of the particular commands used to perform the data retrieval (see comment below)
* Create "samples" text file listing the SRR fastq files for running nf-core/rna-seq nextflow
* Run nf-core/rnaseq using proper maximum intron size parameter using "tomato.config"

Notes:

* Experimental datasets originally processed using code in https://bitbucket.org/hotpollen/rna-seq/src/master/ and https://bitbucket.org/hotpollen/flavonoid-rnaseq
* All Pollen project datasets are now in the SRA under the same project number ! (SRP252265)
* Ann is using a fork of flavonoid-rnaseq for all new code she's writing, on branch ~~IGBF-3143~~. To find her fork, go to https://bitbucket.org/hotpollen/flavonoid-rnaseq and select "forks"

Ann Loraine made changes - 21/Jul/22 12:38 PM

Assignee

Ann Loraine [ aloraine ]

Ann Loraine made changes - 21/Jul/22 12:38 PM

Status

In Progress [ 3 ]

To-Do [ 10305 ]

Ann Loraine made changes - 25/Jul/22 1:09 PM

Status

To-Do [ 10305 ]

In Progress [ 3 ]

Ann Loraine made changes - 25/Jul/22 1:09 PM

Assignee

Ann Loraine [ aloraine ]

Ann Loraine made changes - 25/Jul/22 1:16 PM

Description

Data sets to process:

Positive control: [SRP328042|https://trace.ncbi.nlm.nih.gov/Traces/index.html?view=study&acc=SRP328042]
Experimental: [SRP252265|https://trace.ncbi.nlm.nih.gov/Traces/index.html?view=study&acc=SRP252265]

To-Do:

* Obtain data in fastq format from Sequence Read Archive using fasterqdump options for paired end data - *DONE*
* Please data into directories named for the SRP number, e.g., SRP328042 and SRP252265 within a directory named "alt_splicing" under "nobackup" - *DONE*
* Make a note of the particular commands used to perform the data retrieval (see comment below)
* Create "samples" text file listing the SRR fastq files for running nf-core/rna-seq nextflow
* Run nf-core/rnaseq using proper maximum intron size parameter using "tomato.config"

Notes:

* Experimental datasets originally processed using code in https://bitbucket.org/hotpollen/rna-seq/src/master/ and https://bitbucket.org/hotpollen/flavonoid-rnaseq
* All Pollen project datasets are now in the SRA under the same project number ! (SRP252265)
* Ann is using a fork of flavonoid-rnaseq for all new code she's writing, on branch ~~IGBF-3143~~. To find her fork, go to https://bitbucket.org/hotpollen/flavonoid-rnaseq and select "forks"

Data sets to process:

Positive control: [SRP328042|https://trace.ncbi.nlm.nih.gov/Traces/index.html?view=study&acc=SRP328042]
Experimental: [SRP252265|https://trace.ncbi.nlm.nih.gov/Traces/index.html?view=study&acc=SRP252265]

To-Do:

* Obtain data in fastq format from Sequence Read Archive using fasterqdump options for paired end data - *DONE*
* Please data into directories named for the SRP number, e.g., SRP328042 and SRP252265 within a directory named "alt_splicing" under "nobackup" - *DONE*
* Make a note of the particular commands used to perform the data retrieval (see comment below)
* Create "samples" text file listing the SRR fastq files for running nf-core/rna-seq nextflow
* Run nf-core/rnaseq using proper maximum intron size parameter using "tomato.config"

Notes:

* Experimental datasets originally processed using code in https://bitbucket.org/hotpollen/rna-seq/src/master/ and https://bitbucket.org/hotpollen/flavonoid-rnaseq
* All Pollen project datasets are now in the SRA under the same project number ! (SRP252265)
* Ann is using a fork of flavonoid-rnaseq for all new code she's writing, on branch ~~IGBF-3143~~. To find her fork, go to https://bitbucket.org/hotpollen/flavonoid-rnaseq and select "forks"
* Documentation for the pipeline we are using is here: https://nf-co.re/rnaseq/3.4/usage

Ann Loraine made changes - 25/Jul/22 1:19 PM

Description

Data sets to process:

Positive control: [SRP328042|https://trace.ncbi.nlm.nih.gov/Traces/index.html?view=study&acc=SRP328042]
Experimental: [SRP252265|https://trace.ncbi.nlm.nih.gov/Traces/index.html?view=study&acc=SRP252265]

To-Do:

* Obtain data in fastq format from Sequence Read Archive using fasterqdump options for paired end data - *DONE*
* Please data into directories named for the SRP number, e.g., SRP328042 and SRP252265 within a directory named "alt_splicing" under "nobackup" - *DONE*
* Make a note of the particular commands used to perform the data retrieval (see comment below)
* Create "samples" text file listing the SRR fastq files for running nf-core/rna-seq nextflow
* Run nf-core/rnaseq using proper maximum intron size parameter using "tomato.config"

Notes:

* Experimental datasets originally processed using code in https://bitbucket.org/hotpollen/rna-seq/src/master/ and https://bitbucket.org/hotpollen/flavonoid-rnaseq
* All Pollen project datasets are now in the SRA under the same project number ! (SRP252265)
* Ann is using a fork of flavonoid-rnaseq for all new code she's writing, on branch ~~IGBF-3143~~. To find her fork, go to https://bitbucket.org/hotpollen/flavonoid-rnaseq and select "forks"
* Documentation for the pipeline we are using is here: https://nf-co.re/rnaseq/3.4/usage

Data sets to process:

Positive control: [SRP328042|https://trace.ncbi.nlm.nih.gov/Traces/index.html?view=study&acc=SRP328042] Data are published in [this article|https://pubmed.ncbi.nlm.nih.gov/34359978/].
Experimental: [SRP252265|https://trace.ncbi.nlm.nih.gov/Traces/index.html?view=study&acc=SRP252265]

To-Do:

* Obtain data in fastq format from Sequence Read Archive using fasterqdump options for paired end data - *DONE*
* Please data into directories named for the SRP number, e.g., SRP328042 and SRP252265 within a directory named "alt_splicing" under "nobackup" - *DONE*
* Make a note of the particular commands used to perform the data retrieval (see comment below)
* Create "samples" text file listing the SRR fastq files for running nf-core/rna-seq nextflow
* Run nf-core/rnaseq using proper maximum intron size parameter using "tomato.config"

Notes:

* Experimental datasets originally processed using code in https://bitbucket.org/hotpollen/rna-seq/src/master/ and https://bitbucket.org/hotpollen/flavonoid-rnaseq
* All Pollen project datasets are now in the SRA under the same project number ! (SRP252265)
* Ann is using a fork of flavonoid-rnaseq for all new code she's writing, on branch ~~IGBF-3143~~. To find her fork, go to https://bitbucket.org/hotpollen/flavonoid-rnaseq and select "forks"
* Documentation for the pipeline we are using is here: https://nf-co.re/rnaseq/3.4/usage

Ann Loraine made changes - 25/Jul/22 1:21 PM

Description

Data sets to process:

Positive control: [SRP328042|https://trace.ncbi.nlm.nih.gov/Traces/index.html?view=study&acc=SRP328042] Data are published in [this article|https://pubmed.ncbi.nlm.nih.gov/34359978/].
Experimental: [SRP252265|https://trace.ncbi.nlm.nih.gov/Traces/index.html?view=study&acc=SRP252265]

To-Do:

* Obtain data in fastq format from Sequence Read Archive using fasterqdump options for paired end data - *DONE*
* Please data into directories named for the SRP number, e.g., SRP328042 and SRP252265 within a directory named "alt_splicing" under "nobackup" - *DONE*
* Make a note of the particular commands used to perform the data retrieval (see comment below)
* Create "samples" text file listing the SRR fastq files for running nf-core/rna-seq nextflow
* Run nf-core/rnaseq using proper maximum intron size parameter using "tomato.config"

Notes:

* Experimental datasets originally processed using code in https://bitbucket.org/hotpollen/rna-seq/src/master/ and https://bitbucket.org/hotpollen/flavonoid-rnaseq
* All Pollen project datasets are now in the SRA under the same project number ! (SRP252265)
* Ann is using a fork of flavonoid-rnaseq for all new code she's writing, on branch ~~IGBF-3143~~. To find her fork, go to https://bitbucket.org/hotpollen/flavonoid-rnaseq and select "forks"
* Documentation for the pipeline we are using is here: https://nf-co.re/rnaseq/3.4/usage

Data sets to process:

Positive control: [SRP328042|https://trace.ncbi.nlm.nih.gov/Traces/index.html?view=study&acc=SRP328042] Data are published in [this article|https://pubmed.ncbi.nlm.nih.gov/34359978/].
Experimental: [SRP252265|https://trace.ncbi.nlm.nih.gov/Traces/index.html?view=study&acc=SRP252265]

To-Do:

* Obtain data in fastq format from Sequence Read Archive using fasterqdump options for paired end data - *DONE*
* Please data into directories named for the SRP number, e.g., SRP328042 and SRP252265 within a directory named "alt_splicing" under "nobackup" - *DONE*
* Make a note of the particular commands used to perform the data retrieval (see comment below)
* Create "samples" text file listing the SRR fastq files for running nf-core/rna-seq nextflow
* Run nf-core/rnaseq using proper maximum intron size parameter using "tomato.config"

Notes:

* Experimental datasets originally processed using code in https://bitbucket.org/hotpollen/rna-seq/src/master/ and https://bitbucket.org/hotpollen/flavonoid-rnaseq
* All Pollen project datasets are now in the SRA under the same project number ! (SRP252265)
* Ann is using a fork of flavonoid-rnaseq for all new code she's writing, on branch ~~IGBF-3143~~. To find her fork, go to https://bitbucket.org/hotpollen/flavonoid-rnaseq and select "forks"
* Documentation for the pipeline we are using is here: https://nf-co.re/rnaseq/3.4/usage

Methods used to create positive control RNA-Seq data:

{quote}
2.5.2. Preparation of RNA-Seq Library and Sequencing Total RNA was extracted utilizing Trizol reagent (Invitrogen, Waltham, MA, USA). RNA quantity and quality were determined by NanoDrop 1000 spectrophotometer (Thermo Scientific Inc., Waltham, MA, USA), 1% agarose gel electrophoresis and Agilent 2100 Bioanalyzer (Agilent Technologies, Palo Alto, CA, USA). Following the protocol described by [28], strand-specific RNA-Seq libraries from 3 biological replicates for each group from WW and DS anthers were prepared using 1 ng/µL of total RNA sample and sequenced by Novogene Biotech (Beijing, China) on Illumina HiSeq 4000 system (Illumina, Inc., San Diego, CA, USA) according to the manufacturer’s instructions. The raw sequence reads were deposited into NCBI Sequence Read Archive under accession the number PRJNA746070.
{quote}

Ann Loraine made changes - 25/Jul/22 1:22 PM

Description

Data sets to process:

Positive control: [SRP328042|https://trace.ncbi.nlm.nih.gov/Traces/index.html?view=study&acc=SRP328042] Data are published in [this article|https://pubmed.ncbi.nlm.nih.gov/34359978/].
Experimental: [SRP252265|https://trace.ncbi.nlm.nih.gov/Traces/index.html?view=study&acc=SRP252265]

To-Do:

* Obtain data in fastq format from Sequence Read Archive using fasterqdump options for paired end data - *DONE*
* Please data into directories named for the SRP number, e.g., SRP328042 and SRP252265 within a directory named "alt_splicing" under "nobackup" - *DONE*
* Make a note of the particular commands used to perform the data retrieval (see comment below)
* Create "samples" text file listing the SRR fastq files for running nf-core/rna-seq nextflow
* Run nf-core/rnaseq using proper maximum intron size parameter using "tomato.config"

Notes:

* Experimental datasets originally processed using code in https://bitbucket.org/hotpollen/rna-seq/src/master/ and https://bitbucket.org/hotpollen/flavonoid-rnaseq
* All Pollen project datasets are now in the SRA under the same project number ! (SRP252265)
* Ann is using a fork of flavonoid-rnaseq for all new code she's writing, on branch ~~IGBF-3143~~. To find her fork, go to https://bitbucket.org/hotpollen/flavonoid-rnaseq and select "forks"
* Documentation for the pipeline we are using is here: https://nf-co.re/rnaseq/3.4/usage

Methods used to create positive control RNA-Seq data:

{quote}
2.5.2. Preparation of RNA-Seq Library and Sequencing Total RNA was extracted utilizing Trizol reagent (Invitrogen, Waltham, MA, USA). RNA quantity and quality were determined by NanoDrop 1000 spectrophotometer (Thermo Scientific Inc., Waltham, MA, USA), 1% agarose gel electrophoresis and Agilent 2100 Bioanalyzer (Agilent Technologies, Palo Alto, CA, USA). Following the protocol described by [28], strand-specific RNA-Seq libraries from 3 biological replicates for each group from WW and DS anthers were prepared using 1 ng/µL of total RNA sample and sequenced by Novogene Biotech (Beijing, China) on Illumina HiSeq 4000 system (Illumina, Inc., San Diego, CA, USA) according to the manufacturer’s instructions. The raw sequence reads were deposited into NCBI Sequence Read Archive under accession the number PRJNA746070.
{quote}

Data sets to process:

Positive control: [SRP328042|https://trace.ncbi.nlm.nih.gov/Traces/index.html?view=study&acc=SRP328042] Data are published in [this article|https://pubmed.ncbi.nlm.nih.gov/34359978/].
Experimental: [SRP252265|https://trace.ncbi.nlm.nih.gov/Traces/index.html?view=study&acc=SRP252265]

To-Do:

* Obtain data in fastq format from Sequence Read Archive using fasterqdump options for paired end data - *DONE*
* Please data into directories named for the SRP number, e.g., SRP328042 and SRP252265 within a directory named "alt_splicing" under "nobackup" - *DONE*
* Make a note of the particular commands used to perform the data retrieval (see comment below)
* Create "samples" text file listing the SRR fastq files for running nf-core/rna-seq nextflow
* Run nf-core/rnaseq using proper maximum intron size parameter using "tomato.config"

Notes:

* Experimental datasets originally processed using code in https://bitbucket.org/hotpollen/rna-seq/src/master/ and https://bitbucket.org/hotpollen/flavonoid-rnaseq
* All Pollen project datasets are now in the SRA under the same project number ! (SRP252265)
* Ann is using a fork of flavonoid-rnaseq for all new code she's writing, on branch ~~IGBF-3143~~. To find her fork, go to https://bitbucket.org/hotpollen/flavonoid-rnaseq and select "forks"
* Documentation for the pipeline we are using is here: https://nf-co.re/rnaseq/3.4/usage

Methods used to create *positive control* RNA-Seq data from SRP328042, according to the paper:

{quote}
2.5.2. Preparation of RNA-Seq Library and Sequencing Total RNA was extracted utilizing Trizol reagent (Invitrogen, Waltham, MA, USA). RNA quantity and quality were determined by NanoDrop 1000 spectrophotometer (Thermo Scientific Inc., Waltham, MA, USA), 1% agarose gel electrophoresis and Agilent 2100 Bioanalyzer (Agilent Technologies, Palo Alto, CA, USA). Following the protocol described by [28], strand-specific RNA-Seq libraries from 3 biological replicates for each group from WW and DS anthers were prepared using 1 ng/µL of total RNA sample and sequenced by Novogene Biotech (Beijing, China) on Illumina HiSeq 4000 system (Illumina, Inc., San Diego, CA, USA) according to the manufacturer’s instructions. The raw sequence reads were deposited into NCBI Sequence Read Archive under accession the number PRJNA746070.
{quote}

Ann Loraine made changes - 25/Jul/22 1:39 PM

Attachment

strand-specific-protocol.png [ 17253 ]

Ann Loraine made changes - 25/Jul/22 1:39 PM

Attachment

10.1.1.1052.3871.pdf [ 17254 ]

Ann Loraine made changes - 25/Jul/22 1:39 PM

Attachment

strand-specific-protocol.png [ 17253 ]

Ann Loraine made changes - 25/Jul/22 1:40 PM

Description

Data sets to process:

Positive control: [SRP328042|https://trace.ncbi.nlm.nih.gov/Traces/index.html?view=study&acc=SRP328042] Data are published in [this article|https://pubmed.ncbi.nlm.nih.gov/34359978/].
Experimental: [SRP252265|https://trace.ncbi.nlm.nih.gov/Traces/index.html?view=study&acc=SRP252265]

To-Do:

* Obtain data in fastq format from Sequence Read Archive using fasterqdump options for paired end data - *DONE*
* Please data into directories named for the SRP number, e.g., SRP328042 and SRP252265 within a directory named "alt_splicing" under "nobackup" - *DONE*
* Make a note of the particular commands used to perform the data retrieval (see comment below)
* Create "samples" text file listing the SRR fastq files for running nf-core/rna-seq nextflow
* Run nf-core/rnaseq using proper maximum intron size parameter using "tomato.config"

Notes:

* Experimental datasets originally processed using code in https://bitbucket.org/hotpollen/rna-seq/src/master/ and https://bitbucket.org/hotpollen/flavonoid-rnaseq
* All Pollen project datasets are now in the SRA under the same project number ! (SRP252265)
* Ann is using a fork of flavonoid-rnaseq for all new code she's writing, on branch ~~IGBF-3143~~. To find her fork, go to https://bitbucket.org/hotpollen/flavonoid-rnaseq and select "forks"
* Documentation for the pipeline we are using is here: https://nf-co.re/rnaseq/3.4/usage

Methods used to create *positive control* RNA-Seq data from SRP328042, according to the paper:

{quote}
2.5.2. Preparation of RNA-Seq Library and Sequencing Total RNA was extracted utilizing Trizol reagent (Invitrogen, Waltham, MA, USA). RNA quantity and quality were determined by NanoDrop 1000 spectrophotometer (Thermo Scientific Inc., Waltham, MA, USA), 1% agarose gel electrophoresis and Agilent 2100 Bioanalyzer (Agilent Technologies, Palo Alto, CA, USA). Following the protocol described by [28], strand-specific RNA-Seq libraries from 3 biological replicates for each group from WW and DS anthers were prepared using 1 ng/µL of total RNA sample and sequenced by Novogene Biotech (Beijing, China) on Illumina HiSeq 4000 system (Illumina, Inc., San Diego, CA, USA) according to the manufacturer’s instructions. The raw sequence reads were deposited into NCBI Sequence Read Archive under accession the number PRJNA746070.
{quote}

Data sets to process:

Positive control: [SRP328042|https://trace.ncbi.nlm.nih.gov/Traces/index.html?view=study&acc=SRP328042] Data are published in [this article|https://pubmed.ncbi.nlm.nih.gov/34359978/].
Experimental: [SRP252265|https://trace.ncbi.nlm.nih.gov/Traces/index.html?view=study&acc=SRP252265]

To-Do:

* Obtain data in fastq format from Sequence Read Archive using fasterqdump options for paired end data - *DONE*
* Please data into directories named for the SRP number, e.g., SRP328042 and SRP252265 within a directory named "alt_splicing" under "nobackup" - *DONE*
* Make a note of the particular commands used to perform the data retrieval (see comment below)
* Create "samples" text file listing the SRR fastq files for running nf-core/rna-seq nextflow
* Run nf-core/rnaseq using proper maximum intron size parameter using "tomato.config"

Notes:

* Experimental datasets originally processed using code in https://bitbucket.org/hotpollen/rna-seq/src/master/ and https://bitbucket.org/hotpollen/flavonoid-rnaseq
* All Pollen project datasets are now in the SRA under the same project number ! (SRP252265)
* Ann is using a fork of flavonoid-rnaseq for all new code she's writing, on branch ~~IGBF-3143~~. To find her fork, go to https://bitbucket.org/hotpollen/flavonoid-rnaseq and select "forks"
* Documentation for the pipeline we are using is here: https://nf-co.re/rnaseq/3.4/usage

Methods used to create *positive control* RNA-Seq data from SRP328042, according to the paper:

{quote}
2.5.2. Preparation of RNA-Seq Library and Sequencing Total RNA was extracted utilizing Trizol reagent (Invitrogen, Waltham, MA, USA). RNA quantity and quality were determined by NanoDrop 1000 spectrophotometer (Thermo Scientific Inc., Waltham, MA, USA), 1% agarose gel electrophoresis and Agilent 2100 Bioanalyzer (Agilent Technologies, Palo Alto, CA, USA). Following the protocol described by [28], strand-specific RNA-Seq libraries from 3 biological replicates for each group from WW and DS anthers were prepared using 1 ng/µL of total RNA sample and sequenced by Novogene Biotech (Beijing, China) on Illumina HiSeq 4000 system (Illumina, Inc., San Diego, CA, USA) according to the manufacturer’s instructions. The raw sequence reads were deposited into NCBI Sequence Read Archive under accession the number PRJNA746070.
{quote}

A PDF copy of the protocol paper (reference 28) for RNA-Seq library synthesis is attached.

Ann Loraine made changes - 27/Jul/22 2:57 PM

Comment

[ Testing email function of jira: [~aloraine] ]

Ann Loraine made changes - 27/Jul/22 6:21 PM

Status

In Progress [ 3 ]

To-Do [ 10305 ]

Ann Loraine made changes - 27/Jul/22 6:21 PM

Assignee

Ann Loraine [ aloraine ]

Ann Loraine made changes - 01/Aug/22 8:05 AM

Sprint

Summer 4 2022 July 4, Summer 5 2022 July 18 [ 150, 151 ]

Summer 4 2022 July 4, Summer 5 2022 July 18, Summer 6 2022 Aug 1 [ 150, 151, 152 ]

Ann Loraine made changes - 01/Aug/22 8:05 AM

Rank

Ranked higher

Ann Loraine made changes - 11/Aug/22 4:29 PM

Status

To-Do [ 10305 ]

In Progress [ 3 ]

Ann Loraine made changes - 11/Aug/22 4:29 PM

Assignee

Molly Davis [ molly ]

Ann Loraine made changes - 17/Aug/22 9:31 AM

Sprint

Summer 4 2022 July 4, Summer 5 2022 July 18, Summer 6 2022 Aug 1 [ 150, 151, 152 ]

Summer 4 2022 July 4, Summer 5 2022 July 18, Summer 6 2022 Aug 1, Fall 1 2022 Aug 15 [ 150, 151, 152, 153 ]

Ann Loraine made changes - 17/Aug/22 9:31 AM

Rank

Ranked higher

Ann Loraine made changes - 26/Aug/22 12:47 PM

Link

This issue relates to ~~IGBF-3162~~ [ ~~IGBF-3162~~ ]