Uploaded image for project: 'IGB'
  1. IGB
  2. IGBF-3258

Download and process SRP371294 RNA-Seq data

    Details

    • Type: Task
    • Status: Closed (View Workflow)
    • Priority: Major
    • Resolution: Done
    • Affects Version/s: None
    • Fix Version/s: None
    • Labels:
      None

      Description

      This data set from Arabidopsis thaliana contains six samples of FACS-sorted sperm and vegetative cells from mature pollen. The publication is here: https://pubmed.ncbi.nlm.nih.gov/36515615/

      This would be a useful reference data set for our studies, as the authors reported many differentially and alternatively spliced genes between sperm and vegetative cells harvested from mature Arabidopsis pollen.

      For this task

      • download the data as fastq files from SRA
      • align fastq files using nf-core/rnaseq vs. TAIR10 genome (see link below)
      • for alignment parameters, use original publication (referenced above) and their parameters that they used in their experiment. Every RNA-Seq to genome alignment tool requires the user to define a maximum intron size parameter. Never use the default! Customize for your species!
      • align using same maxIntron parameter reported in the methods section for the paper
      • for the above, make a new "config" file
      • create coverage graphs
      • create junction files

      Use this reference genome for alignment:

      Create the "fasta" file from the above 2bit file using blat suite tools on cluster. The program you need is 2bitToFa (I think to load it, you have to use "module load blatsuite" or something like that. Use "module avail" to find the correct module name.)

      2bitToFa command:

      twoBitToFa A_thaliana_Jun_2009.2bit A_thaliana_Jun_2009.fa
      

        Attachments

          Issue Links

            Activity

            Hide
            ann.loraine Ann Loraine added a comment -

            Added files to repository:

            • SRP371294-multiqc_report.html
            • SRP371294-salmon.merged.gene_counts.tsv
            • SRP371294.config (copy of /nobackup/tomato_genome/alt_splicing/SRP371294/Arabidopsis.config)

            Multiqc file looks fine.

            Moving to DONE.

            Show
            ann.loraine Ann Loraine added a comment - Added files to repository: SRP371294-multiqc_report.html SRP371294-salmon.merged.gene_counts.tsv SRP371294.config (copy of /nobackup/tomato_genome/alt_splicing/SRP371294/Arabidopsis.config) Multiqc file looks fine. Moving to DONE.
            Hide
            ann.loraine Ann Loraine added a comment -

            Transferring with:

            scp -J aloraine@hop.renci.org -r SRP371294.transfer 
            aloraine@lorainelab-quickload.scidas.org:/projects/igbquickload/lorainelab/www/main/htdocs/rnaseq/A_thaliana_Jun_2009/SRP371294/.
            
            Show
            ann.loraine Ann Loraine added a comment - Transferring with: scp -J aloraine@hop.renci.org -r SRP371294.transfer aloraine@lorainelab-quickload.scidas.org:/projects/igbquickload/lorainelab/www/main/htdocs/rnaseq/A_thaliana_Jun_2009/SRP371294/.
            Hide
            ann.loraine Ann Loraine added a comment -

            Deploying data to:

            /projects/igbquickload/lorainelab/www/main/htdocs/rnaseq/A_thaliana_Jun_2009/SRP371294

            On RENCI sci-das host.

            Note:

            This will not be part of the hotpollen Quickload but instead will get put into the "rnaseq" quickload.

            Show
            ann.loraine Ann Loraine added a comment - Deploying data to: /projects/igbquickload/lorainelab/www/main/htdocs/rnaseq/A_thaliana_Jun_2009/SRP371294 On RENCI sci-das host. Note: This will not be part of the hotpollen Quickload but instead will get put into the "rnaseq" quickload.
            Hide
            ann.loraine Ann Loraine added a comment -

            Copied files to /nobackup/tomato_genome/alt_splicing/SRP371294.transfer ( a location where my user has write permission )

            Number of files: 36
            Number of samples: 6
            Size: 15 Gb

            Show
            ann.loraine Ann Loraine added a comment - Copied files to /nobackup/tomato_genome/alt_splicing/SRP371294.transfer ( a location where my user has write permission ) Number of files: 36 Number of samples: 6 Size: 15 Gb
            Hide
            ann.loraine Ann Loraine added a comment -
            Show
            ann.loraine Ann Loraine added a comment - Read this: https://mason.gmu.edu/~montecin/UNIXpermiss.htm

              People

              • Assignee:
                Mdavis4290 Molly Davis
                Reporter:
                ann.loraine Ann Loraine
              • Votes:
                0 Vote for this issue
                Watchers:
                2 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: