Uploaded image for project: 'IGB'
  1. IGB
  2. IGBF-3739

Re-run SRA muday 2022 timeseries data again with SL4

    Details

    • Type: Task
    • Status: Closed (View Workflow)
    • Priority: Major
    • Resolution: Done
    • Affects Version/s: None
    • Fix Version/s: None
    • Labels:
      None

      Description

      SRP460750
      Directory: /projects/tomato_genome/fnb/dataprocessing/SRP460750/nfcore-SL4-2024-05-21
      Previously we noticed that SRA had mismatched some of the data incorrectly and 16 of the sample-names were mislabeled. Dr. Reid reached out and had SRA change everything to the correct sample names. Now we must rerun the muday SRA data again on the cluster with nextflow and make sure the data is correctly labeled.
      For this task, we need to confirm and sanity-check the muday time course data that Rob recently uploaded and submitted to the Sequence Read Archive.
      If the data are good, we will replace all the existing BAM, junctions, etc. files deployed in the "hotpollen" quickload site with newly processed data.
      For this task:
      Check SRP on NCBI and review submission
      Download the data onto the cluster by using the SRP name
      Run nf-core/rnaseq pipeline
      Run our coverage graph and junctions scripts on the data
      Note that all files should now use their "SRR" names instead of the existing file names.

        Attachments

          Issue Links

            Activity

            Mdavis4290 Molly Davis created issue -
            Mdavis4290 Molly Davis made changes -
            Field Original Value New Value
            Epic Link IGBF-2993 [ 21429 ]
            Mdavis4290 Molly Davis made changes -
            Link This issue relates to IGBF-3720 [ IGBF-3720 ]
            Mdavis4290 Molly Davis made changes -
            Status To-Do [ 10305 ] In Progress [ 3 ]
            Mdavis4290 Molly Davis made changes -
            Rank Ranked higher
            Mdavis4290 Molly Davis made changes -
            Description muday-2022-timeseries = SRP460750 SRP460750
            Directory: /projects/tomato_genome/fnb/dataprocessing/SRP460750/nfcore-SL4-2024-05-21
            Previously we noticed that SRA had mismatched some of the data incorrectly and 16 of the sample-names were mislabeled. Dr. Reid reached out and had SRA change everything to the correct sample names. Now we must rerun the muday SRA data again on the cluster with nextflow and make sure the data is correctly labeled.
            For this task, we need to confirm and sanity-check the muday time course data that Rob recently uploaded and submitted to the Sequence Read Archive.
            If the data are good, we will replace all the existing BAM, junctions, etc. files deployed in the "hotpollen" quickload site with newly processed data.
            For this task:
            Check SRP on NCBI and review submission
            Download the data onto the cluster by using the SRP name
            Run nf-core/rnaseq pipeline
            Run our coverage graph and junctions scripts on the data
            Note that all files should now use their "SRR" names instead of the existing file names.
            Mdavis4290 Molly Davis made changes -
            Summary Rerun SRA muday 2022 timeseries data again with SL4 Re-run SRA muday 2022 timeseries data again with SL4
            Hide
            Mdavis4290 Molly Davis added a comment -
            Show
            Mdavis4290 Molly Davis added a comment - Used this document to run the pipeline: https://docs.google.com/document/d/1ig9ET-ykXF5nAX3P487cXWmZDGUlQpcwrvFXpbyP5vw/edit
            ann.loraine Ann Loraine made changes -
            Sprint Spring 10 [ 194 ] Spring 10, Summer 1 [ 194, 195 ]
            ann.loraine Ann Loraine made changes -
            Rank Ranked higher
            Hide
            Mdavis4290 Molly Davis added a comment - - edited

            Branch: https://bitbucket.org/mdavis4290/molly5-flavonoid-rnaseq/branch/IGBF-3739
            Directory: /projects/tomato_genome/fnb/dataprocessing/SRP460750/nfcore-SL4-2024-05-21/results/star_salmon
            Reviewer:
            Check that files have reasonable sizes (no "zero" size files, for example)
            Check that every "FJ.bed.gz" file has a corresponding "FJ.bed.gz.tbi" index file
            Check that every bam file has a corresponding "FJ.bed.gz" file
            Check that every bam file has a corresponding "scaled.bedgraph.gz" file
            Check that every "scaled.bedgraph.gz" has a corresponding "scaled.bedgraph.gz.tbi"

            Show
            Mdavis4290 Molly Davis added a comment - - edited Branch : https://bitbucket.org/mdavis4290/molly5-flavonoid-rnaseq/branch/IGBF-3739 Directory : /projects/tomato_genome/fnb/dataprocessing/SRP460750/nfcore-SL4-2024-05-21/results/star_salmon Reviewer : Check that files have reasonable sizes (no "zero" size files, for example) Check that every "FJ.bed.gz" file has a corresponding "FJ.bed.gz.tbi" index file Check that every bam file has a corresponding "FJ.bed.gz" file Check that every bam file has a corresponding "scaled.bedgraph.gz" file Check that every "scaled.bedgraph.gz" has a corresponding "scaled.bedgraph.gz.tbi"
            Mdavis4290 Molly Davis made changes -
            Status In Progress [ 3 ] Needs 1st Level Review [ 10005 ]
            Mdavis4290 Molly Davis made changes -
            Assignee Molly Davis [ molly ] Robert Reid [ robertreid ]
            Hide
            robofjoy Robert Reid added a comment -

            I found 72

            bam files (~ 400MB in size)
            bai files (kb in size)
            scaled.bedgraph.gz files (40MB in size)
            more bedgraph tbi files (70k in size)

            72 bed.gz files, 40MB in size
            and 72 tbi files for these bad files

            Looks like everything is here!

            Show
            robofjoy Robert Reid added a comment - I found 72 bam files (~ 400MB in size) bai files (kb in size) scaled.bedgraph.gz files (40MB in size) more bedgraph tbi files (70k in size) 72 bed.gz files, 40MB in size and 72 tbi files for these bad files Looks like everything is here!
            robofjoy Robert Reid made changes -
            Assignee Robert Reid [ robertreid ] Molly Davis [ molly ]
            Mdavis4290 Molly Davis made changes -
            Status Needs 1st Level Review [ 10005 ] First Level Review in Progress [ 10301 ]
            Mdavis4290 Molly Davis made changes -
            Status First Level Review in Progress [ 10301 ] Ready for Pull Request [ 10304 ]
            Show
            Mdavis4290 Molly Davis added a comment - PR : https://bitbucket.org/hotpollen/flavonoid-rnaseq/pull-requests/48
            Mdavis4290 Molly Davis made changes -
            Status Ready for Pull Request [ 10304 ] Pull Request Submitted [ 10101 ]
            Mdavis4290 Molly Davis made changes -
            Assignee Molly Davis [ molly ] Ann Loraine [ aloraine ]
            ann.loraine Ann Loraine made changes -
            Status Pull Request Submitted [ 10101 ] Reviewing Pull Request [ 10303 ]
            Hide
            ann.loraine Ann Loraine added a comment -

            PR is merged. Ready for testing.

            Show
            ann.loraine Ann Loraine added a comment - PR is merged. Ready for testing.
            ann.loraine Ann Loraine made changes -
            Status Reviewing Pull Request [ 10303 ] Merged Needs Testing [ 10002 ]
            ann.loraine Ann Loraine made changes -
            Assignee Ann Loraine [ aloraine ]
            Mdavis4290 Molly Davis made changes -
            Assignee Molly Davis [ molly ]
            Mdavis4290 Molly Davis made changes -
            Status Merged Needs Testing [ 10002 ] Post-merge Testing In Progress [ 10003 ]
            Hide
            Mdavis4290 Molly Davis added a comment -

            Review:

            • All quick load files are accounted for in SL4 directory
            • I made sure permission were accessible for group chmod -R g+w *
            • The merged files on bitbucket are correct

            Moving to Done!

            Show
            Mdavis4290 Molly Davis added a comment - Review : All quick load files are accounted for in SL4 directory I made sure permission were accessible for group chmod -R g+w * The merged files on bitbucket are correct Moving to Done!
            Mdavis4290 Molly Davis made changes -
            Resolution Done [ 10000 ]
            Status Post-merge Testing In Progress [ 10003 ] Closed [ 6 ]

              People

              • Assignee:
                Mdavis4290 Molly Davis
                Reporter:
                Mdavis4290 Molly Davis
              • Votes:
                0 Vote for this issue
                Watchers:
                3 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: