Uploaded image for project: 'IGB'
  1. IGB
  2. IGBF-3883

Merge OLD NextFlow runs with Current de novo NEXTFLOW runs

    Details

    • Type: Task
    • Status: Closed (View Workflow)
    • Priority: Major
    • Resolution: Done
    • Affects Version/s: None
    • Fix Version/s: None
    • Labels:
      None

      Description

      GOAL: Now that we have a merged NEXTFLOW salmon counts table derived from the de novo rna-SPades assembly, we combine want to compare the original NETFLOW results (reference based) to the newly merged de novo table.

      Step 1 is to make an EVEN bigger table. Each row is a gene.
      To start I suggest the table be for only 1 variety ( all NAG, e.g.,).

      We create this 1 table. And then apply Deseq2 and a PCA to see what results we get. Do they various time points align perfectly when comparing the de novo to the reference based?

        Attachments

          Activity

          robofjoy Robert Reid created issue -
          robofjoy Robert Reid made changes -
          Field Original Value New Value
          Epic Link IGBF-2993 [ 21429 ]
          robofjoy Robert Reid made changes -
          Assignee Ann Loraine [ aloraine ] Robert Reid [ robertreid ]
          robofjoy Robert Reid made changes -
          Assignee Robert Reid [ robertreid ] Brandon Bendickson [ bbendick ]
          bbendick Brandon Bendickson made changes -
          Status To-Do [ 10305 ] In Progress [ 3 ]
          Hide
          bbendick Brandon Bendickson added a comment -

          I found an issue where I lost sequences when modifying my de novo counts files. I remade the counts files and got the right number of sequences. I merged the de novo runs with the old NEXTFLOW runs. From what I can tell, the de novo runs map closely to the old runs.

          My merged files are located in: /projects/tomato_genome/fnb/dataprocessing/brandon_work/NEXTFLOW/result_processing/combine_old_and_new

          I would disregard Nag for now. I may have to rerun RNA spades for Nag as per ticket #3901.

          Moving this to first level review.

          Show
          bbendick Brandon Bendickson added a comment - I found an issue where I lost sequences when modifying my de novo counts files. I remade the counts files and got the right number of sequences. I merged the de novo runs with the old NEXTFLOW runs. From what I can tell, the de novo runs map closely to the old runs. My merged files are located in: /projects/tomato_genome/fnb/dataprocessing/brandon_work/NEXTFLOW/result_processing/combine_old_and_new I would disregard Nag for now. I may have to rerun RNA spades for Nag as per ticket #3901. Moving this to first level review.
          bbendick Brandon Bendickson made changes -
          Status In Progress [ 3 ] Needs 1st Level Review [ 10005 ]
          bbendick Brandon Bendickson made changes -
          Assignee Brandon Bendickson [ bbendick ] Robert Reid [ robertreid ]
          robofjoy Robert Reid made changes -
          Status Needs 1st Level Review [ 10005 ] First Level Review in Progress [ 10301 ]
          robofjoy Robert Reid made changes -
          Status First Level Review in Progress [ 10301 ] To-Do [ 10305 ]
          ann.loraine Ann Loraine made changes -
          Sprint Fall 1 [ 202 ] Fall 1, Fall 2 [ 202, 203 ]
          ann.loraine Ann Loraine made changes -
          Rank Ranked higher
          bbendick Brandon Bendickson made changes -
          Assignee Robert Reid [ robertreid ] Brandon Bendickson [ bbendick ]
          bbendick Brandon Bendickson made changes -
          Status To-Do [ 10305 ] In Progress [ 3 ]
          Hide
          bbendick Brandon Bendickson added a comment -

          Successfully merged all de novo runs with previous runs.
          Results are in: /projects/tomato_genome/fnb/dataprocessing/brandon_work/NEXTFLOW/result_processing/combine_old_and_new

          Checked the files to make sure we didn't lose anything
          -bash-4.4$ wc -l Heinz_merged.tsv
          24832 Heinz_merged.tsv
          -bash-4.4$ wc -l Malintka_merged.tsv
          24841 Malintka_merged.tsv
          -bash-4.4$ wc -l Nagcarlang_merged.tsv
          24697 Nagcarlang_merged.tsv
          -bash-4.4$ wc -l Tamaulipas_merged.tsv
          25357 Tamaulipas_merged.tsv

          Show
          bbendick Brandon Bendickson added a comment - Successfully merged all de novo runs with previous runs. Results are in: /projects/tomato_genome/fnb/dataprocessing/brandon_work/NEXTFLOW/result_processing/combine_old_and_new Checked the files to make sure we didn't lose anything -bash-4.4$ wc -l Heinz_merged.tsv 24832 Heinz_merged.tsv -bash-4.4$ wc -l Malintka_merged.tsv 24841 Malintka_merged.tsv -bash-4.4$ wc -l Nagcarlang_merged.tsv 24697 Nagcarlang_merged.tsv -bash-4.4$ wc -l Tamaulipas_merged.tsv 25357 Tamaulipas_merged.tsv
          bbendick Brandon Bendickson made changes -
          Status In Progress [ 3 ] Needs 1st Level Review [ 10005 ]
          bbendick Brandon Bendickson made changes -
          Assignee Brandon Bendickson [ bbendick ] Robert Reid [ robertreid ]
          Hide
          robofjoy Robert Reid added a comment -

          I checked out merge.py in the above folder.
          It is a clean script using pandas!! Great work.

          I think this naming conventin works really well:
          M.25C.0hr.S.2.R1.fastq.gz.denovo
          Malintka.R3.0hr.25C.ref ......

          I think this ticket is ready for closing!

          Show
          robofjoy Robert Reid added a comment - I checked out merge.py in the above folder. It is a clean script using pandas!! Great work. I think this naming conventin works really well: M.25C.0hr.S.2.R1.fastq.gz.denovo Malintka.R3.0hr.25C.ref ...... I think this ticket is ready for closing!
          robofjoy Robert Reid made changes -
          Status Needs 1st Level Review [ 10005 ] First Level Review in Progress [ 10301 ]
          robofjoy Robert Reid made changes -
          Status First Level Review in Progress [ 10301 ] Ready for Pull Request [ 10304 ]
          robofjoy Robert Reid made changes -
          Status Ready for Pull Request [ 10304 ] Pull Request Submitted [ 10101 ]
          robofjoy Robert Reid made changes -
          Status Pull Request Submitted [ 10101 ] Reviewing Pull Request [ 10303 ]
          robofjoy Robert Reid made changes -
          Status Reviewing Pull Request [ 10303 ] Merged Needs Testing [ 10002 ]
          robofjoy Robert Reid made changes -
          Status Merged Needs Testing [ 10002 ] Post-merge Testing In Progress [ 10003 ]
          robofjoy Robert Reid made changes -
          Resolution Done [ 10000 ]
          Status Post-merge Testing In Progress [ 10003 ] Closed [ 6 ]

            People

            • Assignee:
              robofjoy Robert Reid
              Reporter:
              robofjoy Robert Reid
            • Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: