Uploaded image for project: 'IGB'
  1. IGB
  2. IGBF-3460

Make Markdown creating and summarizing temperature:genotype interaction testing

    Details

    • Type: Task
    • Status: Closed (View Workflow)
    • Priority: Major
    • Resolution: Done
    • Affects Version/s: None
    • Fix Version/s: None
    • Labels:
      None

      Description

      Create a Markdown creating and summarizing temperature:genotype interaction testing.

      Please follow the same general pattern as in the other statistical analysis markdowns.

      Start with data from SL4 not SL5.

      If the output is per time, then include a column: time indicating the time point being tested.

      Code Sources:

      https://rdrr.io/bioc/DESeq2/man/results.html#:~:text=object)%24tDegreesFreedom%20.-,Value,standard%20error%20of%20the%20log2FoldChange%20.
      https://bioconductor.org/packages/devel/bioc/vignettes/DESeq2/inst/doc/DESeq2.html#note-on-factor-levels

        Attachments

          Issue Links

            Activity

            ann.loraine Ann Loraine created issue -
            ann.loraine Ann Loraine made changes -
            Field Original Value New Value
            Epic Link IGBF-3446 [ 22548 ]
            ann.loraine Ann Loraine made changes -
            Assignee Ann Loraine [ aloraine ]
            ann.loraine Ann Loraine made changes -
            Description Create a Markdown creating and summarizing temperature:genotype interaction testing.

            Please follow the same general pattern as in are2VF36.Rmd and the other statistical analysis markdowns.

            Create a Markdown creating and summarizing temperature:genotype interaction testing.

            Please follow the same general pattern as in the other statistical analysis markdowns.

            ann.loraine Ann Loraine made changes -
            Description Create a Markdown creating and summarizing temperature:genotype interaction testing.

            Please follow the same general pattern as in the other statistical analysis markdowns.

            Create a Markdown creating and summarizing temperature:genotype interaction testing.

            Please follow the same general pattern as in the other statistical analysis markdowns.

            Start with data from SL4 not SL5.

            ann.loraine Ann Loraine made changes -
            Description Create a Markdown creating and summarizing temperature:genotype interaction testing.

            Please follow the same general pattern as in the other statistical analysis markdowns.

            Start with data from SL4 not SL5.

            Create a Markdown creating and summarizing temperature:genotype interaction testing.

            Please follow the same general pattern as in the other statistical analysis markdowns.

            Start with data from SL4 not SL5.

            If the output is per time, then include a column: time indicating the time point being tested.

            Mdavis4290 Molly Davis made changes -
            Assignee Molly Davis [ molly ]
            Mdavis4290 Molly Davis made changes -
            Link This issue relates to IGBF-3436 [ IGBF-3436 ]
            Mdavis4290 Molly Davis made changes -
            Status To-Do [ 10305 ] In Progress [ 3 ]
            Mdavis4290 Molly Davis made changes -
            Description Create a Markdown creating and summarizing temperature:genotype interaction testing.

            Please follow the same general pattern as in the other statistical analysis markdowns.

            Start with data from SL4 not SL5.

            If the output is per time, then include a column: time indicating the time point being tested.

            Create a Markdown creating and summarizing temperature:genotype interaction testing.

            Please follow the same general pattern as in the other statistical analysis markdowns.

            Start with data from SL4 not SL5.

            If the output is per time, then include a column: time indicating the time point being tested.

            Code Sources:

            https://rdrr.io/bioc/DESeq2/man/results.html#:~:text=object)%24tDegreesFreedom%20.-,Value,standard%20error%20of%20the%20log2FoldChange%20.
            https://bioconductor.org/packages/devel/bioc/vignettes/DESeq2/inst/doc/DESeq2.html#note-on-factor-levels
            Hide
            Mdavis4290 Molly Davis added a comment - - edited

            Used FindMutantVsWildtypeDEGenes-DESeq2.Rmd as an example of how to create my markdown.

            Some things I changed and created:

            • output file name
            • the 'v' object which grabs genotype A & V and now the time chunk that I want
            • coldata now includes temperature instead of time
            • the design was changed to genotype + temperature with an interaction term with both of them
            • the results now doesn't include a contrast but just a results name because the interaction term already made the comparison
            • I added a new column to the result. It is called time and includes the time chunk data that was used in the analysis. This column should now show up in the output files
            • I decided to change the time chunk and run the analysis each time point. In other words, I reran the analysis for each timepoint. This was due to needing fast results. I hope to go back and change the code to loop through each time and produce an output file that combines all of them.
            • For now I made an output txt file for each time and genome but didn't leave it that way.
            • I took all of the output txt files and combined them into excel file sheets. One for SL4 and one for SL5.
            • Results file names:
              MvW-temp-SL4.xlsx
              MvW-temp-SL5.xlsx
            • I then made sure the excel file were easy to read and the top row was frozen when scrolling down.

            Next Step: Write and explain everything in the markdown.

            Show
            Mdavis4290 Molly Davis added a comment - - edited Used FindMutantVsWildtypeDEGenes-DESeq2.Rmd as an example of how to create my markdown. Some things I changed and created: output file name the 'v' object which grabs genotype A & V and now the time chunk that I want coldata now includes temperature instead of time the design was changed to genotype + temperature with an interaction term with both of them the results now doesn't include a contrast but just a results name because the interaction term already made the comparison I added a new column to the result. It is called time and includes the time chunk data that was used in the analysis. This column should now show up in the output files I decided to change the time chunk and run the analysis each time point. In other words, I reran the analysis for each timepoint. This was due to needing fast results. I hope to go back and change the code to loop through each time and produce an output file that combines all of them. For now I made an output txt file for each time and genome but didn't leave it that way. I took all of the output txt files and combined them into excel file sheets. One for SL4 and one for SL5. Results file names : MvW-temp-SL4.xlsx MvW-temp-SL5.xlsx I then made sure the excel file were easy to read and the top row was frozen when scrolling down. Next Step : Write and explain everything in the markdown.
            Hide
            Mdavis4290 Molly Davis added a comment - - edited

            Branch: https://bitbucket.org/mdavis4290/molly3-flavonoid-rnaseq/branch/IGBF-3460

            File names:

            • 72_F3H_PollenTube/FindTreatmentEffectAcrossGenotypes-DESeq2.Rmd
            • 72_F3H_PollenTube/FindTreatmentEffectAcrossGenotypes-DESeq2.pdf
            • 72_F3H_PollenTube/results/MvW-temp-SL4.xlsx
            • 72_F3H_PollenTube/results/MvW-temp-SL5.xlsx

            Notes: There is some more work to do obviously and improvements to make but the results are good to go and are ready to be shown.

            Show
            Mdavis4290 Molly Davis added a comment - - edited Branch : https://bitbucket.org/mdavis4290/molly3-flavonoid-rnaseq/branch/IGBF-3460 File names : 72_F3H_PollenTube/FindTreatmentEffectAcrossGenotypes-DESeq2.Rmd 72_F3H_PollenTube/FindTreatmentEffectAcrossGenotypes-DESeq2.pdf 72_F3H_PollenTube/results/MvW-temp-SL4.xlsx 72_F3H_PollenTube/results/MvW-temp-SL5.xlsx Notes: There is some more work to do obviously and improvements to make but the results are good to go and are ready to be shown.
            Mdavis4290 Molly Davis made changes -
            Assignee Molly Davis [ molly ]
            Mdavis4290 Molly Davis made changes -
            Status In Progress [ 3 ] Needs 1st Level Review [ 10005 ]
            Mdavis4290 Molly Davis made changes -
            Assignee Ann Loraine [ aloraine ]
            ann.loraine Ann Loraine made changes -
            Status Needs 1st Level Review [ 10005 ] First Level Review in Progress [ 10301 ]
            Hide
            ann.loraine Ann Loraine added a comment -

            Adding Molly's fork as a new remote to my local repo copy. Starting to look at her new work. Thanks [~molly].

            Show
            ann.loraine Ann Loraine added a comment - Adding Molly's fork as a new remote to my local repo copy. Starting to look at her new work. Thanks [~molly] .
            Hide
            ann.loraine Ann Loraine added a comment - - edited

            For the next draft, I would do all four temperatures in the same Markdown.

            Probably the easiest way to code it would be to transform the block of code inside the "for" loop starting on line 62 into a function that accepts a data frame with counts and a time point. Then, the time point parameter would then get used in line 70 to subset the columns.

            The function would then return the DESeq object.

            Then, you'd add additional code chunks, one per time point, where you run the function and save the returned DESeq object in a list. Because you would now have eight DESeq objects, you'd need to use a different key than in the current code. For example, instead of saving the results objects with assembly names as keys, you could use assembly names plus time point as keys.

            I think it would be a good idea to do something like the above because currently, to re-run this code, we have to manually change it by editing the Markdown.

            If this could be done, then the Makdown code can write out the file, instead of having to manually combine everything.

            Also, another benefit is that the Markdown can then summarize the results by reporting the number of comparisons in each time point that met the target false discovery rate (Q) threshold.

            It looks like none of the comparisons met the threshold. This is a negative result. That's OK. The most important thing here, and with all analyses, is to "show your work" so that whoever is reading this can believe that the analysis is correct and then move on to the next question.

            Show
            ann.loraine Ann Loraine added a comment - - edited For the next draft, I would do all four temperatures in the same Markdown. Probably the easiest way to code it would be to transform the block of code inside the "for" loop starting on line 62 into a function that accepts a data frame with counts and a time point. Then, the time point parameter would then get used in line 70 to subset the columns. The function would then return the DESeq object. Then, you'd add additional code chunks, one per time point, where you run the function and save the returned DESeq object in a list. Because you would now have eight DESeq objects, you'd need to use a different key than in the current code. For example, instead of saving the results objects with assembly names as keys, you could use assembly names plus time point as keys. I think it would be a good idea to do something like the above because currently, to re-run this code, we have to manually change it by editing the Markdown. If this could be done, then the Makdown code can write out the file, instead of having to manually combine everything. Also, another benefit is that the Markdown can then summarize the results by reporting the number of comparisons in each time point that met the target false discovery rate (Q) threshold. It looks like none of the comparisons met the threshold. This is a negative result. That's OK. The most important thing here, and with all analyses, is to "show your work" so that whoever is reading this can believe that the analysis is correct and then move on to the next question.
            ann.loraine Ann Loraine made changes -
            Assignee Ann Loraine [ aloraine ] Molly Davis [ molly ]
            ann.loraine Ann Loraine made changes -
            Status First Level Review in Progress [ 10301 ] To-Do [ 10305 ]
            Hide
            ann.loraine Ann Loraine added a comment -

            Also, thanks for rebasing your branch

            Show
            ann.loraine Ann Loraine added a comment - Also, thanks for rebasing your branch
            Mdavis4290 Molly Davis made changes -
            Status To-Do [ 10305 ] In Progress [ 3 ]
            Show
            Mdavis4290 Molly Davis added a comment - Branch : https://bitbucket.org/mdavis4290/molly3-flavonoid-rnaseq/branch/IGBF-3460b [~aloraine]
            Mdavis4290 Molly Davis made changes -
            Assignee Molly Davis [ molly ] Ann Loraine [ aloraine ]
            Mdavis4290 Molly Davis made changes -
            Status In Progress [ 3 ] Needs 1st Level Review [ 10005 ]
            ann.loraine Ann Loraine made changes -
            Status Needs 1st Level Review [ 10005 ] First Level Review in Progress [ 10301 ]
            ann.loraine Ann Loraine made changes -
            Status First Level Review in Progress [ 10301 ] Ready for Pull Request [ 10304 ]
            ann.loraine Ann Loraine made changes -
            Assignee Ann Loraine [ aloraine ] Molly Davis [ molly ]
            Hide
            ann.loraine Ann Loraine added a comment -

            Please submit PR.

            Show
            ann.loraine Ann Loraine added a comment - Please submit PR.
            Show
            Mdavis4290 Molly Davis added a comment - PR : https://bitbucket.org/hotpollen/flavonoid-rnaseq/pull-requests/26
            Mdavis4290 Molly Davis made changes -
            Assignee Molly Davis [ molly ] Ann Loraine [ aloraine ]
            Mdavis4290 Molly Davis made changes -
            Status Ready for Pull Request [ 10304 ] Pull Request Submitted [ 10101 ]
            ann.loraine Ann Loraine made changes -
            Status Pull Request Submitted [ 10101 ] Reviewing Pull Request [ 10303 ]
            ann.loraine Ann Loraine made changes -
            Status Reviewing Pull Request [ 10303 ] Merged Needs Testing [ 10002 ]
            ann.loraine Ann Loraine made changes -
            Link This issue relates to IGBF-3463 [ IGBF-3463 ]
            Hide
            ann.loraine Ann Loraine added a comment -

            I have reviewed the new content and have change requests. Moving to Done!

            Show
            ann.loraine Ann Loraine added a comment - I have reviewed the new content and have change requests. Moving to Done!
            ann.loraine Ann Loraine made changes -
            Status Merged Needs Testing [ 10002 ] Post-merge Testing In Progress [ 10003 ]
            ann.loraine Ann Loraine made changes -
            Resolution Done [ 10000 ]
            Status Post-merge Testing In Progress [ 10003 ] Closed [ 6 ]
            ann.loraine Ann Loraine made changes -
            Assignee Ann Loraine [ aloraine ] Molly Davis [ molly ]

              People

              • Assignee:
                Mdavis4290 Molly Davis
                Reporter:
                ann.loraine Ann Loraine
              • Votes:
                0 Vote for this issue
                Watchers:
                2 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: