Uploaded image for project: 'IGB'
  1. IGB
  2. IGBF-3290

Use new sample labels recommended by Muday Lab

    Details

    • Type: Task
    • Status: Closed (View Workflow)
    • Priority: Major
    • Resolution: Done
    • Affects Version/s: None
    • Fix Version/s: None
    • Labels:
      None

      Description

      See attached.

      For this task, modify AddGeneAnnotations.Rmd to change labels as advised.

      A.34.15.8 is actually F.34.15.8 and vice versa
      A.28.30.8 is actually V.28.30.8 and vice versa
      A.34.45.8 is actually F.34.45.8 and vice versa
      A.28.75.8 is actually V.28.75.8 and vice versa

        Attachments

          Issue Links

            Activity

            Hide
            ann.loraine Ann Loraine added a comment -

            Created new branch with updates. Merged into main in team repo.

            Show
            ann.loraine Ann Loraine added a comment - Created new branch with updates. Merged into main in team repo.
            Hide
            ann.loraine Ann Loraine added a comment -

            Molly - just letting you know that the new code addressing sample switching is merged into the main branch. Please take a look. If you do not observe anything that need to be changed, please move this ticket to DONE. If you do see a problem, make a note of it and move this ticket back to "To-Do"

            Thank you!!

            attn: [~molly]

            Show
            ann.loraine Ann Loraine added a comment - Molly - just letting you know that the new code addressing sample switching is merged into the main branch. Please take a look. If you do not observe anything that need to be changed, please move this ticket to DONE. If you do see a problem, make a note of it and move this ticket back to "To-Do" Thank you!! attn: [~molly]
            Hide
            ann.loraine Ann Loraine added a comment -

            Update:

            Muday lab tracked down the problem that caused the sample switching. It was due to misunderstanding on the sequencing facility side. See attached PDF.

            Re-opening the ticket.

            Show
            ann.loraine Ann Loraine added a comment - Update: Muday lab tracked down the problem that caused the sample switching. It was due to misunderstanding on the sequencing facility side. See attached PDF. Re-opening the ticket.
            Hide
            ann.loraine Ann Loraine added a comment -

            During scrum, decided to make copies of the original misnamed fastq files using new names that match what the samples truly are.
            Ann: I would like to use our sample codes in the names.

            Show
            ann.loraine Ann Loraine added a comment - During scrum, decided to make copies of the original misnamed fastq files using new names that match what the samples truly are. Ann: I would like to use our sample codes in the names.
            Hide
            ann.loraine Ann Loraine added a comment -

            I'd like to look into how the information shown on SRA record pages corresponds to columns in the sample submission table.

            Motivation:

            • Will help pick great values for each column that illuminate the experimental design.
            • Will help Johnson lab members, students, etc. re-use these data for stuff
            Show
            ann.loraine Ann Loraine added a comment - I'd like to look into how the information shown on SRA record pages corresponds to columns in the sample submission table. Motivation: Will help pick great values for each column that illuminate the experimental design. Will help Johnson lab members, students, etc. re-use these data for stuff
            Hide
            robofjoy Robert Reid added a comment -

            Azenta sample pictures with labels.pptx Muday lab RNA samples for sample name conversion.xls

            These are the 2 files shared by Gloria after they resolved the sample switching by the sequencing company.

            Gloria's email blurb:
            "Anthony and I separated worked through the misordered samples and both came up with the same pattern. In the attached Excel sheet, you can see in the FIRST column what your original sample names were (before that short switch of are). In the SECOND column are what those samples already are.

            When you rename the samples, we'd like you to keep the old name on the sheet someone to present in any confusion. What would be really helpful to us is if after you rename the lanes, you could put them in the intended order.

            We are mining this data to look at the transcription of specific genes. Ann did note that we should look at normalized reads. We'd love to have such a spreadsheet that had the normalized reads. We'd also love this exported to an excel sheet with the SolyIDs of each gene and the gene names.

            Now that we have clear sample identities, we'd love to see if the PCA plot shows groups based on temperature and genotypes!"

            Show
            robofjoy Robert Reid added a comment - Azenta sample pictures with labels.pptx Muday lab RNA samples for sample name conversion.xls These are the 2 files shared by Gloria after they resolved the sample switching by the sequencing company. Gloria's email blurb: "Anthony and I separated worked through the misordered samples and both came up with the same pattern. In the attached Excel sheet, you can see in the FIRST column what your original sample names were (before that short switch of are). In the SECOND column are what those samples already are. When you rename the samples, we'd like you to keep the old name on the sheet someone to present in any confusion. What would be really helpful to us is if after you rename the lanes, you could put them in the intended order. We are mining this data to look at the transcription of specific genes. Ann did note that we should look at normalized reads. We'd love to have such a spreadsheet that had the normalized reads. We'd also love this exported to an excel sheet with the SolyIDs of each gene and the gene names. Now that we have clear sample identities, we'd love to see if the PCA plot shows groups based on temperature and genotypes!"
            Hide
            ann.loraine Ann Loraine added a comment -

            New email from Muday lab with attachments Excel spreadsheet with new file name mappings and PowerPoint with images of the original sample boxes are now added to the repository.

            See flavonoid-rnaseq/muday-144-analysis/Documentation.

            Modifying code now.

            Show
            ann.loraine Ann Loraine added a comment - New email from Muday lab with attachments Excel spreadsheet with new file name mappings and PowerPoint with images of the original sample boxes are now added to the repository. See flavonoid-rnaseq/muday-144-analysis/Documentation. Modifying code now.
            Hide
            robofjoy Robert Reid added a comment -

            Updated the UNC Charlotte HPC cluster to reflect the changes in sample names.

            Sample names are being left in the original form. This is for the raw sequence data backed up on cluster at:

            /projects/tomato_genome/rnaseq/muday144-timeSeries-checkReadMEFIRST

            I changed the folder name to suggest that one should check out the README !!
            And in the folder I added the README.txt.

            Within the README.txt I mention what has happened and then point the user to the EXCEL file the highlights the sample name switching that is in bitBucket HotPollen.

            https://bitbucket.org/hotpollen/flavonoid-rnaseq/src/main/muday-144-analysis/Documentation/Muday-lab-RNA-samples-for-sample-name-conversion.xlsx

            Show
            robofjoy Robert Reid added a comment - Updated the UNC Charlotte HPC cluster to reflect the changes in sample names. Sample names are being left in the original form. This is for the raw sequence data backed up on cluster at: /projects/tomato_genome/rnaseq/muday144-timeSeries-checkReadMEFIRST I changed the folder name to suggest that one should check out the README !! And in the folder I added the README.txt. Within the README.txt I mention what has happened and then point the user to the EXCEL file the highlights the sample name switching that is in bitBucket HotPollen. https://bitbucket.org/hotpollen/flavonoid-rnaseq/src/main/muday-144-analysis/Documentation/Muday-lab-RNA-samples-for-sample-name-conversion.xlsx
            Hide
            ann.loraine Ann Loraine added a comment - - edited

            Updated AddGeneAnnotations.Rmd to incorporate NEW sample renaming scheme recommended by Muday lab in

            • flavonoid-rnaseq/72_F3H_PollenTube/Documentation/Muday-lab-RNA-samples-for-sample-name-conversion.xlsx

            To review:

            • Read the knitted Markdowns AddGeneAnnotations.pdf and MakeMdsPlots.pdf
            • Check output file sample_renaming_summary.txt. Compare to Muday-lab-RNA-samples-for-sample-name-conversion.xlsx.

            Note that the new code and new files are already committed to the main branch of the team repository.

            Show
            ann.loraine Ann Loraine added a comment - - edited Updated AddGeneAnnotations.Rmd to incorporate NEW sample renaming scheme recommended by Muday lab in flavonoid-rnaseq/72_F3H_PollenTube/Documentation/Muday-lab-RNA-samples-for-sample-name-conversion.xlsx To review: Read the knitted Markdowns AddGeneAnnotations.pdf and MakeMdsPlots.pdf Check output file sample_renaming_summary.txt. Compare to Muday-lab-RNA-samples-for-sample-name-conversion.xlsx. Note that the new code and new files are already committed to the main branch of the team repository.
            Hide
            Mdavis4290 Molly Davis added a comment - - edited

            Gene annotations and Mds plots look like they have accurate renaming for samples provided by Muday lab. I compared the renaming summary to the naming conversions and they seem to match each other. Let me know if you would like to make a new ticket to perform the DESeq analysis with the new renamed counts file muday-144-SL5_counts-salmon.txt.

            [~aloraine]

            Show
            Mdavis4290 Molly Davis added a comment - - edited Gene annotations and Mds plots look like they have accurate renaming for samples provided by Muday lab. I compared the renaming summary to the naming conversions and they seem to match each other. Let me know if you would like to make a new ticket to perform the DESeq analysis with the new renamed counts file muday-144-SL5_counts-salmon.txt. [~aloraine]
            Hide
            ann.loraine Ann Loraine added a comment - - edited

            No, instead make a ticket tracking notifying Muday Lab of the good news.

            Show
            ann.loraine Ann Loraine added a comment - - edited No, instead make a ticket tracking notifying Muday Lab of the good news.
            Hide
            ann.loraine Ann Loraine added a comment -

            Updating makeAnnotsXml.py to use the new labels.

            Show
            ann.loraine Ann Loraine added a comment - Updating makeAnnotsXml.py to use the new labels.
            Hide
            ann.loraine Ann Loraine added a comment -

            Committed new annots.xml file and new script version to repository

            Show
            ann.loraine Ann Loraine added a comment - Committed new annots.xml file and new script version to repository

              People

              • Assignee:
                ann.loraine Ann Loraine
                Reporter:
                ann.loraine Ann Loraine
              • Votes:
                0 Vote for this issue
                Watchers:
                Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: