Uploaded image for project: 'IGB'
  1. IGB
  2. IGBF-2998

Deploy alignments data to Quickload site

    Details

    • Type: Improvement
    • Status: Closed (View Workflow)
    • Priority: Major
    • Resolution: Done
    • Affects Version/s: None
    • Fix Version/s: None
    • Labels:
      None
    • Story Points:
      1.5
    • Sprint:
      Fall 9 2021 Dec 13 - Dec 24, Spring 1 2022 Jan 3 - Jan 14, Spring 2 2022 Jan 18 - Jan 28

      Description

      For this task, deploy data files to a quickload site. Create annots.xml for all the files.

        Attachments

          Activity

          Hide
          ann.loraine Ann Loraine added a comment -

          Copied makeAnnotsXml.py from rna-seq project in "hotpollen" workgroup (bitbucket) into src directory. Need to modify it to work with this project.

          Moving back to To-Do for the moment while I work on a couple of other things.

          Show
          ann.loraine Ann Loraine added a comment - Copied makeAnnotsXml.py from rna-seq project in "hotpollen" workgroup (bitbucket) into src directory. Need to modify it to work with this project. Moving back to To-Do for the moment while I work on a couple of other things.
          Hide
          ann.loraine Ann Loraine added a comment - - edited

          Added code and file "the_colors.txt" needed to use color scheme defined in the repository R code. See new method in Common.R:

          # For each value in ExternalDataSets/samples.csv, used to run
          # Nextflow's rna-seq pipeline, write out a data sheet mapping
          # sample names to hexadecimal color codes, used by makeAnnotsXml.py 
          # to configure colors for displaying bam, bedgraph, etc. data files
          # in IGB
          outputColorsForAnnotsXml = function(fname="../ExternalDataSets/the_colors.txt") {
            the_colors = getSampleColors(hex=T)
            df = data.frame(sample=names(the_colors),
                            the_colors=the_colors)
            write.table(df,file=fname,quote=F,sep="\t",row.names = F)
            return(df)
          }
          

          Note: there's a weird inconsistency between the header values in the counts data file and the bam files and samples.csv file provided to Nextflow nf-core/rnaseq pipeline. For some reason, the hyphen characters got replaced with period characters in the counts header. I'm not sure why this happened or if it means the files are somehow out of sync with each other.

          Show
          ann.loraine Ann Loraine added a comment - - edited Added code and file "the_colors.txt" needed to use color scheme defined in the repository R code. See new method in Common.R: # For each value in ExternalDataSets/samples.csv, used to run # Nextflow's rna-seq pipeline, write out a data sheet mapping # sample names to hexadecimal color codes, used by makeAnnotsXml.py # to configure colors for displaying bam, bedgraph, etc. data files # in IGB outputColorsForAnnotsXml = function(fname= "../ExternalDataSets/the_colors.txt" ) { the_colors = getSampleColors(hex=T) df = data.frame(sample=names(the_colors), the_colors=the_colors) write.table(df,file=fname,quote=F,sep= "\t" ,row.names = F) return (df) } Note : there's a weird inconsistency between the header values in the counts data file and the bam files and samples.csv file provided to Nextflow nf-core/rnaseq pipeline. For some reason, the hyphen characters got replaced with period characters in the counts header. I'm not sure why this happened or if it means the files are somehow out of sync with each other.
          Hide
          ann.loraine Ann Loraine added a comment -

          To test:

          1. Add this new Quickload site to IGB: http://lorainelab-quickload.scidas.org/hotpollen, call it "Hot Pollen" (or whatever you want)
          2. Open the latest tomato genome by clicking the tomato image on the IGB home screen
          3. Observe that the tomato gene models load
          4. Observe that the Quickload you added appears as a folder in the "Available Data" section of the Data Access panel in IGB
          5. Open the folder - you should see a subfolder named "ARE PGRP1939255"
          6. Open the above folder - you should see two sub-folders called "Reads" and "Graph - Scaled"
          7. Open the "Reads" folder and check that you can load some data from every data set shown
          8. Open the "Graph -Scaled" folder and check that you can load some data from every data set shown

          If the above tests work, then move the ticket to "Done" and re-assign it to [~aloraine].....

          Show
          ann.loraine Ann Loraine added a comment - To test: Add this new Quickload site to IGB: http://lorainelab-quickload.scidas.org/hotpollen , call it "Hot Pollen" (or whatever you want) Open the latest tomato genome by clicking the tomato image on the IGB home screen Observe that the tomato gene models load Observe that the Quickload you added appears as a folder in the "Available Data" section of the Data Access panel in IGB Open the folder - you should see a subfolder named "ARE PGRP1939255" Open the above folder - you should see two sub-folders called "Reads" and "Graph - Scaled" Open the "Reads" folder and check that you can load some data from every data set shown Open the "Graph -Scaled" folder and check that you can load some data from every data set shown If the above tests work, then move the ticket to "Done" and re-assign it to [~aloraine] .....
          Hide
          ann.loraine Ann Loraine added a comment -

          Added two files showing scaled coverage graphs.

          Show
          ann.loraine Ann Loraine added a comment - Added two files showing scaled coverage graphs.
          Hide
          omarne Omkar Marne (Inactive) added a comment -

          Test results:

          • I added IGB: http://lorainelab-quickload.scidas.org/hotpollen and named it Hot Pollen.
          • After opening the data source in IGB, clicking tomato gene, and opening folder and subfolders of the data source, data got loaded as expected.
          • Reads and Graphs folder loaded some data of every data set shown.

          Closing the ticket.

          Show
          omarne Omkar Marne (Inactive) added a comment - Test results: I added IGB: http://lorainelab-quickload.scidas.org/hotpollen and named it Hot Pollen. After opening the data source in IGB, clicking tomato gene, and opening folder and subfolders of the data source, data got loaded as expected. Reads and Graphs folder loaded some data of every data set shown. Closing the ticket.

            People

            • Assignee:
              ann.loraine Ann Loraine
              Reporter:
              ann.loraine Ann Loraine
            • Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: