Tested with:
local aloraine$ wc -l 30-605730043_SL4_salmon.merged.gene_counts.tsv
34076 30-605730043_SL4_salmon.merged.gene_counts.tsv
local aloraine$ wc -l 30-804059537_SL4_salmon.merged.gene_counts.tsv
34076 30-804059537_SL4_salmon.merged.gene_counts.tsv
and
local aloraine$ wc -l 30-605730043_SL5_salmon.merged.gene_counts.tsv
36649 30-605730043_SL5_salmon.merged.gene_counts.tsv
local aloraine$ wc -l 30-804059537_SL5_salmon.merged.gene_counts.tsv
36649 30-804059537_SL5_salmon.merged.gene_counts.tsv
As shown above, the new data files for 30-605730043 have the same number of lines as their SL4 and SL5 counterparts from the 30-804059537 dataset. Also, the SL5 files have more lines than the SL4 files, which is consistent with prior knowledge that the SL5 assembly has more gene models in it.
Sanity check passes. Moving to DONE.
Branch: https://bitbucket.org/mdavis4290/molly-pistil-rna-seq/branch/IGBF-3466