Hi There,
I have been following this GuideLine:
And I am now trying to follow this:
Only problem is that it fails pretty fast when I get to the first step where I combine the matrices I get an error:
Error in calcNormFactors.DGEList(exp_study) : NAs not permitted
Can anyone tell me how to proceed as I have followed the exact pipeline to this point?
I use the below command:
/services/tools/ngs/trinityrnaseq_r20140717/util/
abundance_estimates_to_matrix.pl --est_method RSEM --out_prefix Trinity_genes ./Lt1160414/Lt11604.RSEM.genes.results ./Lt1180314/Lt1180314_estimate_out/Lt1180314.RSEM.genes.results ./St1180314/St1180314_estimate_out/St1180314.RSEM.genes.results ./St3160414/St3160414_estimate.results --name_sample_by_basedir
-reading file: ./Lt1160414/Lt1160414_estimate_out/Lt1160414.RSEM.genes.results
-reading file: ./Lt1180314/Lt1180314_estimate_out/Lt1180314.RSEM.genes.results
-reading file: ./St1180314/St1180314_estimate_out/St1180314.RSEM.genes.results
-reading file: ./St3160414/St3160414_estimate_out/St3160414.RSEM.genes.results
* Outputting combined matrix.
/services/tools/ngs/trinityrnaseq_r20140717/util/support_scripts/run_TMM_scale_matrix.pl --matrix Trinity_genes.not_cross_norm.fpkm.tmp > Trinity_genes.TMM.fpkm.matrixCMD: R --vanilla -q < __tmp_runTMM.R 1>&2
> library(edgeR)
Loading required package: limma
>
> rnaseqMatrix = read.table("Trinity_genes.not_cross_norm.fpkm.tmp", header=T, row.names=1, com='', check.names=F)
> rnaseqMatrix = round(rnaseqMatrix)
> exp_study = DGEList(counts=rnaseqMatrix, group=factor(colnames(rnaseqMatrix)))
> exp_study = calcNormFactors(exp_study)
Error in calcNormFactors.DGEList(exp_study) : NAs not permitted
Calls: calcNormFactors -> calcNormFactors.DGEList
Execution halted
Error, cmd: R --vanilla -q < __tmp_runTMM.R 1>&2 died with ret (256) at /services/tools/ngs/trinityrnaseq_r20140717/util/support_scripts/run_TMM_scale_matrix.pl line 98.
Error, CMD: /services/tools/ngs/trinityrnaseq_r20140717/util/support_scripts/run_TMM_scale_matrix.pl --matrix Trinity_genes.not_cross_norm.fpkm.tmp > Trinity_genes.TMM.fpkm.matrix died with ret 6400 at /services/tools/ngs/trinityrnaseq_r20140717/util/
abundance_estimates_to_matrix.pl line 240.
My combined matrix does have NA's but why is that not allowed? I mean, not all genes are expected to be in all samples?
here is head of the Matrix:
head Trinity_genes.counts.matrix
Lt1160414_estimate_out Lt1180314_estimate_out St1180314_estimate_out St3160414_estimate_out
c46397_g2 47.33 NA NA NA
c17494_g1 30.79 4.50 24.56 2004.71
c35331_g1 68.42 220.91 174.29 135.64
c35772_g2 NA 1.02 NA 0.00
c38800_g1 668.13 2275.87 613.69 4213.08
c13549_g1 14.26 7.66 NA 13.00
c73778_g1 0.00 3.00 0.00 0.00
c40067_g2 13.19 NA NA 118.29
head Trinity_genes.not_cross_norm.fpkm.tmp
Lt1160414_estimate_out Lt1180314_estimate_out St1180314_estimate_out St3160414_estimate_out
c46397_g2 2.42 NA NA NA
c17494_g1 1.48 0.62 0.81 13.64
c35331_g1 1.31 3.59 4.41 1.20
c35772_g2 NA 0.24 NA 0.00
c38800_g1 23.57 33.77 9.32 58.64
c13549_g1 0.51 0.24 NA 1.12
c73778_g1 0.00 0.75 0.00 0.00
c40067_g2 1.88 NA NA 0.63
I hope somebody will be able to help :)
Cheers,
Bent