Hello, Vasilis.
The background frequencies need to be calculated with the “4d” procedure using “phyloFit” with gene sets for each species. You can find a clear example of this in our hg38 make doc which you can view in our source tree at http://genome-source.cse.ucsc.edu/gitweb/?p=kent.git;a=blob;f=src/hg/makeDb/doc/hg38.txt. Look for the section titled “Phylogenetic tree from 7-way”.
The rho, expected-length and target-coverage phastCons parameters you referenced can be used as-is.
Please contact us again at gen...@soe.ucsc.edu if you have any further questions. All messages sent to that address are archived on a publicly-accessible Google Groups forum. If your question includes sensitive data, you may send it instead to genom...@soe.ucsc.edu.
---
Steve Heitner
UCSC Genome Bioinformatics Group
--
Hello, Vasilis.
The background frequencies need to be calculated with the “4d” procedure using “phyloFit” with gene sets for each species. You can find a clear example of this in our hg38 make doc which you can view in our source tree at http://genome-source.cse.ucsc.edu/gitweb/?p=kent.git;a=blob;f=src/hg/makeDb/doc/hg38.txt. Look for the section titled “Phylogenetic tree from 7-way”.
The rho, expected-length and target-coverage phastCons parameters you referenced can be used as-is.
Please contact us again at gen...@soe.ucsc.edu if you have any further questions. All messages sent to that address are archived on a publicly-accessible Google Groups forum. If your question includes sensitive data, you may send it instead to genom...@soe.ucsc.edu.
---
Steve Heitner
UCSC Genome Bioinformatics Group
Hello, Vasilis.
This is mostly correct. You should only need the gene set for your reference species. You should not need gene sets for all species. For future reference, if you do need a gene set, you can always use Genscan (http://genes.mit.edu/GENSCAN.html) or Augustus (http://bioinf.uni-greifswald.de/augustus/) to obtain rough gene predictions.
Please contact us again at gen...@soe.ucsc.edu if you have any further questions. Questions sent to that address will be archived in a publicly-accessible forum for the benefit of other users. If your question contains sensitive data, you may send it instead to genom...@soe.ucsc.edu.
---
Steve Heitner
UCSC Genome Bioinformatics Group
--
Hello, Vasilis.
This is mostly correct. You should only need the gene set for your reference species. You should not need gene sets for all species. For future reference, if you do need a gene set, you can always use Genscan (http://genes.mit.edu/GENSCAN.html) or Augustus (http://bioinf.uni-greifswald.de/augustus/) to obtain rough gene predictions.
Please contact us again at gen...@soe.ucsc.edu if you have any further questions.
Questions sent to that address will be archived in a publicly-accessible forum for the benefit of other users. If your question contains sensitive data, you may send it instead to genom...@soe.ucsc.edu.
---
Steve Heitner
UCSC Genome Bioinformatics Group