Hey Santiago,
I agree with you, I can't think a situation where it would make sense to have a calibration on internal nodes when using a coalescent tree prior. However, I was thinking more at the use of heterochronous data (e.g. aDNA). I can't see anything wrong in using tipdates in a coalescne-based analysis. Another situation is that you may have information to calibrate the root.
As for the generation time, from Drummond et al 2002: "It is assumed genealogies are realized by the Kingman coalescent process. Our time units in this article are “calendar units before the present” [e.g., days before present (BP)], where the present is the time of the most recent leaf and set to zero. Let p denote the number of calendar units per generation and Ө=Ne p. The scale factor Ө converts “coalescent time” to calendar time and is one of two key objects of our inference. Note that we do not estimate p and Ne separately, only their product." [just to clarify, Ө is not theta, as theta is defined earlier in the paper as theta=2Ne*mu].
My understanding from this is that: while you provide a calibration in a unit of calendar year (days, year. my etc), as you say, and the coalescent time (i.e. the x-axis) is rescaled in calendar year (days, year), the estimation of the population size is in Ne*p where p is the generation time expressed in the unit you provided in your calibration. I wouldn't be able to explain to you how this is handled mathematically, but biologically, it always made lots of sense to me because genetic drift operates on a 'per generation basis'. With EBSP being an extension of BSP, I always assumed the same approach is applied.
I'm not sure whether this helps or confuses even more things...
cheers,
carlo
Drummond, A.J., Nicholls, G.K., Rodrigo, A.G., and Solomon, W. (2002) Estimating Mutation Parameters, Population History and Genealogy Simultaneously From Temporally Spaced Sequence Data. Genetics 161(3), 1307-1320.