Hi Yiying,
Yes, you must always run your analysis first without data (`usedata = 0`) to check that the effective prior is sensible -- this has been emphasised in the PAML documentation and related articles and protocol papers/tutorials :)
If you have specified a lower-bound calibration with a minimum age of 28 Myr (e.g., `L(28)`), you expect to see a distribution with a minimum around 28 Myr (which you see in the plot you show above). The shape of the distribution that you expect depends on the type of node age constraint you have used and is implemented in MCMCtree. Following Table 7 in the PAML documentation...
... I believe you might have chosen the lower-bound constraint based on your first message. For instance, if you used `L(28)`, this means that you have used the default values for `p`, `c`, and `p_L` (`p=0.1`, `c=1`, and `p_L = 0.025`). Therefore, you are not using a hard bound. In other words, you are allowing for the possibility that the minimum age can be a little bit younger than 28Myr with a tail probability of 2.5% (which you also see on the plot you show in your last message).
Remember that you can use the mcmc3r R package to check how the distribution that you want to use to constrain a specific node age looks like. Assuming that you used `L(28)`, I have then used mcmc3r to plot how this distribution would look like, which is actually very similar to what you have obtained when running MCMCtree without data:
```R
curve( mcmc3r::dL( x, tL = 28 ), from = 26, to = 90, n = 1e4 )
abline(v=28, col = "blue")
```

Please note that the shape of the distribution from 28Myr to the edge of my x axis (region for older times) has changed when you ran MCMCtree as now you have to consider parental nodes in the tree topology (which may be uncalibrated and/or calibrated with additional constraints, I do not know this as I have not seen your input files); such nodes must also have a time density as they are part of the joint time prior. When considering the densities for calibrated and uncalibrated nodes to build the time prior, it is important to always bear in mind that daughter nodes can never be younger than parental nodes in the specified tree topology, and so marginal time densities for daughter nodes may have to be truncated so that this biological rule is not violated. Please note that this truncation in the time prior may happen if the densities assigned to daughter nodes ended up being older than parental nodes or when you have overlapping node age constraints in neighbouring nodes and the aforementioned restriction is to be applied. You can check this by plotting the marginal time densities that you have for "n_57", "n_56", "n_55", "n_54" when you are not using your data. If you do that, you will see the density with which you constrained the age of node "n_58" (i.e., `L(28)`) has the area for older ages "occupied" by the marginal time densities estimated for these four parental nodes, hence why the density for node "n_58" has been somewhat narrowed if compared to the plot obtained in mcmc3r.
If you wanted to use a hard bound instead, you will need to include `p_L = 1e-300` as specified in the PAML documentation, and also shown in the attached table above. You will need to include the values of `p` and `c` too. E.g.: if you keep `p` and `c` with default values, then 'L(28, 0.1, 1, 1e-300)' - I suggest that you play around and plot various distributions using the mcmc3r R package to check which shape distribution best represents your knowledge on the fossil specimen that you are using to constrain that specific node age on your tree topology!
Lastly, you will have to check whether your posterior densities obtained when using your data are in conflict with the the prior densities (no data) to assess whether your data are informative and to check whether there are serious conflicts between the specified prior and the posterior. Following
Nascimento et al. 2017, if the prior and posterior densities (e.g., for a specific node age) were very similar, this means that the data are not very informative (essentially, you have not updated your prior despite incorporating some molecular data). If there is an overlap between prior and posterior densities, but the posterior is more concentrated than the prior, then the data are informative and the prior you used is reasonable. If prior and posterior densities do not overlap, then the prior and your data are in conflict (e.g., perhaps the prior is misspecified). You can use Tracer or other in-house scripts to load the "mcmc.txt" files obtained when running MCMCtree with and without data and plot both prior and posterior densities for the same nodes to check for these issues as part of the MCMC diagnostics -- you always have to carry out MCMC diagnostics to make sure that your chains have reached convergence, you have not had mixing issues, etc. You can read
Nascimento et al. 2017 for more details and examples on Bayesian phylogenetic analyses.
Hope this helps!
All the best,
Sandy