Hello Mark,
This is one of those questions to which there is no simple answer.
In an ideal world we have detailed locations of all the taxa,
preferably as points. The reality is that we rarely do.
When we have point data they tend to be sparsely sampled - this is
one of the reasons cell sizes for many analyses tend to be
relatively large.
Sometimes we have range polygons, such as from the IUCN. These tend
to overestimate distributions, or at least generalise the
boundaries. This is partly because of the nominal map scale and
purpose they are generated for does not require a high level of
detail.
Sometimes we have rasters generated using species distribution
models (also known as habitat suitability models or environmental
niche models). These tend to also overestimate ranges, and in some
case underestimate them. These might have high spatial resolutions,
but remember that precision is not the same as accuracy.
You have the fourth case, where the spatial location of a taxon is
known to some administrative unit. Ideally such units are small,
much smaller than the resolution wanted for an analysis. In the
worst case you might have something the size of Western Australia
(2.6 million km^2) which spans a wide range of biomes. In your case
1400 km^2 is equivalent to a square of approximately 37.5 km x 37.5
km, so it might be reasonable to analyse the data at something like
a 50 km resolution.
It is probably worth trying several resolutions to assess
sensitivity, for example 10, 25, 50 etc, although bear in mind that
taxon ranges in Biodiverse are in cell units so the numbers are not
directly comparable across cell sizes (a taxon range in one 50 km
cell might span anywhere from one to twenty-five 10 km cells).
An alternative is to use the administrative units as the spatial
analysis units. If your data are a table then you can use the names
of the units to define the cells (referred to in Biodiverse as a
text_group). The results will not plot spatially (something to fix
one day) but the indices will all be calculated and can be exported
from Biodiverse and then reattached to the spatial data using a join
of some sort.
Note that the units will not be an issue for PD since it is just the
sum of branch lengths within the specified neighbourhood (usually
sp_self_only() which is each cell in isolation). The endemism
indices will be affected since now the ranges are estimated as the
number of administrative units. The amount they are affected
depends on how much variation there is in the areas of the admin
units. If they are all close in size then it will not make much
difference. If they vary widely (e.g. Western Australia and Belize)
then there are clearly issues. Using this approach also implies
that each taxon spans the full area of each admin unit, but that is
also assumed with equal area grid cells.
Another option, if you are confident that each taxon spans all (or
nearly all) of each admin unit in which it is found, is to import
the data directly from polygons to square cells in Biodiverse. This
means you do not need to first calculate the centroids.
https://biodiverse-analysis-software.blogspot.com/2018/12/import-polygon-and-polyline-data.html
Sorry there are no concrete recommendations but hopefully there is
enough for you to start some experimentation to see what approach
gives meaningful results that help answer the question(s) you are
asking.
Regards,
Shawn.