You can look at detection probability as a "nuisance" parameter. Ultimately, the parameter of interest is occupancy, but in order to estimate occupancy with accuracy, we have to take into account how effectively the species of interest can be detected. This is why detection probability should be approached first--it actually impacts your estimates of occupancy.
For example, a site surveyed for three seasons with three surveys in each season might appear:
101 000 001
If we could detect the species with 100% accuracy, we'd say that the site went extinct from year 1 to year 2. However, we know from seasons 1 and 3 that the species was present at the site, but not always detected (indicated by zeros). Clearly, detection probability is not 100%. Therefore it is possible that during year 2, the species was actually present (occupancy), but undetected (detection). This probability is directly incorporated into the calculation of occupancy probability.
Therefore, occupancy probability is an informed estimate of occupancy after taking into account detectability. Detection probability is a nuisance parameter that is determined from the 1's and 0's of your survey history.
This is a pretty basic concept--it might be helpful for you to read some of the Mackenzie papers from the early 2000s, and the text Occupancy Estimation and Modeling.