I don't know why, but it's almost impossible to find a fast compact 16 bit signed division assembly routine.
Ideally, HL=DE/HL with DE=remainder. But I'll take anything.
Here is the z88dk small implementation, which follows the C standard of the remainder taking the sign of the dividend.