Hi Sam
Not a daft question at all. Before I come to your question, you mentioned possibly transforming your non-normal variable prior to imputation. One thing to be aware of when you do this (and you may well be aware already given your second option to use a hot-deck type approach) is that when you impute using the transformed variable you change the assumed functional relationship between it and the other variables. For more on this, see the following paper by von Hippel:
http://smr.sagepub.com/content/42/1/105.abstract
As you have probably discovered, Rubin's combination rules are for estimation of a parameter, and hypothesis testing / confidence intervals for a parameter (or multiple parameters). There have been a few approaches developed for combining p-values directly (see
http://missingdata.lshtm.ac.uk/index.php?option=com_content&view=article&id=164:combining-p-values-from-multiple-imputations&catid=57:multiple-imputation&Itemid=98), but I think the approaches described in the two papers I listed on that page started off from an assumption that there is a target parameter being estimated, so I'm not sure whether it would be applicable in your case. I don't have Schafer's book to hand (also referred to on that page), but I recall that he has some discussion of applying rank based methods to multiply imputed datasets.
An alternative 'answer', that you might disagree with (!), is that arguably we should never perform analyses where we only report a p-value. We are almost always interested (or should be) in quantifying the direction and magnitude of some parameter in a model (possibly one with minimal assumptions), in which case you can make progress. For example, let's say you wanted to compare the medians of your two groups. Then so long as the sample size is not small, the sampling distribution of the sample median is approximately normal, with an analytical approximate formula for the standard error. Consequently you can apply Rubin's rules to the difference in group medians, and combine the standard errors etc as usual. If you happen to be a Stata user, Stata will fit a median (using the qreg command) to multiple imputed datasets, and thus do this all for you. I'm not sure whether mice in R will (yet).
Best wishes
Jonathan