My knee-jerk reaction would be to say that some other software
layer should strip these out, remember where they went, and
then put them back in after a parse. This is based on the
observation that links wouldn't improve the speed or accuracy
of the parse, and are thus irrelevant clutter.
> More generally, how could information from footnotes be captured? Used
> by Relex?
Relex would be a reasonable layer in which to do this kind of
processing; however, I don't see how it could be "used" by relex.
Relex is a mashup of two different things: a pathetic document
management framework wrapped around a core dependency parser.
The dependency parser is all about English grammar, and can't
per-se "do anything" with urls or footnotes.
However, patches that would strip out urls/footnotes, and then
reinsert them, post-parse, would be accepted.
-- linas