Fixme: Title::getLength() documentation claims that it adds things to the link cache, so the local cache here should be unnecessary, but in fact calling getLength() repeatedly for the same $page does seem to run one query for each call?
handling of "o" format character for Iranian, Hebrew, Hijri & Thai?
look at multiple extension, separately and together.
Check for unique id attribute :P
Check for unique id attribute :P
FIXME: we may want to blacklist some broken browsers
check XML more carefully
sensible defaults
document