Sam, need to apologise - I made the spurious claim that (...?)\1* didn't
define a regular language. I was, obviously, imagining the situation where
the alphabet was infinite (as you do). Over a finite alphabet the above
_is_ regular, but for the slightly awkward reason that it's equivalent to
(aaa)+|(aab)+|(aac)+| ... |(zzy)+|(zzz)+
... which is not an observation that is particularly useful, I'll admit.
As to the related question of whether
(AB)\1
(with A and B nontrivial CF expressions) is context-free: it isn't;
consider
(a+b+)\1
which isn't context-free (I'm not sure it's even context-sensitive - it
falls under the "can count two things, not three" rule).
--
jan grant
http://ioctl.org/jan/
Whenever I see a dog salivate I get an insatiable urge to ring a bell.