I felt I needed to be more specific and clear about my issue:
When I am not using the clean_url function this is what happens:
"http://pharos.ece.utexas.edu/wiki/index.php?from=20121028231450&hideliu=1&hidemyself=1&target=Pharos_Tutorials&.........................................."
Now, after I add the clean_url function (in my original post), my crawler prematurely completes. Here are some of the first debug lines in the shell:
...
...
The url being crawled is obviously wrong. I can paste in my parse if need be.
Thanks,
Windter