I went to the openaustralia.org site the other day and noticed some missing speeches, then saw the twitter post about the parser.
I've made a few changes to the parser which got it working again on my local machine for all of Feb/March. Looks like the APH XML format has changed to what looks like a slightly cleaner output.
It might be worth someone with a better understanding of the code base testing the results but hopefully it'll help. For late Feb data I needed to add 'Robert John Carr' with alt name 'Bob Carr' to the senators/people csv files to fix an error that was occurring.
I also noticed after all the hansard data was loaded that I wasn't getting any speeches listed on the Senator's or Member's pages. It looks like there might be a different search system used for that rather then pulling the data directly out of mysql but I wasn't sure so perhaps I missed something.
> I went to the openaustralia.org site the other day and noticed some > missing speeches, then saw the twitter post about the parser.
> I've made a few changes to the parser which got it working again on my > local machine for all of Feb/March. Looks like the APH XML format has > changed to what looks like a slightly cleaner output.
> It might be worth someone with a better understanding of the code base > testing the results but hopefully it'll help. For late Feb data I needed to > add 'Robert John Carr' with alt name 'Bob Carr' to the senators/people csv > files to fix an error that was occurring.
I also noticed after all the hansard data was loaded that I wasn't getting
> any speeches listed on the Senator's or Member's pages. It looks like there > might be a different search system used for that rather then pulling the > data directly out of mysql but I wasn't sure so perhaps I missed something.
That uses the Xapian search index so it needs to be up to date (see `twfy/search/index.pl`). That's probably not clear in the install instructions so if you can suggest improvements that'd be great.
The speaker should be picked up now too, however I noticed that on the 28th Feb (and possibly other dates) it's assigning the speaker to the wrong person. Jenkins instead of Slipper. Not quite sure why that is yet, will keep looking.
> The speaker should be picked up now too, however I noticed that on the > 28th Feb (and possibly other dates) it's assigning the speaker to the wrong > person. Jenkins instead of Slipper. Not quite sure why that is yet, will > keep looking.
> To post to this group, send email to openaustralia-dev@googlegroups.com. > To unsubscribe from this group, send email to > openaustralia-dev+unsubscribe@googlegroups.com. > For more options, visit this group at > http://groups.google.com/group/openaustralia-dev?hl=en.
The issue with the speaker showing as the wrong person appears to be because the speaker hasn't been updated in the list of people in reprasentatives.csv. No commits on that as I'm still not all that sure how it works and what else it might effect.
I did take a brief look at the issue with the tabling of documents but wasn't able to find the cause. Haven't had time to get back for another look.
>> The speaker should be picked up now too, however I noticed that on the >> 28th Feb (and possibly other dates) it's assigning the speaker to the wrong >> person. Jenkins instead of Slipper. Not quite sure why that is yet, will >> keep looking.
>> To post to this group, send email to openaustralia-dev@googlegroups.com. >> To unsubscribe from this group, send email to >> openaustralia-dev+unsubscribe@googlegroups.com. >> For more options, visit this group at >> http://groups.google.com/group/openaustralia-dev?hl=en.
> The issue with the speaker showing as the wrong person appears to be
> because the speaker hasn't been updated in the list of people in
> reprasentatives.csv. No commits on that as I'm still not all that sure how
> it works and what else it might effect.
I've merged these changes and updated the sitting calendar, updated the
speaker, replaced a few Senators and loaded all of 2012 to date.