Account Options

  1. Sign in
The old Google Groups will be going away soon, but your browser is incompatible with the new version.
Google Groups Home
« Groups Home
Hansard Parser
There are currently too many topics in this group that display first. To make this topic appear first, remove this option from another topic.
There was an error processing your request. Please try again.
flag
  6 messages - Collapse all  -  Translate all to Translated (View all originals)
The group you are posting to is a Usenet group. Messages posted to this group will make your email address visible to anyone on the Internet.
Your reply message has not been sent.
Your post was successful
 
From:
To:
Cc:
Followup To:
Add Cc | Add Followup-to | Edit Subject
Subject:
Validation:
For verification purposes please type the characters you see in the picture below or the numbers you hear by clicking the accessibility icon. Listen and type the numbers you hear
 
J Wells  
View profile  
 More options Mar 24 2012, 6:53 pm
From: J Wells <def...@gmail.com>
Date: Sat, 24 Mar 2012 15:53:41 -0700 (PDT)
Local: Sat, Mar 24 2012 6:53 pm
Subject: Hansard Parser

Hi all,

I went to the openaustralia.org site the other day and noticed some missing
speeches, then saw the twitter post about the parser.

I've made a few changes to the parser which got it working again on my
local machine for all of Feb/March. Looks like the APH XML format has
changed to what looks like a slightly cleaner output.

https://github.com/JWells/openaustralia-parser/commit/dc9cf62f25cb4e6...

It might be worth someone with a better understanding of the code base
testing the results but hopefully it'll help. For late Feb data I needed to
add 'Robert John Carr' with alt name 'Bob Carr' to the senators/people csv
files to fix an error that was occurring.

I also noticed after all the hansard data was loaded that I wasn't getting
any speeches listed on the Senator's or Member's pages. It looks like there
might be a different search system used for that rather then pulling the
data directly out of mysql but I wasn't sure so perhaps I missed something.

Regards,
Justin


 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Henare Degan  
View profile  
 More options Mar 24 2012, 11:09 pm
From: Henare Degan <hen...@openaustralia.org>
Date: Sun, 25 Mar 2012 14:09:56 +1100
Local: Sat, Mar 24 2012 11:09 pm
Subject: Re: Hansard Parser

On 25 March 2012 09:53, J Wells <def...@gmail.com> wrote:

Very awesome Justin!

I've parsed the first day of sitting for 2012 (2012-02-07) and I found it
wasn't getting these speeches under Questions without Notice:
http://parlinfo.aph.gov.au/parlInfo/search/display/display.w3p;db=CHA...

Any ideas?

Also the speaker is not being picked up as a person in the Personal
Explanations debates (i.e. his speeches are just appearing in the text):
http://parlinfo.aph.gov.au/parlInfo/search/display/display.w3p;db=CHA...

BTW, here's the ticket related to this latest round of APH breakages:
http://tickets.openaustraliafoundation.org.au/browse/OA-499

In my quick testing I also picked up another bug with the parser but I
think it's from before this latest problem:
http://tickets.openaustraliafoundation.org.au/browse/OA-502

I also noticed after all the hansard data was loaded that I wasn't getting

> any speeches listed on the Senator's or Member's pages. It looks like there
> might be a different search system used for that rather then pulling the
> data directly out of mysql but I wasn't sure so perhaps I missed something.

That uses the Xapian search index so it needs to be up to date (see
`twfy/search/index.pl`). That's probably not clear in the install
instructions so if you can suggest improvements that'd be great.

Cheers,

Henare


 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Justin Wells  
View profile  
 More options Mar 25 2012, 2:44 am
From: Justin Wells <def...@gmail.com>
Date: Sat, 24 Mar 2012 23:44:06 -0700 (PDT)
Local: Sun, Mar 25 2012 2:44 am
Subject: Re: Hansard Parser

https://github.com/JWells/openaustralia-parser/commit/b35c3da6507e443...
should fix the first issue with the missing speeches.

The speaker should be picked up now too, however I noticed that on the 28th
Feb (and possibly other dates) it's assigning the speaker to the wrong
person. Jenkins instead of Slipper. Not quite sure why that is yet, will
keep looking.


 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Henare Degan  
View profile  
 More options Apr 12 2012, 6:41 pm
From: Henare Degan <hen...@openaustralia.org>
Date: Fri, 13 Apr 2012 08:41:38 +1000
Local: Thurs, Apr 12 2012 6:41 pm
Subject: Re: Hansard Parser

Hey Justin,

I might try to take a look at getting the parser running again this
weekend. Did you make any more progress you want me to look at integrating?

Cheers,

Henare

On 25 March 2012 17:44, Justin Wells <def...@gmail.com> wrote:


 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Justin Wells  
View profile  
 More options Apr 13 2012, 6:38 pm
From: Justin Wells <def...@gmail.com>
Date: Fri, 13 Apr 2012 15:38:03 -0700 (PDT)
Local: Fri, Apr 13 2012 6:38 pm
Subject: Re: Hansard Parser

Hi Henare,

I added a small change to get the questions without notices working,
https://github.com/JWells/openaustralia-parser/commit/b35c3da6507e443...

The issue with the speaker showing as the wrong person appears to be
because the speaker hasn't been updated in the list of people in
reprasentatives.csv. No commits on that as I'm still not all that sure how
it works and what else it might effect.

I did take a brief look at the issue with the tabling of documents but
wasn't able to find the cause. Haven't had time to get back for another
look.

Regards,
Justin


 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Henare Degan  
View profile  
 More options Jun 24 2012, 3:48 am
From: Henare Degan <hen...@openaustralia.org>
Date: Sun, 24 Jun 2012 17:48:59 +1000
Local: Sun, Jun 24 2012 3:48 am
Subject: Re: Hansard Parser

On 14 April 2012 08:38, Justin Wells <def...@gmail.com> wrote:

> I added a small change to get the questions without notices working,

> https://github.com/JWells/openaustralia-parser/commit/b35c3da6507e443...

> The issue with the speaker showing as the wrong person appears to be
> because the speaker hasn't been updated in the list of people in
> reprasentatives.csv. No commits on that as I'm still not all that sure how
> it works and what else it might effect.

I've merged these changes and updated the sitting calendar, updated the
speaker, replaced a few Senators and loaded all of 2012 to date.

I've also added you to the list of contributors, thanks Justin!
http://www.openaustralia.org/about/

Cheers,

Henare


 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
End of messages
« Back to Discussions « Newer topic     Older topic »