With the availability of the PROIEL dependency tree analysis, I thought I'd give that a go.
I've uploaded to github code for extracting the clauses in John's Gospel and generating a very basic reading programme from that.
Clauses were extracted by looking at any 'pred' arc and linearizing all nodes from that point down. If there were embedded preds then clauses corresponding
to both inner and outer preds were generated.
Note that the current code is just based on forms with use made of syntactic or morphological information. I also can't do inline replacement into an English context because I don't have an English text mapped to the PROIEL analysis.
However, my initial impression is that the PROIEL analysis will be preferable to work with moving forward.
Could you clarify in what ways you see the PROIEL data being superior
to the opentext data? One obvious one that leaps to mind is that
OpenText seems to be a dead project...
"Let things true be preferred to things false, things eternal to
things momentary, things useful to things agreeable."
Lucius Caelius Lactantius
> You received this message because you are subscribed to the Google Groups "graded-reader" group.
> To post to this group, send email to graded...@googlegroups.com.
> To unsubscribe from this group, send email to graded-reade...@googlegroups.com.
> For more options, visit this group at http://groups.google.com/group/graded-reader?hl=en.
Admittedly, I haven't spent THAT much time with it but it seems that
it will be easier to extract the kind of syntactic information I'm
interested in from it.
On Apr 12, 7:41 am, Patrick Narkinsky <patr...@narkinsky.com> wrote:
> Could you clarify in what ways you see the PROIEL data being superior
> to the opentext data? One obvious one that leaps to mind is that
> OpenText seems to be a dead project...
> Patrick Narkinsky