> On Thu, Feb 12, 2015 at 8:28 PM, David Hall <> wrote:>> I'm trying to figure out a data type for trees. Epic currently encodes the>> position of a tree node as a Span object relative to the token sequence. So>> in the tree>>>> A>> | \>> B C>>>> Would be encoded as>> Node(A, Span(0,2), List(Node(B, Span(0,1)), Node(C, Span(1,2)))>>> The current data structure Tree[L] is in service of the underlying> algorithms. It might make sense to add another data structure (Parse[L, W]?)> that is more specific.Or epic.slab.Tree. Or reuse scalaz.Tree. So another entirely different type.
>> In slabs, there is not really a concept of a sentence or a sorted list, so>> the Spans can't reference any index. But the spans would allow to flatten>> the tree into a list and reconstruct it, which would make the serialization>> easier.>>> Trees really are about relationships between spans (subject to certain> constraints). It might make more sense to extend slabs to have a notion of a> dependency link between two spans. (here, parent constituent would be the> head and its children the dependent.) This has come up before in our early> discussion of slabs (Steven Bethard, I think.)Are the two spans arranged in a context-free manner? Otherwise we'redrifting into graph territory. Also, is it possible to keep all data ona node or is it required to be able to annotate the connections too?
--
You received this message because you are subscribed to the Google Groups "ScalaNLP" group.
To unsubscribe from this group and stop receiving emails from it, send an email to scalanlp+u...@googlegroups.com.
To post to this group, send email to scal...@googlegroups.com.
Visit this group at http://groups.google.com/group/scalanlp.
For more options, visit https://groups.google.com/d/optout.
Could you give me an example where you would need a reference to the Tokens in the parsetree?
--
Ah. Maybe rename to isContiguous?
}