I just started using dkpro-tc for my bachelor thesis. After going through the examples, there are a couple of questions I have in mind.
- I am using a set of datas, where I have to decide whether a sentence is fitting to a template or not. In your examples you had one file for a sentence. Is it necessary to split my data into serveral files? For now i have a file for good or bad.
- Every sentence I'm using has an unique ID. Can I use this ID in dkpro-tc? I saw, that the evaluation is sorted by an ID. I want to access original sentences ater evaluating them and for me it seems easier to map the ID's with the sentences. Or is there an other option to get the original text?
Best regards,
Sebastian
- I am using a set of datas, where I have to decide whether a sentence is fitting to a template or not. In your examples you had one file for a sentence. Is it necessary to split my data into serveral files? For now i have a file for good or bad.
- Every sentence I'm using has an unique ID. Can I use this ID in dkpro-tc? I saw, that the evaluation is sorted by an ID. I want to access original sentences ater evaluating them and for me it seems easier to map the ID's with the sentences. Or is there an other option to get the original text?
--
You received this message because you are subscribed to the Google Groups "dkpro-tc-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to dkpro-tc-user...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
Now i wrote my own Reader. After some problems (Units have to read annotated texts?) I decided to extend the Conll Reader. I'm setting the sentences as an Unit. I tested the Reader and my sentences are in each Unit. Yout tip with the ID worked fine.
When I am looking at the IdOutcomeReport, i get only two outputs. One for every dataset. Do I have to set an option, so weka knows that it has to map the unit?