ENhanced configuration file

39 views
Skip to first unread message

Karni Gilon

unread,
Oct 26, 2016, 6:41:53 AM10/26/16
to eop-users
HI

I successfuly ran the project with the basic configuration file:  MaxEntClassificationEDA_Base+OpenNLP_EN
I changed it now to:MaxEntClassificationEDA_Base+WN+VO+TP+TPPos_EN..
But now I'm stuck with the exception below. See below last lines of the console.
Is it again a configuration issue?
Thanks
Karni.


16/10/26 13:33:36 INFO implbase.LAP_ImplBase: Pair 800 written as /tmp/EN/dev1/800.xmi
Oct 26, 2016 1:33:36 PM eu.excitementproject.eop.core.MaxEntClassificationEDA initializeModel
INFO: The trained model will be stored in /cs/guest/karnigilon/workspace/analogies1/eop-resources-1.2.3/model/MaxEntClassificationEDAModel_Base+WN+VO+TP+TPPos_EN
Oct 26, 2016 1:33:36 PM eu.excitementproject.eop.core.component.scoring.BagOfLexesScoringEN <init>
INFO: Load WordNet done.
Oct 26, 2016 1:33:36 PM eu.excitementproject.eop.core.component.scoring.BagOfLexesScoringEN <init>
INFO: Load VerbOcean done.
The CAS has two needed Views: Okay
The Cas has EntailmentMetadata: Okay
Language:EN
Task:IE
Origin:null
Channel:null
TextDocumentID:null
TextCollectionID:null
HypothesisDocumentID:null
HypothesisCollectionID:null
The CAS has one (or more) Entailment.Pair
PairID: 22
GoldAnswer: NONENTAILMENT
TextSOFA:Chicago-based Boeing has already scrubbed three delivery slots in 2006 that had been booked by Air Canada.
HypothesisSOFA:Boeing's headquarters is in Canada.
The CAS has 1 Pair(s)
Checking Annotations of TextView
It has:
1 sentence Annotation(s)
18 token Annotation(s)
18 pos Annotation(s)
0 lemma Annotation(s)
0 NER Annotation(s)
0 Dependency Annotation(s)
Checking Annotations of HypothesisView
It has:
1 sentence Annotation(s)
7 token Annotation(s)
7 pos Annotation(s)
0 lemma Annotation(s)
0 NER Annotation(s)
0 Dependency Annotation(s)
Exception in thread "main" java.lang.NullPointerException
at eu.excitementproject.eop.core.component.scoring.BagOfLemmasScoring.countTokens(BagOfLemmasScoring.java:74)
at eu.excitementproject.eop.core.component.scoring.BagOfLemmasScoring.calculateScores(BagOfLemmasScoring.java:48)
at eu.excitementproject.eop.core.MaxEntClassificationEDA.constructContext(MaxEntClassificationEDA.java:453)
at eu.excitementproject.eop.core.MaxEntClassificationEDA.casToEvent(MaxEntClassificationEDA.java:578)
at eu.excitementproject.eop.core.MaxEntClassificationEDA.readInXmiFile(MaxEntClassificationEDA.java:560)
at eu.excitementproject.eop.core.MaxEntClassificationEDA.readInXmiFiles(MaxEntClassificationEDA.java:544)
at eu.excitementproject.eop.core.MaxEntClassificationEDA.startTraining(MaxEntClassificationEDA.java:513)
at hello2.pkg.HelloWorld2.creatingNewModels(HelloWorld2.java:83)
at hello2.pkg.HelloWorld2.basicTraining(HelloWorld2.java:298)
at hello2.pkg.DocCmp.main(DocCmp.java:56)

Roberto Ferrari

unread,
Oct 26, 2016, 8:07:58 AM10/26/16
to eop-...@googlegroups.com
Probably the EDA's developers could give you a more precise answer. However it seems to me that
the error is due to the fact that you are using an EDA's configuration that uses the lemma as a feature but
OpenNLP that is used for data preprocessing can't provide it. If you want to use the lemma in addition to
the other features you have to install TreeTagger. When you install the EOP there is an option to install the TreeTagger too.

Best
Roberto


--
You received this message because you are subscribed to the Google Groups "eop-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to eop-users+unsubscribe@googlegroups.com.
To post to this group, send email to eop-...@googlegroups.com.
Visit this group at https://groups.google.com/group/eop-users.
To view this discussion on the web visit https://groups.google.com/d/msgid/eop-users/b765f992-e510-4303-8d41-71591e715b39%40googlegroups.com.

Karni Gilon

unread,
Oct 26, 2016, 9:06:34 AM10/26/16
to eop-users
Hi 

I tired to install TreeTagger via install.sh and got the error below.
Which is recommended:  (i)search for ant-macro's elsehwere (ii) install treetagger directly without ant (the general information on ant says that it is needed in case of not using Eclipse....I'm using Eclipse. Do I still need it?)

Thanks
Karni.

Installing TreeTagger ...
Buildfile: /cs/guest/karnigilon/Excitement-Open-Platform-1.2.3/lap/src/scripts/treetagger/build.xml

BUILD FAILED
/cs/guest/karnigilon/Excitement-Open-Platform-1.2.3/lap/src/scripts/treetagger/build.xml:23: Cannot find http://dkpro-core-asl.googlecode.com/svn/built-ant-macros/tags/0.4.0/ant-macros.xml imported from /cs/guest/karnigilon/Excitement-Open-Platform-1.2.3/lap/src/scripts/treetagger/build.xml

Total time: 0 seconds
Error!
To unsubscribe from this group and stop receiving emails from it, send an email to eop-users+...@googlegroups.com.

Roberto Zanoli

unread,
Oct 31, 2016, 6:34:57 AM10/31/16
to eop-...@googlegroups.com, Richard Eckart de Castilho
Hello Richard,

Our build.xml contains a reference to the macros file that doesn't exist any more (

Do you know if in DKPro that is in the github repository there is a copy of this file that we could use?

Thanks in advance
Roberto


To unsubscribe from this group and stop receiving emails from it, send an email to eop-users+unsubscribe@googlegroups.com.

To post to this group, send email to eop-...@googlegroups.com.
Visit this group at https://groups.google.com/group/eop-users.



--
Roberto Zanoli
Technologist
Human Language Technologies Research Unit
Fondazione Bruno Kessler
Via Sommarive 18, 38123 Povo - Trento (Italy)
zan...@fbk.eu

Roberto Zanoli

unread,
Oct 31, 2016, 10:30:11 AM10/31/16
to Richard Eckart de Castilho, eop-...@googlegroups.com
Thanks a lot Richard
Roberto

On Mon, Oct 31, 2016 at 11:48 AM, Richard Eckart de Castilho <richard...@gmail.com> wrote:
Hi Roberto,


> On 31.10.2016, at 11:34, Roberto Zanoli <zan...@fbk.eu> wrote:
>
> Our build.xml contains a reference to the macros file that doesn't exist any more (
> http://dkpro-core-asl.googlecode.com/svn/built-ant-macros/tags/0.4.0/ant-macros.xml).
>
> Do you know if in DKPro that is in the github repository there is a copy of this file that we could use?

Yep, they are here:

  https://raw.githubusercontent.com/dkpro/resource-packager/0.4.0/ant-macros.xml

There are also newer versions:

  https://github.com/dkpro/dkpro-resource-packager/releases

Just replace the version in the first URL if you want to use a different version.
I think they are backwards compatible.

Best,

-- Richard



Roberto Zanoli

unread,
Oct 31, 2016, 10:45:30 AM10/31/16
to eop-...@googlegroups.com
---------- Forwarded message ----------
From: Richard Eckart de Castilho <richard...@gmail.com>
Date: Mon, Oct 31, 2016 at 11:48 AM
Subject: Re: [eop-users] ENhanced configuration file
To: Roberto Zanoli <zan...@fbk.eu>
Cc: eop-...@googlegroups.com


Hi Roberto,

> On 31.10.2016, at 11:34, Roberto Zanoli <zan...@fbk.eu> wrote:
>
> Our build.xml contains a reference to the macros file that doesn't exist any more (
> http://dkpro-core-asl.googlecode.com/svn/built-ant-macros/tags/0.4.0/ant-macros.xml).
>
> Do you know if in DKPro that is in the github repository there is a copy of this file that we could use?

Karni Gilon

unread,
Nov 1, 2016, 3:13:10 AM11/1/16
to eop-...@googlegroups.com
Hi Roberto

It seemed to work, but at the end of the build I got the following message:


BUILD FAILED
/cs/guest/karnigilon/Excitement-Open-Platform-1.2.3/lap/src/scripts/treetagger/build.xml:29: The following error occurred while executing this line:
/cs/guest/karnigilon/Excitement-Open-Platform-1.2.3/lap/src/scripts/treetagger/build.xml:40: The following error occurred while executing this line:
/cs/guest/karnigilon/Excitement-Open-Platform-1.2.3/lap/src/scripts/treetagger/build.xml:194: The following error occurred while executing this line:
/cs/guest/karnigilon/Excitement-Open-Platform-1.2.3/lap/src/scripts/treetagger/build.xml:704: MD5 checksum mismatch for [linux-x86_64/tree-tagger].
Please verify the checksum and if necessary update this script.
Expected: 2a0a3740cc9a838949219caf0d64544c
Actual  : 16cf3578fafd8c39b5b9df0077297025


--
You received this message because you are subscribed to a topic in the Google Groups "eop-users" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/eop-users/my7WE1xqg0I/unsubscribe.
To unsubscribe from this group and all its topics, send an email to eop-users+unsubscribe@googlegroups.com.

To post to this group, send email to eop-...@googlegroups.com.
Visit this group at https://groups.google.com/group/eop-users.

Roberto Zanoli

unread,
Nov 1, 2016, 5:40:00 AM11/1/16
to eop-...@googlegroups.com

In build.xml replace 2a0a3740cc9a838949219caf0d64544c (expected) with 16cf3578fafd8c39b5b9df0077297025 (actual). 



--
You received this message because you are subscribed to the Google Groups "eop-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to eop-users+unsubscribe@googlegroups.com.

To post to this group, send email to eop-...@googlegroups.com.
Visit this group at https://groups.google.com/group/eop-users.

Karni Gilon

unread,
Nov 1, 2016, 7:03:36 AM11/1/16
to eop-...@googlegroups.com
Thanks. The build was successful now. but running eclipse I still get an exception:

Caused by: java.io.IOException: Unable to locate model [en] in the following locations [/cs/guest/karnigilon/Excitement-Open-Platform-1.2.3/lap/src/target/model-staging/de/tudarmstadt/ukp/dkpro/core/treetagger/lib/tagger-en-little-endian.par, classpath:/de/tudarmstadt/ukp/dkpro/core/treetagger/lib/tagger-en-little-endian.par].  Make sure the environment variable 'TREETAGGER_HOME' or 'TAGDIR' or the system property 'treetagger.home' point to the TreeTagger installation directory

Notes: 
(1) The file tagger-en-little-endian.par, does  exist in the location in the first row above
(2) I set (manually) the tree tagger variable to: /cs/guest/karnigilon/Excitement-Open-Platform-1.2.3/lap/src/target/model-staging/de/tudarmstadt/ukp/dkpro/core/treetagger


Roberto Zanoli

unread,
Nov 1, 2016, 10:43:44 AM11/1/16
to eop-...@googlegroups.com
Could you try to remove the content of your local maven repository under ./m2 directory and rerun the script to install the EOP (i.e., bash ./install.sh 1.2.3 treetagger)

at this point you should find treetagger here: /your_user_name/.m2/repository/de/tudarmstadt/ukp/dkpro/core/de.tudarmstadt.ukp.dkpro.core.treetagger-bin/

then set the EOP_RESOURCES variable (i.e., export EOP_RESOURCES="/home/user_name/Excitement-Open-Platform-1.2.3/target/EOP-1.2.3/eop-resources-1.2.3/")

Then try the example in the quick start, i.e., https://github.com/hltfbk/EOP-1.2.3/wiki/Quick-Start

and finally the ones in the step by step tutorial (https://github.com/hltfbk/EOP-1.2.3/wiki/Step-by-Step-Tutorial) where you can use treetagger for preprocessing the data




Karni Gilon

unread,
Nov 2, 2016, 2:33:22 AM11/2/16
to eop-users
Hi Roberto,

I followed your suggestion and ran through the quick start.
Steps 1-3 were successful.
Step 4 (evaluation) was not.
Looks like the "Gold standard" file is missing ?

Here is what I'm getting:
16/11/02 08:26:54 INFO runner.EOPRunner: running the EOP
Nov 02, 2016 8:26:54 AM eu.excitementproject.eop.util.eval.Scorer print
INFO: The results are in:./eop-resources-1.2.3/results/EditDistanceEDA_EN.xml_results.txt_report.xml
16/11/02 08:26:54 INFO runner.EOPRunner: Results file: ./eop-resources-1.2.3/results/EditDistanceEDA_EN.xml_results.txt
16/11/02 08:26:54 INFO runner.EOPRunner: Evaluation file: ./eop-resources-1.2.3/results/EditDistanceEDA_EN.xml_results.txt_report.xml
16/11/02 08:26:54 ERROR runner.EOPRunner: Could not score the results -- the testFile option is missing


To unsubscribe from this group and all its topics, send an email to eop-users+...@googlegroups.com.

To post to this group, send email to eop-...@googlegroups.com.
Visit this group at https://groups.google.com/group/eop-users.

--
You received this message because you are subscribed to the Google Groups "eop-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to eop-users+...@googlegroups.com.

To post to this group, send email to eop-...@googlegroups.com.
Visit this group at https://groups.google.com/group/eop-users.

--
You received this message because you are subscribed to a topic in the Google Groups "eop-users" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/eop-users/my7WE1xqg0I/unsubscribe.
To unsubscribe from this group and all its topics, send an email to eop-users+...@googlegroups.com.

To post to this group, send email to eop-...@googlegroups.com.
Visit this group at https://groups.google.com/group/eop-users.

--
You received this message because you are subscribed to the Google Groups "eop-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to eop-users+...@googlegroups.com.

To post to this group, send email to eop-...@googlegroups.com.
Visit this group at https://groups.google.com/group/eop-users.

Roberto Zanoli

unread,
Nov 2, 2016, 3:44:17 AM11/2/16
to eop-...@googlegroups.com
OK. With exception evaluation (please check the EOPRunner and its parameters because it should run; if at the end you won't be able to run it we could ask Vivi who implemented it.) it works given that those examples do not require TreeTagger for producing the lemma.

What about TreeTagger, is it in your maven local repository (.m2)?

If yes go on with the examples in the step by step tutorial.

To unsubscribe from this group and stop receiving emails from it, send an email to eop-users+unsubscribe@googlegroups.com.

To post to this group, send email to eop-...@googlegroups.com.
Visit this group at https://groups.google.com/group/eop-users.

Karni Gilon

unread,
Nov 2, 2016, 4:22:17 AM11/2/16
to eop-...@googlegroups.com
Hן Roberto,

I used the command line method to run a different configuration file which includes the Lemma:
./eop-resources-1.2.3/configuration-files/MaxEntClassificationEDA_Base+VO+TP+TPPos+TS_EN.xml"
 
It worked.! 


As for Eclipse, the step-by-step tutorial shows the example of  OpenNLPTaggerEN. This works for me. But when I change to TreeTagger instead, it does not work (I get the exception regarding not finding the little endian param model, which I sent before).


Regards
Karni

PS. There are many files in .m2 repository that relate to treetagger (I assume that otherwise the command line operation would not work). Specifically I serached for "little-endian", and found "de/tudarmstadt/ukp/dkpro/core/de.tudarmstadt.ukp.dkpro.core.treetagger-model-en/20111109.0/de.tudarmstadt.ukp.dkpro.core.treetagger-model-en-20111109.0.jar" 


To unsubscribe from this group and all its topics, send an email to eop-users+unsubscribe@googlegroups.com.

To post to this group, send email to eop-...@googlegroups.com.
Visit this group at https://groups.google.com/group/eop-users.
Reply all
Reply to author
Forward
0 new messages