DKPro Core type system vs ClearTk type system

44 views
Skip to first unread message

Majid Laali

unread,
Nov 2, 2015, 4:44:46 PM11/2/15
to cleartk-developers
Hi, 

I came across DKPro Core website and noticed this project has nice a coverage over several NLP components and comprehensive type systems for UIMA. 

I am afraid I have not understood why ClearTK has its own type system and created several wrappers over NLP components while the DKPro project can provide those component for ClearTK. 

I would appreciate if someone could give me some hints of the idea behind this.

Thanks, 
Majid






Majid Laali   PhD CandidateConcordia University
 1515 St. Catherine St. West, EV9-401, Montreal QC, Canada
 

 

Richard Eckart de Castilho

unread,
Nov 2, 2015, 5:18:39 PM11/2/15
to cleartk-d...@googlegroups.com
Hi Majid,

I am one of the DKPro Core developers. DKPro Core and ClearTK developed in parallel and influenced each other quite a bit. We actually worked together quite intensively on uimaFIT (which was originally uutuc and created by the ClearTK folks and now is Apache uimaFIT maintained by myself).

At a point in time, we were considering to set up a common type system, but eventually never got around to doing it.

ClearTK and DKPro Core have always had different foci and different approaches to solve particular issues. E.g. DKPro Core is very strong in dynamic model loading and parametrization. ClearTK uses Java generics in very clever ways and has some interesting architectural approaches. ClearTK has always had a very strong ML focus while DKPro TC has picked up on that topic rather recently.

So you see, both projects have their benefits and users.

Cheers,

-- Richard

Majid Laali

unread,
Nov 2, 2015, 5:50:43 PM11/2/15
to cleartk-developers
Hi Richard, 

Thank you for the clarification. 

I assumed that the main focus of ClearTK is to bring the machine learning functionality to UIMA, not creating a comprehensive NLP components or even a type system. Actually I neglected the architectural differences between these two projects. 

Anyway, I hope these two projects come to one standard type system in future so that they benefit more from each other. 

Thanks, 
Majid






Majid Laali   PhD CandidateConcordia University
 1515 St. Catherine St. West, EV9-401, Montreal QC, Canada
 

 

Steven Bethard

unread,
Nov 6, 2015, 4:58:29 PM11/6/15
to cleartk-developers
I think it's only a matter of a lack of developer time which prevents the two type systems from being merged. If anyone wants to propose a common type system, and provide the necessary pull requests to update all the ClearTK and/or DKPro code, that would be awesome.

You'd want to be a bit careful to look into the cTAKES type system too though. That's heavily influenced by the ClearTK type system, and you'd want to align that one as well.

Steve

--
You received this message because you are subscribed to the Google Groups "cleartk-developers" group.
To unsubscribe from this group and stop receiving emails from it, send an email to cleartk-develop...@googlegroups.com.
To post to this group, send email to cleartk-d...@googlegroups.com.
Visit this group at http://groups.google.com/group/cleartk-developers.
For more options, visit https://groups.google.com/d/optout.

Majid Laali

unread,
Nov 7, 2015, 12:07:28 PM11/7/15
to cleartk-developers
I am new to the both projects (i.e. ClearTK, DKPro), but at the first glance, my impression is that ClearTK project focuses on bringing the machine learning functionality to UIMA and DKPro Core creates wrappers around existing NLP components and, therefore, a type system. It is very nice these two projects collaborated with each other and had influence on each other. However, it would be great if the scope of these projects could be separated.

I believe most sub-projects in ClearTK that focus on the integration of machine learning is independent of any type system. For example, cleartk-ml-* projects do not usually use any type system. The only classes that coupled with the type system are classes in the org.cleartk.ml.feature package. I think these classes can also be updated so that they become independent of the type system (with approach like the one used in the FeatureFunctionExtractor class). However, I agree it needs some careful thought before doing so.

Thanks, 
Majid






Majid Laali   PhD CandidateConcordia University
 1515 St. Catherine St. West, EV9-401, Montreal QC, Canada
 

To unsubscribe from this group and stop receiving emails from it, send an email to cleartk-developers+unsub...@googlegroups.com.
Reply all
Reply to author
Forward
0 new messages