Text Classification API is now fully Open Source!

297 views
Skip to first unread message

julien nioche

unread,
Jun 30, 2009, 4:17:54 PM6/30/09
to DigitalPebble
Hi,

The source code of the TC API is now available under Apache License
from the Resources section of the website. The documentation is a bit
sparse, but there is some JavaDoc and some test classes which can be
used for reference.

The aim of the APi is not to be a generic solution for ML; there are
already plenty of projects for that such as Weka, Rapid-I or Apache
Mahout. It is not meant to be used for text preprocessing, which can
be done with GATE or UIMA, but is intended to provide a reliable,
efficient and straightforward API specifically for Text Classification
purposes.

A GATE plugin using the API will be made public in August or
September. A UIMA integration could follow shortly after that.

Anyone's contribution is very welcome, please use this group for
questions / discussions.

We would appreciate if you mention DigitalPebble in any reports or
publications involving the use of the API.

All the best!

Julien

DigitalPebble

unread,
May 19, 2013, 2:36:29 PM5/19/13
to digita...@googlegroups.com
Hi 

Not sure I understand the question but if you mean "is the TC API limited to a specific domain or set of labels" then the answer is no. The API doesn't come with any pre-set model. It is domain neutral and would work with whatever set of labels is used in your training data. It has been used with success in quite a few very different applications for DigitalPebble's clients (Human Resources, Legal, filtering of adult content etc...)

I hope this answers your question

Julien

On 18 May 2013 22:59, <link2...@gmail.com> wrote:
Thanks Julien nice post. I want to ask one question is there any category model we used to learn for ML like if we take the example of news we should learn our model for specific categories like Sports, Weather.

Thanks

--
You received this message because you are subscribed to the Google Groups "DigitalPebble" group.
To unsubscribe from this group and stop receiving emails from it, send an email to digitalpebbl...@googlegroups.com.
To post to this group, send an email to digita...@googlegroups.com.
Visit this group at http://groups.google.com/group/digitalpebble?hl=en-GB.
For more options, visit https://groups.google.com/groups/opt_out.
 
 



--
 
Open Source Solutions for Text Engineering
 
http://digitalpebble.blogspot.com
http://www.digitalpebble.com
Reply all
Reply to author
Forward
0 new messages