ARFF DataWriter

18 views
Skip to first unread message

Torsten Zesch

unread,
Jan 26, 2012, 1:59:36 PM1/26/12
to cleart...@googlegroups.com
Hi,

I want to use ClearTK's nice feature extraction capabilities to create
ARFF files (to be used with Weka).
As there seems to be no DataWriter for ARFF in ClearTK:
Has someone already written such a writer and would share the code?

While we talk about Weka:
Are there any specific obstacles in adding Weka support to ClearTK or
has simply nobody found time to do it, yet.
(I am thinking about trying to implement the necessary parts and don't
want to run into obvious traps).

-Torsten

Lee Becker

unread,
Jan 26, 2012, 2:10:17 PM1/26/12
to cleartk-users
On Jan 26, 11:59 am, Torsten Zesch <torsten.ze...@googlemail.com>
wrote:
We have mostly steered away from Weka because of its Gnu General
Public License does not play too well with ClearTK's Apache license
(for more details see http://www.apache.org/licenses/GPL-compatibility.html).
That said, I know some people have written ARFF writers in the past,
and actually getting Weka to play with ClearTK shouldn't be too
difficult. It might make sense to include Weka DataWriters in our
code, but leave out any Weka distribution (Philip, Steve, thoughts?).

Richard Eckart de Castilho

unread,
Jan 26, 2012, 3:01:50 PM1/26/12
to cleart...@googlegroups.com
Am 26.01.2012 um 20:10 schrieb Lee Becker:

> We have mostly steered away from Weka because of its Gnu General
> Public License does not play too well with ClearTK's Apache license
> (for more details see http://www.apache.org/licenses/GPL-compatibility.html).
> That said, I know some people have written ARFF writers in the past,
> and actually getting Weka to play with ClearTK shouldn't be too
> difficult. It might make sense to include Weka DataWriters in our
> code, but leave out any Weka distribution (Philip, Steve, thoughts?).

ClearTK provides wrappers for the Stanford tools which are GPL. It shouldn't be too much of a problem to wrap Weka as long as the core of ClearTK has no dependencies on that module.

Cheers,

-- Richard

Philip Ogren

unread,
Jan 26, 2012, 3:19:58 PM1/26/12
to cleart...@googlegroups.com
The GPL is fine with our new project structure so long as we keep the dependency isolated. I think this would be a really important contribution.

I have written a nearly working Weka wrapper for ClearTK which I never polished up and released. There are a few difficulties with it that I cant remember off the top of my head. I will go dig around for the code and see if I can find anything that would help you get started.

Sent from my iPhone

> --
> You received this message because you are subscribed to the Google Groups "cleartk-users" group.
> To post to this group, send email to cleart...@googlegroups.com.
> To unsubscribe from this group, send email to cleartk-user...@googlegroups.com.
> For more options, visit this group at http://groups.google.com/group/cleartk-users?hl=en.
>

Philip Ogren

unread,
Jan 26, 2012, 11:57:00 PM1/26/12
to cleart...@googlegroups.com
Torsten,

I spent some time resurrecting my Weka code this evening.  It looks like I wrote the training data writer part but didn't do the classifier - though I am certain that I have at one point figured out the necessary weka apis for this.  I wasn't able to get it cleaned up enough to check it into subversion.  I will give a status update by the end of this coming weekend.  

Cheers,
Philip

Philip Ogren

unread,
Jan 29, 2012, 4:30:45 PM1/29/12
to cleart...@googlegroups.com
Torsten,

I have created an issue for this here:


It doesn't add anything to what's already been said here.  I hope to spend a couple hours on this still this afternoon and any updates I will will be posted to that issue.  

Philip

Philip Ogren

unread,
Jan 29, 2012, 11:22:30 PM1/29/12
to cleart...@googlegroups.com
I plugged away on this a bit more this evening but am unable to commit any code just yet.  Perhaps this week.  I'm a bit under the weather just now.

Torsten Zesch

unread,
Jan 30, 2012, 10:18:54 AM1/30/12
to cleart...@googlegroups.com
Thanks Philip. I am looking forward to any code you might be able to share.

-Torsten

2012/1/30 Philip Ogren <phi...@ogren.info>:

Reply all
Reply to author
Forward
0 new messages