About additional training material

21 views
Skip to first unread message

Eric Sadit Tellez

unread,
Apr 27, 2017, 1:37:55 PM4/27/17
to PAN Workshop Series on Digital Text Forensics
Dear all,

Is it valid to use additional training data? The PAN's site comments about  http://ttg.uni-saarland.de/resources/DSLCC, but we are interested in more sources, in particular, the Twitter's streaming API.

Best regards,
Eric

Martin Potthast

unread,
Apr 27, 2017, 2:46:42 PM4/27/17
to pan-workshop-series
Hi Eric,

I don't see any problems with using additional data for training. In fact, some of the most successful author identification approaches do so on a regular basis, based on the impostor method. Everything is allowed, as long as it is not cheating, unethical, or any other form of scientific misconduct. One additional limitation, however, is that we cannot grant access to the any web service while your software is executed, lest the test data leak.

So, if you wish to use external data, you can do so only by downloading it up front and using it offline. The important thing is that you tell us about what you did and how you did it and why you believe it is working in your notebook paper.

Best,
Martin

--
--
You received this message because you are subscribed to the Google Group "PAN".
Visit this group at http://groups.google.com/group/pan-workshop-series
To unsubscribe send email to pan-workshop-series+unsub...@googlegroups.com.
---
You received this message because you are subscribed to the Google Groups "PAN Workshop Series on Digital Text Forensics" group.
To unsubscribe from this group and stop receiving emails from it, send an email to pan-workshop-series+unsub...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.



--
Dr. Martin Potthast
Bauhaus-Universität Weimar
Digital Bauhaus Lab
Bauhausstr. 9a
99423 Weimar
Germany

+49 3643 58 3567
+49 171 809 1945

www.potthast.net

Eric Sadit Tellez

unread,
Apr 27, 2017, 6:29:30 PM4/27/17
to PAN Workshop Series on Digital Text Forensics
Excellent!

Best regards,
Eric
Reply all
Reply to author
Forward
0 new messages