Author profiling task at PAN - minor change in output format.

7 views
Skip to first unread message

Francisco Rangel

unread,
Apr 6, 2018, 3:59:28 AM4/6/18
to pan-workshop-series, pan
Dear participants,

This is to inform you about a minor change in the output format you should take into account.

As it is explained in the webpage, your software must take as input the absolute path to an unpacked dataset, and has to output for each document of the dataset a corresponding XML file that looks like this:
  <author id="author-id"
	  lang="en|es|ar"
	  gender_txt="male|female"
	  gender_img="male|female"
	  gender_comb="male|female"
  />
  

We ask you to provide with three different predictions for the author's gender depending on your approach:

  • gender_txt: gender prediction by using only text
  • gender_img: gender prediction by using only images
  • gender_comb: gender prediction by using both text and images

As previously said, you can participate in both textual and images classification, or only in one of them. Hence, if your approach uses only textual features, your prediction should be given in gender_txt. Similarly, if your approach relies on images, your prediction should be given in gender_img. In case you use both text and images, your prediction should be given in gender_comb. Furthermore, in such a case, if you can provide also the prediction by using both approaches separately, this would allow us to perform a more in-depth analysis of the results and to compare textual vs. image based author profiling. In this case, you should provide for the same author the three predictions: gender_txt, gender_img and gender_comb.

The naming of the output files is up to you, we recommend to use the author-id as filename and "xml" as extension.

IMPORTANT! Languages should not be mixed. A folder should be created for each language and place inside only the files with the prediction for this language.


Please, do not hesitate to contact us if you have any doubt,

--
Francisco M. Rangel Pardo
CTO Autoritas Consulting S.A.
Twitter: @kicorangel
Reply all
Reply to author
Forward
0 new messages