Top HXL hashtags

30 views
Skip to first unread message

David Megginson

unread,
May 10, 2018, 9:45:31 AM5/10/18
to HXL public mailing list
Hi, everyone. As many of you will have seen elsewhere, HXL 1.1 is out now, and we'd like to thank everyone who contributed to the discussions and development over the last year and a half.

We've just run an analysis of the HXL-hashtagged datasets on the Humanitarian Data Exchange to see which humanitarian hashtags you all use the most. Here are some of the results.

A. Top 10 hashtags by number of columns tagged
  1. #affected 11511
  2. #date 5916
  3. #country 5013
  4. #meta 2907
  5. #value 1934
  6. #loc 1786
  7. #activity 1779
  8. #org 1686
  9. #x_applicants 975
  10. #x_decisions 975

B. Top 10 hashtags by number of unique datasets using them

  1. #date 1054
  2. #country 841
  3. #status 551
  4. #affected 505
  5. #loc 469
  6. #meta 463
  7. #population 423
  8. #adm1 364
  9. #activity 334
  10. #org 328

C. Top 10 hashtags by number of unique data providers using them

  1. #adm1 28
  2. #adm2 28
  3. #date 28
  4. #country 22
  5. #org 21
  6. #loc 20
  7. #adm3 19
  8. #meta 19
  9. #sector 19
  10. #affected 18


Cheers, David

Chair, HXL WG



Paola Di Maio

unread,
Jun 6, 2018, 4:23:43 AM6/6/18
to hxlpr...@googlegroups.com
David
kindly share a bit about your method
(as I am learning from you for my project as briefly discussed)

the HXL top cats are derived from a most frequent terms in use?
how did you shortlist the source documents?
what tools have you used to extract the term and analyse them?

thank you


--
You received this message because you are subscribed to the Google Groups "Humanitarian Exchange Language (HXL)" group.
To unsubscribe from this group and stop receiving emails from it, send an email to hxlproject+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.



--

David Megginson

unread,
Jun 6, 2018, 4:43:00 PM6/6/18
to hxlpr...@googlegroups.com
Hi, Paola. I scanned all of the public datasets tagged "hxl" on HDX. Code is here: https://github.com/HXLStandard/hdx-hashtag-crawler

A. Top 10 hashtags by number of columns tagged - total number of columns (in all scanned datasets) where the hashtag appears.

B. Top 10 hashtags by number of unique datasets using them - total number of datasets scanned where the hashtag appears at least once (if it appears more than once in the same dataset, it's still counted only the once).

C. Top 10 hashtags by number of unique data providers using them - total number of data providers who used the hashtag (each provider counted only once, no matter how often the provider used the tag).

I hope this is helpful.


Cheers, David

To unsubscribe from this group and stop receiving emails from it, send an email to hxlproject+...@googlegroups.com.

For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "Humanitarian Exchange Language (HXL)" group.
To unsubscribe from this group and stop receiving emails from it, send an email to hxlproject+...@googlegroups.com.

Paola Di Maio

unread,
Jun 7, 2018, 12:34:55 AM6/7/18
to hxlpr...@googlegroups.com
Thanks
will check
so I guess I should start building a library of 'datasets'


To unsubscribe from this group and stop receiving emails from it, send an email to hxlproject+unsubscribe@googlegroups.com.

For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "Humanitarian Exchange Language (HXL)" group.
To unsubscribe from this group and stop receiving emails from it, send an email to hxlproject+unsubscribe@googlegroups.com.

For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "Humanitarian Exchange Language (HXL)" group.
To unsubscribe from this group and stop receiving emails from it, send an email to hxlproject+unsubscribe@googlegroups.com.

For more options, visit https://groups.google.com/d/optout.

David Megginson

unread,
Jun 7, 2018, 9:29:14 AM6/7/18
to hxlpr...@googlegroups.com
CKAN is a decent (and free) platform for collecting and organising them—worth a shot.



D

To unsubscribe from this group and stop receiving emails from it, send an email to hxlproject+...@googlegroups.com.

For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "Humanitarian Exchange Language (HXL)" group.
To unsubscribe from this group and stop receiving emails from it, send an email to hxlproject+...@googlegroups.com.

For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "Humanitarian Exchange Language (HXL)" group.
To unsubscribe from this group and stop receiving emails from it, send an email to hxlproject+...@googlegroups.com.

For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "Humanitarian Exchange Language (HXL)" group.
To unsubscribe from this group and stop receiving emails from it, send an email to hxlproject+...@googlegroups.com.
Reply all
Reply to author
Forward
0 new messages