using Wikipedia as an auxiliary data source

19 views
Skip to first unread message

Fernando Diaz

unread,
Jul 20, 2015, 4:46:22 PM7/20/15
to tre...@googlegroups.com

The question came up about whether the Wikipedia page of the event could be used, so long as it was time aligned with the decision making. The experimental setup for this would be to download the Wikipedia revision history for the event page and then only inspect those revisions published before the decision making time (usually the current document/hour batch).

This method is safe insofar as you automate the detection of the wikipedia page. That is, you CANNOT assume that you are provided with a handle to the event's Wikipedia page. You ONLY have the query when the event starts. If you can automatically find the event Wikipedia page in a temporally-aligned version of all of Wikipedia, then you are free to use the above scheme.

If you have questions about this, please let me know.

Fernando
Reply all
Reply to author
Forward
0 new messages