Hi GEM benchmark team,
I recently processed thousands of questions and answers from /r/AskNYC, a forum for questions from residents and visitors of New York City.
I'm continuing to work on this dataset to improve overall quality and hopefully make a process to remove more toxic or forum-specific answers.
I saw that GEM had a couple of fact-based generation benchmarks for restaurants, and wondered if this dataset could be adapted for your uses, or that could be a future collaboration.
Some issues with the dataset - the questions are long, there are multiple valid answers to the same question (i.e. not just q -> a), copyright could be an issue, and filtering by Reddit votes cannot remove all offensive content.
-- Nick Doiron