Is there any categorization of datasets that contain simple(r) sentences?

23 views

Skip to first unread message

Colin Goldberg

unread,

Jul 19, 2019, 3:12:09 PM7/19/19

to SQuAD - The Stanford Question Answering Dataset

Hi,

For my purposes in developing/testing my approach, I would very much like to start with a dataset that contains simple/simpler sentences. The contexts (in the 2.0 devset) that I have inspected manually seem to comprise of (mostly) long or compound sentences. I would want to start with simpler sentences, and then move on to longer, compound sentences.

Is there any categorization that indicates the simplicity or otherwise of the sentences in any context in any dataset?

If not, I would like to recommend this for your consideration.

Thanks

Colin Goldberg

Reply all

Reply to author

Forward

0 new messages