Dear Sophie,
As I noted in a posting to chibolts and info-childes about 3 months ago, I have been implementing UD (Universal Dependencies) now for about half of the languages in CHILDES, including all of the Romance and Germanic languages, along with Turkish and soon East Asian languages. Note that many of these did not have MOR grammars and many corpora were not tagged, but most are now. All of French is tagged using UD.
During this process it was necessary to move as much as possible to standard orthography for each language, as determined by the computational linguists creating the training sets for UD. For French, this means having j’ai as one unit, for example.
If you wish to run UD on new French data, please make sure first that the files pass check. After that you can either send the data to me for addition to CHILDES or you can use the morphotag command inside Batchalign which you could download from
https://github.com/talkbank.
— Brian MacWhinney
Teresa Heinz Professor of Cognitive Psychology,
Language Technologies and Modern Languages, CMU