Hello again,
Since I had seen interest on our R project, I would like to offer some general view of what we want to deliver and why:
R language is very good at managing, calculating quantitative data at this point. Our goal is to make it able to handle qualitative data in the form of controlled vocabulary and common terminology. We want ability to process the following more interesting functions:
On the whole dataframe:
-export/import to the cloud
-verify integrity
-upgrade to the last accepted definition
(in our case the cloud is a constellation (as a matter of speaking :)) of database servers)
On each observation/record:
-translate to a new language
-update/correct term locally and replicate the update into the cloud
-calculate plural, singular, masculine, feminine, likely synonyms, antonyms
-provide a complete history of updates and their authors since the creation of the record
on a text in a known natural language:
-recognize the terms of a chosen controlled vocabulary and mark them with a pointer to the current version of the terms.
-calculate a map of meanings (and other functions of Semantic Web)
-perform related statistical analysis of those 2 operations
This is not an exhaustive list of interesting features.
Thank you,
Christian Tzurcanu