Greetings!
Thank you for attending the meeting and for your comments.
As you probably noticed, we have run into the problem with reasoning
involving complex logical definitions pertaining to dietary intakes of
nutrients.
I have tried using different reasoners (Pellet, Racer, HermiT) on two
different machines, but only HermiT 1.3.x worked and only on very limited
number of data instances.
The logical definitions that I talked about during the meeting have all
been independently tested and they classify the target data correctly. In
the test data scenario, when the ontology contains only the logical
definition, for say "low intake of A", all test data instances that fall
in the "low intake of A" range get classified correctly and reasoning is
completed in minutes time.
However, the problem with very long reasoning time arises when the small
set of data instances of type "intake of A" has to be "evaluated" against
2 or 3 logical definitions (e.g.,"low intake of A", "high intake of A" and
"normal intake of A"), which is the case the closest to the real life
scenario. In such cases as well as in the case when the file contains all
test data instances (app 300 instances) the reasoning takes hours.
The following five files include different combinations of data sets and
logical definitions, with data pertaining to average daily valine intake
(which is one or 9 amino acid intakes currently covered by the relevant
literature). Your time allowing, if you could take a look, try them out on
your machine(s) and provide some comments, that would be very appreciated.
1. OwTestVAL-1LogDef-11data.owl
https://code.google.com/p/onstr/source/browse/data/testDataSets/OwTestVAL-1
LogDef-11data.owl
This file contains ONSTR with the logical definition for "low valine
intake data" and 11 data instances for low valine intake. This is the case
for testing the correctness of this logical definition for "low valine
intake" (ONSTR_ Reasoning takes minutes or less, depending on processor(s).
2. OwTestVAL-3LogDef-11data.owl
https://code.google.com/p/onstr/source/browse/data/testDataSets/OwTestVAL-3
LogDef-11data.owl
This is ONSTR with 11 data instances for low valine intake and 3 logical
definitions (for low/high/normal valine intake). Reasoning takes longer
than in 1) but still in minutes range.
3.OwTestVAL-2LogDef-16data.owl
https://code.google.com/p/onstr/source/browse/data/testDataSets/OwTestVAL-2
LogDef-16data.owl
ONSTR with only 16 test data instances and two logical definitions. (11
instances of low valine intake data + 5 for normal valine intake).
Reasoner takes hours to complete the task.
4. OwTestVAL-3LogDef-33data.owl
https://code.google.com/p/onstr/source/browse/data/testDataSets/OwTestVAL-3
LogDef-33data.owl
ONSTR with 33 test data instances that are "handled" by all three logical
definitions pertaining to average daily valine intake (ONSTR_6097970).
Longer reasoning time than 3) and 4).
5. ONSTR_withTESTdata.owl
https://code.google.com/p/onstr/source/browse/data/testDataSets/ONSTR_withT
ESTdata.owl
This file contains ONSTR with all test data instances, roughly 300 test
data instances for all 9 amino acid intakes and total protein intakes.
Note: In all files, ONSTR contains logical definitions for intakes of all
relevant amino acids and total protein, but these are used in reasoning
(in data classification) only in the case 5).
For those who like to read the .owl file:
In all files, some HTML/XML comments for data instances are not 100%
correct. Please don't rely on them.
If you would be so kind to provide some/any feedback in the next 2-3 weeks
that would be greatly appreciated!
P.S.
If you would need more information about the project, you may take a look
at:
1. The ONSTR ICBO2013 presentation is available at:
https://code.google.com/p/onstr/source/browse/docs/Publications/ICBO2013_ON
STR_NikolicEtAl_FINALpres.pptx
2. Oct 21, 2013 Meeting presentation is available at:
https://code.google.com/p/onstr/source/browse/docs/MeetingNotesAndPresentat
ions/Meeting_2013-10-21.pptx
3. Other NBSDC project related info are available at:
https://nbsdc.org
Thank you in advance!
snez
________________________________
This e-mail message (including any attachments) is for the sole use of
the intended recipient(s) and may contain confidential and privileged
information. If the reader of this message is not the intended
recipient, you are hereby notified that any dissemination, distribution
or copying of this message (including any attachments) is strictly
prohibited.
If you have received this message in error, please contact
the sender by reply e-mail message and destroy all copies of the
original message (including attachments).