FTP 36 beats Prover 9 on region connection calculus

28 views

Skip to first unread message

Mark Tarver

unread,

Sep 3, 2022, 9:08:02 PM9/3/22

to Shen

For aficionados of automated reasoning, Prover 9 is the lineal descendent of OTTER which was until the early years of this century, the best automated theorem prover in the world. The title is now held by Vampire in Manchester.

Region connection calculus is a first-order theory covered in https://en.wikipedia.org/wiki/Region_connection_calculus. I'm running FTP 36 under Bruno's Shen/Scheme port. The axioms are entered as definitions.

[[all x [c x x]]
[all x [all y [[c x y] => [c y x]]]]
[[dc X Y] =df [~ [c X Y]]]
[[p X Y] =df [all z [[c z X] => [c z Y]]]]
[[pp X Y] =df [[p X Y] & [~ [p Y X]]]]
[[e X Y] =df [[p X Y] & [p Y X]]]
[[o X Y] =df [exists z [[p z X] & [p z Y]]]]
[[po X Y] =df [[o X Y] & [[~ [p X Y]] & [~ [p Y X]]]]]
[[ec X Y] =df [[c X Y] & [~ [o X Y]]]]
[[dr X Y] =df [~ [o X Y]]]
[[tpp X Y] =df [[pp X Y] & [exists z [[ec z X] & [ec z Y]]]]]
[[ntpp X Y] =df [[pp X Y] & [~ [exists z [[ec z X] & [ec z Y]]]]]]]

The theorem to prove is:

[[[tpp a b] & [ec b c]] => [[dc a c] v [ec a c]]]

The resulting proof was gained in 0.062 seconds and is 439 steps long and 5,442 lines long and takes 29,028 inferences to secure. It's too large to paste into this message, so I'm attaching it to this email.

Prover 9 gets a proof in 0.49 seconds; meaning FTP is running at 8X faster in terms of arriving at a solution. So we've actually graduated out of the minor leagues into the major.

Prover 9 is ballpark about 30,000 lines of C vs FTP 36 which is currently 634 lines of Shen.

It's a good result so I'm going to bed.

Mark

prf.txt

Mark Tarver

unread,

Sep 4, 2022, 5:47:14 AM9/4/22

to Shen

FTP 37 - an optimised version of FTP 36 gets the same result in 16,039 inferences

compared to 29,028 inferences of FTP 36.

Mark

Reply all

Reply to author

Forward

0 new messages