They should contain 30 positions each.
Enter each position into your computer, and let it think for 15
minutes (900 seconds).
If a position is solved, write down its solution time in seconds.
It doesn't count as a solution if the program finds, the move
and then changes its mind.
If after finding a move, then changing its mind, then finding
it again, you should use the *last* time found.
Any solution that is not found, score as 900 seconds.
Add up all the times.
Divide by 30.
Subtract the result from either 2630 or 2450.
The final figure is the ELO rating.
Or, isn't the ELO rating,as the case may be.
Old tests get frigged by some programmers (they recognise the
position and apply a special kludge factor to 'help' the solution)
Old test are often used for tuning purposes. This results in programs
performing better, either by design or as a side effect.
30 positions are not enough.
Test positions favour 'finders' rather than 'planners'. Its quite
easy to make a program be a finder, very good at finding
combinations *if they are there*, but then it becomes hopeless
at chess games with many positions where there is nothing
to be found, just good positional moves.
Like all numbers trying to encapsulate complex behaviour BT test
results have their place. But ......
Chris Whittington
>Two of the game databases that came with Rebel 8 were BT2450.DAT and
>BT2630.DAT. The instructions book calls them BT tests but doesn't say
>anything else about them other than that BT2630 is the "latest
>BT-test". Can anyone explain how to use them?
We have 2 testers in germany that published nice test-suites in the
big-german COmputerchess-magazine Computer-schach and spiele.
One is named Hubert Bednorz, the other Fred Toennissen.
Bednorz is the B in BT. Toennissen is the T in BT.
The BT2630 is the predecessor of the BT2450-test-suite.
If you find all positions in the BT2630 in 0" seconds, you have 2630
ELO.
Same with BT2450-test-suite.
The BT2450 was too easy for the strong tactic programs, so B and T
exchanged some positions and the result is the BT2630 test-suite.
In my opinion the test works nice, but it only finds out about the
tactical strength of a program. I know many programs that do not have
a high BT-rate, but play strong chess.
thanks,
vince