> Due to cost limitations we had to limit crafty to 2 seconds of analysis time p...

jdoliner · on Feb 17, 2015

This is definitely the biggest limitation of our approach right now and there are certainly some things that we counted as blunders that aren't true blunders. We're working on rectifying this by doing another pass with a better engine and more time to analyze.

That said we tested this on a smaller set of games by comparing it to results from better engines and found that only a very small number of moves tricked crafty. It's still generally quite reliable for the majority of moves.

logicallee · on Feb 17, 2015

you could just rewrite your article to call these "obvious blunders" - i.e. which you define as ones that crafty identifies in 2 seconds or less. redefine what you're doing so your methodology is correct :) Plus it's still interesting. Probably more interesting than blunders that take longer to identify!

Once you have found the blunders, you can verify them by analyzing the found positions more deeply. (Of course you should also report the number of false positives - ones that appear blunders after 2 seconds but turn out not to be on slightly longer analysis.)

verteu · on Feb 17, 2015

Thanks the response -- I really enjoyed your article.

The results of the cross-validation you mentioned would be interesting as well.