I mean that the expert considers 90% a match, but when the person demonstrating compared two recordings of the same person (Nixon), it only displayed an 86% match. I guess I'm not understanding how he expects to get over 90% when two very similar recordings were compared and only came out as 86%.
MOO and I am not an expert so I could be wrong.
I think there is always room for inconclusive results in this sort of thing because there probably is some overlap between the results of non-matches and matches.
Suppose you have tested thousands and thousands of samples and have seen consistent patterns.
Suppose you take any two random male voices and they usually get about 40-85 % result.
Suppose you take any two matching voices and they usually get about 80-98% result.
Suppose you and your colleagues have never seen a matching sample that goes below 74%.
Further suppose you've never seen a non-matching sample score over 88%
Now, if you look at this pattern and see that your test scored 48 %. You can be pretty darn sure they're not a match.
Look at another test that scored 95. You can be very sure they're a match.
But if you get a score between 74-88 you just can't say anything for certain and would have to conclude that your test is inconclusive because you've seen those scores go either way.
Add a couple of percent to be sure you eliminate false positives and you come up with the 90 % threshold, knowing that some 86 % Nixons will go inconclusive.
I pulled the percentages out of my hat so don't anybody quote me on this, they're just an illustration and nothing to do with the actual test.