Yes, a friend from another board took the test a second time, passing on every question. He felt that the results weren’t very impressive. He reported that it got seventeen correct answers, and only two incorrect, but thought that it didn’t do well on the other eleven questions. When Watson’s answer is “I’m not sure”, it gives the responses it “considered”, in order of probability – a guess, essentially. Watson’s guesswork would have yielded a correct answer only four times out of eleven.
He also decided that unless Watson is “playing the digital equivalent of possum”, it has problems with context & category. And he noted that most of the questions in this quiz were fairly easy by “Jeopardy!” standards – first-round stuff, not real challengers.
I wouldn’t be a bit surprised if Watson had been “dumbed down” a bit for this game. How much fun would it be if everyone who played got blown away by Watson? Although, without of the time pressure of who can ring in first, it’s much easier for the human contestant to get a decent score.