It means that, if a random definitively confirmed human is asked to identify whether or not 'subject x' seems human (in a context where either response is reasonable), there is a better than 50% chance that they will respond in the positive.

What got me - in the context of these being AI designed for a first person shooter game as aopponents, was the part about how they could be used in robotics for more pleasant interactions. The drones will shoot more nicely?