Phonexia voice biometrics more than doubles its accuracy
27 March 2019 16:06 GMT

Phonexia has released the fourth generation of its voice biometrics technology called Phonexia Deep Embeddings.

The firm said Deep Embeddings for Speaker Identification, now released in its production version, is the world’s first commercially available voice biometrics engine based exclusively on deep neural networks (DNN). The fourth generation uses deep neural networks together with more robust speaker models, leading to major improvements compared to the previous third generation.

"Every performance improvement in voice biometrics makes the technology more usable in existing and new scenarios. Voice biometrics can be used in criminal investigations, financial services, virtual personal assistants, smart homes, IoT, automotive, industry 4.0, embedded devices (devices with no permanent connection to the Internet) and much more".

The latest fourth generation of Phonexia voice biometrics technology has significantly improved its accuracy and has broken through the 99% accuracy barrier to a 0.96% Equal Error Rate, while the previous generation’s Equal Error Rate was 1.24% and was already considered as one of the most accurate on the market.

Phonexia Deep EmbeddingsTM achieved even more significant accuracy improvements when tested on multiple clients’ datasets. In almost all cases the accuracy was more than doubled compared to the previous iVector based generation. The chart below shows the accuracy improvements between Phonexia’s third and fourth generation Speaker Identification Equal Error Rate measurements.