The first letter P or W is "blink" data, the second number is a time stamp which would be need to be converted to a frame (timex24 frames per second) the third element is the viceme/phenome, the words (hey, look) would be discarded and the expressions (anger, happy) would be added in much the same way the current face shapes are created using the vicemes (etc, E, L, AL and so on). I can provide more comprehensive explanation and examples when required.