Does the tone analyser process speech differently?

Does the tone analyser work differently with speech? As in, can it detect sarcasm, or anger based on our tone (the way we enunciate the words I mean), rather than convert it from speech to text and then processing the text?