It is quite fast I agree. You get the most control by recording your own voice.
The National Centre for languages has a couple of tutorials (linked to on this page).
There is a text to speech example in the second tutorial.
As goneunderground has said the full stops certainly seem to act as pauses.