The first improvement(i. and i: for floating point and integer vectors)
is minor, under 15% in time.
Other improvements are more substantial; benchmarks demonstrating
that are available by following the links above.