As usual, you shouldn't assume anything about the ordering of the results - but if you don't care about that, this should be faster on suitable hardware. (It will be slightly less efficient overall though, of course. It still needs to do the same work, with added overhead for parallelization.)