>I think I remember setting up the MTT tests on Sif so that tests
>are run both with and without the coll_hierarch component selected.
>The coll_hierarch component stresses code paths and potential
>race conditions in its own way. So, if the problems are showing up
>more frequently for the test runs with the coll_hierarch component
>enabled, then I would check the communicator creation code paths.
>
>
Going back to the subject heading "SM init failures", I looked at a
bunch of the MTT stack traces. E.g., the 143 failures with 20880 on
IU_Sif seen on http://www.open-mpi.org/mtt/index.php?do_redir=973 . If
you look at the failures where "MPI_Init" shows up in the stack trace,
you get one of these two: