Could you clarify how you are running this application? We have a known problem with comm_spawn from a singleton (i.e., if you just did
a.out instead of mpirun —np 1 a.out) - the errors look somewhat like what you are showing here, hence our curiousity.

I am getting the following error messages every time I run a very simple program that spawns child processes:[turkana:27949] [0,0,0] ORTE_ERROR_LOG: Not found in file base/soh_base_get_proc_soh.c at line 80
[turkana:27949] [0,0,0] ORTE_ERROR_LOG: Not found in file base/oob_base_xcast.c at line 108[turkana:27949] [0,0,0] ORTE_ERROR_LOG: Not found in file base/rmgr_base_stage_gate.c at line 276[turkana:27949] [0,0,0] ORTE_ERROR_LOG: Not found in file base/soh_base_get_proc_soh.c at line 80
[turkana:27949] [0,0,0] ORTE_ERROR_LOG: Not found in file base/oob_base_xcast.c at line 108[turkana:27949] [0,0,0] ORTE_ERROR_LOG: Not found in file base/rmgr_base_stage_gate.c at line 276

These errors are being generated by the master process. Does any body know what do they mean?

Also, if I spawn four child processes, not all of them run to completion, i.e. till MPI_Finalize.