This parallelized version of CNS v1.21 enables parallel computations for some steps (especially FFTs) in the crystallographic refinement process. By default, the cns_solve_env script is set to utilize all processors on a node by setting the variable OMP_NUM_THREADS to the appropriate value. Benchmark runs indicate that CNS runs with the best efficiency on a 2p or 4p node.

All benchmarks were run on a 2.8 GHz Intel EMT 64((E5462) with 8 GB of memory and 12 MB secondary cache.
CNS was built with the Intel compiler and OpenMP. The programs in the benchmark runs were the standard input suite from CNS.

Bottom line: The efficiency of parallelization drops rapidly after 2p. For long jobs, it may be worth using 4p on a dual-core node.