The PARAM Padma is the newest systems made by the Indian C-DAC. It is built
somewhat asymmetrically from 54 4-processor SMPs and 1 32-processor node. All
nodes employ 1 GHz IBM POWER4 processors. As an interconnection network C-DACs
own PARAMnet-II is used for which a peak bandwidth of 2.5 Gb/s (312 MB/s) is
given with a latency for short messages of ≅ 10 µs. The network is
build from 16-port PARAMnet-II switches and has a Clos64 topology, very similar
to the structure used by Myrinet. No MPI results over
this network are available.
C-DAC has already a long tradition of building parallel machines and it has
always provided its own software to go with them. Therefore, the Padma comes
with Fortran 90, C(++), MPI, and a Parallel Files System.

Measured Performances:
The Padma performs at 532 Gflop/s with the HPC Linpack Benchmark (see [42]) for a linear system of size N =
224,000 on a 62-node machine with a theoretical peak of 992 Gflop/s. That
amounts to an efficiency of 53.6% for this benchmark.