Quadrics Background • Develops interconnect products for the HPC market – HPC Linux systems – AlphaServer SC systems • Quadrics is owned by the Finmeccanica group • Quadrics will be 12 years old in July

Bandwidth scalability – 1024 nodes • Bandwidth achieved when 1024 nodes all communicate at the same time • QsNetII provides better average bandwidth and much narrower spread in best to worst case performance System Interconnect Min Max Average Atlas Infiniband 95 762 263 QsNetII Thunder 248 403 369 Data from Lawrence Livermore National Lab, published at the Sonoma OpenFabrics workshop June 2007

Building a 16K node system in 2009/10 • Single water cooled rack will • 8 Blade switches per rack provide 1000-2000 standard • Connect 128 of these racks cores ~12-25 TF. with 1024-way top switches • Single fibre cable per node - for full bi-section bandwidth.

QsNetIII Fault Tolerance • All of the QsNetII Features – CRCs on every packet – Automatic retransmission – Adaptive routing avoids failed links – Redundant routes – Redundant, hot plugable, PSUs and fans + Full line rate testing of each link as it comes up – Switches generate CRPAT, CJPAT or PRBS packets – Links are only added to the route tables when they are (a) up, (b) connect to the right place, and (c) can transfer data without error.

Why Quadrics? • Focus on the most demanding HPC applications • Delivers large system scalability – All nodes achieve host adapter bandwidth at the same time – Minimal spread between best and worst case performance – Low and uniform latency – Highly optimised collectives • Single supplier of interconnect hardware, software, support • Stability of our products • Track record of delivering production systems • European company

ppOpen-HPC: Open Source Infrastructure for Development and ...

post-peta-scale system with heterogeneous computing nodes. “ppOpen-HPC” is five-year ... ¾ Final version of ppOpen-HPC for Post-Peta-Scale SystemRead more

ppOpen-HPC: Open Source Infrastructure for Development and ...

mized for post-peta-scale systems. ... Proceedings of the 4th Fault Tolerance for HPC at eXtreme Scale (FTXS) 2014, in conjunction with DSN2014 (2014)Read more

These presentations are classified and categorized, so you will always find everything clearly laid out and in context.
You are watching All you need to know about the Microsoft Band presentation right now. We are staying up to date!