Dr. Martin Schulz

Leading the Way to Ultrascale Computing: The BG/L System Software Environment

Zeit: Mon 19.12.2005, 11:00, 60 Minuten
Ort: HS14

Zusammenfassung

Abstract:
Unprecedented levels of performance have been achieved on BlueGene/L (BG/L). The order of magnitude improvement in performance was enabled by an order of magnitude increase in processors. This unprecedented scale has presented unique challenges for the system software environment. We have successfully met these challenges by carefully designing the system software to reflect our experiences with previous ASC systems. In this talk, we will describe the overall BG/L system software environment, including its multiple operating systems (lightweight compute node kernel or CNK, I/O node kernel and front-end traditional OS), MPI implementation and parallel file system.
We will then discuss specific details of implementing TotalView, the high performance debugger for the system, including a description of BG/L's unique debugger interface. We conclude with current work in porting the DPCL dynamic instrumentation infrastructure to BG/L, a particularly challenging activity since the CNK does not support dynamic libraries.

Vortragender

Biography:
Martin Schulz joined CASC in 2004 as a computer scientist. His research interests include parallel and distributed architectures and applications; performance monitoring, modeling and analysis; memory system optimization; parallel programming paradigms; tool support for parallel programming; and fault tolerance at the application and system level.
Martin earned his doctorate in computer science in 2001 from the Technische Universität München (Munich, Germany). He also holds a master of science in computer science from the University of Illinois at Urbana-Champaign. After completing his graduate studies and before joining LLNL, he worked for two years as a research associate at Cornell University.