HPTT: A High-Performance Tensor Transposition C++ Library

… Recently we presented TTC, a domain-specific compiler for tensor transpositions. Despite the fact that the performance of the generated code is nearly optimal, due to its offline nature, TTC cannot be utilized in all the application codes …

… Static program analysis is increasingly recognized as a fundamental tool for compiler optimization, bug detection and security analysis (among others). Pointer analysis is the foundation, on which virtually all other analyses are based …

RTHMS: A Tool for Data Placement on Hybrid Memory System

… Modern applications running on supercomputers have increasing demands for memory, in terms of capacity, speed, power efficiency, and persistence. Since there is no single memory technology today that can satisfy all these requirements …

… of all the tasks scheduled on each core into a DAG by considering … allocator converts the interference graph of all the tasks into a DAG by using a k … task. Both cache allocators significantly improve the utilization of all …

"What's in a Name?" Going Beyond Allocation Site Names in Heap Analysis

OSEK-V: Application-Specific RTOS Instantiation in Hardware

… The employment of a real-time operating system (RTOS) in an embedded control systems is often an all-or-nothing decision: While the RTOS-abstractions provide for easier software composition and development, the price in terms of event …

[Poster] Understanding the Performance of Dynamic Binary Modification on ARM Using MAMBO

LCTES 2017 People: Cosmin Gorgovan, Amanieu d'Antras, Mikel Luján

… , do not affect performance on all available systems and microarchitectures …

Skeletal Program Enumeration for Rigorous Compiler Testing

… skeleton P and a set of variables V , enumerate a set of programs P exhibiting all … and variable set, offering a level of guarantee absent from all existing compiler … is a novel algorithm for computing the canonical (and smallest) set of all non-α …