
The principal objective for the project is to facilitate the development of a software environment that meets the demands of the new generation of HPC architectures. This will cover the whole software stack (system and programming environments), numerical solvers and pre/post/co processing software.
A co-design approach is employed, that covers the software environment for computer architectures, the requirements of more demanding applications, and is adapted to future hardware architectures (multicore/many core processors, high-speed networks and data storage).
These developments will be validated according to their capacity to deal with the new exascale challenges- larger scalability, higher resiliency, greater security, improved modularity, with better abstraction and interactivity for application cases.
Start Date: September 2014
Duration: 3 years
Avalon Members: T. Gautier, L.Lefevre, C. Perez, I. Rais, J. Richard
More information on the ELCI web site.
o propose to revisit the principles of existing schedulers after studying the main factors impacted by job submissions. Then, we will propose novel efficient algorithms for optimizing the schedule for unconventional objectives like energy consumption and to design provable approximation multi-objective optimization algorithms for some relevant combinations of objectives (performance, fairness, energy consumption, etc.). An important characteristic of the project is its right balance between theoretical analysis and practical implementation. The most promising ideas will lead to integration in reference systems such as SLURM and OAR as well as new features in programming standards implementations such as MPI or OpenMP. We expect MOEBUS results to impact further use of very large scale parallel platforms.
The last decade has brought tremendous changes to the characteristics of large scale distributed computing platforms. Large grids processing terabytes of information a day and the peer-to-peer technology have become common even though understanding how to efficiently such platforms still raises many challenges. As demonstrated by the USS SimGrid project funded by the ANR in 2008, simulation has proved to be a very effective approach for studying such platforms. Although even more challenging, we think the issues raised by petaflop/exaflop computers and emerging cloud infrastructures can be addressed using similar simulation methodology.
The goal of the SONGS project is to extend the applicability of the SimGrid simulation framework from Grids and Peer-to-Peer systems to Clouds and High Performance Computation systems. Each type of large-scale computing system will be addressed through a set of use cases and lead by researchers recognized as experts in this area.


