Thesis Defense – Hadrien Croubois

Hadrien Croubois: Toward an autonomic engine for scientific workflows and elastic Cloud infrastructure

Everyone is welcome to attend Hadrien Croubois’s thesis defense, which will take place Tuesday 16th October at 14h at Salle des thèse (ENS de Lyon).

You are also invited to the cocktail that follows the defense.


Eddy Caron, ENS de Lyon

Committee members:

Noël De Palma, Université Jospeh Fourier, reviewer
Johan Montagnat, CNRS, Laboratoire I3S UMR 7271, reviewer
Luciana Arantes, Université Sorbonne, examiner
Frédéric Desprez, Inria, examiner
Pushpinder Kaur Chouhan,Ulster University, examiner


The constant development of scientific and industrial computation infrastructures requires the concurrent development of scheduling and deployment mechanisms to manage such infrastructures. Throughout the last decade, the emergence of the Cloud paradigm raised many hopes, but achieving full platform autonomicity is still an ongoing challenge.

Work undertaken during this Ph.D. aimed at building a workflow engine that integrated the logic needed to manage workflow execution and Cloud deployment on its own. More precisely, we focus on Cloud solutions with a dedicated Data as a Service (DaaS) data management component. Our objective was to automate the execution of workflows submitted by many users on elastic Cloud resources.

This contribution proposes a modular middleware infrastructure and details the implementation of the underlying modules:

  • A workflow clustering algorithm that optimises data locality in the context of DaaS-centered communications;
  • A dynamic scheduler that executes clustered workflows on Cloud resources;
  • A deployment manager that handles the allocation and deallocation of Cloud resources according to the workload characteristics and users’ requirements.

All these modules have been implemented in a simulator to analyse their behaviour and measure their effectiveness when running both synthetic and real scientific workflows. We also implemented these modules in the DIET middleware to give it new features and prove the versatility of this approach. Simulation running the WASABI workflow (waves analysis based inference, a framework for the reconstruction of gene regulatory networks) showed that our approach can decrease the deployment cost by up to 44% while meeting the required deadlines.

Phd Crossover

Phd Crossover

LIP & LBMC – Avalon & Gandrillon – Hadrien & Arnaud –

En octobre vous pouvez découvrir le crossover entre la thèse d’Hadrien Croubois (Avalon) du laboratoire de l’Informatique du Parallélisme (LIP) de l’ENS  et d’Arnaud Bonnafoux (Gandrillon) du Laboratoire de Biologie Moléculaire de la Cellule (LBMC) de l’ENS.


Commencez par découvrir le vendredi 12 octobre 2018, à 14h à l’ENS de Lyon site Monod (salle de réunion M6) comment Arnaud Bonnafoux a conçu WASABI. (Plus d’information ici)

au LIP

Puis venez découvrir le mardi 16 octobre 2018, à 14h à l’ENS de Lyon site Monod (salle des thèses) comment le moteur de workflow d’Hadrien Croubois est profitable aux besoins en calcul de WASABI. (Plus d’information ici)

Thesis Defense — Pedro Silva

Everyone is welcome to attend Pedro Silva’s thesis defense, which will take place next Monday 11th December at 14h at Salle Place de l’Ecole (1, Place de l’École, 69003).

On the mapping of distributed applications onto multiple Clouds.

Christian Perez, Inria
Frédéric Desprez, Inria

Committee members:
Patricia PASCAL-STOLF, IUT de Blagnac, reviewer
Emmanuel Jeannot, Inria, reviewer
Christine Morin, Inria, examiner
Erik Elmroth, University of Umea – Sweden, examiner

You are also invited to the cocktail that follows the defense at the
LIP’s coffee area (coin café) (46 Allée d’Italie, 3rd floor).


The Cloud has become a very popular platform for deploying distributed applications. Today, virtually any credit card holder can have access to Cloud services. There are many different ways of offering Cloud services to customers. In this thesis we specially focus on the Infrastructure as a Service (IaaS), a model that, usually, proposes virtualized computing resources to costumers in the form of virtual machines (VMs).
Thanks to its attractive pay-as-you-use cost model, it is easier for customers to outsource hosting infrastructures and benefit of
savings related to upfront investments and maintenance costs. Also, customers can have access to features such as scalability, availability, and reliability, which previously were almost exclusive for large companies.

To place a distributed application on the Cloud, a customer must first consider the mapping between her application (or its parts) to the target infrastructure. She needs to take into consideration cost, resource, and communication constraints to select the most suitable set of VMs, from private and public Cloud providers. However, defining a mapping manually may be a challenge in large-scale or time constrained scenarios since the number of possible configuration explodes. The large offer of Cloud providers in the market and eventual advantages of having an application deployed over different Cloud sites, like redundancy and reachability, for example, make this challenge even more complex. Furthermore, when automating the mapping definition, scalability issues must be taken into account since this problem is a generalization of the graph homomorphism problem, which is NP-complete.

In this thesis we address the problem of calculating initial and reconfiguration placements for distributed applications over possibly multiple Clouds. Our objective is to minimize renting and migration costs while satisfying applications’ resource and communication constraints. We concentrate on the mapping between applications and Cloud infrastructures. Using an incremental approach, we split the problem into three different parts and propose efficient heuristics that can compute good quality placements very quickly for small and large scenarios. First we model the problem as a communication oblivious problem and propose vector packing based heuristics able to calculate initial placement solutions. Then, we extend our application and infrastructure models by introducing communication constraints, and propose a graph homomorphism based heuristic to calculate initial placement solutions. In the last part, we introduce application reconfiguration to our models and propose a heuristic able to calculate communication and reconfiguration aware placement solutions.

These heuristics have been extensively evaluated against state of the art approaches such as MIP solvers and meta-heuristics. We show through simulations that the proposed heuristics manage to compute solutions in a few seconds that would take many hours or days for other approaches to compute.

3ème journée du thème IDCHP de la FIL

Troisième journée du thème Informatique distribuée et calcul haute performance (IDCHP) de la Fédération Informatique de Lyon

Le 13 octobre à 9h30 à L’ENS de Lyon. Salle des conseils de l’ENS. Batiment Monod (2ème étage)
Comment venir: Accès à l’ENS. Site de Monod.
Inscription (gratuite mais fortement souhaitée pour des questions de logistique via le pad communiqué dans la mailing list)

Journée d’échange axée sur les systèmes distribuées


Talk invité ouvert à tous: The Red Belly Blockchain: Consistency and Scalability. Vincent Gramoli (Université de Sydney).  (voir détails ci-dessous)

 Suite à la première journée du thème IDCHP de la FIL nous avons prévu de nous retrouver sur des thèmes plus spécifiques pour des journées d’échange.

  • La deuxième journée sur l’IoT et les capteur aura lieu le 27 novembre (lieu et détails à venir)
  • La troisième journée sur les systèmes distribuées le 13 octobre (voir le programme ci-dessous)


9h30 – 9h45 Accueil

9h45 – 12h00 Session plénière

Quelques sujets que nous pouvons aborder …

Autonomic workflow Deployment on dynamic platforms Hadrien Croubois (LIP) / Eddy Caron (LIP)
Taming Tail Latency in Key-Value Stores for Heterogeneous Workloads Sonia Ben Mokhtar (LIRIS)
Blockchain: an Ephemeral Buzzword or a New Paradigm for Distributed Systems? – What Research Agenda for the Next 5 Years?  Lionel Brunie (LIRIS)

Achieving Privacy and Utility Trade-off in User Mobility Data Sara Bouchenak (LIRIS)

12h00 – 13h30 Repas offert par la FIL

14h00Exposé ouvert à tous à l’Amphi B – Site de Monod – ENS de Lyon

Titre: The Red Belly Blockchain: Consistency and Scalability
Vincent Gramoli (University Sydney)
Abstract: To cope with blockchain inconsistencies, like double-spending, developers started building upon Byzantine Fault-Tolerant (BFT) consensus to solve this blockchain problem. At first it seems reasonable because consensus can be effective at totally ordering transactions into a chain.
Unfortunately, these two problems are different. In short, the blockchain problem aims at totally ordering blocks of transactions issued by a large set of potentially misbehaving internet machines, whereas the consensus problem aims at deciding upon one of the values proposed by typically fewer but non-misbehaving machines.
In this talk, I will describe the blockchain consensus problem, a variant of the classic consensus definition, that was recently defined for blockchains. I will present the red belly blockchain, a blockchain that does not use BFT consensus as a black box but rather solves the blockchain consensus problem and illustrate empirically the benefits it offers.
Bio: Vincent Gramoli is the Head of the Concurrent Systems Research Group at the University of Sydney and a Senior Researcher at Data61-CSIRO.
He received his PhD from University of Rennes and his Habilitation from UPMC Sorbonne University. Prior to this, he was affiliated with Cornell University and EPFL. His research interest is in distributed computing.


Ben Mokhtar Sonia LIRIS
Bennani Nadia LIRIS
Bouchenak Sara LIRIS
Brunie Lionel LIRI
Caniou Yves LIP
Caron Eddy LIP
Cazabet Rémy LIRIS
Croubois Hadrien LIP
Furno Angelo CITI
Gramoli Vincent Sydney University
Kotto Kombi Roland LIRIS
Le Mouël Frédéric CITI
Lefevre Laurent LIP
Moyaux Thierry DISP
Médini Lionel LIRIS
Perez Christian LIP
Simonin Olivier CITI
Urbain Xavier LIRIS
Vivien Frédéric LIP
Canon Louis-Claude LIP







Pour toutes questions merci de contacter Eddy Caron et/ou Sonia Ben Mokhtar.

Posted in FIL

GrPPI: A Generic Parallel Pattern Interface for Stream and Data Processing

Next Avalon working group will be tomorrow (3/10/2016) at 15h in amphi L. Dr. Manuel F. Dolz, currently visiting us from Madrid, will be talking about his work.

GrPPI: A Generic Parallel Pattern Interface for Stream and Data Processing

Current parallel programming frameworks aid developers to a great extent in implementing applications that exploit parallel hardware resources. Nevertheless, developers require additional expertise to properly use and tune them to operate efficiently on specific parallel platforms. With the lack of high-level parallel pattern abstractions, we present GrPPI, a generic and reusable parallel pattern interface for both stream processing and data-intensive C++ applications ( GrPPI accommodates a layer between developers and existing parallel programming frameworks targeting multi-core processors, such as C++ threads, OpenMP and Intel TBB. To achieve this goal, this interface leverages modern C++ features, metaprogramming techniques, and template-based programming to act as switch among those frameworks. All in all, thanks to its high-level API and compact design, GrPPI allows users to easily expose parallelism and hide away the complexity behind concurrency mechanisms. We evaluate this interface using an image processing use case and demonstrate its benefits from the usability, flexibility, and performance points of view.



Avalon members contribute (Eddy Caron as header of the parallelism group) and will be attending to the Compas’17 conference.

Avalon’s talks at Compas’2017.

Four talks:

  • Comment estimer la consommation énergétique de processus avec seulement un wattmètre (et un solveur) ? Valentin Lorentz, Laurent Lefèvre and Gilles Fedak. (Wednesday 11:30 – 12:00)
  • On the (In)Efficiency of IPFS for Geo-distributed VM Images. Jad Darrous (Wednesday 14:00-14:30)
  • Ordonnancement multi-objectifs de workflows dans le cloud : un modèle plus réaliste avec tâches de durée stochastique. Aurélie Kong Win Chang (ROMA member, co-supervised by AVALON members) (Thursday 11:30 – 12:00)
  • Assessing the Impact of Network Bandwidth and Operator Placement on Data Stream Processing for Edge Computing Environments. Alexandre Da Silva Veith, Marcos Dias de Assuncao and Laurent Lefevre  (Thursday 14:00 – 15:30)

Avalon’s keynote by Gilles Fedak (Thursday 9:00 – 10:00)

Title: From research to company. Two technology transfers stories.

Abstract: During this keynote we will have the opportunity to discover successful technology transfers. Two speakers, two stories, two points of view, two feedback for young researchers and so much more…

From an INRIA/University Research Team to an International Innovative Company. Denis presents in this talks the major steps that were needed to achieve a successful technology transfer from an academic team to a startup company, and then growing to a well established company with international presence. From the many difficulties, and often failure, ache, bitterness and headache, to the satisfaction, pleasure, and enjoyment that you sometimes get.

The story behind iExec, the first French major ICO success. What’s an ICO ? ICO stands for Initial Coin Offering. It’s a new way for startup to get funding by emitting new crypto-currencies or tokens on the blockchain. On April 19, iExec raised 10.000 Bitcoins in less than 3 hours, making it the 5th largest ICO in history. In this talk, you will learn about Bitcoins, the decentralization of the economy, and also about the scientific background of iExec, the first blockchain-based distributed Cloud platform.

Laurent Lefevre co-organizes the GreenDays@Sophia event co-located with Compas’2017 ! (June 26-27, 2017)

It is free for Compas participants, so come, assist and discuss with the French Green community !

More information on : GreenDays@Sophia website

More information on the Compas Website.

1ère journée du thème IDCHP de la FIL

Première journée du thème Informatique distribuée et calcul haute performance de la Fédération Informatique de Lyon

Le 13 juin à 9h30 à L’INSA. Amphi Bâtiment Blaise Pascal 501-337 (3ème étage)
Comment venir:
Inscription (gratuite mais fortement souhaitée pour des questions de logistique)

Cette journée de réflexion de la FIL est axée sur le partage des données au sein de notre thème


Partant du constat que beaucoup de nos thématiques peuvent se retrouver autour des données, nous vous proposons de réfléchir à la manière de structurer nos échanges:
  • Comment partager des données au sein de la fédération ?
  • Vous avez réalisé des acquisitions de données pour un papier ou des expériences et ces données sont partageables ? 
  • Vous avez une collaboration académique ou industrielle et ces données sont partageables ? 
  • Vous avez des matrices de très grande taille qui pourrait intéresser vos collègues ? 
  • Vous avez réalisé des benchmarks à partir de data-sets significatifs selon vous qui pourraient servir à d’autres ? 
  • Vous avez généré des dizaines de To de log ?
  • Vous avez des wattmètres ou des capteurs qui produisent des données ?
  • Vous avez un accès à des bases de données de génomique, d’images satellitaires, de data-set issus des réseaux sociaux … ?
  •  Vous commencez à voir où on veut en venir et vous avez d’autres idées à soumettre ? 


9h20 – 9h30 Accueil

9h30 – 12h00 Session plénière

9h30 – 9h45 Intro de la FIL et de la journée [Lionel Brunie / Eddy Caron]
9h45 – 9h55 Sonia Ben Mokhtar (LIRIS)
Données de Mobilité. projet IMU PRIVAMOV
9h55 – 10h05 Jonathan Rouzaud Cornabas (LIRIS)
Expérience évolution in-silico de bactérie
10h05 – 10h15 Laurent Lefevre (LIP)
Mise en place d’une infrastructure de wattmetres pour la plate-forme Grid5000 lyonnaise.
10h15 – 10h25 Stéphane Frénot (CITI)
La plateforme Jumplyn: Collecte et aide à la gestion de projets étudiants.
10h25 – 10h35 Lionel Médini (LIRIS)
Le Web sémantique et le Web des objets.
10h35 – 10h45 Simon Delamare (LIP)
Collecte de données sur la plateforme Grid’5000.
10h45 – 11h00 Frédéric Le Mouël (CITI)
Plateforme capteurs IoT sur une architecture Fog/Edge computing

11h00 – 12h00 Ordre du jour et Discussions

– La FIL et le partage des données. Bilan.
– Vos attentes vis à vis de la FIL
– L’organisations des prochaines journées
– Idées pour le prochain appel de la FIL.

12h00 – 13h30 Repas offert par la FIL

13h30 – 15h30 Ateliers de discussions en petits groupes en fonction des sujets et questions du matin
15h30 – 16h00 Restitution & Conclusions




Alias Christophe LIP
Ben Mokhtar Sonia LIRIS
Benslimane Djamal LIRIS
Biennier Frédérique LIRIS
Brunie Lionel LIRI
Caron Eddy LIP
Delamare Simon LIP
Frénot Stéphane CITI
Gautier Thierry LIP
Ghedira Chirine LIRIS
Hassas Salima LIRIS
Le Mouel Frederic CITI
Lefevre Laurent LIP
Lesueur Francois CITI
Matignon Laetitia LIRIS et CITI
Médini Lionel LIRIS
Nicolas Olivier CITI
Pérez Christian LIP
Puzenat Didier LIRIS
Rouzaud-Cornabas Jonathan LIRIS
Stouls Nicolas CITI
Subercaze Julien Laboratoire Hubert Curien
Trystram Jean-Baptiste CITI
Urbain Xavier LIRIS


Pour toutes questions merci de contacter et/ou

Posted in FIL

Sensibilisation à la valorisation de la recherche (ENS de Lyon)

Suivez les retours d’expériences de 4 chercheurs de l’ENS de Lyon côté valorisation.


– 9h00 : Accueil et Introduction (Service valorisation)
– 9h10 : Guillaume Fantino : Retour d’expérience sur la création de start-up et thèse CIFRE
– 9h50 : Patrick Picard : Retour d’expérience sur la valorisation d’une base de données et prestation de service
– 10h20 : Bertrand Mollereau : Retour d’expérience sur la valorisation d’un savoir-faire et d’un brevet via une collaboration de recherche
– 11h00 : Eddy Caron (Avalon) : Retour d’expérience sur la valorisation d’un logiciel via une collaboration de recherche
– 11h40 – 12h30 : Clôture et invitation au pot (Service valorisation).