Recent Teaching

  • 2020 Fall, CS2212 Discrete Structures.
  • 2020 Fall, CS6320 Algorithms for Parallel Computing.
  • 2020 Spring, SC3260/5260 High-Performance Computing (with Ana Gainaru).
  • 2019 Fall, CS6320 Algorithms for Parallel Computing.

Recent Talks on Scheduling & Resilience

  • Resilient Scheduling of Moldable Jobs on Failure-Prone Platforms. 22nd IEEE International Conference on Cluster Computing, Virtual, Sept. 2020. [slides]
  • Selective Protection for Sparse Iterative Solvers to Reduce the Resilience Overhead. 32nd IEEE International Symposium on Computer Architecture and High Performance Com-
    puting (SBAC-PAD)
    , Virtual, Sept. 2020. [slides]
  • Scheduling Stochastic Jobs on HPC Systems (and Beyond). The 9th Joint Laboratory for Extreme Scale Computing Workshop, Knoxville, TN, USA, Apr. 2019. [slides]
  • Multi-Resource Scheduling of Parallel Jobs. The 13th Scheduling for Large Scale Systems Workshop, Berkeley, CA, USA, June 2018. [slides]
  • Scheduling Parallel Tasks under Multiple Resources: List vs. Pack. The 32nd International Parallel and Distributed Processing Symposium (IPDPS), Vancouver, Canada, May 2018. [slides]
  • Identifying the Right Replication Level to Detect and Correct Silent Errors at Scale. The 7th Workshop on Fault Tolerance for HPC at eXtreme Scale (FTXS), Washington D.C., USA, June 2017. [slides]
  • When Amdahl Meets Young/Daly. The 18th IEEE International Conference on Cluster Computing, Taipei, Taiwan, September 2016. [slides]
  • Towards Optimal Multi-Level Checkpointing. The 5th Joint Laboratory for Extreme Scale Computing (JLESC) Workshop, Lyon, France, June 2016. [slides]
  • Mathematical Exercises on Daly and Extensions (with Aurélien Cavelan). The 3rd JLESC Summer School, Lyon, France, June 2016. [slides]
  • Resilience Algorithms to Cope with Fail-stop and Silent Errors. HPC Days in Lyon, Lyon, France, April 2016. [slides]
  • Resilient Algorithms for Coping with Silent Errors. Guest lecture delivered at ENS Lyon for the Master 2 course “Resilient and energy-aware scheduling algorithms”, Lyon, France, October 2015. [slides]
  • Assessing the Impact of Partial Verifications Against Silent Data Corruptions. The 44th International Conference on Parallel Processing (ICPP’15), Beijing, China, September 2015. [slides]
  • Which Verification for Soft Error Detection? The 3rd Joint Laboratory for Extreme Scale Computing (JLESC) Workshop, Barcelona, Spain, July 2015. [slides]

Professional Service

  • Editorial and Organizational
    • Guest Editor for Sustainable Computing: Informatics and Systems (Special Issue on RE-HPC)
    • Co-chair and Co-organizer for the  First International Workshop on Resilience and/or Energy-aware techniques for High-Performance Computing (RE-HPC’16)
  • Program Committee Member
    • 32nd IEEE International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD’20)
    • 49th International Conference on Parallel Processing (ICPP’20)
    • 8th International Workshop on Energy Efficient Data Centres (E2DC’20)
    • 26th IEEE International Conference on High Performance Computing, Data, and Analytics (HiPC’19)
    • 48th International Conference on Parallel Processing (ICPP’19)
    • 9th International Conference on Advanced Communications and Computation (INFOCOMP’19)
    • IEEE International Symposium on Object/component/service-oriented Real-time distributed Computing (ISORC’19)
    • International Conference for High Performance Computing, Networking, Storage, and Analysis (SC’18) – Poster Committee
    • 47th International Conference on Parallel Processing (ICPP’18)
    • 7th International Workshop on Energy Efficient Data Centres (E2DC’18)
    • 8th International Conference on Advanced Communications and Computation (INFOCOMP’18)
    • 24th IEEE International Conference on High Performance Computing, Data, and Analytics (HiPC’17)
    • 17th International Conference on Algorithms and Architectures for Parallel Processing (ICA3PP’17)
    • 6th International Workshop on Energy Efficient Data Centres (E2DC’17)
    • 4th International Workshop on High Performance Computing for Big Data (HPC4BD’17)
    • International Conference on Supercomputing (ICS’17) – External Reviewer Committee
    • 7th International Conference on Advanced Communications and Computation (INFOCOMP’17)
    • 30th IEEE International Parallel & Distributed Processing Symposium (IPDPS’16)
    • 3rd International Workshop on High Performance Computing for Big Data (HPC4BD’16)
    • 27th International Symposium on Computer Architecture and High Performance
      Computing (SBAC-PAD’15)
  • Journal/External Reviewer
    • TPDS, JPDC, IJHPCA, Cluster Computing, Sustainable Computing, Concurrency and Computation, IJFCS, PPL, IPL, TII, COMNET, PDP’17, IPDPS’14, Euro-Par’13, GreenCom’13, IC3’13, ICPADS’12.