Talks
I presented the following talks at international conferences and professional events:
An Empirical Characterization of Outages and Incidents in Public Services for Large Language Models. [Slide]
- ICPE, Virtual, Toronto, Canada, May 7, 2025.
- ICT.OPEN, Utrecht, the Netherlands, April 16, 2025.
Generic and ML Workloads in an HPC Datacenter: Node Energy, Job Failures, and Node-Job Analysis. [Slide]
- ICPADS, Belgrade, Serbia, October 12, 2024.
How Do ML Jobs Fail in Datacenters? Analysis of a Long-Term Dataset from an HPC Cluster. [Slide]
- HotCloudPerf, Coimbra, Portugal, April 16, 2023.
Last Update: