| 2025 |
MSc Thesis |
Yiren Bai |
Understanding Service Reliability of Large Language Models: An Empirical Characterization on Operator and User Reports |
|
| 2025 |
BSc Thesis |
Maja Bińkowska |
DataViz: A Business Data Visualization System Using LLMs |
|
| 2025 |
Lab Project |
Sándor Battaglini-Fischer, Nishanthi Srinivasan, Bálint László Szarvas |
FAILS: A Framework for Automated Collection and Analysis of LLM Service Incidents |
HotCloudPerf2025 |
| 2024 |
MSc Thesis |
Qingxian Lu |
Operational Analysis of OpenAI Services Using Self-Reported Outages and Incidents |
ICPE2025 |
| 2024 |
BSc Thesis |
Yizhen Zang |
Enhancing Operational Data Synthesis and Predictive Analysis in HPC Clusters Using Large Language Models |
Under Review |
| 2023 |
MSc Thesis |
Shekhar Suman |
'ODAbler': Design and Evaluation of an Operational Data Analytics Framework for Energy-efficient management of Workloads in a Data Centre Simulator OpenDC |
GraphSys2024 |
| 2023 |
MSc Thesis |
Jianyang Gu |
Characterization and Modelling of Resource Usage and Energy Consumption in HPC Datacenters by Machine Learning |
|
| 2023 |
MSc Thesis |
Wenjun Liang |
Data Characterization and Anomaly Detection for HPC Datacenters Using Machine Learning |
|