LLM-based cost-aware task scheduling for cloud computing systems

[thumbnail of Pei_et_al-2025-Journal_of_Cloud_Computing.pdf]
Text
- Published Version
· Restricted to Repository staff only
· The Copyright of this document has not been checked yet. This may affect its availability.

Please see our End User Agreement.

It is advisable to refer to the publisher's version if you intend to cite from this work. See Guidance on citing.

Add to AnyAdd to TwitterAdd to FacebookAdd to LinkedinAdd to PinterestAdd to Email

Pei, H., Gu, Y., Sun, Y., Wang, Q., Liu, C., Chen, X. ORCID: https://orcid.org/0000-0001-9267-355X and Cheng, L. (2025) LLM-based cost-aware task scheduling for cloud computing systems. Journal of Cloud Computing, 14. 81. ISSN 2192-113X

Abstract/Summary

Cloud task scheduling faces significant challenges due to resource heterogeneity, conflicting optimization objectives, and dynamic workload fluctuations. Traditional heuristic algorithms often necessitate comprehensive knowledge of environmental parameters, significantly constraining their efficacy in dynamic cloud computing environments. While Deep Reinforcement Learning (DRL) methods have shown promise in intelligent scheduling via continuous environment interaction, they suffer from limited generalization to diverse cloud scenarios and lack decision interpretability. To address these shortcomings, this paper proposes LarS, a scheduling framework that employs Large Language Models (LLMs) as high-level decision agents for cloud task scheduling. In LarS, DRL agents trained in carefully chosen representative cloud environments generate a high-quality dataset of scheduling decisions, which is used to fine-tune an LLM. By jointly optimizing average response time, task success rate, and average rental cost, LarS achieves strong generalization across heterogeneous cloud deployments. Experimental results demonstrate that LarS surpasses current approaches in average response time, success rate, and average cost, and maintains strong generalization performance under varied experimental settings.

Item Type Article
URI https://centaur.reading.ac.uk/id/eprint/127770
Refereed Yes
Divisions Science > School of Mathematical, Physical and Computational Sciences > Department of Computer Science
Publisher Springer Nature
Download/View statistics View download statistics for this item

University Staff: Request a correction | Centaur Editors: Update this record