LLM-based cost-aware task scheduling for cloud computing systems

Pei, Haoran; Gu, Yan; Sun, Yajuan; Wang, Qingle; Liu, Cong; Chen, Xiaomin; Cheng, Long

Download

Preview

Text (Open Access)
- Published Version
· Available under License Creative Commons Attribution.

Advice

Please see our End User Agreement.

It is advisable to refer to the publisher's version if you intend to cite from this work. See Guidance on citing.

Tools

Lists

Pei, H., Gu, Y., Sun, Y., Wang, Q., Liu, C., Chen, X. ORCID: https://orcid.org/0000-0001-9267-355X and Cheng, L. (2025) LLM-based cost-aware task scheduling for cloud computing systems. Journal of Cloud Computing, 14. 81. ISSN 2192-113X doi: 10.1186/s13677-025-00822-0

Abstract/Summary

Cloud task scheduling faces significant challenges due to resource heterogeneity, conflicting optimization objectives, and dynamic workload fluctuations. Traditional heuristic algorithms often necessitate comprehensive knowledge of environmental parameters, significantly constraining their efficacy in dynamic cloud computing environments. While Deep Reinforcement Learning (DRL) methods have shown promise in intelligent scheduling via continuous environment interaction, they suffer from limited generalization to diverse cloud scenarios and lack decision interpretability. To address these shortcomings, this paper proposes LarS, a scheduling framework that employs Large Language Models (LLMs) as high-level decision agents for cloud task scheduling. In LarS, DRL agents trained in carefully chosen representative cloud environments generate a high-quality dataset of scheduling decisions, which is used to fine-tune an LLM. By jointly optimizing average response time, task success rate, and average rental cost, LarS achieves strong generalization across heterogeneous cloud deployments. Experimental results demonstrate that LarS surpasses current approaches in average response time, success rate, and average cost, and maintains strong generalization performance under varied experimental settings.

Altmetric Badge

Dimensions Badge

Item Type	Article
URI	https://centaur.reading.ac.uk/id/eprint/127770
Identification Number/DOI	10.1186/s13677-025-00822-0
Refereed	Yes
Divisions	Science > School of Mathematical, Physical and Computational Sciences > Department of Computer Science
Publisher	Springer Nature
Download/View statistics	View download statistics for this item

Download Statistics

Downloads

Downloads per month over past year

Deposit Details

CORE (COnnecting REpositories)

University Staff: Request a correction | Centaur Editors: Update this record

Date Deposited:	07 Jan 2026 11:54	Date item deposited into CentAUR
Last Modified:	18 Jan 2026 08:00	Date item last modified