Show simple item record

dc.contributor.advisorLakshmi, J
dc.contributor.authorSaraf, Prathamesh
dc.date.accessioned2024-05-29T07:19:02Z
dc.date.available2024-05-29T07:19:02Z
dc.date.submitted2023
dc.identifier.urihttps://etd.iisc.ac.in/handle/2005/6522
dc.description.abstractCloud workload orchestration plays a pivotal role in optimizing the performance, resource utilization, and cost effectiveness of applications in data centers. As modern businesses and IT operations are migrating their businesses to the cloud, understanding the dynamics of cloud data centers has become indispensable. Often, two perspectives play a pivotal role in workload orchestration in data centers. One is from the cloud provider side, whose goal is to provision as many applications as possible on the available resources biding to SLA constraints thereby increasing return on investment. Other being from the side of enterprises and individual customers, often referred to as end users, whose primary objective is to ensure application performance with reduction in deployment cost. Containerization has gained popularity for deploying applications on public clouds, where large enterprises manage numerous applications through thousands of containers placed onto Virtual Machines (VMs). While the need for cost efficient placement in cloud data centers is undeniable, the complexities involved in achieving this goal cannot be understated. This problem is usually modelled as a multi-dimensional Vector Bin packing Problem (VBP). Solving VBP optimally is NP-hard and practical solutions requiring real-time decisions use heuristics. This work explores the landscape of cloud data centers, emphasizing the significance of efficient bin packing in achieving optimal cost and resource utilization. Traditional methods, including heuristics and optimal algorithms, face limitations in handling continuous request arrivals and the dynamic nature of cloud workloads. Integer Linear Programming (ILP), which can provide optimal solutions for small problem sizes with tens of requests, may take minutes to hours to complete even at such scales. Moreover, optimal algorithms inherently demand perfect knowledge of all current and future requests to be placed within the bins, rendering them unsuitable for the dynamic and often unpredictable online placement scenarios prevalent in cloud setups. To address these challenges, this work introduces a novel approach to solving VBP through Reinforcement Learning (RL), trained on the historical container workload trace for an enterprise a.k.a. CARL (Cost-optimized container placement using Adversarial Reinforcement Learning). The proposed work evaluates the effectiveness of CARL in comparison to traditional methods. CARL leverages historical container workload traces, learning from a semi-optimal VBP solver while optimizing VM costs. The contributions of this research extend beyond traditional methods, providing insights into the advantages and disadvantages of heuristics, optimal algorithms, and learning approaches. We trained and evaluated CARL on workloads derived from realistic traces from Google Cloud and Alibaba for the placement of 10,000 container requests onto over 8000 VMs. CARL is fast, making placement decisions for request sets with 124 containers per second within 65 ms onto 1000s of potential VMs. It is also efficient, achieving up to 13.98% lower VM costs than baseline heuristics for larger traces. To push the boundaries further, the research uses Mixture of Experts (MoE) strategy in CARL wherein multiple experts are used that helps CARL in learning placement policies of various approaches combined. The inclusion of a MoE strategy enhances CARL’s adaptability to changes in workload distribution, ensuring competitive performance in scenarios with skewed resource needs or inter-arrival times.en_US
dc.description.sponsorshipAccenture India Limiteden_US
dc.language.isoen_USen_US
dc.relation.ispartofseries;ET00531
dc.rightsI grant Indian Institute of Science the right to archive and to make available my thesis or dissertation in whole or in part in all forms of media, now hereafter known. I retain all proprietary rights, such as patent rights. I also retain the right to use in future works (such as articles or books) all or part of this thesis or dissertationen_US
dc.subjectSchedulingen_US
dc.subjectVirtual Machine Placementen_US
dc.subjectDeep Reinforcement Learningen_US
dc.subjectAdversarial Learningen_US
dc.subjectClouden_US
dc.subjectCloud Workloadsen_US
dc.subjectOnline Placementen_US
dc.subjectWorkload Orchestrationen_US
dc.subjectImitation Learningen_US
dc.subject.classificationResearch Subject Categories::TECHNOLOGY::Information technology::Computer scienceen_US
dc.titleIntelligent Methods for Cloud Workload Orchestration in Data Centersen_US
dc.typeThesisen_US
dc.degree.nameMTech (Res)en_US
dc.degree.levelMastersen_US
dc.degree.grantorIndian Institute of Scienceen_US
dc.degree.disciplineEngineeringen_US


Files in this item

This item appears in the following Collection(s)

Show simple item record