Reinforcement learning-driven hybrid precopy/postcopy VM migration for energy-efficient data centers
This study proposes the use of a hybrid precopy/postcopy virtual machine (VM) migration framework to aid an autonomous agent when making migration decisions to continuously optimize the balance among migration time, downtime, and energy consumption. The data center state and the resource load, inclu...
Saved in:
| Main Authors: | , , , , |
|---|---|
| Format: | Article |
| Language: | en en en |
| Published: |
IEEE
2025
|
| Subjects: | |
| Online Access: | http://irep.iium.edu.my/123767/7/123767_%20Reinforcement%20learning-driven.pdf http://irep.iium.edu.my/123767/8/123767_%20Reinforcement%20learning-driven_Scopus.pdf http://irep.iium.edu.my/123767/9/123767_%20Reinforcement%20learning-driven_WoS.pdf http://irep.iium.edu.my/123767/ https://ieeexplore.ieee.org/document/11175406 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| _version_ | 1847096656840884224 |
|---|---|
| author | Hidayat, Taufik Ramli, Kalamullah Harwahyu, Ruki Salman, Muhammad Gunawan, Teddy Surya |
| author_facet | Hidayat, Taufik Ramli, Kalamullah Harwahyu, Ruki Salman, Muhammad Gunawan, Teddy Surya |
| author_sort | Hidayat, Taufik |
| building | IIUM Library |
| collection | Institutional Repository |
| content_provider | International Islamic University Malaysia |
| content_source | IIUM Repository (IREP) |
| continent | Asia |
| country | Malaysia |
| description | This study proposes the use of a hybrid precopy/postcopy virtual machine (VM) migration framework to aid an autonomous agent when making migration decisions to continuously optimize the balance among migration time, downtime, and energy consumption. The data center state and the resource load, including the CPU, memory, and network, are represented in the agent’s state space using a two-layer graph neural network (GNN), and the asynchronous advantage actor–critic (A3C) algorithm is employed to dynamically determine whether to continue the precopy phase or switch to postcopy and optimize the trade-off among the total migration time, downtime, and energy consumption while adhering to the service-level agreement (SLA) constraints. An adaptive host selection policy ensures that VMs are migrated only to underloaded machines, preventing overload and ensuring system stability. A simulation evaluation that employed the VM workload from the GWA-Bitbrains dataset revealed that this framework achieved a total migration time of 45.5 s, with 30.1 s spent on the precopy phase and 15.4 s spent on the postcopy phase, resulting in a downtime of 15.4 s. Compared with previous approaches, this result represents an decrease in total migration time of 12.5% from 52 s to 45.5 s; a 23% decrease in downtime from 20 s to 15.4 s; and a 4.4% increase in energy efficiency from 87% to 91.4%. The SLA compliance remained stable at 92.8%, affirming that the service quality was preserved. This study demonstrates the effectiveness of integrating GNN-based embeddings and A3C scheduling in terms of reducing downtime and energy usage while maintaining reliable service delivery in data centers. |
| format | Article |
| id | my.iium.irep-123767 |
| institution | Universiti Islam Antarabangsa Malaysia |
| language | en en en |
| publishDate | 2025 |
| publisher | IEEE |
| record_format | dspace |
| spelling | my.iium.irep-1237672025-10-17T08:09:43Z http://irep.iium.edu.my/123767/ Reinforcement learning-driven hybrid precopy/postcopy VM migration for energy-efficient data centers Hidayat, Taufik Ramli, Kalamullah Harwahyu, Ruki Salman, Muhammad Gunawan, Teddy Surya TK7885 Computer engineering This study proposes the use of a hybrid precopy/postcopy virtual machine (VM) migration framework to aid an autonomous agent when making migration decisions to continuously optimize the balance among migration time, downtime, and energy consumption. The data center state and the resource load, including the CPU, memory, and network, are represented in the agent’s state space using a two-layer graph neural network (GNN), and the asynchronous advantage actor–critic (A3C) algorithm is employed to dynamically determine whether to continue the precopy phase or switch to postcopy and optimize the trade-off among the total migration time, downtime, and energy consumption while adhering to the service-level agreement (SLA) constraints. An adaptive host selection policy ensures that VMs are migrated only to underloaded machines, preventing overload and ensuring system stability. A simulation evaluation that employed the VM workload from the GWA-Bitbrains dataset revealed that this framework achieved a total migration time of 45.5 s, with 30.1 s spent on the precopy phase and 15.4 s spent on the postcopy phase, resulting in a downtime of 15.4 s. Compared with previous approaches, this result represents an decrease in total migration time of 12.5% from 52 s to 45.5 s; a 23% decrease in downtime from 20 s to 15.4 s; and a 4.4% increase in energy efficiency from 87% to 91.4%. The SLA compliance remained stable at 92.8%, affirming that the service quality was preserved. This study demonstrates the effectiveness of integrating GNN-based embeddings and A3C scheduling in terms of reducing downtime and energy usage while maintaining reliable service delivery in data centers. IEEE 2025-09-22 Article PeerReviewed application/pdf en http://irep.iium.edu.my/123767/7/123767_%20Reinforcement%20learning-driven.pdf application/pdf en http://irep.iium.edu.my/123767/8/123767_%20Reinforcement%20learning-driven_Scopus.pdf application/pdf en http://irep.iium.edu.my/123767/9/123767_%20Reinforcement%20learning-driven_WoS.pdf Hidayat, Taufik and Ramli, Kalamullah and Harwahyu, Ruki and Salman, Muhammad and Gunawan, Teddy Surya (2025) Reinforcement learning-driven hybrid precopy/postcopy VM migration for energy-efficient data centers. IEEE Access, 13. pp. 169521-169533. E-ISSN 2169-3536 https://ieeexplore.ieee.org/document/11175406 10.1109/ACCESS.2025.3613235 |
| spellingShingle | TK7885 Computer engineering Hidayat, Taufik Ramli, Kalamullah Harwahyu, Ruki Salman, Muhammad Gunawan, Teddy Surya Reinforcement learning-driven hybrid precopy/postcopy VM migration for energy-efficient data centers |
| title | Reinforcement learning-driven hybrid precopy/postcopy VM migration for energy-efficient data centers |
| title_full | Reinforcement learning-driven hybrid precopy/postcopy VM migration for energy-efficient data centers |
| title_fullStr | Reinforcement learning-driven hybrid precopy/postcopy VM migration for energy-efficient data centers |
| title_full_unstemmed | Reinforcement learning-driven hybrid precopy/postcopy VM migration for energy-efficient data centers |
| title_short | Reinforcement learning-driven hybrid precopy/postcopy VM migration for energy-efficient data centers |
| title_sort | reinforcement learning-driven hybrid precopy/postcopy vm migration for energy-efficient data centers |
| topic | TK7885 Computer engineering |
| url | http://irep.iium.edu.my/123767/7/123767_%20Reinforcement%20learning-driven.pdf http://irep.iium.edu.my/123767/8/123767_%20Reinforcement%20learning-driven_Scopus.pdf http://irep.iium.edu.my/123767/9/123767_%20Reinforcement%20learning-driven_WoS.pdf http://irep.iium.edu.my/123767/ https://ieeexplore.ieee.org/document/11175406 |
| url_provider | http://irep.iium.edu.my/ |
