Abstract

Energy consumption is an emerging concern in many fields, including information technology, particularly in data warehousing environments where Extract, Transform, Load (ETL) processes account for a significant portion of operational costs and resource utilization. Despite advances in hardware-level optimization, limited attention has been given to software-level energy optimization within ETL workflows. In reality, software is as important as hardware, and it is equally responsible for a decrease or increase in energy consumption. We argue that for modern applications in which energy efficiency is a priority, ETL processes should be optimally designed. This paper addresses this gap by proposing a Green ETL (GETL) approach designed to reduce energy consumption while maintaining high performance. The proposed method integrates transformation-level reuse through a shared transformation cache and adaptive parallel execution using Apache Spark, enabling efficient resource utilization and elimination of redundant computations. The proposed GETL removes unnecessary calculations and reduces both execution time and energy consumption, without requiring any modifications to the underlying data processing engines. To evaluate the effectiveness of the proposed GETL, experiments were conducted using the Transaction Processing Council Data Integration (TPC-DI) benchmark across multiple scale factors. The results demonstrate that the proposed approach achieves an average energy reduction of approximately 30%, with higher savings observed under large-scale workloads. In addition, GETL improves execution efficiency and reduces resource utilization compared to a traditional Spark-based ETL implementation.


Document

The PDF file did not load properly or your web browser does not support viewing PDF files. Download directly to your device: Download PDF document
Back to Top
GET PDF

Document information

Published on 08/06/26
Accepted on 08/06/26
Submitted on 07/06/26

Volume Online First, 2026
DOI: 10.23967/j.rimni.2026.10.82763
Licence: CC BY-NC-SA license

Document Score

0

Views 18
Recommendations 0

Share this document

claim authorship

Are you one of the authors of this document?