A robust methodology for long-term sustainability evaluation of Machine Learning models

arXiv — cs.LGWednesday, November 12, 2025 at 5:00:00 AM
The development of a robust methodology for assessing the long-term sustainability of machine learning models marks a significant advancement in the field of artificial intelligence. Current evaluation practices primarily focus on short-term resource usage, leading to a lack of standardized protocols that can accurately reflect the real-world lifecycles of AI systems. The proposed comprehensive evaluation protocol aims to address these gaps by applying to both batch and streaming learning scenarios. Experiments conducted on various classification tasks reveal that traditional static evaluations fail to capture the sustainability of models as they evolve and undergo repeated updates. The findings indicate that long-term sustainability varies significantly across different models, with many exhibiting higher environmental costs that yield little performance benefit. This research is essential for guiding future AI development towards more sustainable practices.
— via World Pulse Now AI Editorial System

Was this article worth reading? Share it

Recommended Readings
Advanced Torrential Loss Function for Precipitation Forecasting
PositiveArtificial Intelligence
Accurate precipitation forecasting is increasingly crucial due to climate change. Recent machine learning approaches have emerged as alternatives to traditional methods like numerical weather prediction. However, many of these methods still use standard loss functions, which may not perform well during prolonged dry spells when precipitation is below the threshold. To overcome this issue, a new advanced torrential (AT) loss function is introduced, formulated as a quadratic unconstrained binary optimization (QUBO), which aims to enhance forecasting accuracy.
Fairness for the People, by the People: Minority Collective Action
PositiveArtificial Intelligence
Machine learning models often reflect biases found in their training data, resulting in unfair treatment of minority groups. While various bias mitigation techniques exist, they typically involve utility costs and require organizational support. This article introduces the concept of Algorithmic Collective Action, where end-users from minority groups can collaboratively relabel their data to promote fairness without changing the firm's training process. Three model-agnostic methods for effective relabeling are proposed and validated on real-world datasets, demonstrating that a minority subgroup can significantly reduce unfairness with minimal impact on prediction error.
Adaptive Detection of Software Aging under Workload Shift
PositiveArtificial Intelligence
Software aging is a phenomenon that affects long-running systems, resulting in gradual performance degradation and an increased risk of failures. To address this issue, a new adaptive approach utilizing machine learning for software aging detection in dynamic workload environments has been proposed. This study compares static models with adaptive models, specifically the Drift Detection Method (DDM) and Adaptive Windowing (ADWIN). Experiments demonstrate that while static models experience significant performance drops with unseen workloads, the adaptive model with ADWIN maintains high accuracy, achieving an F1-Score above 0.93.
SemanticNN: Compressive and Error-Resilient Semantic Offloading for Extremely Weak Devices
PositiveArtificial Intelligence
The article presents SemanticNN, a novel semantic codec designed for extremely weak embedded devices in the Internet of Things (IoT). It addresses the challenges of integrating artificial intelligence (AI) on such devices, which often face resource limitations and unreliable network conditions. SemanticNN focuses on achieving semantic-level correctness despite bit-level errors, utilizing a Bit Error Rate (BER)-aware decoder and a Soft Quantization (SQ)-based encoder to enhance collaborative inference offloading.
MADiff: Motion-Aware Mamba Diffusion Models for Hand Trajectory Prediction on Egocentric Videos
PositiveArtificial Intelligence
The article presents MADiff, a novel method for predicting hand trajectories in egocentric videos using diffusion models. This approach aims to enhance the understanding of human intentions and actions, which is crucial for advancements in embodied artificial intelligence. The challenges of capturing high-level human intentions and the effects of camera egomotion interference are addressed, making this method significant for applications in extended reality and robot manipulation.
Power Ensemble Aggregation for Improved Extreme Event AI Prediction
PositiveArtificial Intelligence
The paper titled 'Power Ensemble Aggregation for Improved Extreme Event AI Prediction' addresses the challenge of predicting climate extreme events, particularly heat waves, using machine learning. It frames the prediction as a classification problem, aiming to determine if surface air temperature will exceed a specific local quantile. The study finds that using a power mean to aggregate ensemble predictions significantly enhances prediction accuracy compared to traditional methods, especially for higher quantile thresholds.
Optical Echo State Network Reservoir Computing
PositiveArtificial Intelligence
A new design for an optical Echo State Network (ESN) has been proposed, enhancing reservoir computing capabilities. This innovative architecture allows for flexible optical matrix multiplication and nonlinear activation, utilizing the nonlinear properties of stimulated Brillouin scattering (SBS). The approach promises reduced computational overhead and energy consumption compared to traditional methods, with simulations demonstrating strong memory capacity and processing capabilities, making it suitable for various machine learning applications.
Do AI Voices Learn Social Nuances? A Case of Politeness and Speech Rate
PositiveArtificial Intelligence
A recent study published on arXiv investigates whether advanced text-to-speech systems can learn social nuances, specifically the human tendency to slow speech for politeness. Researchers tested 22 synthetic voices from AI Studio and OpenAI under polite and casual conditions, finding that the polite prompts resulted in significantly slower speech across both platforms. This suggests that AI can internalize and replicate subtle psychological cues in human communication.