arXiv:2507.15540v2 Announce Type: replace 
Abstract: We study self-supervised procedure learning, which discovers key steps and their order from a set of unlabeled videos. Previous methods typically learn frame-to-frame correspondences between videos before determining key steps and their order. However, their performance often suffers from order variations, background/redundant frames, and repeated actions. To overcome these challenges, we propose a self-supervised framework, which utilizes a fused Gromov-Wasserstein optimal transport with a structural prior for frame-to-frame mapping. However, optimizing only for the above temporal alignment may lead to degenerate solutions, where all frames are mapped to a small cluster in the embedding space and thus every video is assigned to just one key step. To address that issue, we integrate a contrastive regularization, which maps different frames to various points, avoiding trivial solutions. Finally, extensive experiments on egocentric and third-person benchmarks demonstrate our superior performance over prior works, including OPEL which relies on a classical Kantorovich optimal transport with an optimality prior.

تقدم دراسة جديدة حول التعلم الذاتي المراقب إطارًا يستخدم النقل الأمثل لجروموف-واسرشتاين المدمج لتحسين تحديد الخطوات الرئيسية في مقاطع الفيديو غير المعلّمة. تتناول هذه الطريقة التحديات مثل اختلافات الترتيب والإطارات الزائدة، مما يحسن الأداء مقارنة بالطرق السابقة مثل OPEL. تدمج الدراسة تنظيمًا تباينيًا لتجنب الحلول التافهة، مما يؤدي إلى تحسين المحاذاة الزمنية ونتائج أفضل في التجارب الشاملة.

Un nuevo estudio sobre el aprendizaje de procedimientos auto-supervisado presenta un marco que utiliza el transporte óptimo de Gromov-Wasserstein fusionado para mejorar la identificación de pasos clave en videos no etiquetados. Este enfoque aborda desafíos como las variaciones de orden y los fotogramas redundantes, mejorando el rendimiento en comparación con métodos anteriores como OPEL. La integración de una regularización contrastiva evita soluciones triviales, lo que conduce a un mejor alineamiento temporal y resultados superiores en experimentos extensos.

Une nouvelle étude sur l'apprentissage de procédures auto-supervisé présente un cadre qui utilise le transport optimal de Gromov-Wasserstein fusionné pour améliorer l'identification des étapes clés dans des vidéos non étiquetées. Cette approche répond aux défis tels que les variations d'ordre et les images redondantes, améliorant les performances par rapport aux méthodes précédentes comme OPEL. L'intégration d'une régularisation contrastive empêche les solutions triviales, conduisant à un meilleur alignement temporel et à des résultats supérieurs lors d'expériences approfondies.

A new study on self-supervised procedure learning introduces a framework that utilizes fused Gromov-Wasserstein optimal transport to improve the identification of key steps in unlabeled videos. This approach addresses challenges such as order variations and redundant frames, enhancing performance over previous methods like OPEL. The integration of contrastive regularization prevents trivial solutions, leading to better temporal alignment and superior results in extensive experiments.

Procedure Learning via Regularized Gromov-Wasserstein Optimal Transport

Here's everything you need to know about the latest smart ring by Oura, based on a wearable expert's real-world usage.

Is the $500 Oura Ring 4 Ceramic worth it? I wore one for a month, and here's my advice

Samsung's QN90F QLED delivers great streaming and gaming performance, making it a strong holiday value.

Why I recommend this Samsung QLED TV over pricier OLED models in 2025 - and don't regret it

<h1>
 
 
 AI Amnesia: Erasing Knowledge Without a Trace
</h1>

Imagine your AI model accidentally learned something it shouldn't have – sensitive customer data, for example. Current methods for deleting this information often require retraining the entire model, an expensive and time-consuming process. What if we could surgically remove that knowledge without starting from scratch?

The key lies in a novel approach: creating artificial "forgetting cues." We're teaching the model to unlearn specific data patterns by exposing it to carefully crafted synthetic examples. These examples are designed to strongly contradict the information we want the model to forget, effectively overwriting the problematic associations in its memory. This works even if you don’t have access to the original data you need to erase.

Think of it like this: you're trying to forget a bad song stuck in your head. Instead of trying to actively suppress it (which rarely works), you blast an even more catchy song. The new song overwrites the old one, effectively erasing it from your mental playlist.

Benefits of Selective Forgetting:

<ul>
<li> Enhanced Data Privacy: Remove sensitive data without compromising the overall model's performance.</li>
<li> Reduced Retraining Costs: Avoid full model retraining, saving significant time and resources.</li>
<li> Improved Model Security: Eliminate vulnerabilities introduced by unintentionally learned patterns.</li>
<li> Adaptable Learning: Enables continuous refinement of AI models based on evolving data landscapes.</li>
<li> Compliance Ready: Supports compliance with data privacy regulations like GDPR.</li>
<li> Scalable Solutions: Works efficiently even with limited access to training data.</li>
</ul>

Practical Tip: One challenge is ensuring the synthetic data accurately targets the information you want to remove without negatively impacting the model's ability to generalize. Rigorous testing and validation with holdout datasets are crucial.

The promise of AI that can truly 'forget' opens exciting possibilities for responsible AI development. By enabling precise data deletion, we pave the way for more secure, compliant, and adaptable machine learning systems. Imagine AI models that can adapt to changing ethical guidelines or quickly unlearn incorrect information, all without massive retraining efforts. This is a crucial step towards trustworthy and responsible AI that respects data privacy and aligns with societal values. Future exploration could include extending this to different data modalities and model architectures.

Related Keywords: machine unlearning, data privacy, few-shot learning, zero-shot learning, synthetic data, model editing, catastrophic forgetting, incremental learning, continual learning, deep learning, neural networks, data security, algorithmic fairness, responsible ai, ethical ai, federated unlearning, privacy-preserving ai, model retraining, AI governance, data deletion, GDPR compliance

يتناول المقال نهجًا جديدًا لمعالجة مشكلة نسيان الذكاء الاصطناعي، حيث قد تحتفظ نماذج الذكاء الاصطناعي عن غير قصد بمعلومات حساسة. تتطلب الطرق التقليدية لحذف هذه البيانات غالبًا إعادة تدريب كاملة للنموذج، وهو ما يعد مكلفًا ويستغرق وقتًا. تتضمن التقنية الجديدة إنشاء 'إشارات نسيان' اصطناعية تساعد النموذج على نسيان أنماط بيانات معينة من خلال تعريضه لأمثلة صناعية تتناقض مع المعلومات غير المرغوب فيها، مما يسمح بإزالة المعرفة المستهدفة دون الحاجة للوصول إلى البيانات الأصلية.

El artículo aborda un nuevo enfoque para tratar el problema de la amnesia de la IA, donde los modelos de IA pueden retener inadvertidamente información sensible. Los métodos tradicionales para eliminar estos datos a menudo requieren un reentrenamiento completo del modelo, lo que es costoso y lleva tiempo. La nueva técnica implica crear 'señales de olvido' artificiales que ayudan al modelo a desaprender patrones de datos específicos al exponerlo a ejemplos sintéticos que contradicen la información no deseada, permitiendo así la eliminación selectiva de conocimientos sin necesidad de acceder a l…

L'article traite d'une nouvelle approche pour résoudre le problème de l'amnésie de l'IA, où les modèles d'IA peuvent involontairement conserver des informations sensibles. Les méthodes traditionnelles pour supprimer ces données nécessitent souvent un réentraînement complet du modèle, ce qui est coûteux et long. La nouvelle technique consiste à créer des 'cues d'oubli' artificiels qui aident le modèle à désapprendre des schémas de données spécifiques en lui présentant des exemples synthétiques qui contredisent les informations indésirables, permettant ainsi une suppression ciblée des connaissan…

The article discusses a novel approach to address the issue of AI amnesia, where AI models may inadvertently retain sensitive information. Traditional methods for deleting such data often require complete retraining of the model, which is costly and time-consuming. The new technique involves creating artificial 'forgetting cues' that help the model unlearn specific data patterns by presenting it with synthetic examples that contradict the unwanted information, allowing for targeted knowledge removal without needing access to the original data.

AI Amnesia: Erasing Knowledge Without a Trace

Baseus' Enercore CG11 travel adapter is one of the better designed ones I've tested, although its best feature isn't immediately apparent.

Traveling soon? Why this one charger is the only one you'll ever need to pack

Turning materials like wood chips, crop residues and municipal solid waste into fuels and chemicals is important for our country's energy independence.

طور الباحثون نماذج حاسوبية متقدمة تهدف إلى تحسين التنبؤات لعمليات طحن الكتلة الحيوية. تركز هذه الابتكارات على تحويل مواد مثل رقائق الخشب، بقايا المحاصيل، والنفايات الصلبة البلدية إلى وقود ومواد كيميائية قيمة، وهو أمر مهم لتعزيز استقلالية الطاقة في البلاد. من المتوقع أن تعمل النماذج على تحسين كفاءة معالجة الكتلة الحيوية، مما يساهم في حلول الطاقة المستدامة.

Investigadores han desarrollado modelos informáticos avanzados destinados a mejorar las predicciones para los procesos de molienda de biomasa. Esta innovación se centra en convertir materiales como astillas de madera, residuos de cultivos y residuos sólidos municipales en combustibles y productos químicos valiosos, lo que es crucial para mejorar la independencia energética del país. Se espera que los modelos optimicen la eficiencia del procesamiento de biomasa, contribuyendo así a soluciones energéticas sostenibles.

Des chercheurs ont développé des modèles informatiques avancés visant à améliorer les prévisions pour les processus de broyage de la biomasse. Cette innovation se concentre sur la conversion de matériaux tels que les copeaux de bois, les résidus de culture et les déchets solides municipaux en combustibles et produits chimiques précieux, ce qui est crucial pour renforcer l'indépendance énergétique du pays. Les modèles devraient optimiser l'efficacité du traitement de la biomasse, contribuant ainsi à des solutions énergétiques durables.

Researchers have developed advanced computer models aimed at improving predictions for biomass milling processes. This innovation focuses on converting materials such as wood chips, crop residues, and municipal solid waste into valuable fuels and chemicals, which is crucial for enhancing energy independence in the country. The models are expected to optimize the efficiency of biomass processing, thereby contributing to sustainable energy solutions.

Researchers develop computer models for better biomass milling predictions

The most interesting aspect of the latest Even Realities glasses may be their limitations.

I've tried several AI smart glasses (including Meta Ray-Bans) in 2025 - these are the most comfortable

arXiv:2511.11162v1 Announce Type: new 
Abstract: The Dual Diffusion Implicit Bridge (DDIB) is an emerging image-to-image (I2I) translation method that preserves cycle consistency while achieving strong flexibility. It links two independently trained diffusion models (DMs) in the source and target domains by first adding noise to a source image to obtain a latent code, then denoising it in the target domain to generate the translated image. However, this method faces two key challenges: (1) low translation efficiency, and (2) translation trajectory deviations caused by mismatched latent distributions. To address these issues, we propose a novel I2I translation framework, OT-ALD, grounded in optimal transport (OT) theory, which retains the strengths of DDIB-based approach. Specifically, we compute an OT map from the latent distribution of the source domain to that of the target domain, and use the mapped distribution as the starting point for the reverse diffusion process in the target domain. Our error analysis confirms that OT-ALD eliminates latent distribution mismatches. Moreover, OT-ALD effectively balances faster image translation with improved image quality. Experiments on four translation tasks across three high-resolution datasets show that OT-ALD improves sampling efficiency by 20.29% and reduces the FID score by 2.6 on average compared to the top-performing baseline models.

تقدم الورقة المعنونة 'OT-ALD: محاذاة التوزيعات الكامنة باستخدام النقل الأمثل لترجمة الصور إلى الصور بشكل أسرع' إطارًا جديدًا لترجمة الصور إلى الصور يسمى OT-ALD. تتناول هذه الطريقة التحديات التي تواجه جسر الانتشار المزدوج الضمني (DDIB)، وخاصة الكفاءة المنخفضة في الترجمة والانحرافات في المسار الناتجة عن عدم توافق التوزيعات الكامنة. من خلال الاستناد إلى نظرية النقل الأمثل، تعمل OT-ALD على تحسين عملية الترجمة، مما يزيد من كفاءة العينة بنسبة 20.29% ويقلل من درجة FID بمقدار 2.6.

El artículo titulado 'OT-ALD: Alineación de Distribuciones Latentes con Transporte Óptimo para Traducción Acelerada de Imagen a Imagen' presenta un nuevo marco para la traducción de imagen a imagen llamado OT-ALD. Este método aborda los desafíos que enfrenta el Puente Implícito de Doble Difusión (DDIB), particularmente la baja eficiencia de traducción y las desviaciones en la trayectoria debido a distribuciones latentes desajustadas. Al aprovechar la teoría del transporte óptimo, OT-ALD mejora el proceso de traducción, aumentando la eficiencia de muestreo en un 20.29% y reduciendo el puntaje F…

L'article intitulé 'OT-ALD : Alignement des distributions latentes avec le transport optimal pour une traduction d'image à image accélérée' présente un nouveau cadre pour la traduction d'image à image appelé OT-ALD. Cette méthode répond aux défis rencontrés par le Dual Diffusion Implicit Bridge (DDIB), notamment la faible efficacité de traduction et les écarts de trajectoire dus à des distributions latentes mal appariées. En s'appuyant sur la théorie du transport optimal, OT-ALD améliore le processus de traduction, augmentant l'efficacité d'échantillonnage de 20,29 % et réduisant le score FID …

The paper titled 'OT-ALD: Aligning Latent Distributions with Optimal Transport for Accelerated Image-to-Image Translation' introduces a new framework for image-to-image translation called OT-ALD. This method addresses challenges faced by the Dual Diffusion Implicit Bridge (DDIB), particularly low translation efficiency and trajectory deviations due to mismatched latent distributions. By leveraging optimal transport theory, OT-ALD enhances the translation process, improving sampling efficiency by 20.29% and reducing the FID score by 2.6.

OT-ALD: Aligning Latent Distributions with Optimal Transport for Accelerated Image-to-Image Translation

arXiv:2511.07947v2 Announce Type: replace-cross 
Abstract: Machine learning models constitute valuable intellectual property, yet remain vulnerable to model extraction attacks (MEA), where adversaries replicate their functionality through black-box queries. Model watermarking counters MEAs by embedding forensic markers for ownership verification. Current black-box watermarks prioritize MEA survival through representation entanglement, yet inadequately explore resilience against sequential MEAs and removal attacks. Our study reveals that this risk is underestimated because existing removal methods are weakened by entanglement. To address this gap, we propose Watermark Removal attacK (WRK), which circumvents entanglement constraints by exploiting decision boundaries shaped by prevailing sample-level watermark artifacts. WRK effectively reduces watermark success rates by at least 88.79% across existing watermarking benchmarks.
  For robust protection, we propose Class-Feature Watermarks (CFW), which improve resilience by leveraging class-level artifacts. CFW constructs a synthetic class using out-of-domain samples, eliminating vulnerable decision boundaries between original domain samples and their artifact-modified counterparts (watermark samples). CFW concurrently optimizes both MEA transferability and post-MEA stability. Experiments across multiple domains show that CFW consistently outperforms prior methods in resilience, maintaining a watermark success rate of at least 70.15% in extracted models even under the combined MEA and WRK distortion, while preserving the utility of protected models.

Class-feature Watermark: A Resilient Black-box Watermark Against Model Extraction Attacks

arXiv:2511.11688v1 Announce Type: new 
Abstract: Diffusion probabilistic models have set a new standard for generative fidelity but are hindered by a slow iterative sampling process. A powerful training-free strategy to accelerate this process is Schedule Optimization, which aims to find an optimal distribution of timesteps for a fixed and small Number of Function Evaluations (NFE) to maximize sample quality. To this end, a successful schedule optimization method must adhere to four core principles: effectiveness, adaptivity, practical robustness, and computational efficiency. However, existing paradigms struggle to satisfy these principles simultaneously, motivating the need for a more advanced solution. To overcome these limitations, we propose the Hierarchical-Schedule-Optimizer (HSO), a novel and efficient bi-level optimization framework. HSO reframes the search for a globally optimal schedule into a more tractable problem by iteratively alternating between two synergistic levels: an upper-level global search for an optimal initialization strategy and a lower-level local optimization for schedule refinement. This process is guided by two key innovations: the Midpoint Error Proxy (MEP), a solver-agnostic and numerically stable objective for effective local optimization, and the Spacing-Penalized Fitness (SPF) function, which ensures practical robustness by penalizing pathologically close timesteps. Extensive experiments show that HSO sets a new state-of-the-art for training-free sampling in the extremely low-NFE regime. For instance, with an NFE of just 5, HSO achieves a remarkable FID of 11.94 on LAION-Aesthetics with Stable Diffusion v2.1. Crucially, this level of performance is attained not through costly retraining, but with a one-time optimization cost of less than 8 seconds, presenting a highly practical and efficient paradigm for diffusion model acceleration.

تناقش الورقة المعنونة 'تحسين الجدول الهرمي لعملية أخذ العينات السريعة والموثوقة لنماذج الانتشار' التقدم في النماذج الاحتمالية للانتشار، المعروفة بدقتها التوليدية ولكن تواجه تحديات بسبب بطء عمليات أخذ العينات. يقدم المؤلفون استراتيجية جديدة بدون تدريب تُسمى تحسين الجدول الزمني، تهدف إلى تحسين توزيع خطوات الزمن لتعزيز جودة العينات مع الالتزام بمبادئ الفعالية، والتكيف، والموثوقية، والكفاءة. يقدم المُحسّن الهرمي للجدول الزمني (HSO) إطار عمل جديدًا لتحسين ثنائي المستوى لمعالجة هذه التحديات.

El artículo titulado 'Optimización de Horarios Jerárquicos para un Muestreo Rápido y Robusto de Modelos de Difusión' discute los avances en modelos probabilísticos de difusión, conocidos por su fidelidad generativa pero que enfrentan desafíos debido a procesos de muestreo lentos. Los autores presentan una estrategia sin entrenamiento llamada Optimización de Horarios, que busca optimizar la distribución de pasos de tiempo para mejorar la calidad de las muestras, respetando principios de efectividad, adaptabilidad, robustez y eficiencia. El propuesto Optimizador de Horarios Jerárquicos (HSO) ofr…

L'article intitulé 'Optimisation Hiérarchique des Horaires pour un Échantillonnage Rapide et Robuste des Modèles de Diffusion' aborde les avancées dans les modèles probabilistes de diffusion, connus pour leur fidélité générative mais confrontés à des défis en raison de processus d'échantillonnage lents. Les auteurs introduisent une stratégie sans formation appelée Optimisation des Horaires, visant à optimiser la distribution des étapes de temps pour améliorer la qualité des échantillons tout en respectant les principes d'efficacité, d'adaptabilité, de robustesse et d'efficacité. Le nouvel Opti…

The paper titled 'Hierarchical Schedule Optimization for Fast and Robust Diffusion Model Sampling' discusses advancements in diffusion probabilistic models, which are known for their generative fidelity but face challenges due to slow sampling processes. The authors introduce a training-free strategy called Schedule Optimization, aimed at optimizing the distribution of timesteps to enhance sample quality while adhering to principles of effectiveness, adaptivity, robustness, and efficiency. The proposed Hierarchical-Schedule-Optimizer (HSO) offers a bi-level optimization framework to address th…

Hierarchical Schedule Optimization for Fast and Robust Diffusion Model Sampling

arXiv:2511.12098v1 Announce Type: new 
Abstract: Generating synthetic CT images from CBCT or MRI has a potential for efficient radiation dose planning and adaptive radiotherapy. However, existing CNN-based models lack global semantic understanding, while Transformers often overfit small medical datasets due to high model capacity and weak inductive bias. To address these limitations, we propose a DINOv3-Guided Cross Fusion (DGCF) framework that integrates a frozen self-supervised DINOv3 Transformer with a trainable CNN encoder-decoder. It hierarchically fuses global representation of Transformer and local features of CNN via a learnable cross fusion module, achieving balanced local appearance and contextual representation. Furthermore, we introduce a Multi-Level DINOv3 Perceptual (MLDP) loss that encourages semantic similarity between synthetic CT and the ground truth in DINOv3's feature space. Experiments on the SynthRAD2023 pelvic dataset demonstrate that DGCF achieved state-of-the-art performance in terms of MS-SSIM, PSNR and segmentation-based metrics on both MRI$\rightarrow$CT and CBCT$\rightarrow$CT translation tasks. To the best of our knowledge, this is the first work to employ DINOv3 representations for medical image translation, highlighting the potential of self-supervised Transformer guidance for semantic-aware CT synthesis. The code is available at https://github.com/HiLab-git/DGCF.

Procedure Learning via Regularized Gromov-Wasserstein Optimal Transport

Was this article worth reading? Share it