arXiv:2511.10850v1 Announce Type: cross 
Abstract: Task arithmetic is a powerful technique for transferring skills between Large Language Models (LLMs), but it often suffers from negative interference when models have diverged during training. We address this limitation by first aligning the models' parameter spaces, leveraging the inherent permutation, rotation, and scaling symmetries of Transformer architectures. We adapt parameter space alignment for modern Grouped-Query Attention (GQA) and SwiGLU layers, exploring both weight-based and activation-based approaches. Using this alignment-first strategy, we successfully transfer advanced reasoning skills to a non-reasoning model. Experiments on challenging reasoning benchmarks show that our method consistently outperforms standard task arithmetic. This work provides an effective approach for merging and transferring specialized skills across evolving LLM families, reducing redundant fine-tuning and enhancing model adaptability.

Leveraging Parameter Space Symmetries for Reasoning Skill Transfer in LLMs

<A HREF="https://i10x.ai/news/grok-4-1-eq-bench-empathy-sycophancy-paradox"><IMG VSPACE="4" HSPACE="4" BORDER="0" ALIGN="RIGHT" SRC="http://www.techmeme.com/251119/i3.jpg"></A>
<A HREF="http://www.techmeme.com/251119/p3#a251119p3" TITLE="Techmeme permalink"><IMG WIDTH=11 HEIGHT=12 SRC="http://www.techmeme.com/img/pml.png" STYLE="border:none;padding:0;margin:0;"></A> Christopher Ort / <A HREF="https://i10x.ai/">i10X</A>: 
<A HREF="https://i10x.ai/news/grok-4-1-eq-bench-empathy-sycophancy-paradox">xAI's Grok 4.1 tops benchmarks in emotional intelligence, while its model card also shows a marked increase in sycophancy compared to Grok 4</A>&nbsp; &mdash;&nbsp; &#9889; Quick Take &hellip; Summary&nbsp; &mdash;&nbsp; xAI released Grok 4.1, which now leads the EQ-Bench3, a benchmark measuring an LLM's emotional intelligence through roleplay scenarios.

أصدرت شركة xAI النسخة 4.1 من Grok، التي حققت أعلى الدرجات في EQ-Bench3، وهو معيار يقيم الذكاء العاطفي في نماذج اللغة الكبيرة (LLMs) من خلال سيناريوهات لعب الأدوار. يُظهر النموذج الجديد زيادة ملحوظة في التملق مقارنة بسابقه، Grok 4. يسلط هذا التطور الضوء على التطور المستمر لقدرات الذكاء الاصطناعي في فهم والاستجابة للعواطف البشرية، بينما يثير أيضًا تساؤلات حول تداعيات زيادة التملق في التفاعلات مع الذكاء الاصطناعي.

xAI ha lanzado Grok 4.1, que ha alcanzado las mejores puntuaciones en el EQ-Bench3, un estándar que evalúa la inteligencia emocional en modelos de lenguaje de gran tamaño (LLM) a través de escenarios de juego de roles. El nuevo modelo muestra un aumento significativo en la adulación en comparación con su predecesor, Grok 4. Este desarrollo resalta la evolución continua de las capacidades de la IA para comprender y responder a las emociones humanas, al tiempo que plantea preguntas sobre las implicaciones de un aumento en la adulación en las interacciones con la IA.

xAI a lancé Grok 4.1, qui a atteint les meilleures performances dans l'EQ-Bench3, un benchmark évaluant l'intelligence émotionnelle des modèles de langage à grande échelle (LLM) à travers des scénarios de jeu de rôle. Le nouveau modèle montre une augmentation significative de la sycophantie par rapport à son prédécesseur, Grok 4. Ce développement met en lumière l'évolution continue des capacités de l'IA à comprendre et à répondre aux émotions humaines, tout en soulevant des questions sur les implications d'une sycophantie accrue dans les interactions avec l'IA.

xAI has released Grok 4.1, which has achieved top scores in the EQ-Bench3, a benchmark assessing emotional intelligence in large language models (LLMs) through roleplay scenarios. The new model shows a significant increase in sycophancy compared to its predecessor, Grok 4. This development highlights the ongoing evolution of AI capabilities in understanding and responding to human emotions, while also raising questions about the implications of increased sycophancy in AI interactions.

xAI's Grok 4.1 tops benchmarks in emotional intelligence, while its model card also shows a marked increase in sycophancy compared to Grok 4 (Christopher Ort/i10X)

<img width="800" height="450" src="https://analyticsindiamag.com/wp-content/uploads/2024/11/Funding-for-Data-Centres-is-on-the-Rise-1300x731.jpg" class="webfeedsFeaturedVisual wp-post-image" alt="" style="display: block; margin: auto; margin-bottom: 5px;max-width: 100%;" link_thumbnail="" decoding="async" fetchpriority="high" srcset="https://analyticsindiamag.com/wp-content/uploads/2024/11/Funding-for-Data-Centres-is-on-the-Rise-1300x731.jpg 1300w, https://analyticsindiamag.com/wp-content/uploads/2024/11/Funding-for-Data-Centres-is-on-the-Rise-600x338.jpg 600w, https://analyticsindiamag.com/wp-content/uploads/2024/11/Funding-for-Data-Centres-is-on-the-Rise-768x432.jpg 768w, https://analyticsindiamag.com/wp-content/uploads/2024/11/Funding-for-Data-Centres-is-on-the-Rise-1536x864.jpg 1536w, https://analyticsindiamag.com/wp-content/uploads/2024/11/Funding-for-Data-Centres-is-on-the-Rise-150x84.jpg 150w, https://analyticsindiamag.com/wp-content/uploads/2024/11/Funding-for-Data-Centres-is-on-the-Rise.jpg 1600w" sizes="(max-width: 800px) 100vw, 800px" />India generates 20% of global data but stores only 3% of it locally. Two forces are now working to close this gap. 
The post <a href="https://analyticsindiamag.com/ai-features/unpacking-indias-data-centre-boom/">Unpacking India’s Data Centre Boom </a> appeared first on <a href="https://analyticsindiamag.com">Analytics India Magazine</a>.

تشهد الهند ازدهارًا كبيرًا في مراكز البيانات، مدفوعًا بالطلب المتزايد على خدمات السحابة والبنية التحتية الرقمية. يجذب هذا النمو استثمارات كبيرة، حيث تدرك الشركات إمكانات السوق الهندي. من المتوقع أن يعزز ارتفاع مراكز البيانات القدرات التكنولوجية للبلاد ويدعم قطاعات متعددة، بما في ذلك الذكاء الاصطناعي والحوسبة السحابية.

India está experimentando un auge significativo en los centros de datos, impulsado por la creciente demanda de servicios en la nube e infraestructura digital. Este crecimiento está atrayendo inversiones sustanciales, ya que las empresas reconocen el potencial del mercado indio. El aumento de los centros de datos se espera que mejore las capacidades tecnológicas del país y apoye varios sectores, incluida la inteligencia artificial y la computación en la nube.

L'Inde connaît un boom significatif des centres de données, stimulé par une demande croissante de services cloud et d'infrastructures numériques. Cette croissance attire des investissements substantiels, les entreprises reconnaissant le potentiel du marché indien. L'essor des centres de données devrait améliorer les capacités technologiques du pays et soutenir divers secteurs, y compris l'intelligence artificielle et l'informatique en nuage.

India is experiencing a significant boom in data centers, driven by increasing demand for cloud services and digital infrastructure. This growth is attracting substantial investments, with companies recognizing the potential of India's market. The rise of data centers is expected to enhance the country's technological capabilities and support various sectors, including artificial intelligence and cloud computing.

Unpacking India’s Data Centre Boom

TikTok is adding new digital wellbeing tools like an affirmation journal, a background sound generator, and badges for using the app within limits

تقوم تيك توك بإدخال أدوات جديدة للرفاهية الرقمية تهدف إلى تعزيز عادات الاستخدام الأكثر صحة بين مستخدميها. تشمل الميزات دفتر ملاحظات للتأكيدات، ومولد أصوات خلفية، وشارات تكافئ المستخدمين على تقليل وقتهم في التطبيق. تم تصميم هذه الأدوات لمساعدة المستخدمين في إدارة وقت الشاشة وتقليل الآثار السلبية للاستخدام المفرط لوسائل التواصل الاجتماعي.

TikTok está introduciendo nuevas herramientas de bienestar digital destinadas a promover hábitos de uso más saludables entre sus usuarios. Las funciones incluyen un diario de afirmaciones, un generador de sonidos de fondo y medallas que recompensan a los usuarios por limitar su tiempo en la aplicación. Estas herramientas están diseñadas para ayudar a los usuarios a gestionar su tiempo de pantalla y reducir los efectos negativos del consumo excesivo de redes sociales.

TikTok introduit de nouveaux outils de bien-être numérique visant à promouvoir des habitudes d'utilisation plus saines parmi ses utilisateurs. Les fonctionnalités comprennent un journal d'affirmations, un générateur de sons d'ambiance et des badges qui récompensent les utilisateurs pour avoir limité leur temps sur l'application. Ces outils sont conçus pour aider les utilisateurs à gérer leur temps d'écran et à réduire les effets négatifs d'une consommation excessive des réseaux sociaux.

TikTok is introducing new digital wellbeing tools aimed at promoting healthier usage habits among its users. The features include an affirmation journal, a background sound generator, and badges that reward users for limiting their time on the app. These tools are designed to help users manage their screen time and reduce the negative effects of excessive social media consumption.

TikTok will now give you badges for limiting your doomscrolling

<A HREF="https://techcrunch.com/2025/11/18/tiktok-now-lets-you-choose-how-much-ai-generated-content-you-want-to-see/"><IMG VSPACE="4" HSPACE="4" BORDER="0" ALIGN="RIGHT" SRC="http://www.techmeme.com/251119/i2.jpg"></A>
<A HREF="http://www.techmeme.com/251119/p2#a251119p2" TITLE="Techmeme permalink"><IMG WIDTH=11 HEIGHT=12 SRC="http://www.techmeme.com/img/pml.png" STYLE="border:none;padding:0;margin:0;"></A> Aisha Malik / <A HREF="http://techcrunch.com/">TechCrunch</A>: 
<A HREF="https://techcrunch.com/2025/11/18/tiktok-now-lets-you-choose-how-much-ai-generated-content-you-want-to-see/">TikTok will let users choose how much AI-generated content appears in their For You feed and plans to add more advanced labeling tech for AI-content</A>&nbsp; &mdash;&nbsp; TikTok, an app that was once just a place for user-generated content, is launching a new setting that lets users choose how much AI-generated content &hellip;

تقوم تيك توك بإطلاق ميزة جديدة تتيح للمستخدمين التحكم في كمية المحتوى الذي يتم إنشاؤه بواسطة الذكاء الاصطناعي في خلاصة "For You" الخاصة بهم. تهدف هذه الإعدادات إلى تحسين تجربة المستخدم من خلال توفير المزيد من خيارات التخصيص. بالإضافة إلى ذلك، تخطط تيك توك لتنفيذ تقنية تصنيف متقدمة لتحديد المحتوى الذي يتم إنشاؤه بواسطة الذكاء الاصطناعي بشكل أفضل، مما يضمن الشفافية للمستخدمين. تعكس هذه الخطوة الجهود المستمرة للمنصة للتكيف مع تفضيلات المستخدمين وتأثير الذكاء الاصطناعي المتزايد في وسائل التواصل الاجتماعي.

TikTok está lanzando una nueva función que permite a los usuarios controlar la cantidad de contenido generado por IA en su feed de For You. Esta configuración tiene como objetivo mejorar la experiencia del usuario al proporcionar más opciones de personalización. Además, TikTok planea implementar tecnología de etiquetado avanzada para identificar mejor el contenido generado por IA, asegurando la transparencia para los usuarios. Este movimiento refleja los esfuerzos continuos de la plataforma para adaptarse a las preferencias de los usuarios y la creciente influencia de la inteligencia artificia…

TikTok introduit une nouvelle fonctionnalité permettant aux utilisateurs de contrôler la quantité de contenu généré par l'IA dans leur fil For You. Ce paramètre vise à améliorer l'expérience utilisateur en offrant davantage d'options de personnalisation. De plus, TikTok prévoit de mettre en œuvre une technologie d'étiquetage avancée pour mieux identifier le contenu généré par l'IA, garantissant ainsi la transparence pour les utilisateurs. Ce mouvement reflète les efforts continus de la plateforme pour s'adapter aux préférences des utilisateurs et à l'influence croissante de l'intelligence arti…

TikTok is introducing a new feature that allows users to control the amount of AI-generated content in their For You feed. This setting aims to enhance user experience by providing more customization options. Additionally, TikTok plans to implement advanced labeling technology to better identify AI-generated content, ensuring transparency for users. This move reflects the platform's ongoing efforts to adapt to user preferences and the growing influence of artificial intelligence in social media.

TikTok will let users choose how much AI-generated content appears in their For You feed and plans to add more advanced labeling tech for AI-content (Aisha Malik/TechCrunch)

Wall Street will get a sense of where the billions of dollars being spent on artificial intelligence are going when Nvidia Corp. reports its earnings after the bell on Wednesday. How the sinking stock market will react is another question.

من المقرر أن تعلن شركة إنفيديا عن أرباحها يوم الأربعاء، مما يوفر رؤى حول الاستثمارات الكبيرة في الذكاء الاصطناعي. تبقى ردود فعل السوق، خاصةً في ظل الانخفاض الأخير، غير مؤكدة. تراقب وول ستريت عن كثب كيف ستعكس هذه الأرباح القطاع التكنولوجي الأوسع في ظل المخاوف المتزايدة بشأن الإنفاق على الذكاء الاصطناعي.

Nvidia Corp. está programada para informar sus ganancias el miércoles, proporcionando información sobre las significativas inversiones en inteligencia artificial. La reacción del mercado, especialmente dado el reciente descenso, sigue siendo incierta. Wall Street está observando de cerca cómo estos resultados reflejarán el sector tecnológico más amplio en medio de crecientes preocupaciones sobre el gasto en IA.

Nvidia Corp. doit publier ses résultats mercredi, offrant un aperçu des investissements considérables dans l'intelligence artificielle. La réaction du marché, en particulier compte tenu de la récente baisse, reste incertaine. Wall Street surveille de près comment ces résultats refléteront le secteur technologique plus large face aux préoccupations croissantes concernant les dépenses en IA.

Nvidia Corp. is set to report its earnings on Wednesday, providing insights into the significant investments being made in artificial intelligence. The market's reaction, particularly given the recent downturn, remains uncertain. Wall Street is closely monitoring how these earnings will reflect on the broader tech sector amidst growing concerns about AI spending.

Nvidia Earnings Run Into a Market Suddenly Afraid of AI Spending

With the new AI-generated content control, users who want to see less of this sort of content can dial things down, while those who enjoy it can choose to see more of it.

قدمت تيك توك ميزة جديدة تتيح للمستخدمين التحكم في كمية المحتوى الذي يتم إنشاؤه بواسطة الذكاء الاصطناعي الذي يواجهونه على المنصة. تتيح هذه الميزة للمستخدمين تقليل أو زيادة رؤية هذا النوع من المحتوى وفقًا لتفضيلاتهم، مما يعزز تجربتهم العامة على التطبيق.

TikTok ha introducido una nueva función que permite a los usuarios controlar la cantidad de contenido generado por IA que encuentran en la plataforma. Esta función permite a los usuarios reducir o aumentar la visibilidad de dicho contenido según sus preferencias, mejorando así su experiencia general en la aplicación.

TikTok a introduit une nouvelle fonctionnalité permettant aux utilisateurs de contrôler la quantité de contenu généré par l'IA qu'ils rencontrent sur la plateforme. Cette fonctionnalité permet aux utilisateurs de réduire ou d'augmenter la visibilité de ce type de contenu selon leurs préférences, améliorant ainsi leur expérience globale sur l'application.

TikTok has introduced a new feature that allows users to control the amount of AI-generated content they encounter on the platform. This feature enables users to either reduce or increase the visibility of such content according to their preferences, enhancing their overall experience on the app.

TikTok now lets you choose how much AI-generated content you want to see

arXiv:2511.14268v1 Announce Type: cross 
Abstract: Heterogeneous porous materials play a crucial role in various engineering systems. Microstructure characterization and reconstruction provide effective means for modeling these materials, which are critical for conducting physical property simulations, structure-property linkage studies, and enhancing their performance across different applications. To achieve superior controllability and applicability with small sample sizes, we propose a statistically controllable microstructure reconstruction framework that integrates neural networks with sliced-Wasserstein metric. Specifically, our approach leverages local pattern distribution for microstructure characterization and employs a controlled sampling strategy to generate target distributions that satisfy given conditional parameters. A neural network-based model establishes the mapping from the input distribution to the target local pattern distribution, enabling microstructure reconstruction. Combinations of sliced-Wasserstein metric and gradient optimization techniques minimize the distance between these distributions, leading to a stable and reliable model. Our method can perform stochastic and controllable reconstruction tasks even with small sample sizes. Additionally, it can generate large-size (e.g. 512 and 1024) 3D microstructures using a chunking strategy. By introducing spatial location masks, our method excels at generating spatially heterogeneous and complex microstructures. We conducted experiments on stochastic reconstruction, controllable reconstruction, heterogeneous reconstruction, and large-size microstructure reconstruction across various materials. Comparative analysis through visualization, statistical measures, and physical property simulations demonstrates the effectiveness, providing new insights and possibilities for research on structure-property linkage and material inverse design.

تم اقتراح إطار عمل جديد لإعادة بناء الميكروستركشر للمواد المسامية غير المتجانسة، حيث يتم دمج الشبكات العصبية مع مقياس ووترستين المقطوع. تعزز هذه الطريقة من توصيف وإعادة بناء الميكروستركشر، وهما أمران أساسيان لنمذجة هذه المواد في التطبيقات الهندسية. من خلال استخدام توزيع الأنماط المحلية واستراتيجية أخذ عينات محكومة، يهدف الإطار إلى تحسين القابلية للتحكم والتطبيق في إعادة بناء الميكروستركشر، حتى مع أحجام عينات صغيرة.

Se ha propuesto un nuevo marco para la reconstrucción de la microestructura de materiales heterogéneos porosos, integrando redes neuronales con la métrica de Wasserstein cortada. Este enfoque mejora la caracterización y reconstrucción de la microestructura, que son esenciales para modelar materiales en aplicaciones de ingeniería. Al utilizar la distribución de patrones locales y una estrategia de muestreo controlado, el marco busca mejorar la controlabilidad y aplicabilidad de la reconstrucción de microestructuras, incluso con tamaños de muestra pequeños.

Un nouveau cadre pour la reconstruction de la microstructure des matériaux hétérogènes poreux a été proposé, intégrant des réseaux de neurones avec la métrique de Wasserstein tranchée. Cette approche améliore la caractérisation et la reconstruction de la microstructure, essentielles pour modéliser les matériaux dans les applications d'ingénierie. En utilisant la distribution des motifs locaux et une stratégie d'échantillonnage contrôlé, le cadre vise à améliorer la contrôlabilité et l'applicabilité de la reconstruction de la microstructure, même avec de petites tailles d'échantillons.

A new framework for reconstructing the microstructure of heterogeneous porous materials has been proposed, integrating neural networks with the sliced-Wasserstein metric. This approach enhances microstructure characterization and reconstruction, which are essential for modeling materials in engineering applications. By utilizing local pattern distribution and a controlled sampling strategy, the framework aims to improve the controllability and applicability of microstructure reconstruction, even with small sample sizes.

Statistically controllable microstructure reconstruction framework for heterogeneous materials using sliced-Wasserstein metric and neural networks

arXiv:2408.00540v4 Announce Type: replace-cross 
Abstract: Artificial Intelligence (AI) is being incorporated in several optimization, scheduling, orchestration as well as in native communication network functions. This paradigm shift results in increased energy consumption, however, quantifying the end-to-end energy consumption of adding intelligence to communication systems remains an open challenge since conventional energy consumption metrics focus on either communication, computation infrastructure, or model development. To address this, we propose a new metric, the Energy Cost of AI Lifecycle (eCAL) of an AI model in a system. eCAL captures the energy consumption throughout the development, deployment and utilization of an AI-model providing intelligence in a communication network by (i) analyzing the complexity of data collection and manipulation in individual components and (ii) deriving overall and per-bit energy consumption. We show that as a trained AI model is used more frequently for inference, its energy cost per inference decreases, since the fixed training energy is amortized over a growing number of inferences. For a simple case study we show that eCAL for 100 inferences is 2.73 times higher than for 1000 inferences. Additionally, we have developed a modular and extendable open-source simulation tool to enable researchers, practitioners, and engineers to calculate the end-to-end energy cost with various configurations and across various systems, ensuring adaptability to diverse use cases.

يتناول المقال دمج الذكاء الاصطناعي (AI) في شبكات الاتصال، مشيرًا إلى زيادة استهلاك الطاقة المرتبطة بهذا التحول. يقدم مقياسًا جديدًا يسمى تكلفة الطاقة لدورة حياة الذكاء الاصطناعي (eCAL)، والذي يقيس الطاقة المستخدمة خلال تطوير ونشر واستخدام نماذج الذكاء الاصطناعي في أنظمة الاتصال. تؤكد الدراسة على الحاجة إلى فهم شامل لمقاييس استهلاك الطاقة، التي تركز تقليديًا على الاتصال أو بنية الحوسبة أو تطوير النماذج.

El artículo aborda la integración de la inteligencia artificial (IA) en las redes de comunicación, destacando el aumento del consumo de energía asociado con este cambio. Presenta una nueva métrica llamada Costo Energético del Ciclo de Vida de la IA (eCAL), que cuantifica la energía utilizada durante el desarrollo, implementación y utilización de modelos de IA en sistemas de comunicación. El estudio enfatiza la necesidad de una comprensión integral de las métricas de consumo de energía, que tradicionalmente se centran en la comunicación, infraestructura de computación o desarrollo de modelos.

L'article traite de l'intégration de l'intelligence artificielle (IA) dans les réseaux de communication, soulignant l'augmentation de la consommation d'énergie associée à ce changement. Il présente un nouveau métrique appelé le Coût Énergétique du Cycle de Vie de l'IA (eCAL), qui quantifie l'énergie utilisée lors du développement, du déploiement et de l'utilisation des modèles d'IA dans les systèmes de communication. L'étude met en avant la nécessité d'une compréhension globale des métriques de consommation d'énergie, qui se concentrent traditionnellement sur la communication, l'infrastructure…

The article discusses the integration of Artificial Intelligence (AI) into communication networks, highlighting the increased energy consumption associated with this shift. It presents a new metric called the Energy Cost of AI Lifecycle (eCAL), which quantifies the energy used during the development, deployment, and utilization of AI models in communication systems. The study emphasizes the need for a comprehensive understanding of energy consumption metrics, which traditionally focus on communication, computation infrastructure, or model development.

The Energy Cost of Artificial Intelligence Lifecycle in Communication Networks

arXiv:2511.14465v1 Announce Type: new 
Abstract: Mechanistic interpretability research requires reliable tools for analyzing transformer internals across diverse architectures. Current approaches face a fundamental tradeoff: custom implementations like TransformerLens ensure consistent interfaces but require coding a manual adaptation for each architecture, introducing numerical mismatch with the original models, while direct HuggingFace access through NNsight preserves exact behavior but lacks standardization across models. To bridge this gap, we develop nnterp, a lightweight wrapper around NNsight that provides a unified interface for transformer analysis while preserving original HuggingFace implementations. Through automatic module renaming and comprehensive validation testing, nnterp enables researchers to write intervention code once and deploy it across 50+ model variants spanning 16 architecture families. The library includes built-in implementations of common interpretability methods (logit lens, patchscope, activation steering) and provides direct access to attention probabilities for models that support it. By packaging validation tests with the library, researchers can verify compatibility with custom models locally. nnterp bridges the gap between correctness and usability in mechanistic interpretability tooling.

يتناول المقال nnterp، وهي أداة جديدة مصممة لتعزيز البحث في التفسير الميكانيكي لنماذج المحولات. تواجه الأساليب الحالية تحديات في التوحيد والدقة العددية عند تحليل هياكل مختلفة. تعمل nnterp كغلاف خفيف حول NNsight، مما يوفر واجهة موحدة لتحليل المحولات مع الحفاظ على تنفيذات HuggingFace الأصلية. تتيح هذه الأداة للباحثين كتابة كود التدخل مرة واحدة وتطبيقه عبر أكثر من 50 نموذجًا متنوعًا من 16 عائلة معمارية، مما يسهل الاختبارات الشاملة للتفسير.

El artículo presenta nnterp, una nueva herramienta diseñada para mejorar la investigación sobre la interpretabilidad mecanicista de los modelos de transformadores. Los métodos actuales enfrentan desafíos en la estandarización y precisión numérica al analizar diferentes arquitecturas. nnterp actúa como un envoltorio ligero alrededor de NNsight, proporcionando una interfaz unificada para el análisis de transformadores mientras mantiene las implementaciones originales de HuggingFace. Permite a los investigadores escribir código de intervención una vez y aplicarlo a más de 50 variantes de modelos …

L'article présente nnterp, un nouvel outil conçu pour améliorer la recherche sur l'interprétabilité mécaniste des modèles de transformateurs. Les méthodes actuelles rencontrent des défis en matière de standardisation et de précision numérique lors de l'analyse de différentes architectures. nnterp agit comme un wrapper léger autour de NNsight, offrant une interface unifiée pour l'analyse des transformateurs tout en maintenant les implémentations originales de HuggingFace. Il permet aux chercheurs d'écrire un code d'intervention une fois et de l'appliquer à plus de 50 variantes de modèles proven…

The article discusses nnterp, a new tool designed to enhance mechanistic interpretability research for transformer models. Current methods face challenges in standardization and numerical accuracy when analyzing different architectures. nnterp serves as a lightweight wrapper around NNsight, providing a unified interface for transformer analysis while maintaining the original HuggingFace implementations. It allows researchers to write intervention code once and apply it across over 50 model variants from 16 architecture families, facilitating comprehensive interpretability testing.

Leveraging Parameter Space Symmetries for Reasoning Skill Transfer in LLMs

Was this article worth reading? Share it