arXiv:2511.02587v1 Announce Type: new 
Abstract: The research explores error analysis in the performance of translating by Machine Translation from English into Romanian, and it focuses on lexical errors found in texts which include official information, provided by the World Health Organization (WHO), the Gavi Organization, by the patient information leaflet (the information about the active ingredients of the vaccines or the medication, the indications, the dosage instructions, the storage instructions, the side effects and warning, etc.). All of these texts are related to Covid-19 and have been translated by Google Translate, a multilingual Machine Translation that was created by Google. In the last decades, Google has actively worked to develop a more accurate and fluent automatic translation system. This research, specifically focused on improving Google Translate, aims to enhance the overall quality of Machine Translation by achieving better lexical selection and by reducing errors. The investigation involves a comprehensive analysis of 230 texts that have been translated from English into Romanian.

تستكشف هذه الدراسة تحليل الأخطاء المعجمية في الترجمة الآلية من الإنجليزية إلى الرومانية. تركز على التحديات التي تواجه ترجمة النصوص الرسمية، خاصة تلك المتعلقة بمعلومات الصحة المقدمة من منظمات مثل منظمة الصحة العالمية وجافي.

Esta investigación analiza los errores léxicos en la traducción automática del inglés al rumano. Se centra en los desafíos de traducir textos oficiales, especialmente aquellos relacionados con la información de salud proporcionada por organizaciones como la Organización Mundial de la Salud y Gavi.

Cette recherche se penche sur l'analyse des erreurs lexicales dans la traduction automatique de l'anglais vers le roumain. Elle met en lumière les défis rencontrés lors de la traduction de textes officiels, en particulier ceux liés aux informations sanitaires fournies par des organisations telles que l'Organisation mondiale de la santé et Gavi.

This research delves into the analysis of lexical errors in machine translation from English to Romanian. It highlights the challenges faced in translating official texts, particularly those related to health information provided by organizations like the World Health Organization and Gavi.

The Analysis of Lexical Errors in Machine Translation from English into Romanian

According to recent research, the average financial advisor in the United States is in their late 50s, and around 40% are expected to retire within the next decade.

تحدث Iron Tree Financial ضجة من خلال إنشاء ممارسة تدعم ليس فقط المستشارين الماليين الذين يقتربون من التقاعد، ولكن أيضًا تضمن الحفاظ على ثقة عملائهم. مع توقع تقاعد نسبة كبيرة من المستشارين في الولايات المتحدة قريبًا، فإن هذه المبادرة ضرورية للحفاظ على استمرارية الخدمات المالية. من خلال التركيز على احتياجات المستشارين المتقاعدين وعملائهم، تضع Iron Tree مثالًا إيجابيًا في الصناعة.

Iron Tree Financial está causando sensación al crear una práctica que no solo apoya a los asesores financieros que se acercan a la jubilación, sino que también asegura que la confianza de sus clientes se mantenga. Con una parte significativa de los asesores en EE. UU. que se espera que se jubilen pronto, esta iniciativa es crucial para mantener la continuidad en los servicios financieros. Al centrarse tanto en las necesidades de los asesores que se retiran como en las de sus clientes, Iron Tree está estableciendo un ejemplo positivo en la industria.

Iron Tree Financial fait sensation en créant une pratique qui soutient non seulement les conseillers financiers proches de la retraite, mais qui garantit également que la confiance de leurs clients est préservée. Avec une part importante des conseillers aux États-Unis prévue pour prendre leur retraite bientôt, cette initiative est cruciale pour maintenir la continuité des services financiers. En se concentrant à la fois sur les besoins des conseillers partant à la retraite et de leurs clients, Iron Tree donne un exemple positif dans l'industrie.

Iron Tree Financial is making waves by creating a practice that not only supports financial advisors nearing retirement but also ensures that their clients' trust is preserved. With a significant portion of advisors in the U.S. expected to retire soon, this initiative is crucial for maintaining continuity in financial services. By focusing on both the needs of retiring advisors and their clients, Iron Tree is setting a positive example in the industry.

How Iron Tree Financial Built a Practice That Helps Financial Advisors Retire While Preserving Client Trust

Pinterest CEO Bill Ready says open source AI is offering cost savings to the company, particularly in visual search.

سلط الرئيس التنفيذي لشركة بينتيريست، بيل ريدي، الضوء على الفوائد الكبيرة للذكاء الاصطناعي مفتوح المصدر، وخاصة قدرته على تحسين البحث المرئي مع تقليل التكاليف للشركة. هذا التطور مهم لأنه لا يحسن تجربة المستخدم فحسب، بل يضع بينتيريست أيضًا في موقع تنافسي في مشهد التكنولوجيا، مما يظهر كيف يمكن أن تؤدي الحلول المبتكرة إلى الكفاءة والتوفير.

El CEO de Pinterest, Bill Ready, ha destacado los beneficios significativos de la IA de código abierto, especialmente su capacidad para mejorar la búsqueda visual mientras reduce los costos para la empresa. Este desarrollo es crucial, ya que no solo mejora la experiencia del usuario, sino que también posiciona a Pinterest de manera competitiva en el panorama tecnológico, demostrando cómo las soluciones innovadoras pueden llevar a la eficiencia y al ahorro.

Le PDG de Pinterest, Bill Ready, a souligné les avantages significatifs de l'IA open source, en particulier sa capacité à améliorer la recherche visuelle tout en réduisant les coûts pour l'entreprise. Ce développement est crucial car il améliore non seulement l'expérience utilisateur, mais positionne également Pinterest de manière compétitive dans le paysage technologique, montrant comment des solutions innovantes peuvent conduire à l'efficacité et aux économies.

Pinterest CEO Bill Ready has highlighted the significant benefits of open source AI, particularly its ability to enhance visual search while also reducing costs for the company. This development is crucial as it not only improves user experience but also positions Pinterest competitively in the tech landscape, showcasing how innovative solutions can lead to efficiency and savings.

Pinterest CEO touts open source AI: ‘tremendous performance’ with reduced costs

According to Oxford Economics, TikTok contributed $24.2 billion to U.S. GDP in 2023, supporting over 224,000 jobs. For small businesses and content creators—constituencies Republicans claim to champion—the platform has become essential infrastructure.

في عام 2023، كان لتطبيق تيك توك تأثير كبير على الاقتصاد الأمريكي، حيث ساهم بمبلغ 24.2 مليار دولار في الناتج المحلي الإجمالي ودعم أكثر من 224,000 وظيفة، وفقًا لاقتصاد أكسفورد. يبرز هذا أهمية المنصة للشركات الصغيرة وصانعي المحتوى، وهي مجموعات يدعي الجمهوريون أنهم يدعمونها. بينما يتجادل السياسيون التقليديون حول مستقبل المنصة، يمهد جون مكينتي الطريق لنموذج جديد من التأثير يعترف بدور تيك توك كجزء أساسي من البنية التحتية للنمو الاقتصادي.

En 2023, TikTok tuvo un impacto significativo en la economía de EE. UU., contribuyendo con 24.2 mil millones de dólares al PIB y apoyando a más de 224,000 empleos, según Oxford Economics. Esto resalta la importancia de la plataforma para las pequeñas empresas y los creadores de contenido, grupos que los republicanos suelen afirmar apoyar. Mientras los políticos establecidos debaten sobre el futuro de la plataforma, John McEntee está allanando el camino para un nuevo modelo de influencia que reconoce el papel de TikTok como infraestructura esencial para el crecimiento económico.

En 2023, TikTok a eu un impact significatif sur l'économie américaine, contribuant à hauteur de 24,2 milliards de dollars au PIB et soutenant plus de 224 000 emplois, selon Oxford Economics. Cela souligne l'importance de la plateforme pour les petites entreprises et les créateurs de contenu, des groupes que les républicains affirment souvent soutenir. Alors que les politiciens établis débattent de l'avenir de la plateforme, John McEntee ouvre la voie à un nouveau modèle d'influence qui reconnaît le rôle de TikTok comme une infrastructure essentielle pour la croissance économique.

In 2023, TikTok made a significant impact on the U.S. economy, contributing $24.2 billion to the GDP and supporting over 224,000 jobs, according to Oxford Economics. This highlights the platform's importance for small businesses and content creators, groups that Republicans often claim to support. While establishment politicians debate the platform's future, John McEntee is paving the way for a new model of influence that recognizes TikTok's role as essential infrastructure for economic growth.

While Establishment Politicians Squabbled Over TikTok, John McEntee Built a New Model for Influence

بدأت شركة فيتبيت عروض الجمعة السوداء مبكرًا، حيث تقدم أحد أشهر أجهزة تتبع اللياقة البدنية بسعر رائع يبلغ 100 دولار. هذه أخبار رائعة لعشاق اللياقة البدنية الذين يتطلعون إلى ترقية معداتهم أو بدء رحلتهم الصحية دون إنفاق الكثير. مع ميزات تساعد في تتبع التمارين ومعدل ضربات القلب وأنماط النوم، تجعل هذه الصفقة من السهل على الجميع البقاء متحفزين وصحيين.

Fitbit ha comenzado sus ofertas de Black Friday temprano, ofreciendo uno de sus rastreadores de fitness más populares a un precio fantástico de 100 $. Esta es una gran noticia para los entusiastas del fitness que buscan actualizar su equipo o comenzar su viaje de salud sin gastar demasiado. Con características que ayudan a rastrear entrenamientos, frecuencia cardíaca y patrones de sueño, esta oferta facilita que todos se mantengan motivados y saludables.

Fitbit a lancé ses offres de Black Friday en avance, proposant l'un de ses trackers de fitness les plus populaires à un prix fantastique de 100 $. C'est une excellente nouvelle pour les passionnés de fitness qui cherchent à améliorer leur équipement ou à commencer leur parcours de santé sans se ruiner. Avec des fonctionnalités qui aident à suivre les entraînements, le rythme cardiaque et les habitudes de sommeil, cette offre facilite la motivation et le maintien d'une bonne santé.

Fitbit has kicked off its Black Friday deals early, offering one of its most popular fitness trackers at a fantastic price of $100. This is great news for fitness enthusiasts looking to upgrade their gear or start their health journey without breaking the bank. With features that help track workouts, heart rate, and sleep patterns, this deal makes it easier for everyone to stay motivated and healthy.

Fitbit Black Friday deals are here early and one of our favorite fitness trackers is on sale for $100

Khan's appointment sends a message to the tech industry, whose most powerful players have already been critical of Mamdani, a Democratic socialist.

تعيين لينا خان لرئاسة فريق الانتقال لعمدة نيويورك المنتخب زوهرا مامداني هو خطوة مهمة تشير إلى تغيير في نهج الحكومة في المدينة، خاصة فيما يتعلق بصناعة التكنولوجيا. خان، المعروفة بموقفها النقدي تجاه الشركات التكنولوجية الكبرى، تجلب منظورًا جديدًا يمكن أن يعيد تشكيل السياسات لصالح المساءلة والعدالة. هذه الانتقالة مهمة لأنها تعكس قيم جيل جديد من القيادة وقد تؤثر على كيفية عمل عمالقة التكنولوجيا في المدينة.

La designación de Lina Khan como copresidenta del equipo de transición del alcalde electo de Nueva York, Zohran Mamdani, es un movimiento significativo que señala un cambio en el enfoque de la gobernanza de la ciudad, especialmente en lo que respecta a la industria tecnológica. Khan, conocida por su postura crítica hacia las grandes empresas tecnológicas, aporta una nueva perspectiva que podría remodelar las políticas en favor de la responsabilidad y la equidad. Esta transición es importante ya que refleja los valores de una nueva generación de liderazgo y podría influir en cómo operan los gigantes tecnológicos en la ciudad.

La nomination de Lina Khan pour coprésider l'équipe de transition du maire élu de NYC, Zohran Mamdani, est un mouvement significatif qui signale un changement dans l'approche de la gouvernance de la ville, en particulier en ce qui concerne l'industrie technologique. Khan, connue pour sa position critique envers les grandes entreprises technologiques, apporte une nouvelle perspective qui pourrait remodeler les politiques en faveur de la responsabilité et de l'équité. Cette transition est importante car elle reflète les valeurs d'une nouvelle génération de dirigeants et pourrait influencer le fonctionnement des géants de la technologie dans la ville.

Lina Khan's appointment to co-chair the transition team for NYC mayor-elect Zohran Mamdani is a significant move that signals a shift in the city's approach to governance, particularly regarding the tech industry. Khan, known for her critical stance on major tech companies, brings a fresh perspective that could reshape policies in favor of accountability and fairness. This transition is important as it reflects the values of a new generation of leadership and could influence how tech giants operate in the city.

Lina Khan to co-chair NYC mayor-elect Zohran Mamdani’s transition team

<a href="https://petapixel.com/2025/11/05/motorolas-edge-70-doesnt-discard-cameras-despite-ultra-thin-chassis/"><img width="1600" height="840" src="https://petapixel.com/assets/uploads/2025/11/Motorola-Edge-70-PetaPixel.jpg" class="attachment-card-large size-card-large wp-post-image" alt="Four Motorola smartphones are displayed on a curved metallic surface; three show their stylish rear designs in green, blue, and greenish-gray, while one shows its front with a portrait on the screen." decoding="async" fetchpriority="high" /></a>Motorola has unveiled the Motorola Edge 70, its thinnest smartphone yet, measuring just 0.24 of an inch (5.99 millimeters) thick. The device headlines a new generation of ultra-thin handsets that aim to balance durability, performance, and advanced camera technology without sacrificing design.
[<a href="https://petapixel.com/2025/11/05/motorolas-edge-70-doesnt-discard-cameras-despite-ultra-thin-chassis/">Read More</a>]

يُعجب أحدث إصدار من موتورولا، Edge 70، بتصميمه النحيف للغاية مع الحفاظ على كاميرات عالية الجودة. إن هذا التوازن بين الأناقة والوظائف مهم لأنه يُظهر التزام موتورولا بالابتكار في تكنولوجيا الهواتف الذكية، مما يجذب المستهلكين الذين يفضلون كل من الجماليات والأداء. قد يُحدد Edge 70 معيارًا جديدًا للأجهزة المستقبلية، مما يجعله تطورًا مثيرًا في عالم التكنولوجيا.

El último lanzamiento de Motorola, el Edge 70, impresiona con su diseño ultra delgado mientras mantiene cámaras de alta calidad. Este equilibrio entre estilo y funcionalidad es significativo, ya que muestra el compromiso de Motorola con la innovación en la tecnología de teléfonos inteligentes, atrayendo a consumidores que priorizan tanto la estética como el rendimiento. El Edge 70 podría establecer un nuevo estándar para futuros dispositivos, convirtiéndolo en un desarrollo emocionante en el mundo tecnológico.

Le dernier modèle de Motorola, l'Edge 70, impressionne par son design ultra-fin tout en intégrant des caméras de haute qualité. Cet équilibre entre style et fonctionnalité est important car il démontre l'engagement de Motorola envers l'innovation dans la technologie des smartphones, attirant les consommateurs qui privilégient à la fois l'esthétique et la performance. L'Edge 70 pourrait établir une nouvelle norme pour les futurs appareils, ce qui en fait un développement passionnant dans le monde de la technologie.

Motorola's latest release, the Edge 70, impresses with its ultra-thin design while still incorporating high-quality cameras. This balance of style and functionality is significant as it showcases Motorola's commitment to innovation in smartphone technology, appealing to consumers who prioritize both aesthetics and performance. The Edge 70 could set a new standard for future devices, making it an exciting development in the tech world.

Motorola’s Edge 70 Doesn’t Discard Cameras Despite Ultra-Thin Chassis

Vaccines train the body's defenses to fight infections safely. Learn how vaccines work, how the immune system responds, and the science behind lasting immunity.

تلعب اللقاحات دورًا حاسمًا في تدريب دفاعات جسمنا على محاربة العدوى بشكل آمن وفعال. يتناول هذا المقال كيفية عمل اللقاحات، واستجابة الجهاز المناعي، والابتكارات العلمية التي تسهم في مناعة دائمة. فهم هذه المفاهيم أمر حيوي لأنه يبرز أهمية التطعيم في الصحة العامة والوقاية من الأمراض.

Las vacunas desempeñan un papel crucial en el entrenamiento de las defensas de nuestro cuerpo para combatir infecciones de manera segura y efectiva. Este artículo profundiza en cómo funcionan las vacunas, la respuesta del sistema inmunológico y los avances científicos que contribuyen a una inmunidad duradera. Comprender estos conceptos es vital, ya que resalta la importancia de la vacunación en la salud pública y la prevención de enfermedades.

Les vaccins jouent un rôle crucial en entraînant les défenses de notre corps à combattre les infections de manière sûre et efficace. Cet article explore le fonctionnement des vaccins, la réponse du système immunitaire et les avancées scientifiques qui contribuent à une immunité durable. Comprendre ces concepts est essentiel car cela souligne l'importance de la vaccination dans la santé publique et la prévention des maladies.

Vaccines play a crucial role in training our body's defenses to combat infections safely and effectively. This article delves into how vaccines function, the immune system's response, and the scientific breakthroughs that contribute to lasting immunity. Understanding these concepts is vital as it highlights the importance of vaccination in public health and disease prevention.

How Vaccines Work: Exploring the Immune System and Breakthroughs in Vaccine Science

arXiv:2511.01090v1 Announce Type: new 
Abstract: Large Language Models (LLMs) have recently exploded in popularity, often matching or outperforming human abilities on many tasks. One of the key factors in training LLMs is the availability and curation of high-quality data. Data quality is especially crucial for under-represented languages, where high-quality corpora are scarce. In this work we study the characteristics and coverage of Romanian pretraining corpora and we examine how they differ from English data. By training a lightweight multitask model on carefully LLM-annotated Romanian texts, we are able to analyze and perform multi-level filtering (e.g., educational value, topic, format) to generate high-quality pretraining datasets. Our experiments show noteworthy trends in the topics present in Romanian and English data, while also proving the effectiveness of filtering data through improved LLM pretraining performance across multiple benchmarks.

تسلط دراسة حديثة الضوء على أهمية تحسين جودة وتنوع بيانات التدريب المسبق لنماذج اللغة الكبيرة (LLMs) الرومانية. مع تزايد شعبية LLMs على مستوى العالم، فإن ضمان حصول اللغات الممثلة تمثيلاً ناقصًا مثل الرومانية على بيانات عالية الجودة أمر بالغ الأهمية لتطويرها. لا تسلط هذه الأبحاث الضوء فقط على الحالة الحالية لمجموعات البيانات الرومانية، بل تؤكد أيضًا على الحاجة إلى تحسين تنسيق البيانات، مما قد يعزز أداء LLMs في تطبيقات متنوعة. هذه خطوة مهمة نحو جعل تقنيات اللغة المتقدمة أكثر شمولاً.

Un estudio reciente destaca la importancia de mejorar la calidad y diversidad de los datos de preentrenamiento para los modelos de lenguaje (LLMs) rumanos. A medida que los LLMs ganan popularidad a nivel mundial, asegurar que lenguas subrepresentadas como el rumano tengan acceso a datos de alta calidad es crucial para su desarrollo. Esta investigación no solo arroja luz sobre el estado actual de los corpus rumanos, sino que también enfatiza la necesidad de una mejor curación de datos, lo que podría mejorar el rendimiento de los LLMs en diversas aplicaciones. Este es un paso significativo hacia la creación de tecnologías lingüísticas avanzadas más inclusivas.

Une étude récente souligne l'importance d'améliorer la qualité et la diversité des données de préentraînement pour les modèles de langage (LLMs) roumains. Alors que les LLMs gagnent en popularité dans le monde entier, il est crucial de garantir que des langues sous-représentées comme le roumain aient accès à des données de haute qualité. Cette recherche met en lumière l'état actuel des corpus roumains et souligne la nécessité d'une meilleure curation des données, ce qui pourrait améliorer les performances des LLMs dans diverses applications. C'est une étape significative vers la création de technologies linguistiques avancées plus inclusives.

A recent study highlights the importance of improving the quality and diversity of pretraining data for Romanian large language models (LLMs). As LLMs gain traction globally, ensuring that under-represented languages like Romanian have access to high-quality data is crucial for their development. This research not only sheds light on the current state of Romanian corpora but also emphasizes the need for better data curation, which could enhance the performance of LLMs in various applications. This is a significant step towards making advanced language technologies more inclusive.

Improving Romanian LLM Pretraining Data using Diversity and Quality Filtering

arXiv:2511.01854v2 Announce Type: replace 
Abstract: Recent advances in LLM Multi-Agent Systems enable scalable orchestration of sub-agents, each coordinating hundreds or thousands of tools or Model Context Protocol (MCP) servers. However, existing retrieval methods typically match queries against coarse agent-level descriptions before routing, which obscures fine-grained tool functionality and often results in suboptimal agent selection. We introduce Tool-to-Agent Retrieval, a unified framework that embeds both tools and their parent agents in a shared vector space and connects them through metadata relationships. By explicitly representing tool capabilities and traversing metadata to the agent level, Tool-to-Agent Retrieval enables granular tool-level or agent-level retrieval, ensuring that agents and their underlying tools or MCP servers are equally represented without the context dilution that arises from chunking many tools together. Evaluating Tool-to-Agent Retrieval across eight embedding models, our approach achieves consistent improvements of 19.4% in Recall@5 and 17.7% in nDCG@5 over previous state-of-the-art agent retrievers on the LiveMCPBench benchmark.

تساعد التطورات الحديثة في أنظمة الوكلاء المتعددة LLM على إدارة العديد من الأدوات والوكلاء الفرعيين بشكل فعال. تهدف تقنية استرجاع الأداة إلى الوكيل إلى تحسين اختيار الوكلاء من خلال توفير فهم أوضح لوظائف الأدوات، مما يؤدي إلى تحسين التنسيق والأداء.

Los avances recientes en los sistemas multiagente LLM están facilitando la gestión efectiva de numerosas herramientas y subagentes. La introducción de la recuperación de herramienta a agente tiene como objetivo mejorar la selección de agentes al proporcionar una comprensión más clara de las funcionalidades de las herramientas, lo que lleva a una mejor orquestación y un rendimiento mejorado.

Les avancées récentes dans les systèmes multi-agents LLM facilitent la gestion efficace de nombreux outils et sous-agents. L'introduction de la récupération outil-agent vise à améliorer la sélection des agents en fournissant une compréhension plus claire des fonctionnalités des outils, ce qui conduit à une meilleure orchestration et à de meilleures performances.

Recent advancements in LLM Multi-Agent Systems are making it easier to manage numerous tools and sub-agents effectively. The introduction of Tool-to-Agent Retrieval aims to enhance agent selection by providing a clearer understanding of tool functionalities, leading to better orchestration and improved performance.

Tool-to-Agent Retrieval: Bridging Tools and Agents for Scalable LLM Multi-Agent Systems

arXiv:2509.15207v3 Announce Type: replace 
Abstract: We propose FlowRL: matching the full reward distribution via flow balancing instead of maximizing rewards in large language model (LLM) reinforcement learning (RL). Recent advanced reasoning models adopt reward-maximizing methods (\eg, PPO and GRPO), which tend to over-optimize dominant reward signals while neglecting less frequent but valid reasoning paths, thus reducing diversity. In contrast, we transform scalar rewards into a normalized target distribution using a learnable partition function, and then minimize the reverse KL divergence between the policy and the target distribution. We implement this idea as a flow-balanced optimization method that promotes diverse exploration and generalizable reasoning trajectories. We conduct experiments on math and code reasoning tasks: FlowRL achieves a significant average improvement of $10.0\%$ over GRPO and $5.1\%$ over PPO on math benchmarks, and performs consistently better on code reasoning tasks. These results highlight reward distribution-matching as a key step toward efficient exploration and diverse reasoning in LLM reinforcement learning.

تقدم FlowRL نهجًا جديدًا للتعلم المعزز لنماذج اللغة الكبيرة من خلال مطابقة توزيعات المكافآت عبر توازن التدفق. تعالج هذه الطريقة قيود تقنيات تعظيم المكافآت التقليدية، التي غالبًا ما تتجاهل مسارات التفكير الأقل تكرارًا ولكنها صالحة، مما يعزز التنوع في استجابات النموذج.

FlowRL presenta un enfoque novedoso para el aprendizaje por refuerzo en modelos de lenguaje grandes al igualar las distribuciones de recompensa mediante el equilibrio de flujos. Este método aborda las limitaciones de las técnicas tradicionales de maximización de recompensas, que a menudo pasan por alto caminos de razonamiento menos frecuentes pero válidos, mejorando así la diversidad en las respuestas del modelo.

FlowRL propose une nouvelle approche de l'apprentissage par renforcement pour les grands modèles de langage en faisant correspondre les distributions de récompense grâce à l'équilibrage des flux. Cette méthode répond aux limites des techniques traditionnelles de maximisation des récompenses, qui négligent souvent des chemins de raisonnement moins fréquents mais valides, améliorant ainsi la diversité des réponses du modèle.

FlowRL introduces a novel approach to reinforcement learning for large language models by matching reward distributions through flow balancing. This method addresses the limitations of traditional reward-maximizing techniques, which often overlook less frequent but valid reasoning paths, ultimately enhancing diversity in model responses.

FlowRL: Matching Reward Distributions for LLM Reasoning

arXiv:2511.02805v1 Announce Type: new 
Abstract: Typical search agents concatenate the entire interaction history into the LLM context, preserving information integrity but producing long, noisy contexts, resulting in high computation and memory costs. In contrast, using only the current turn avoids this overhead but discards essential information. This trade-off limits the scalability of search agents. To address this challenge, we propose MemSearcher, an agent workflow that iteratively maintains a compact memory and combines the current turn with it. At each turn, MemSearcher fuses the user's question with the memory to generate reasoning traces, perform search actions, and update memory to retain only information essential for solving the task. This design stabilizes context length across multi-turn interactions, improving efficiency without sacrificing accuracy. To optimize this workflow, we introduce multi-context GRPO, an end-to-end RL framework that jointly optimize reasoning, search strategies, and memory management of MemSearcher Agents. Specifically, multi-context GRPO samples groups of trajectories under different contexts and propagates trajectory-level advantages across all conversations within them. Trained on the same dataset as Search-R1, MemSearcher achieves significant improvements over strong baselines on seven public benchmarks: +11% on Qwen2.5-3B-Instruct and +12% on Qwen2.5-7B-Instruct relative average gains. Notably, the 3B-based MemSearcher even outperforms 7B-based baselines, demonstrating that striking a balance between information integrity and efficiency yields both higher accuracy and lower computational overhead. The code and models will be publicly available at https://github.com/icip-cas/MemSearcher

MemSearcher هو نهج مبتكر يعزز كفاءة وكلاء البحث من خلال إدارة الذاكرة عبر التعلم المعزز من النهاية إلى النهاية. على عكس الطرق التقليدية التي تواجه صعوبة مع السياقات الطويلة، يقوم MemSearcher بتحسين تاريخ التفاعل، موازنًا بين الاحتفاظ بالمعلومات وتكاليف الحوسبة. يعد هذا التدفق العملي بتحسين القابلية للتوسع والأداء في مهام البحث.

MemSearcher es un enfoque innovador que mejora la eficiencia de los agentes de búsqueda al gestionar la memoria mediante el aprendizaje por refuerzo de extremo a extremo. A diferencia de los métodos tradicionales que luchan con contextos largos, MemSearcher optimiza el historial de interacciones, equilibrando la retención de información y los costos computacionales. Este flujo de trabajo promete mejorar la escalabilidad y el rendimiento en tareas de búsqueda.

MemSearcher est une approche révolutionnaire qui améliore l'efficacité des agents de recherche en gérant la mémoire grâce à l'apprentissage par renforcement de bout en bout. Contrairement aux méthodes traditionnelles qui ont du mal avec de longs contextes, MemSearcher optimise l'historique des interactions, équilibrant la rétention d'informations et les coûts computationnels. Ce flux de travail innovant promet d'améliorer l'évolutivité et les performances dans les tâches de recherche.

MemSearcher is a groundbreaking approach that enhances the efficiency of search agents by managing memory through end-to-end reinforcement learning. Unlike traditional methods that struggle with long contexts, MemSearcher optimizes the interaction history, balancing information retention and computational costs. This innovative workflow promises to improve scalability and performance in search tasks.

The Analysis of Lexical Errors in Machine Translation from English into Romanian

The Analysis of Lexical Errors in Machine Translation from English into Romanian

Was this article worth reading? Share it