FunReason: Enhancing Large Language Models' Function Calling via Self-Refinement Multiscale Loss and Automated Data Refinement

arXiv — cs.LG•Wednesday, November 26, 2025 at 5:00:00 AM

PositiveArtificial Intelligence

FunReason has been introduced as a novel framework aimed at enhancing the function calling capabilities of large language models (LLMs) through an automated data refinement strategy and a Self-Refinement Multiscale Loss (SRML) approach. This development addresses the challenges of integrating reasoning processes with accurate function execution, which has been a significant hurdle in optimizing LLM performance in real-world applications.
The introduction of FunReason is significant as it leverages the inherent reasoning abilities of LLMs to generate high-quality training examples, thereby improving query parseability, reasoning coherence, and function call precision. This advancement could lead to more effective applications of LLMs in various domains, enhancing their practical utility and reliability.
The evolution of LLMs is marked by ongoing efforts to refine their reasoning capabilities and function execution accuracy. Recent studies have explored various methodologies, including selective self-generated calibration for pruning models and frameworks for evaluating derivation capabilities. These developments reflect a broader trend in AI research focused on optimizing LLMs for complex reasoning tasks and integrating them with external tools to enhance problem-solving capabilities.

— via World Pulse Now AI Editorial System

Read Original

Was this article worth reading? Share it

Kansei

Practice and improve your language skills with personalized AI conversations.

AI & DataTry the app

One More Thing in AI

Master AI with curated tools and tutorials for practical, real-world applications.

AI & DataTry the app

Langtail

Build and deploy robust LLM applications quickly with your team.

Business & ProductivityTry the app

Continue Readings

Tech Monitor20 hours ago

Look to the human brain for a glimpse of AI’s future

PositiveArtificial Intelligence

Recent discussions highlight the potential of the human brain as a low-power model for the future of artificial intelligence (AI), particularly in the development of large language models (LLMs). This perspective shifts the focus from AI's traditionally high energy demands to a more sustainable approach inspired by biological systems.

Read full article

via Tech Monitor

arXiv — cs.CLa day ago

MindEval: Benchmarking Language Models on Multi-turn Mental Health Support

NeutralArtificial Intelligence

The introduction of MindEval marks a significant advancement in the evaluation of language models for multi-turn mental health support, addressing the limitations of current AI chatbots that often reinforce maladaptive beliefs. Developed in collaboration with Ph.D-level Licensed Clinical Psychologists, this framework aims to enhance the realism of simulated therapeutic conversations through automated evaluation methods.

Read full article

via arXiv — cs.CL

arXiv — cs.CVa day ago

VideoChat-M1: Collaborative Policy Planning for Video Understanding via Multi-Agent Reinforcement Learning

PositiveArtificial Intelligence

The introduction of VideoChat-M1 represents a significant advancement in video understanding through a novel multi-agent system that employs Collaborative Policy Planning (CPP). This system allows multiple agents to generate, execute, and communicate unique tool invocation policies tailored to user queries, enhancing the exploration of complex video content.

Read full article

via arXiv — cs.CV

arXiv — cs.CVa day ago

Proxy-Free Gaussian Splats Deformation with Splat-Based Surface Estimation

PositiveArtificial Intelligence

A new method called SpLap has been introduced for proxy-free deformation of Gaussian splats, utilizing a surface-aware splat graph to enhance the quality of deformations while minimizing computational overhead. This approach overcomes limitations of traditional methods that rely on proxies, which can be of varying quality and add complexity to the deformation process.

Read full article

via arXiv — cs.CV

arXiv — cs.CVa day ago

While recognizing actions, LMMs struggle to detect core interaction events

NeutralArtificial Intelligence

Large multi-modal models (LMMs) have shown improved performance in visual tasks, particularly in analyzing video sequences. A recent study evaluated their ability to detect core interaction events, such as when hands contact or release objects, using a new dataset with over 20,000 annotated interactions from the Something-Something-V2 dataset.

Read full article

via arXiv — cs.CV

arXiv — cs.CVa day ago

V-Attack: Targeting Disentangled Value Features for Controllable Adversarial Attacks on LVLMs

PositiveArtificial Intelligence

A new study introduces V-Attack, a method designed to enhance controllability in adversarial attacks on Large Vision-Language Models (LVLMs) by targeting disentangled value features. This approach addresses the limitations of existing methods that struggle with precise semantic manipulation due to the entanglement of semantic information in patch-token representations.

Read full article

via arXiv — cs.CV

arXiv — cs.CVa day ago

Zoo3D: Zero-Shot 3D Object Detection at Scene Level

PositiveArtificial Intelligence

Zoo3D has been introduced as the first training-free 3D object detection framework, enabling the construction of 3D bounding boxes through graph clustering of 2D instance masks. This innovative approach allows for the recognition of previously unseen objects without the need for extensive training, marking a significant advancement in 3D object detection technology.

Read full article

via arXiv — cs.CV

arXiv — cs.CLa day ago

SSA: Sparse Sparse Attention by Aligning Full and Sparse Attention Outputs in Feature Space

PositiveArtificial Intelligence

The introduction of Sparse Sparse Attention (SSA) aims to enhance the efficiency of large language models (LLMs) by aligning outputs from both sparse and full attention mechanisms. This approach addresses the limitations of traditional sparse attention methods, which often suffer from performance degradation due to inadequate gradient updates during training. SSA proposes a unified framework that seeks to improve attention sparsity while maintaining model effectiveness.

Read full article

via arXiv — cs.CL