ReFineG: Synergizing Small Supervised Models and LLMs for Low-Resource Grounded Multimodal NER
PositiveArtificial Intelligence
The recent publication of 'ReFineG: Synergizing Small Supervised Models and LLMs for Low-Resource Grounded Multimodal NER' introduces a three-stage framework designed to improve Grounded Multimodal Named Entity Recognition (GMNER) in low-resource settings. Traditional methods often struggle due to the need for costly multimodal annotations and can underperform in specific domains. ReFineG addresses these issues by combining small supervised models with frozen multimodal large language models (MLLMs). The framework includes a domain-aware data synthesis strategy, an uncertainty-based refinement mechanism, and a multimodal context selection algorithm. This approach not only enhances the accuracy of entity recognition but also allows for effective visual grounding. The framework's effectiveness was validated when it secured the second position in the CCKS2025 GMNER Shared Task, achieving an F1 score of 0.6461, showcasing its potential for practical applications in the field.
— via World Pulse Now AI Editorial System
