MCMoE: Completing Missing Modalities with Mixture of Experts for Incomplete Multimodal Action Quality Assessment

arXiv — cs.CVMonday, November 24, 2025 at 5:00:00 AM
  • A new framework called MCMoE has been proposed to address the challenges of Multimodal Action Quality Assessment (AQA), particularly when certain modalities are missing during inference. This framework integrates unimodal and joint representation learning through a single-stage training process, utilizing an adaptive gated modality generator to reconstruct absent modalities.
  • The introduction of MCMoE is significant as it enhances the robustness of multimodal models, enabling them to maintain performance even in the absence of complete data. This advancement could lead to improved applications in fields requiring nuanced action recognition, such as robotics and surveillance.
— via World Pulse Now AI Editorial System

Was this article worth reading? Share it

Recommended apps based on your readingExplore all apps
Continue Readings
A Survey of Generative Categories and Techniques in Multimodal Generative Models
NeutralArtificial Intelligence
A comprehensive survey on Multimodal Generative Models (MGMs) has been published, detailing their evolution from text generation to various output modalities such as images, music, and video. The study categorizes six primary generative modalities and discusses foundational techniques like Self-Supervised Learning and Chain-of-Thought prompting that enable cross-modal capabilities.