RoCoISLR: A Romanian Corpus for Isolated Sign Language Recognition

arXiv — cs.LGTuesday, November 18, 2025 at 5:00:00 AM
  • The introduction of RoCoISLR marks a significant advancement in Romanian Isolated Sign Language Recognition, providing a comprehensive dataset that includes over 9,000 video samples and nearly 6,000 glosses. This initiative addresses the existing gap in resources for Romanian sign language, which has hindered research and development in the field.
  • The development of RoCoISLR is crucial as it enhances the potential for automatic sign language recognition, facilitating better communication between deaf and hearing communities. The benchmark results from various models indicate a promising direction for future research and applications in sign language technology.
— via World Pulse Now AI Editorial System

Was this article worth reading? Share it

Recommended Readings
Temporal Realism Evaluation of Generated Videos Using Compressed-Domain Motion Vectors
PositiveArtificial Intelligence
The paper discusses the evaluation of temporal realism in generative video models, highlighting a significant limitation in current metrics that focus primarily on spatial appearance. A new framework is introduced that utilizes motion vectors extracted from compressed video streams to assess temporal behavior. By analyzing Kullback-Leibler, Jensen-Shannon, and Wasserstein divergences between real and generated videos, the study identifies discrepancies in motion dynamics, with specific models showing varying degrees of realism.
Real-Time Sign Language to text Translation using Deep Learning: A Comparative study of LSTM and 3D CNN
PositiveArtificial Intelligence
This study investigates the performance of 3D Convolutional Neural Networks (3D CNNs) and Long Short-Term Memory (LSTM) networks for real-time American Sign Language (ASL) recognition. The evaluation is based on a dataset of 1,200 ASL signs across 50 classes, focusing on accuracy, computational efficiency, and latency. Results show that 3D CNNs achieve 92.4% recognition accuracy but require more processing time per frame compared to LSTMs, which maintain 86.7% accuracy with lower resource consumption. A hybrid model demonstrates decent performance, highlighting the importance of architecture s…