arXiv:2510.15269v2 Announce Type: replace 
Abstract: Medical texts, particularly electronic medical records (EMRs), are a cornerstone of modern healthcare, capturing critical information about patient care, diagnoses, and treatments. These texts hold immense potential for advancing clinical decision-making and healthcare analytics. However, their unstructured nature, domain-specific language, and variability across contexts make automated understanding an intricate challenge. Despite the advancements in natural language processing, existing methods often treat all data as equally challenging, ignoring the inherent differences in complexity across clinical records. This oversight limits the ability of models to effectively generalize and perform well on rare or complex cases. In this paper, we present TACL (Threshold-Adaptive Curriculum Learning), a novel framework designed to address these challenges by rethinking how models interact with medical texts during training. Inspired by the principle of progressive learning, TACL dynamically adjusts the training process based on the complexity of individual samples. By categorizing data into difficulty levels and prioritizing simpler cases early in training, the model builds a strong foundation before tackling more complex records. By applying TACL to multilingual medical data, including English and Chinese clinical records, we observe significant improvements across diverse clinical tasks, including automatic ICD coding, readmission prediction and TCM syndrome differentiation. TACL not only enhances the performance of automated systems but also demonstrates the potential to unify approaches across disparate medical domains, paving the way for more accurate, scalable, and globally applicable medical text understanding solutions.

تقدم TACL (استراتيجية التعلم المنهجي القابلة للتكيف مع العتبة) إطارًا جديدًا يهدف إلى تحسين فهم النصوص الطبية، وخاصة السجلات الطبية الإلكترونية (EMRs). من خلال ضبط عملية التدريب ديناميكيًا بناءً على تعقيد العينات الفردية، تعمل TACL على تحسين اتخاذ القرارات السريرية وتحليل الرعاية الصحية، مما يؤدي إلى تحسينات ملحوظة في مهام مثل الترميز التلقائي لـ ICD وتوقع إعادة القبول.

La introducción de TACL (Threshold-Adaptive Curriculum Learning) ofrece un nuevo marco destinado a mejorar la comprensión de los textos médicos, especialmente los registros médicos electrónicos (EMR). Al ajustar dinámicamente el proceso de entrenamiento según la complejidad de las muestras individuales, TACL mejora la toma de decisiones clínicas y la analítica de salud, lo que lleva a mejoras significativas en tareas como la codificación automática de ICD y la predicción de readmisiones.

L'introduction de TACL (Threshold-Adaptive Curriculum Learning) propose un nouveau cadre visant à améliorer la compréhension des textes médicaux, en particulier les dossiers médicaux électroniques (DME). En ajustant dynamiquement le processus de formation en fonction de la complexité des échantillons individuels, TACL améliore la prise de décision clinique et l'analyse des soins de santé, entraînant des améliorations significatives dans des tâches telles que le codage ICD automatique et la prédiction de réadmission.

The introduction of TACL (Threshold-Adaptive Curriculum Learning) offers a new framework aimed at improving the understanding of medical texts, particularly electronic medical records (EMRs). By dynamically adjusting the training process based on the complexity of individual samples, TACL enhances clinical decision-making and healthcare analytics, leading to significant improvements in tasks such as automatic ICD coding and readmission prediction.

TACL: Threshold-Adaptive Curriculum Learning Strategy for Enhancing Medical Text Understanding

Was this article worth reading? Share it

Ready to build your own newsroom?