arXiv:2510.02712v2 Announce Type: replace 
Abstract: Large Language Models (LLMs) have revolutionized conversational AI, yet their robustness in extended multi-turn dialogues remains poorly understood. Existing evaluation frameworks focus on static benchmarks and single-turn assessments, failing to capture the temporal dynamics of conversational degradation that characterize real-world interactions. In this work, we present a large-scale survival analysis of conversational robustness, modeling failure as a time-to-event process over 36,951 turns from 9 state-of-the-art LLMs on the MT-Consistency benchmark. Our framework combines Cox proportional hazards, Accelerated Failure Time (AFT), and Random Survival Forest models with simple semantic drift features. We find that abrupt prompt-to-prompt semantic drift sharply increases the hazard of inconsistency, whereas cumulative drift is counterintuitively \emph{protective}, suggesting adaptation in conversations that survive multiple shifts. AFT models with model-drift interactions achieve the best combination of discrimination and calibration, and proportional hazards checks reveal systematic violations for key drift covariates, explaining the limitations of Cox-style modeling in this setting. Finally, we show that a lightweight AFT model can be turned into a turn-level risk monitor that flags most failing conversations several turns before the first inconsistent answer while keeping false alerts modest. These results establish survival analysis as a powerful paradigm for evaluating multi-turn robustness and for designing practical safeguards for conversational AI systems.

دراسة حديثة بعنوان 'Time-To-Inconsistency' تقدم تحليل بقاء على نطاق واسع لصلابة نماذج اللغة الكبيرة (LLMs) ضد الهجمات العدائية، حيث تم فحص 36,951 دورة حوار عبر تسعة نماذج متطورة. تكشف الأبحاث أن التحولات الدلالية المفاجئة في المطالبات تزيد بشكل كبير من احتمالية عدم التناسق، بينما قد توفر التحولات التراكمية تأثيرًا وقائيًا، مما يشير إلى ديناميكيات حوارية تكيفية.

Un estudio reciente titulado 'Time-To-Inconsistency' presenta un análisis de supervivencia a gran escala sobre la robustez de los Modelos de Lenguaje Grande (LLMs) frente a ataques adversariales, examinando 36,951 turnos de diálogo en nueve modelos de vanguardia. La investigación revela que los cambios semánticos abruptos en las indicaciones aumentan significativamente la probabilidad de inconsistencias, mientras que los cambios acumulativos pueden ofrecer un efecto protector, indicando dinámicas conversacionales adaptativas.

Une étude récente intitulée 'Time-To-Inconsistency' présente une analyse de survie à grande échelle de la robustesse des modèles de langage de grande taille (LLMs) face aux attaques adversariales, examinant 36 951 tours de dialogue à travers neuf modèles à la pointe de la technologie. La recherche révèle que des changements sémantiques brusques dans les invites augmentent considérablement la probabilité d'incohérences, tandis que des changements cumulatifs peuvent offrir un effet protecteur, indiquant des dynamiques conversationnelles adaptatives.

A recent study titled 'Time-To-Inconsistency' presents a large-scale survival analysis of the robustness of Large Language Models (LLMs) against adversarial attacks, examining 36,951 dialogue turns across nine state-of-the-art models. The research reveals that abrupt semantic shifts in prompts significantly increase the likelihood of inconsistencies, while cumulative shifts may offer a protective effect, indicating adaptive conversational dynamics.

Time-To-Inconsistency: A Survival Analysis of Large Language Model Robustness to Adversarial Attacks

Was this article worth reading? Share it

LCW

Sellm

ChatOne