Functional Localization Enforced Deep Anomaly Detection Using Fundus Images

arXiv — cs.LGTuesday, November 25, 2025 at 5:00:00 AM
  • A recent study has demonstrated the effectiveness of a Vision Transformer (ViT) classifier in detecting retinal diseases from fundus images, achieving accuracies between 0.789 and 0.843 across various datasets, including the newly developed AEyeDB. The study highlights the challenges posed by imaging quality and subtle disease manifestations, particularly in diabetic retinopathy and age-related macular degeneration, while noting glaucoma as a frequently misclassified condition.
  • This advancement is significant as it enhances the reliability of early detection methods for retinal diseases, which are critical for preventing vision loss. The consistent performance of the ViT classifier across heterogeneous datasets underscores its potential utility in clinical settings, providing a robust tool for ophthalmologists and researchers in the field of medical imaging.
  • The findings reflect a growing trend in the application of advanced machine learning techniques, such as Vision Transformers, across various medical domains, including brain aging and pneumonia detection. This shift towards integrating sophisticated AI models aims to improve diagnostic accuracy and reduce subjectivity in medical assessments, addressing longstanding challenges in healthcare technology.
— via World Pulse Now AI Editorial System

Was this article worth reading? Share it

Recommended apps based on your readingExplore all apps
Continue Readings
NOVAK: Unified adaptive optimizer for deep neural networks
PositiveArtificial Intelligence
The recent introduction of NOVAK, a unified adaptive optimizer for deep neural networks, combines several advanced techniques including adaptive moment estimation and lookahead synchronization, aiming to enhance the performance and efficiency of neural network training.
Out-of-distribution generalization of deep-learning surrogates for 2D PDE-generated dynamics in the small-data regime
NeutralArtificial Intelligence
A recent study published on arXiv investigates the out-of-distribution generalization capabilities of deep-learning surrogates for two-dimensional partial differential equation (PDE) dynamics, particularly under small-data conditions. The research introduces a multi-channel U-Net architecture and evaluates its performance against various models, including ViT and PDE-Transformer, across different PDE families.
Knowledge-based learning in Text-RAG and Image-RAG
NeutralArtificial Intelligence
A recent study analyzed the multi-modal approach in the Vision Transformer (EVA-ViT) image encoder combined with LlaMA and ChatGPT large language models (LLMs) to address hallucination issues and enhance disease detection in chest X-ray images. The research utilized the NIH Chest X-ray dataset, comparing image-based and text-based retrieval-augmented generation (RAG) methods, revealing that text-based RAG effectively mitigates hallucinations while image-based RAG improves prediction confidence.
Temporal-Enhanced Interpretable Multi-Modal Prognosis and Risk Stratification Framework for Diabetic Retinopathy (TIMM-ProRS)
PositiveArtificial Intelligence
A novel deep learning framework named TIMM-ProRS has been introduced to enhance the prognosis and risk stratification of diabetic retinopathy (DR), a condition that threatens the vision of millions worldwide. This framework integrates Vision Transformer, Convolutional Neural Network, and Graph Neural Network technologies, utilizing both retinal images and temporal biomarkers to achieve a high accuracy rate of 97.8% across multiple datasets.
Incentivizing Multi-Tenant Split Federated Learning for Foundation Models at the Network Edge
PositiveArtificial Intelligence
A novel Price-Incentive Mechanism (PRINCE) has been proposed to enhance Multi-Tenant Split Federated Learning (SFL) for Foundation Models (FMs) like GPT-4, enabling efficient fine-tuning on resource-constrained devices while maintaining privacy. This mechanism addresses the coordination challenges faced by multiple SFL tenants with diverse fine-tuning needs.

Ready to build your own newsroom?

Subscribe to unlock a personalised feed, podcasts, newsletters, and notifications tailored to the topics you actually care about