SURFing to the Fundamental Limit of Jet Tagging

arXiv — cs.LG•Friday, November 21, 2025 at 5:00:00 AM

NeutralArtificial Intelligence

The SURF method introduces a novel framework for validating generative models in jet tagging, focusing on the performance limits of algorithms. This approach utilizes generative surrogate models to conduct Neyman
This development is significant as it suggests that advancements in jet tagging algorithms are approaching their optimal capabilities, which could have implications for future research and applications in particle physics and machine learning.

— via World Pulse Now AI Editorial System

Read Original

Was this article worth reading? Share it

Continue Readings

arXiv — cs.CV2 days ago

Shape and Texture Recognition in Large Vision-Language Models

NeutralArtificial Intelligence

The study introduces the Large Shape and Textures dataset (LAS&T), a comprehensive collection of diverse shapes and textures extracted from natural images. This dataset is utilized to evaluate the performance of leading Large Vision-Language Models (VLMs) in recognizing and representing shapes and textures in various contexts. Results indicate that VLMs still lag behind human capabilities in shape recognition, particularly when variations in orientation, texture, and color are present.

Read full article

via arXiv — cs.CV

The Guardian — Artificial Intelligence2 days ago

Staffordshire student confronts lecturer for using AI-generated slides – video

NegativeArtificial Intelligence

Students at the University of Staffordshire expressed feelings of being 'robbed' after discovering that a course intended to launch their digital careers was largely taught using AI-generated slides. In a recorded confrontation, student James challenged a lecturer about the reliance on AI for teaching a coding module, stating, 'I do not want to be taught by GPT.' The university has since posted a policy on its course website to justify the use of AI in academic settings.

Read full article

via The Guardian — Artificial Intelligence

arXiv — cs.LG3 days ago

Measuring the (Un)Faithfulness of Concept-Based Explanations

NeutralArtificial Intelligence

Deep vision models are complex systems that perform computations difficult to interpret. Concept-based explanation methods (CBEMs) aim to enhance interpretability by using human-understandable concepts. However, ensuring the faithfulness of these explanations, which represent the model's internal workings, involves trade-offs. Recent advancements in unsupervised CBEMs (U-CBEMs) claim to improve both interpretability and faithfulness, but these improvements may stem from complex surrogates or deletion-based methods that compromise clarity.

Read full article

via arXiv — cs.LG

arXiv — cs.CV3 days ago

Spot The Ball: A Benchmark for Visual Social Inference

NeutralArtificial Intelligence

The article introduces 'Spot The Ball', a benchmark designed to evaluate visual social inference in vision-language models (VLMs) using sports imagery. The task involves localizing a missing sports ball in images from soccer, basketball, and volleyball. The study compares human performance against four advanced VLMs, revealing that humans are significantly more accurate, achieving 20-34% accuracy compared to the models' maximum of 17%. This highlights the limitations of current AI in understanding complex visual cues.

Read full article

via arXiv — cs.CV