Retrieving Semantically Similar Decisions under Noisy Institutional Labels: Robust Comparison of Embedding Methods

arXiv — cs.CL•Monday, December 8, 2025 at 5:00:00 AM

NeutralArtificial Intelligence

A recent study compared two models for retrieving decisions from the Czech Constitutional Court, focusing on a general-purpose embedder from OpenAI and a domain-specific BERT model trained on approximately 30,000 decisions. The evaluation employed a noise-aware approach, revealing that the OpenAI embedder significantly outperformed the BERT model in various settings despite the challenges posed by noisy institutional labels.
This development is significant as it highlights the effectiveness of general-purpose models like OpenAI's in legal contexts, suggesting that they may provide more reliable retrieval of case law compared to specialized models. The findings could influence future research and applications in legal informatics and AI-driven legal research tools.
The results underscore ongoing discussions about the reliability of AI models in specialized domains, particularly in legal contexts where accuracy is paramount. The performance of general-purpose models raises questions about the adequacy of domain-specific training, especially when faced with noisy data, reflecting broader challenges in AI applications across various fields.

— via World Pulse Now AI Editorial System

Read Original

Was this article worth reading? Share it

LucidQuery AI

Combines diffusion reasoning with autoregressive LLM for advanced AI analysis.

AI & DataView app details

Supametas.AI

Extract and structure unstructured data for seamless LLM RAG integration.

AI & DataView app details

TypeThinkAI

Compare top AI models and generate text, images, and videos in one platform.

AI & DataView app details

Continue Readings

International Business Times9 hours ago

RAM Prices Surge as Soaring Demand From AI Giants Like OpenAI Pushes Costs Higher

NegativeArtificial Intelligence

RAM prices have surged sharply due to unprecedented demand from AI companies like OpenAI, leading to increased hardware costs for consumers and businesses. This price increase is attributed to a global shortage of memory chips, exacerbated by the ongoing AI boom.

Read full article

via International Business Times

TechCruncha day ago

State attorneys general warn Microsoft, OpenAI, Google, and other AI giants to fix ‘delusional’ outputs

NegativeArtificial Intelligence

State attorneys general have issued a warning to major AI companies, including Microsoft, OpenAI, and Google, demanding the implementation of new safeguards to prevent harmful psychological impacts from their AI outputs, which have been described as 'delusional.'

Read full article

via TechCrunch

Techmemea day ago

OpenAI says the capabilities of its frontier AI models are accelerating and warns that upcoming models are likely to pose a "high" cybersecurity risk (Ina Fried/Axios)

NegativeArtificial Intelligence

OpenAI has announced that the capabilities of its frontier AI models are accelerating, warning that upcoming models could present a "high" cybersecurity risk. This statement reflects the company's growing concerns about the implications of advanced AI technologies on security and safety.

Read full article

via Techmeme

Engadgeta day ago

OpenAI's house of cards seems primed to collapse

NegativeArtificial Intelligence

OpenAI is facing significant challenges as its financial stability appears increasingly precarious, with concerns mounting over its partnerships and market position. Recent reports indicate that the company's collaboration with Oracle, valued at $300 billion, has resulted in a staggering loss of $315 billion in market value, raising alarms about its reliance on a single customer.

Read full article

via Engadget

ZDNET — Big Dataa day ago

The fix for messy AI agent ecosystems might finally be here

PositiveArtificial Intelligence

A new initiative called the Agentic AI Foundation (AAIF), backed by prominent companies including OpenAI and Anthropic, aims to standardize AI agents and create an open, interoperable foundation for AI technologies. This effort seeks to address the complexities and fragmentation within the current AI ecosystem.

Read full article

via ZDNET — Big Data

VentureBeat — AIa day ago

OpenAI report reveals a 6x productivity gap between AI power users and everyone else

NeutralArtificial Intelligence

A recent report from OpenAI highlights a significant productivity gap, revealing that AI power users send six times more messages to ChatGPT than the median employee in their companies. This disparity is even more pronounced in specific roles, such as coding and data analysis, where top users engage 17 times more than their peers.

Read full article

via VentureBeat — AI

PetaPixela day ago

You Can Now Edit Images Using Photoshop Inside ChatGPT

PositiveArtificial Intelligence

Adobe has integrated its tools from Photoshop, Adobe Express, and Acrobat into OpenAI's ChatGPT, enabling users to edit images and documents directly within the AI platform. This integration aims to enhance user experience by combining Adobe's creative capabilities with ChatGPT's conversational interface.

Read full article

via PetaPixel

Techmemea day ago

Google launches a cheaper AI Plus plan in India, costing ~$2.21 per month for the first six months and ~$4.44 thereafter, to compete with ChatGPT Go (Ivan Mehta/TechCrunch)

PositiveArtificial Intelligence

Google has launched a new AI Plus subscription plan in India, priced at approximately $2.21 per month for the first six months and $4.44 thereafter, aiming to compete with OpenAI's ChatGPT Go. This initiative reflects Google's strategy to enhance its presence in the AI market by offering affordable options to users in India.

Read full article

via Techmeme