Retrieving Semantically Similar Decisions under Noisy Institutional Labels: Robust Comparison of Embedding Methods

arXiv — cs.CLMonday, December 8, 2025 at 5:00:00 AM
  • A recent study compared two models for retrieving decisions from the Czech Constitutional Court, focusing on a general-purpose embedder from OpenAI and a domain-specific BERT model trained on approximately 30,000 decisions. The evaluation employed a noise-aware approach, revealing that the OpenAI embedder significantly outperformed the BERT model in various settings despite the challenges posed by noisy institutional labels.
  • This development is significant as it highlights the effectiveness of general-purpose models like OpenAI's in legal contexts, suggesting that they may provide more reliable retrieval of case law compared to specialized models. The findings could influence future research and applications in legal informatics and AI-driven legal research tools.
  • The results underscore ongoing discussions about the reliability of AI models in specialized domains, particularly in legal contexts where accuracy is paramount. The performance of general-purpose models raises questions about the adequacy of domain-specific training, especially when faced with noisy data, reflecting broader challenges in AI applications across various fields.
— via World Pulse Now AI Editorial System

Was this article worth reading? Share it

Recommended apps based on your readingExplore all apps
Continue Readings
RAM Prices Surge as Soaring Demand From AI Giants Like OpenAI Pushes Costs Higher
NegativeArtificial Intelligence
RAM prices have surged sharply due to unprecedented demand from AI companies like OpenAI, leading to increased hardware costs for consumers and businesses. This price increase is attributed to a global shortage of memory chips, exacerbated by the ongoing AI boom.
State attorneys general warn Microsoft, OpenAI, Google, and other AI giants to fix ‘delusional’ outputs
NegativeArtificial Intelligence
State attorneys general have issued a warning to major AI companies, including Microsoft, OpenAI, and Google, demanding the implementation of new safeguards to prevent harmful psychological impacts from their AI outputs, which have been described as 'delusional.'
OpenAI says the capabilities of its frontier AI models are accelerating and warns that upcoming models are likely to pose a "high" cybersecurity risk (Ina Fried/Axios)
NegativeArtificial Intelligence
OpenAI has announced that the capabilities of its frontier AI models are accelerating, warning that upcoming models could present a "high" cybersecurity risk. This statement reflects the company's growing concerns about the implications of advanced AI technologies on security and safety.
OpenAI's house of cards seems primed to collapse
NegativeArtificial Intelligence
OpenAI is facing significant challenges as its financial stability appears increasingly precarious, with concerns mounting over its partnerships and market position. Recent reports indicate that the company's collaboration with Oracle, valued at $300 billion, has resulted in a staggering loss of $315 billion in market value, raising alarms about its reliance on a single customer.
The fix for messy AI agent ecosystems might finally be here
PositiveArtificial Intelligence
A new initiative called the Agentic AI Foundation (AAIF), backed by prominent companies including OpenAI and Anthropic, aims to standardize AI agents and create an open, interoperable foundation for AI technologies. This effort seeks to address the complexities and fragmentation within the current AI ecosystem.
OpenAI report reveals a 6x productivity gap between AI power users and everyone else
NeutralArtificial Intelligence
A recent report from OpenAI highlights a significant productivity gap, revealing that AI power users send six times more messages to ChatGPT than the median employee in their companies. This disparity is even more pronounced in specific roles, such as coding and data analysis, where top users engage 17 times more than their peers.
You Can Now Edit Images Using Photoshop Inside ChatGPT
PositiveArtificial Intelligence
Adobe has integrated its tools from Photoshop, Adobe Express, and Acrobat into OpenAI's ChatGPT, enabling users to edit images and documents directly within the AI platform. This integration aims to enhance user experience by combining Adobe's creative capabilities with ChatGPT's conversational interface.
Google launches a cheaper AI Plus plan in India, costing ~$2.21 per month for the first six months and ~$4.44 thereafter, to compete with ChatGPT Go (Ivan Mehta/TechCrunch)
PositiveArtificial Intelligence
Google has launched a new AI Plus subscription plan in India, priced at approximately $2.21 per month for the first six months and $4.44 thereafter, aiming to compete with OpenAI's ChatGPT Go. This initiative reflects Google's strategy to enhance its presence in the AI market by offering affordable options to users in India.