World PulseNowPowered by AI

Trending:

Improving Latent Reasoning in LLMs via Soft Concept Mixing

arXiv — cs.CL•Monday, November 24, 2025 at 5:00:00 AM

PositiveArtificial Intelligence

Recent advancements in large language models (LLMs) have introduced Soft Concept Mixing (SCM), a training scheme that enhances latent reasoning by integrating soft concept representations into the model's hidden states. This approach aims to bridge the gap between the discrete token training of LLMs and the more abstract reasoning capabilities observed in human cognition.
The implementation of SCM is significant as it directly addresses the limitations of traditional LLMs, potentially improving their reasoning abilities across various benchmarks. This could lead to more nuanced and contextually aware outputs, enhancing their utility in applications requiring complex reasoning.
The development of SCM reflects a broader trend in AI research focusing on improving reasoning capabilities in LLMs. This includes exploring analogical reasoning, causal relationships, and confidence estimation in model outputs, highlighting ongoing efforts to refine LLMs' cognitive-like functions and their application in real-world scenarios.

— via World Pulse Now AI Editorial System

Was this article worth reading? Share it

Recommended apps based on your readingExplore all apps

Lutra AI

Build custom AI workflows without coding, automating tasks with simple prompts.

Business & ProductivityTry the app

Magicley AI

Access a suite of AI generators for all your creative and productivity tasks.

AI & DataTry the app

Langtail

Build and deploy robust LLM applications quickly with your team.

Business & ProductivityTry the app

Continue Readings

ConCISE: A Reference-Free Conciseness Evaluation Metric for LLM-Generated Answers

arXiv — cs.CLa day ago

ConCISE: A Reference-Free Conciseness Evaluation Metric for LLM-Generated Answers

PositiveArtificial Intelligence

A new reference-free metric called ConCISE has been introduced to evaluate the conciseness of responses generated by large language models (LLMs). This metric addresses the issue of verbosity in LLM outputs, which often contain unnecessary details that can hinder clarity and user satisfaction. ConCISE calculates conciseness through various compression ratios and word removal techniques without relying on standard reference responses.

Read full article

via arXiv — cs.CL

Fairness Evaluation of Large Language Models in Academic Library Reference Services

arXiv — cs.CLa day ago

Fairness Evaluation of Large Language Models in Academic Library Reference Services

PositiveArtificial Intelligence

A recent evaluation of large language models (LLMs) in academic library reference services examined their ability to provide equitable support across diverse user demographics, including sex, race, and institutional roles. The study found no significant differentiation in responses based on race or ethnicity, with only minor evidence of bias against women in one model. LLMs showed nuanced responses tailored to users' institutional roles, reflecting professional norms.

Read full article

via arXiv — cs.CL

A Small Math Model: Recasting Strategy Choice Theory in an LLM-Inspired Architecture

arXiv — cs.LGa day ago

A Small Math Model: Recasting Strategy Choice Theory in an LLM-Inspired Architecture

PositiveArtificial Intelligence

A new study introduces a Small Math Model (SMM) that reinterprets Strategy Choice Theory (SCT) within a neural-network architecture inspired by large language models (LLMs). This model incorporates elements such as counting practice and gated attention, aiming to enhance children's arithmetic learning through probabilistic representation and scaffolding strategies like finger-counting.

Read full article

via arXiv — cs.LG

Learning to Compress: Unlocking the Potential of Large Language Models for Text Representation

arXiv — cs.CLa day ago

Learning to Compress: Unlocking the Potential of Large Language Models for Text Representation

PositiveArtificial Intelligence

A recent study has highlighted the potential of large language models (LLMs) for text representation, emphasizing the need for innovative approaches to adapt these models for tasks like clustering and retrieval. The research introduces context compression as a pretext task, enabling LLMs to generate compact memory tokens that enhance their performance in downstream applications.

Read full article

via arXiv — cs.CL

Humanlike Multi-user Agent (HUMA): Designing a Deceptively Human AI Facilitator for Group Chats

arXiv — cs.CLa day ago

Humanlike Multi-user Agent (HUMA): Designing a Deceptively Human AI Facilitator for Group Chats

PositiveArtificial Intelligence

The Humanlike Multi-user Agent (HUMA) has been developed to enhance group chat interactions by utilizing large language models (LLMs) to facilitate multi-party conversations with human-like timing and strategies. This innovative AI system is designed to improve user engagement and trust in digital platforms where asynchronous communication is prevalent.

Read full article

via arXiv — cs.CL

Cognitive BASIC: An In-Model Interpreted Reasoning Language for LLMs

arXiv — cs.CLa day ago

Cognitive BASIC: An In-Model Interpreted Reasoning Language for LLMs

PositiveArtificial Intelligence

Cognitive BASIC has been introduced as a minimal, BASIC-style prompting language designed to enhance the reasoning capabilities of large language models (LLMs). This in-model interpreter allows for explicit, stepwise execution traces, enabling transparent multi-step reasoning within the model. The approach leverages the simplicity of retro BASIC to create a cognitive control layer that modern LLMs can effectively simulate.

Read full article

via arXiv — cs.CL

Task-Aligned Tool Recommendation for Large Language Models

arXiv — cs.CLa day ago

Task-Aligned Tool Recommendation for Large Language Models

PositiveArtificial Intelligence

Recent advancements in Large Language Models (LLMs) have highlighted the importance of task-aligned tool recommendations, as these models can significantly enhance their problem-solving capabilities when augmented with external tools. However, the challenge remains in effectively selecting the optimal tools for specific tasks, given the impracticality of incorporating all available tools simultaneously.

Read full article

via arXiv — cs.CL

SpatialGeo:Boosting Spatial Reasoning in Multimodal LLMs via Geometry-Semantics Fusion

arXiv — cs.CVa day ago

SpatialGeo:Boosting Spatial Reasoning in Multimodal LLMs via Geometry-Semantics Fusion

PositiveArtificial Intelligence

SpatialGeo has been introduced as a novel vision encoder that enhances the spatial reasoning capabilities of multimodal large language models (MLLMs) by integrating geometry and semantics features. This advancement addresses the limitations of existing MLLMs, particularly in interpreting spatial arrangements in three-dimensional space, which has been a significant challenge in the field.

Read full article

via arXiv — cs.CV