Improving Alignment Between Human and Machine Codes: An Empirical Assessment of Prompt Engineering for Construct Identification in Psychology

arXiv — cs.CLThursday, December 4, 2025 at 5:00:00 AM
  • A recent study published on arXiv presents an empirical framework aimed at optimizing large language models (LLMs) for identifying psychological constructs through prompt engineering. The research evaluates five prompting strategies, revealing that certain methods, such as persona and chain-of-thought prompting, do not fully address the challenges of classification in psychology.
  • This development is significant as it enhances the ability of LLMs to accurately classify psychological constructs, which are often defined by precise theoretical frameworks. Improved classification can lead to better applications in psychological research and clinical settings.
  • The findings contribute to ongoing discussions about the effectiveness of prompt engineering in LLMs, particularly in specialized fields like psychology. As LLMs evolve, the need for robust frameworks that ensure accurate and contextually relevant outputs becomes increasingly critical, highlighting the intersection of AI technology and human cognitive processes.
— via World Pulse Now AI Editorial System

Was this article worth reading? Share it

Recommended apps based on your readingExplore all apps
Continue Readings
A smarter way for large language models to think about hard problems
PositiveArtificial Intelligence
Researchers have discovered that allowing large language models (LLMs) more time to contemplate potential solutions can enhance their accuracy in addressing complex questions. This approach aims to improve the models' performance in challenging scenarios, where quick responses may lead to errors.
NLP Datasets for Idiom and Figurative Language Tasks
NeutralArtificial Intelligence
A new paper on arXiv presents datasets aimed at improving the understanding of idiomatic and figurative language in Natural Language Processing (NLP). These datasets are designed to assist large language models (LLMs) in better interpreting informal language, which has become increasingly prevalent in social media and everyday communication.
Robust Multimodal Sentiment Analysis of Image-Text Pairs by Distribution-Based Feature Recovery and Fusion
PositiveArtificial Intelligence
A new method for robust multimodal sentiment analysis of image-text pairs has been proposed, addressing challenges related to low-quality and missing modalities. The Distribution-based feature Recovery and Fusion (DRF) technique utilizes a feature queue for each modality to approximate feature distributions, enhancing sentiment prediction accuracy in real-world applications.
ZIP-RC: Optimizing Test-Time Compute via Zero-Overhead Joint Reward-Cost Prediction
PositiveArtificial Intelligence
The recent introduction of ZIP-RC, an adaptive inference method, aims to optimize test-time compute for large language models (LLMs) by enabling zero-overhead joint reward-cost prediction. This innovation addresses the limitations of existing test-time scaling methods, which often lead to increased costs and latency due to fixed sampling budgets and a lack of confidence signals.
Identifying attributions of causality in political text
NeutralArtificial Intelligence
A new framework has been introduced for identifying attributions of causality in political text, utilizing a lightweight causal language model to generate structured data sets of causal claims. This approach aims to enhance the systematic analysis of explanations in political science, an area that has been historically fragmented and underdeveloped.
A Group Fairness Lens for Large Language Models
PositiveArtificial Intelligence
A recent study introduces a group fairness lens for evaluating large language models (LLMs), proposing a novel hierarchical schema to assess bias and fairness. The research presents the GFAIR dataset and introduces GF-THINK, a method aimed at mitigating biases in LLMs, highlighting the critical need for broader evaluations of these models beyond traditional metrics.
Culture Affordance Atlas: Reconciling Object Diversity Through Functional Mapping
PositiveArtificial Intelligence
The Culture Affordance Atlas has been introduced as a function-centric framework aimed at addressing cultural biases in mainstream Vision-Language datasets, which often favor higher-income, Western contexts. This initiative involves a re-annotation of the Dollar Street dataset, categorizing 288 objects based on 46 functions to enhance model generalizability across diverse cultural and economic backgrounds.
Teaching Old Tokenizers New Words: Efficient Tokenizer Adaptation for Pre-trained Models
PositiveArtificial Intelligence
Recent research has introduced a novel approach to tokenizer adaptation for pre-trained language models, focusing on vocabulary extension and pruning. The method, termed continued BPE training, enhances tokenization efficiency by continuing the BPE merge learning process on new data, while leaf-based vocabulary pruning removes redundant tokens without compromising model quality.