TopiCLEAR: Topic extraction by CLustering Embeddings with Adaptive dimensional Reduction

arXiv — cs.CLTuesday, December 9, 2025 at 5:00:00 AM
  • A new method called TopiCLEAR has been introduced for topic extraction from social media posts, addressing challenges posed by the informal nature of platforms like X, Facebook, and Reddit. This method utilizes Sentence-BERT for embedding text and Gaussian Mixture Models for clustering, refining the clusters iteratively to improve topic modeling accuracy.
  • The development of TopiCLEAR is significant as it enhances the ability to analyze public perceptions on various topics, which is crucial for understanding social issues, politics, and consumer sentiment in the rapidly evolving digital landscape.
  • This advancement reflects a broader trend in artificial intelligence where researchers are increasingly focusing on improving data analysis techniques for informal and fragmented text data. The integration of frameworks that quantify argument strength and enhance data enrichment for mental health and online safety further illustrates the growing importance of effective communication analysis in online environments.
— via World Pulse Now AI Editorial System

Was this article worth reading? Share it

Recommended apps based on your readingExplore all apps
Continue Readings
Australia Has Banned Social Media for Kids Under 16. How Does It Work?
NegativeArtificial Intelligence
Australia has enacted a ban on social media access for individuals under the age of 16, effective December 10, 2025. This legislation targets major platforms such as Facebook, Instagram, and TikTok, marking one of the strictest measures globally to protect minors from online risks.
Meta is trying to make Facebook suck less by simplifying things a bit
NeutralArtificial Intelligence
Meta is implementing changes to simplify Facebook, aiming to improve user experience by streamlining various features and functionalities. This initiative reflects the company's ongoing efforts to address user feedback and enhance engagement on the platform.
SPOT: An Annotated French Corpus and Benchmark for Detecting Critical Interventions in Online Conversations
NeutralArtificial Intelligence
The introduction of SPOT (Stopping Points in Online Threads) marks a significant advancement in the field of natural language processing, providing the first annotated French corpus designed to identify critical interventions in online discussions. This corpus consists of 43,305 annotated Facebook comments related to misinformation, offering a new lens through which to analyze online discourse.
Automated Data Enrichment using Confidence-Aware Fine-Grained Debate among Open-Source LLMs for Mental Health and Online Safety
PositiveArtificial Intelligence
A new study introduces a Confidence-Aware Fine-Grained Debate (CFD) framework that utilizes multiple open-source large language models (LLMs) to enhance data enrichment for mental health and online safety. This framework simulates human annotators to reach consensus on labeling real-world indicators, addressing the challenges of dynamic life events. Two expert-annotated datasets were created, focusing on mental health discussions on Reddit and risks associated with sharenting on Facebook.
Meta will let Facebook and Instagram users in the EU share less data
NeutralArtificial Intelligence
Meta has announced that users of Facebook and Instagram in the European Union will have the option to share less data, a move aimed at complying with EU regulations and addressing privacy concerns. This decision follows a €200 million fine imposed on the company for its advertising practices, prompting a shift in its data-sharing policies.
Musk goes full Musk after X gets hit with a €120 million EU fine
NegativeArtificial Intelligence
The European Union has imposed a €120 million fine on Elon Musk's social media platform, X, following a two-year investigation into violations of the Digital Services Act (DSA). This penalty marks the first enforcement action under this legislation, which was introduced in 2022, specifically targeting misleading practices related to the platform's blue check verification system.
Meta offers EU users ad-light option in push to end investigation
PositiveArtificial Intelligence
Meta has announced a new ad-light option for Facebook users in the European Union, a significant shift from its previous 'pay or consent' advertising model. This change follows discussions with the European Commission aimed at addressing regulatory concerns and ending ongoing investigations into its advertising practices.
X deactivates EU’s ad account after €120m DSA fine
NegativeArtificial Intelligence
X has deactivated the European Commission's advertising account following a €120 million fine imposed for violating the Digital Services Act, marking the first enforcement action under this legislation. The fine was primarily due to the platform's misleading blue check verification system, which allowed users to purchase verification without proper checks.