Multilingual corpora for the study of new concepts in the social sciences and humanities:
NeutralArtificial Intelligence
- A new article has been published on arXiv detailing a hybrid methodology for constructing a multilingual corpus aimed at studying emerging concepts in the humanities and social sciences, specifically focusing on 'non-technological innovation'. The corpus is built from automatically extracted textual content from company websites and annual reports, which are filtered and processed to create a dataset for machine learning applications.
- This development is significant as it enhances the ability to analyze and understand new concepts within the humanities and social sciences, potentially leading to more informed discussions and advancements in these fields. The methodology could serve as a model for future research in multilingual data analysis.
— via World Pulse Now AI Editorial System
