A systematic review of neurological symptoms and complications of COVID-19
Xiangliang Chen, S. Laurent, O. Onur
et al.
To study the frequency of neurological symptoms and complications in COVID-19 patients in a systematic review of the literature. Relevant studies were identified through electronic explorations of PubMed, medRxiv, and bioRxiv. Besides, three Chinese databases were searched. A snowballing method searching the bibliographies of the retrieved references was applied to identify potentially relevant articles. Articles published within 1 year prior to April 20th, 2020, were screened with no language restriction imposed. Databases were searched for terms related to SARS-CoV-2/COVID-19 and neurological manifestations, using a pre-established protocol registered on the International Prospective Register of Systematic Reviews database (ID: CRD42020187994). A total of 2441 articles were screened for relevant content, of which 92 full-text publications were included in the analyses of neurological manifestations of COVID-19. Headache, dizziness, taste and smell dysfunctions, and impaired consciousness were the most frequently described neurological symptoms, the latter more often among patients with a severe or critical disease course. To date, only smaller cohort studies or single cases have reported cerebrovascular events, seizures, meningoencephalitis, and immune-mediated neurological diseases, not suitable for quantitative analysis. The most frequent neurological symptoms reported in association with COVID-19 are non-specific for the infection with SARS-CoV-2. Although SARS-CoV-2 may have the potential to gain direct access to the nervous system, so far, SARS-CoV-2 was detected in the cerebrospinal fluid in two cases only. Standardized international registries are needed to clarify the clinical relevance of the neuropathogenicity of SARS-CoV-2 and to elucidate a possible impact of SARS-CoV-2 infection on common neurological disease, such as Alzheimer’s, Parkinson’s disease or multiple sclerosis.
Translation and back-translation in qualitative nursing research: methodological review.
Hsiao-Yu Chen, J. Boore
Prevalence of depressive symptoms among older adults in mainland China: A systematic review and meta-analysis.
T. Tang, Jianling Jiang, Xinfeng Tang
Exploration of Acupoint Compatibility Patterns in Acupuncture Treatment for Infertility Based on Data Mining
Liu J, Zhang J, Li H
et al.
Jun Liu,1,2,* Juncha Zhang,1,2,* Haiping Li,1,* Zichun Yuan,3 Jiaxin Liu,1 Weiwei Jia,1 Yi Sun,1 Xuesong Wang,4 Yanfen She1,2 1College of Acupuncture-Moxibustion and Massage, Hebei University of Chinese Medicine, Shijiazhuang, Hebei, People’s Republic of China; 2Hebei International Joint Research Center for Dominant Diseases in Chinese Medicine and Acupuncture, Hebei University of Chinese Medicine, Shijiazhuang, Hebei, People’s Republic of China; 3Department of Graduate School, Beijing University of Chinese Medicine, Beijing, People’s Republic of China; 4Center for Rehabilitation Medicine, Rehabilitation & Sports Medicine Research Institute of Zhejiang Province, Zhejiang Provincial People’s Hospital (Affiliated People’s Hospital), Hangzhou Medical College, Hangzhou, Zhejiang, People’s Republic of China*These authors contributed equally to this workCorrespondence: Xuesong Wang, Center for Rehabilitation Medicine, Rehabilitation & Sports Medicine Research Institute of Zhejiang Province, Zhejiang Provincial People’s Hospital (Affiliated People’s Hospital), Hangzhou Medical College, No. 158 Shangtang Road, Gongshu District, Hangzhou, Zhejiang, 310000, People’s Republic of China, Tel +8618271838070, Email wangxuesong@hebcm.edu.cn Yanfen She, College of Acupuncture-Moxibustion and Massage,Hebei University of Chinese Medicine, No. 3 Xingyuan Road, Luquan District, Shijiazhuang, Hebei, 050200, People’s Republic of China, Tel +8615833955088, Email she@hebcm.edu.cnObjective: This study aimed to explore the compatibility rules of acupoint selection in acupuncture treatment for infertility using data mining techniques.Methods: Randomized controlled trials (RCTs)on acupuncture treatment for infertility were retrieved from the China National Knowledge Infrastructure, Wanfang Data Knowledge Service Platform, VIP Chinese Journal Service Platform, Chinese Biomedical Literature Database, PubMed, Embase, Web of Science, and Cochrane Library up to September 22, 2024. The frequency of acupoint use, meridian attribution, selection of specific points, and acupoint association rules in the treatment of polycystic ovary syndrome-related infertility, diminished ovarian reserve related infertility, and unexplained infertility were analyzed using R language. Cytoscape was employed to visualize acupoint associations and disease-acupoint co-occurrence networks.Results: A total of 220 prescriptions for infertility were included, covering 92 acupoints with an overall frequency of 3,653 uses. The top three frequently used acupoints were Guanyuan (CV4), Sanyinjiao (SP6), Zigong (EX-CA1). The most frequently involved meridians were the Conception Vessel, the Spleen Meridian of Foot-Taiyin, and the Stomach Meridian of Foot-Yangming. The specific acupoint usage frequency from high to low was Front Mu points, Five-shu points, and Yuan-primary points. Network analysis indicated that the core acupoint group consisted of CV4, SP6, Zusanli (ST36), EX-CA1, Qihai (CV6), Baihui (GV20), Xuehai (SP10), Shenshu (BL23), Ganshu (BL18), Guilai (ST29), Zhongji (CV3), Taichong (LR3), Taixi (KI3), Zhongwan (CV12), Tianshu (ST25). The overall therapeutic principle focused on regulating the Flush Vessel and Conception Vessel and harmonizing qi and blood.Conclusion: This study provides valuable insights for the clinical selection of acupoint combinations in the acupuncture treatment of infertility, offering directions for optimizing acupoint strategies in clinical practice.Keywords: infertility, acupuncture, acupoints, association rules, data mining
Gynecology and obstetrics
Long term mental health trajectories after disasters and pandemics: A multilingual systematic review of prevalence, risk and protective factors.
Elizabeth A. Newnham, Enrique L. P. Mergelsberg, Yanyu Chen
et al.
The patterns of long-term psychological response after disasters and pandemics remain unclear. We aimed to determine the trajectories for post-traumatic stress symptoms (PTSS), depression and anxiety prevalence following disasters and pandemic exposure; and identify associated risk and protective factors. A systematic review of the English, Chinese, and Japanese longitudinal mental health literature was conducted. We searched Cochrane, MEDLINE, ProQuest, PsycINFO, PubMed, Web of Science, and CINAHL (English), CNKI and SINOMED (Chinese) and CiNii (Japanese) for studies published between January 2000 and May 2022. Following a pre-specified protocol (PROSPERO: CRD42020206424), conditional linear growth curve models and ANOVA analyses were conducted. The search identified 77,891 papers, with a final sample of 234: 206 English, 24 Chinese, and 4 Japanese-language papers. PTSS rates improved for all ages (p = .018, eta2 = 0.035). In contrast, depression and anxiety prevalence remained elevated for years following exposure (p = .424, eta2 = 0.019 and p = .051, eta2 = 0.064, respectively), with significantly higher rates for children and adolescents (p 0.056). Earthquakes and pandemics were associated with higher prevalence of PTSS (p 0.019). Multi-level risk and protective factors were identified. The chronicity of mental health outcomes highlights a critical need for tailored, sustainable mental health services, particularly for children and adolescents, in disaster- and pandemic-affected settings.
The Routledge Handbook of Stylistics
M. Burke
Cognitive Poetics and Cultural MemoryCritical StylisticsLanguage in LiteratureLiterary Reading, Cognition and EmotionThe Routledge Handbook of Chinese Discourse AnalysisThe Routledge Handbook of Language and HumorThe Routledge Handbook of Translation Studies and LinguisticsA Dictionary of StylisticsThe Routledge Handbook of English Language and Digital HumanitiesThe Routledge Handbook of Literary TranslationThe Routledge Handbook of Educational LinguisticsThe Cambridge Handbook of Cognitive LinguisticsStyle and StylisticsMind The GapCreative Writing and StylisticsAnalyzing Digital FictionForensic LinguisticsThe Routledge Handbook of Translation StudiesHow to Do Critical Discourse AnalysisForensic LinguisticsThe Routledge Handbook of Language AwarenessThe Routledge Handbook of English Language and Digital HumanitiesThe Routledge Handbook of Language in ConflictLanguage and Literature (general)StylisticsThe Routledge Handbook of StylisticsThe Routledge Handbook of English Language StudiesThe Cambridge Handbook of StylisticsThe Routledge Handbook of Forensic LinguisticsLinguistic CriticismAn Introduction to Forensic LinguisticsCritical StylisticsStylisticsThe Routledge Handbook of Applied LinguisticsWorld BuildingStylistic Approaches to TranslationThe Routledge Handbook of Language and CreativityLanguage and PowerRelevance TheoryThe Routledge Handbook of Stylistics
CULTURAL SENSITIVITY IN AI TRANSLATION OF CHINESE LANGUAGE: EVALUATING AI’S CAPABILITY TO GRASP CULTURALLY RICH TEXTS
Iulia Elena CÎNDEA, Jiong WANG
Cultural Sensitivity in AI Translation of Chinese Language: Evaluating AI’s Capability to Grasp Culturally Rich Texts. The widespread adoption of Artificial Intelligence (AI) in translation has fundamentally reshaped global communication by enhancing efficiency, cost-effectiveness, and accessibility. Despite these advancements, critical questions persist about AI’s capacity to navigate the inherent complexities of language, particularly regarding cultural sensitivity and contextual accuracy. These challenges are especially pronounced in Chinese, a language abundant in idiomatic phrases, literary allusions, and culturally embedded references, where mistranslations risk significant misunderstanding. This study examines the capabilities of leading AI translation tools in rendering culturally rich Chinese texts into Romanian and English. Drawing on a curated corpus of classical literature, contemporary literature and news articles, the research evaluates each tool’s performance in terms of accuracy, cultural sensitivity, contextual understanding, emotional tone, and overall fluency. Both qualitative and quantitative analyses reveal considerable variation among popular platforms. While high fluency and coherence are attainable, some persistent gaps remain in the handling of idiomatic expressions and nuanced cultural elements. These findings reaffirm the need for expert post-editing and a judicious, human-centered integration of AI into translation workflows.
REZUMAT. Sensibilitate culturală în traducerea asistată de IA a limbii chineze: evaluarea capacității de redare a textelor cu densitate culturală. Adoptarea pe scară largă a inteligenței artificiale (IA) în traducere a remodelat fundamental comunicarea globală, sporind eficiența, costurile și accesibilitatea. În pofida acestor progrese, persistă întrebări critice privind capacitatea IA de a gestiona complexitățile inerente ale limbajului, în special sensibilitatea culturală și acuratețea contextuală. Provocările sunt deosebit de pronunțate în chineză, limbă bogată în expresii idiomatice, aluzii literare și referințe puternic ancorate cultural, unde erorile de traducere pot genera neînțelegeri majore. Studiul de față examinează capacitățile principalelor instrumente de traducere bazate pe IA în redarea în română și engleză a textelor chineze bogate cultural. Pe baza unui corpus atent selectat, cuprinzând literatură clasică, literatură contemporană și articole de presă, cercetarea evaluează performanța fiecărui instrument în termeni de acuratețe, sensibilitate culturală, înțelegere contextuală, ton emoțional și fluență generală. Analizele calitative și cantitative evidențiază variații considerabile între platformele populare. Deși fluența și coerența ridicate sunt posibile, persistă anumite lacune în tratarea expresiilor idiomatice și a elementelor culturale nuanțate. Aceste constatări reafirmă necesitatea post-editării de către traducători profesioniști și a unei integrări cumpătate, centrate pe om, a IA în fluxurile de traducere.
Cuvinte-cheie: instrumente de traducere bazate pe IA; sensibilitate culturală; hermeneutică; cultureme chineze; post-editare; traducere hibridă
Article history: Received 13 June 2025; Revised 12 November 2025; Accepted 20 November 2025;
Available online 12 December 2025; Available print 30 December 2025
Disorder, Punishment, and Grace: The Harmonization of Divine Will and Fate in the <i>Prometheus</i> Trilogy
Xiao Ren
In the <i>Prometheus</i> trilogy, fate dictates critical actions taken by Prometheus, such as forming alliances, stealing fire, facing punishment, and eventual liberation. This trajectory gradually aligns with the divine will of Zeus, reflecting the theological framework of early Greek religion. Within the play, Prometheus’s rebellion against the established order of distribution determines his “unlawful act”, which brings about retributive justice—a theological necessity for restoring the balance between human advancement and divine sovereignty. In essence, Prometheus’s punishment results from the interplay between fate and Zeus’s rule, yet this suffering is essential for the harmonization of the cosmic order. Consequently, throughout this process, Zeus’s divine justice undergoes continuous evolution, ultimately establishing the foundation for the legitimacy of civic ethics and providing a theological justification for the justice of human suffering. Ultimately, Aeschylus traces civic ethical norms to Zeus’s justice, demonstrating how democracy gains legitimacy through theological discourse, which highlights the intricate connections among Greek religion, democracy, and tragedy.
Religions. Mythology. Rationalism
Systematic review and meta-analysis of urban-rural differences in parenting styles among Chinese secondary school students
Lixia Yan, Guanqi Sun, Junhua Zhang
Objective: Previous literature has reported inconsistent findings regarding urban–rural differences in parenting styles among Chinese parents. This meta-analysis aims to reconcile these discrepancies. Design: A comprehensive literature search was conducted across PubMed, OVID, Web of Knowledge, CNKI, Wan Fang, and Chongqing VIP databases through February 11, 2024, with no restrictions on language, date, or document type. A random effects model was applied using Comprehensive Meta-Analysis 3.0 software, and study quality was assessed using a standardized checklist. Results: Forty-nine studies met the inclusion criteria, including 16,222 urban and 14,975 rural secondary school students. The analysis showed that urban parents demonstrate significantly higher levels of warmth and overprotection than rural parents. These differences are likely attributable to disparities in economic status, educational attainment, parental age, household structure, and broader social conditions. Notably, regional factors moderated urban–rural differences in maternal overprotection, with more pronounced contrasts observed in Eastern China than in other regions. Conclusions: The findings highlight the complex interaction between urban–rural disparities and regional variation in Chinese parenting practices. These insights may inform the development of more context-sensitive policies and interventions to support families across diverse settings in China.
The Difference and Motivation of the Semantic Conflation Patterns in Chinese and English Autonomous Motion Event Sentences: Path and Containers
Xu Xiaotong, Chen Mitian
In English, a single sentence can be formed with only one core verb, while in Chinese, a serial verb construction is required to encode elements that are semantically equivalent to the English core verb. In English, verbs that involve no horizontal/vertical path, such as swirl, or even verbs that lack the concept of motion, such as roar, can directly enter a sentence when coupled with an element denoting the path. Yet in Chinese, the complement from a main movement verb is required. For example, verbs like xuánzhuǎn ‘swirl’ and hūxiào ‘roar’ can only function as additional elements to modify main movement verbs like fēi ‘fly’ or shǐ ‘drive’. Building on this observation, the present research investigates autonomous motion event sentences in Chinese and English, proposing that in both languages, characterizing an autonomous motion event requires the semantic conflation of its motion and path to express a concept of translational movement. The differences in semantic conflation patterns between Chinese and English can be attributed to the greater number and variety of prepositions in English. Some English prepositions, such as across, encode a directional path, while others, such as in, do not. Prepositions expressing a directional path could compensate for the absence of directional motion in non-horizontal/vertical movement verbs. However, Chinese has far fewer prepositions, and none encode a directional path, making movement verbs or directional path verbs indispensable. This study concludes that when a complement for movement is needed to characterize an autonomous motion event, prepositions expressing path perform this function in English, whereas in Chinese, the role is fulfilled by verbs.
Chinese language and literature
Perspectives and Experiences of Family Caregivers Using Supportive Mobile Apps in Dementia Care: Meta-Synthesis of Qualitative Research
Haifei Shen, Yi Han, Wen Shi
et al.
BackgroundSupportive mobile apps are effective tools for family caregivers of persons with dementia to obtain online information and psychological support. Nevertheless, details about the experiences of family caregivers of persons with dementia using mobile apps are limited.
ObjectiveThis study aimed to synthesize the perspectives and experiences of family caregivers of persons with dementia regarding supportive mobile apps.
MethodsWe conducted a synthesis of qualitative research and searched 7 English-language databases and 4 Chinese-language databases. We included qualitative studies (peer-reviewed studies and gray literature) written in English and Chinese on the perspectives and experiences of family caregivers of persons with dementia regarding supportive mobile apps published from database establishment to March 2025. Two researchers independently screened the literature and used the JBI Critical Appraisal Checklist for Qualitative Research to conduct quality assessments on the final included studies. Themes were integrated using the 3-stage thematic synthesis approach by Thomas and Harden.
ResultsA preliminary search yielded 4772 studies, of which 12 (0.25%) met the criteria. The included studies were from 7 different countries or regions, of which the only low- or middle-income country was Brazil. The studies involved a total of 232 family caregivers, most of whom were older adults and female. The integration of extracted content resulted in 4 themes: dynamic changes in value perception—complex attitudes toward mobile app adoption; from tools to partners—a technology-empowered multidimensional support system for family caregivers; external and internal barriers—challenges in family caregivers’ use of mobile apps; and person-centered design—future directions for improving mobile apps.
ConclusionsThis study found that family caregivers’ attitudes toward using supportive mobile apps are influenced by their perceived value of mobile apps and their caregiving burden. In addition, such supportive mobile apps serve as valuable tools for family caregivers to enhance their caregiving abilities and efficiency, alleviate the burden of care, improve negative emotions, foster social connections, and promote self-care. Future mobile app design needs to address obstacles such as design flaws, family caregivers’ lack of technological literacy, time constraints, concerns about privacy breaches, and other device-related issues, with particular attention to the ease of use of mobile apps. Meanwhile, developers need to commit to designing personalized and multifunctional mobile apps as well as promote online collaboration among members of the care network. Overall, our study offers an important reference for developing person-centered supportive mobile apps for family caregivers of persons with dementia.
Trial RegistrationPROSPERO CRD42024510905; https://www.crd.york.ac.uk/PROSPERO/view/CRD42024510905
Information technology, Public aspects of medicine
OphthBench: A Comprehensive Benchmark for Evaluating Large Language Models in Chinese Ophthalmology
Chengfeng Zhou, Ji Wang, Juanjuan Qin
et al.
Large language models (LLMs) have shown significant promise across various medical applications, with ophthalmology being a notable area of focus. Many ophthalmic tasks have shown substantial improvement through the integration of LLMs. However, before these models can be widely adopted in clinical practice, evaluating their capabilities and identifying their limitations is crucial. To address this research gap and support the real-world application of LLMs, we introduce the OphthBench, a specialized benchmark designed to assess LLM performance within the context of Chinese ophthalmic practices. This benchmark systematically divides a typical ophthalmic clinical workflow into five key scenarios: Education, Triage, Diagnosis, Treatment, and Prognosis. For each scenario, we developed multiple tasks featuring diverse question types, resulting in a comprehensive benchmark comprising 9 tasks and 591 questions. This comprehensive framework allows for a thorough assessment of LLMs' capabilities and provides insights into their practical application in Chinese ophthalmology. Using this benchmark, we conducted extensive experiments and analyzed the results from 39 popular LLMs. Our evaluation highlights the current gap between LLM development and its practical utility in clinical settings, providing a clear direction for future advancements. By bridging this gap, we aim to unlock the potential of LLMs and advance their development in ophthalmology.
Language Lives in Sparse Dimensions: Toward Interpretable and Efficient Multilingual Control for Large Language Models
Chengzhi Zhong, Fei Cheng, Qianying Liu
et al.
Large language models exhibit strong multilingual capabilities despite limited exposure to non-English data. Prior studies show that English-centric large language models map multilingual content into English-aligned representations at intermediate layers and then project them back into target-language token spaces in the final layer. From this observation, we hypothesize that this cross-lingual transition is governed by a small and sparse set of dimensions, which occur at consistent indices across the intermediate to final layers. Building on this insight, we introduce a simple, training-free method to identify and manipulate these dimensions, requiring only as few as 50 sentences of either parallel or monolingual data. Experiments on a multilingual generation control task reveal the interpretability of these dimensions, demonstrating that the interventions in these dimensions can switch the output language while preserving semantic content, and that it surpasses the performance of prior neuron-based approaches at a substantially lower cost.
The Role of Self-Efficacy, Task Value, and Intrinsic and Extrinsic Motivations in Students’ Feedback Engagement in English Learning
Zhengdong Gan, Fulan Liu, Honghan Nang
Drawing on Wigfield and Eccles’s motivational theory, which is acclaimed for explaining individual behavioral intentions, this study investigated the extent to which different forms of motivation (i.e., self-efficacy, task value, and intrinsic and extrinsic motivations) predicted student behavioral feedback engagement (i.e., action on teacher feedback and feedback seeking) in English learning. The participants were 276 male and female students who were enrolled in a second-year full-time English language and literature program at two Chinese universities. Multiple regression analyses showed that task value emerged as the only motivational variable that significantly predicted both students’ action on teacher feedback and feedback seeking. Intrinsic motivation significantly predicted action on teacher feedback, whereas extrinsic motivation and self-efficacy significantly predicted feedback seeking. Pedagogical implications for endeavors to support students in their engagement with feedback in learning English as a foreign language in China are discussed.
Babysit A Language Model From Scratch: Interactive Language Learning by Trials and Demonstrations
Ziqiao Ma, Zekun Wang, Joyce Chai
Humans are efficient language learners and inherently social creatures. Our language development is largely shaped by our social interactions, for example, the demonstration and feedback from caregivers. Contrary to human language learning, recent advancements in large language models have primarily adopted a non-interactive training paradigm, and refined pre-trained models through feedback afterward. In this work, we explore how corrective feedback from interactions influences neural language acquisition from scratch through systematically controlled experiments, assessing whether it contributes to word learning efficiency in language models. We introduce a trial-and-demonstration (TnD) learning framework that incorporates three distinct components: student trials, teacher demonstrations, and a reward conditioned on language competence at various developmental stages. Our experiments reveal that the TnD approach accelerates word acquisition for student models of equal and smaller numbers of parameters, and we highlight the significance of both trials and demonstrations. We further show that the teacher's choices of words influence students' word-specific learning efficiency, and a practice-makes-perfect effect is evident by a strong correlation between the frequency of words in trials and their respective learning curves. Our findings suggest that interactive language learning, with teacher demonstrations and active trials, can facilitate efficient word learning in language models.
MERaLiON-TextLLM: Cross-Lingual Understanding of Large Language Models in Chinese, Indonesian, Malay, and Singlish
Xin Huang, Tarun Kumar Vangani, Minh Duc Pham
et al.
Multilingual large language models (MLLMs) have shown impressive capabilities across a variety of languages. However, efficacy can differ greatly between different language families, especially for those with limited linguistic resources. This report presents MERaLiON-TextLLM, a series of open-source language models specifically tailored to improve understanding and generation in Chinese, Indonesian, Malay, and Singlish. The initial released model is built on Llama-3-8B-Base and refined through a meticulously crafted process of continued pre-training and weight merging. Our approach achieves performance improvements across benchmarks in these languages, exceeding the capabilities of the official Llama-3 models. We provide the model checkpoints as a resource to support further research and development in cross-lingual language understanding.
Towards Quantifying and Reducing Language Mismatch Effects in Cross-Lingual Speech Anti-Spoofing
Tianchi Liu, Ivan Kukanov, Zihan Pan
et al.
The effects of language mismatch impact speech anti-spoofing systems, while investigations and quantification of these effects remain limited. Existing anti-spoofing datasets are mainly in English, and the high cost of acquiring multilingual datasets hinders training language-independent models. We initiate this work by evaluating top-performing speech anti-spoofing systems that are trained on English data but tested on other languages, observing notable performance declines. We propose an innovative approach - Accent-based data expansion via TTS (ACCENT), which introduces diverse linguistic knowledge to monolingual-trained models, improving their cross-lingual capabilities. We conduct experiments on a large-scale dataset consisting of over 3 million samples, including 1.8 million training samples and nearly 1.2 million testing samples across 12 languages. The language mismatch effects are preliminarily quantified and remarkably reduced over 15% by applying the proposed ACCENT. This easily implementable method shows promise for multilingual and low-resource language scenarios.
Benchmarking Large Language Models on CFLUE -- A Chinese Financial Language Understanding Evaluation Dataset
Jie Zhu, Junhui Li, Yalong Wen
et al.
In light of recent breakthroughs in large language models (LLMs) that have revolutionized natural language processing (NLP), there is an urgent need for new benchmarks to keep pace with the fast development of LLMs. In this paper, we propose CFLUE, the Chinese Financial Language Understanding Evaluation benchmark, designed to assess the capability of LLMs across various dimensions. Specifically, CFLUE provides datasets tailored for both knowledge assessment and application assessment. In knowledge assessment, it consists of 38K+ multiple-choice questions with associated solution explanations. These questions serve dual purposes: answer prediction and question reasoning. In application assessment, CFLUE features 16K+ test instances across distinct groups of NLP tasks such as text classification, machine translation, relation extraction, reading comprehension, and text generation. Upon CFLUE, we conduct a thorough evaluation of representative LLMs. The results reveal that only GPT-4 and GPT-4-turbo achieve an accuracy exceeding 60\% in answer prediction for knowledge assessment, suggesting that there is still substantial room for improvement in current LLMs. In application assessment, although GPT-4 and GPT-4-turbo are the top two performers, their considerable advantage over lightweight LLMs is noticeably diminished. The datasets and scripts associated with CFLUE are openly accessible at https://github.com/aliyun/cflue.
Why do objects have many names? A study on word informativeness in language use and lexical systems
Eleonora Gualdoni, Gemma Boleda
Human lexicons contain many different words that speakers can use to refer to the same object, e.g., "purple" or "magenta" for the same shade of color. On the one hand, studies on language use have explored how speakers adapt their referring expressions to successfully communicate in context, without focusing on properties of the lexical system. On the other hand, studies in language evolution have discussed how competing pressures for informativeness and simplicity shape lexical systems, without tackling in-context communication. We aim at bridging the gap between these traditions, and explore why a soft mapping between referents and words is a good solution for communication, by taking into account both in-context communication and the structure of the lexicon. We propose a simple measure of informativeness for words and lexical systems, grounded in a visual space, and analyze color naming data for English and Mandarin Chinese. We conclude that optimal lexical systems are those where multiple words can apply to the same referent, conveying different amounts of information. Such systems allow speakers to maximize communication accuracy and minimize the amount of information they convey when communicating about referents in contexts.
Neuroanatomical correlates of phonological processing of Chinese characters and alphabetic words: A meta‐analysis
L. Tan, A. Laird, Karl Li
et al.
571 sitasi
en
Medicine, Psychology