Hasil untuk "English literature"

Menampilkan 20 dari ~9547024 hasil · dari CrossRef, arXiv, DOAJ, Semantic Scholar

JSON API
S2 Open Access 2019
The Country and the City

Raymond Williams

As a brilliant survey of English literature in terms of changing attitudes towards country and city, Williams' highly-acclaimed study reveals the shifting images and associations between these two traditional poles of life throughout the major developmental periods of English culture.

2670 sitasi en History
arXiv Open Access 2026
Bidirectional Chinese and English Passive Sentences Dataset for Machine Translation

Xinyue Ma, Pol Pastells, Mireia Farrús et al.

Machine Translation (MT) evaluation has gone beyond metrics, towards more specific linguistic phenomena. Regarding English-Chinese language pairs, passive sentences are constructed and distributed differently due to language variation, thus need special attention in MT. This paper proposes a bidirectional multi-domain dataset of passive sentences, extracted from five Chinese-English parallel corpora and annotated automatically with structure labels according to human translation, and a test set with manually verified annotation. The dataset consists of 73,965 parallel sentence pairs (2,358,731 English words, 3,498,229 Chinese characters). We evaluate two state-of-the-art open-source MT systems with our dataset, and four commercial models with the test set. The results show that, unlike humans, models are more influenced by the voice of the source text rather than the general voice usage of the source language, and therefore tend to maintain the passive voice when translating a passive in either direction. However, models demonstrate some knowledge of the low frequency and predominantly negative context of Chinese passives, leading to higher voice consistency with human translators in English-to-Chinese translation than in Chinese-to-English translation. Commercial NMT models scored higher in metric evaluations, but LLMs showed a better ability to use diverse alternative translations. Datasets and annotation script will be shared upon request.

en cs.CL, cs.DB
arXiv Open Access 2026
EmoAra: Emotion-Preserving English Speech Transcription and Cross-Lingual Translation with Arabic Text-to-Speech

Besher Hassan, Ibrahim Alsarraj, Musaab Hasan et al.

This work presents EmoAra, an end-to-end emotion-preserving pipeline for cross-lingual spoken communication, motivated by banking customer service where emotional context affects service quality. EmoAra integrates Speech Emotion Recognition, Automatic Speech Recognition, Machine Translation, and Text-to-Speech to process English speech and deliver an Arabic spoken output while retaining emotional nuance. The system uses a CNN-based emotion classifier, Whisper for English transcription, a fine-tuned MarianMT model for English-to-Arabic translation, and MMS-TTS-Ara for Arabic speech synthesis. Experiments report an F1-score of 94% for emotion classification, translation performance of BLEU 56 and BERTScore F1 88.7%, and an average human evaluation score of 81% on banking-domain translations. The implementation and resources are available at the accompanying GitHub repository.

en cs.CL
DOAJ Open Access 2026
Shared Decision-Making With a Surrogate for Life-Sustaining Treatment of Critically Ill Patients: Protocol for a Scoping Review

Yoshiyasu Ito, Mika Moriyama, Akemi Nasu et al.

Abstract BackgroundShared decision-making (SDM) is a collaborative process that integrates patients’ values and preferences into health care decisions. In intensive care units, patients who are critically ill often lack the capacity to make decisions, necessitating surrogates to make complex choices regarding life-sustaining treatments (LSTs). ObjectiveThis scoping review aims to assess the range of research conducted on surrogate SDM for LSTs among patients who are critically ill over the past decade and highlight areas where current research remains limited. MethodsThis scoping review will follow the Joanna Briggs Institute methodology and adhere to the PRISMA-ScR (Preferred Reporting Items for Systematic Reviews and Meta-Analyses Extension for Scoping Reviews) reporting guidelines. Studies will be included if they examine SDM involving surrogates of adult patients who are critically ill in relation to LST decisions within intensive care unit settings. SDM is defined using 4 criteria: participation of both health care professionals and surrogates, mutual information sharing, consensus building, and agreement on treatment based on the patient’s values and preferences. A comprehensive search will be performed across PubMed, CINAHL, PsycInfo, CENTRAL, and Ichushi-Web for English- and Japanese-language studies published between 2016 and 2025. Eligible study designs will include quantitative, qualitative, and mixed methods research. Title and abstract screening, as well as full-text selection, will be conducted independently by 2 reviewers using Rayyan. Data will be extracted on study characteristics, SDM definitions, participant roles, and key findings. Results will be synthesized descriptively and presented in tables and narrative summaries to identify research gaps and inform future investigations. ResultsAs of June 13, 2025, the literature search has been completed. A total of 2899 citations were identified through the specified database searches, and 527 (18.2%) duplicates were removed. Title and abstract screening are currently in progress, and full-text review is expected to be completed by September 2025. ConclusionsThis scoping review will systematically map recent evidence on surrogate SDM in the context of LST decisions for patients who are critically ill. By synthesizing diverse studies, it will identify challenges faced by surrogates and summarize existing interventions that aim to improve SDM processes. The findings are expected to inform future interventions and policies and advance patient- and family-centered care in critical care settings.

Medicine, Computer applications to medicine. Medical informatics
DOAJ Open Access 2026
Rates of Drug-Induced Uveitis: A Review by Medication Class

Jasti R, Wang Z, Zhou L et al.

Raghuram Jasti,1,* Zhenghao Wang,1,* Lucy Zhou,1 Baotram V Nguyen,1 Meghan K Berkenstock2 1Department of Surgery, Drexel University College of Medicine, Philadelphia, PA, USA; 2Division of Ocular Immunology, Wilmer Eye Institute, Johns Hopkins School of Medicine, Baltimore, MD, USA*These authors contributed equally to this workCorrespondence: Meghan K Berkenstock, Division of Ocular Immunology, Wilmer Eye Institute, The Johns Hopkins School of Medicine, 600 N. Wolfe St., Maumenee Building Third Floor, Baltimore, MD, 21087, USA, Tel +1 410 894 0480, Fax +1 410 893 9796, Email mberken2@jhmi.eduPurpose: Drug-induced uveitis is an uncommon but clinically important cause of intraocular inflammation across diverse systemic and ophthalmic therapies. As the use of biologics, targeted agents, and chronic ocular medications expands, clinicians need drug class–specific data on risk, phenotype, and clinical course. In this study, we synthesize and review the published literature on drug-induced uveitis, summarizing reported incidence, clinical phenotypes, latency to onset, and time to resolution by medication class, route of administration, and patient-level factors.Methods: A systematic literature search was conducted with PubMed, Scopus, and the Cochrane Library in September 2025 using terms related to drug-induced and medication-associated uveitis. Articles were included if they discussed human cases of noninfectious uveitis attributed to a specific medication or drug class and were written in English. Data analysis was performed to assess relationships between medication class, medication exposure time, uveitis location, uveitis treatment, and uveitis resolution time.Results: 317 articles with 690 unique patient cases met inclusion criteria for case-level data analysis. The mean age at drug-induced uveitis onset was 54.4 years; most patients were female, 63.4% had bilateral disease, and 74.8% had anterior uveitis. Mean exposure time from first dose to uveitis onset was 197.2 days (SD 497.6; range, 0– 6205), and mean resolution time was 61.0 days (SD 157.4; range, 1– 2520). Antineoplastics (29.1%), vaccines (15.8%), antibiotics (13.0%), intraocular pressure–lowering drops (11.9%), bisphosphonates (9.7%), vascular endothelial growth factor (VEGF) inhibitors (6.4%), antivirals (5.1%), and disease-modifying antirheumatic drugs (4.3%) were the most frequently implicated classes.Conclusion: Drug-induced uveitis, although rare, represents a broad array of presentations, mechanisms, and clinical course. As systemic and targeted therapeutic use continues to expand, understanding the clinical presenation and course will help patient outcomes and minimize vision-threatening risks.Keywords: uveitis, drug-induced uveitis, medication adverse effects, ocular complications

Ophthalmology
arXiv Open Access 2025
Evaluating Machine Translation Models for English-Hindi Language Pairs: A Comparative Analysis

Ahan Prasannakumar Shetty

Machine translation has become a critical tool in bridging linguistic gaps, especially between languages as diverse as English and Hindi. This paper comprehensively evaluates various machine translation models for translating between English and Hindi. We assess the performance of these models using a diverse set of automatic evaluation metrics, both lexical and machine learning-based metrics. Our evaluation leverages an 18000+ corpus of English Hindi parallel dataset and a custom FAQ dataset comprising questions from government websites. The study aims to provide insights into the effectiveness of different machine translation approaches in handling both general and specialized language domains. Results indicate varying performance levels across different metrics, highlighting strengths and areas for improvement in current translation systems.

en cs.CL, cs.LG
arXiv Open Access 2025
Multilingual Question Answering in Low-Resource Settings: A Dzongkha-English Benchmark for Foundation Models

Md. Tanzib Hosain, Rajan Das Gupta, Md. Kishor Morol

In this work, we provide DZEN, a dataset of parallel Dzongkha and English test questions for Bhutanese middle and high school students. The over 5K questions in our collection span a variety of scientific topics and include factual, application, and reasoning-based questions. We use our parallel dataset to test a number of Large Language Models (LLMs) and find a significant performance difference between the models in English and Dzongkha. We also look at different prompting strategies and discover that Chain-of-Thought (CoT) prompting works well for reasoning questions but less well for factual ones. We also find that adding English translations enhances the precision of Dzongkha question responses. Our results point to exciting avenues for further study to improve LLM performance in Dzongkha and, more generally, in low-resource languages. We release the dataset at: https://github.com/kraritt/llm_dzongkha_evaluation.

en cs.CL
DOAJ Open Access 2025
An overview of the treatment interventions and assessment of fear-avoidance for chronic musculoskeletal pain in adults: A scoping review protocol.

Sam Tan, Anju Jaggi, Alex Tasker et al.

<h4>Introduction</h4>The Fear-Avoidance (FA) model aims to explain how an acute pain experience can develop into a persistent state. The FA model considers five core components: kinesiophobia, pain-related fear, catastrophisation, victimisation, and interpersonal social environment. Amongst these, kinesiophobia, tends to dominate the literature on chronic musculoskeletal pain. As a result, current reviews have not considered the other core components of the FA model when exploring its interventions. Moreover, several synonyms of the term kinesiophobia is not reflected in their search strategies. Coupled with the preference of particular study designs and outcome measures, this scoping review aims to provide and characterise an overview of treatment interventions that consider all study designs, relevant outcome measures, FA components, and FA component synonyms.<h4>Methods and analysis</h4>Eligible studies will be in English or with an available English translation from 1970 onwards. Databases to be searched include Cochrane Central Register of Controlled Trials (CENTRAL), MEDLINE, Embase, The Allied and Complementary Database (AMED), PEDro, Web of Science, and grey literature. We will include studies involving participants ≥18 years old with chronic musculoskeletal pain, and interventions targeting FA and/or its components. Three review authors will independently screen papers using preestablished eligibility criteria and conduct assessments of risk of bias, with a fourth independent researcher employed to resolve disagreements where found. Qualitative synthesis techniques will be used to characterise the interventions. Patient and Public Involvement (PPI) has been utilised to develop this protocol and will be conducted following completion of the systematic review to discuss and reflect on the findings.<h4>Ethics and dissemination</h4>This systematic review does not require ethical approval as existing data will be used and the PPI to be conducted is an involvement activity rather than study data. The results will be disseminated through a peer-reviewed journal and via national and international conferences.<h4>Open science framework registration number</h4>This protocol is registered on Open Science Framework: https://doi.org/10.17605/OSF.IO/NR37A.

Medicine, Science
DOAJ Open Access 2025
Overcoming Tiki Pop: Polynesian Translingual Literature Against Cultural Exoticization

Semyon S. Galaktionov, Zoya G. Proshina

This study analyzes Tiki Pop as a cultural phenomenon of the 20th century and provides insight into how Polynesian translingual literature helps eliminate stereotypes imposed on indigenous cultures in the region. The author traces the history of Tiki Pop, from its inception in the 1930s to its decline at the turn of the century, and argues that this phenomenon was a byproduct of colonial times that affected the way Western audiences perceive Polynesia. This exoticizing view of the region is then contrasted to the way it is presented in Polynesian translingual literature. The author then delineates several linguistic devices that are utilized by indigenous ambilingual authors in order to outline their identity and combat stereotypical conceptualization of local cultures.

Philology. Linguistics
arXiv Open Access 2024
End to end Hindi to English speech conversion using Bark, mBART and a finetuned XLSR Wav2Vec2

Aniket Tathe, Anand Kamble, Suyash Kumbharkar et al.

Speech has long been a barrier to effective communication and connection, persisting as a challenge in our increasingly interconnected world. This research paper introduces a transformative solution to this persistent obstacle an end-to-end speech conversion framework tailored for Hindi-to-English translation, culminating in the synthesis of English audio. By integrating cutting-edge technologies such as XLSR Wav2Vec2 for automatic speech recognition (ASR), mBART for neural machine translation (NMT), and a Text-to-Speech (TTS) synthesis component, this framework offers a unified and seamless approach to cross-lingual communication. We delve into the intricate details of each component, elucidating their individual contributions and exploring the synergies that enable a fluid transition from spoken Hindi to synthesized English audio.

en eess.AS, cs.AI
arXiv Open Access 2024
Benchmarking terminology building capabilities of ChatGPT on an English-Russian Fashion Corpus

Anastasiia Bezobrazova, Miriam Seghiri, Constantin Orasan

This paper compares the accuracy of the terms extracted using SketchEngine, TBXTools and ChatGPT. In addition, it evaluates the quality of the definitions produced by ChatGPT for these terms. The research is carried out on a comparable corpus of fashion magazines written in English and Russian collected from the web. A gold standard for the fashion terminology was also developed by identifying web pages that can be harvested automatically and contain definitions of terms from the fashion domain in English and Russian. This gold standard was used to evaluate the quality of the extracted terms and of the definitions produced. Our evaluation shows that TBXTools and SketchEngine, while capable of high recall, suffer from reduced precision as the number of terms increases, which affects their overall performance. Conversely, ChatGPT demonstrates superior performance, maintaining or improving precision as more terms are considered. Analysis of the definitions produced by ChatGPT for 60 commonly used terms in English and Russian shows that ChatGPT maintains a reasonable level of accuracy and fidelity across languages, but sometimes the definitions in both languages miss crucial specifics and include unnecessary deviations. Our research reveals that no single tool excels universally; each has strengths suited to particular aspects of terminology extraction and application.

en cs.CL
arXiv Open Access 2024
CroissantLLM: A Truly Bilingual French-English Language Model

Manuel Faysse, Patrick Fernandes, Nuno M. Guerreiro et al.

We introduce CroissantLLM, a 1.3B language model pretrained on a set of 3T English and French tokens, to bring to the research and industrial community a high-performance, fully open-sourced bilingual model that runs swiftly on consumer-grade local hardware. To that end, we pioneer the approach of training an intrinsically bilingual model with a 1:1 English-to-French pretraining data ratio, a custom tokenizer, and bilingual finetuning datasets. We release the training dataset, notably containing a French split with manually curated, high-quality, and varied data sources. To assess performance outside of English, we craft a novel benchmark, FrenchBench, consisting of an array of classification and generation tasks, covering various orthogonal aspects of model performance in the French Language. Additionally, rooted in transparency and to foster further Large Language Model research, we release codebases, and dozens of checkpoints across various model sizes, training data distributions, and training steps, as well as fine-tuned Chat models, and strong translation models. We evaluate our model through the FMTI framework, and validate 81 % of the transparency criteria, far beyond the scores of even most open initiatives. This work enriches the NLP landscape, breaking away from previous English-centric work in order to strengthen our understanding of multilinguality in language models.

en cs.CL, cs.LG
DOAJ Open Access 2024
Human versus Neural Machine Translation Creativity: A Study on Manipulated MWEs in Literature

Gloria Corpas Pastor, Laura Noriega-Santiáñez

In the digital era, the (r)evolution of neural machine translation (NMT) has reshaped both the market and translators’ workflow. However, the adoption of this technology has not fully reached the creative field of literary translation. Against this background, this study aims to explore to what extent NMT systems can be used to translate the creative challenges posed by idioms, specifically manipulated multiword expressions (MWEs) found in literary texts. To carry out this pilot study, five manipulated MWEs were selected from a fantasy novel and machine-translated (English > Spanish) by four NMT systems (DeepL, Google Translate, Bing Translator, and Reverso). Then, each NMT output as well as a human translation are assessed by six professional literary translators by using a human evaluation sheet. Based on these results, the creativity obtained in each translation method was calculated. Despite the satisfactory performance of both DeepL and Google Translate, HT creativity was highly superior in almost all manipulated MWEs. To the best of our knowledge, this paper not only contributes to the ongoing study of NMT applied to literature, but it is also one of the few studies that delve into the almost unexplored field of assessing creativity in neural machine-translated MWEs.

Information technology
S2 Open Access 2014
Implementing electronic health records in hospitals: a systematic literature review

A. Boonstra, A. Versluis, Janita F. J. Vos

BackgroundThe literature on implementing Electronic Health Records (EHR) in hospitals is very diverse. The objective of this study is to create an overview of the existing literature on EHR implementation in hospitals and to identify generally applicable findings and lessons for implementers.MethodsA systematic literature review of empirical research on EHR implementation was conducted. Databases used included Web of Knowledge, EBSCO, and Cochrane Library. Relevant references in the selected articles were also analyzed. Search terms included Electronic Health Record (and synonyms), implementation, and hospital (and synonyms). Articles had to meet the following requirements: (1) written in English, (2) full text available online, (3) based on primary empirical data, (4) focused on hospital-wide EHR implementation, and (5) satisfying established quality criteria.ResultsOf the 364 initially identified articles, this study analyzes the 21 articles that met the requirements. From these articles, 19 interventions were identified that are generally applicable and these were placed in a framework consisting of the following three interacting dimensions: (1) EHR context, (2) EHR content, and (3) EHR implementation process.ConclusionsAlthough EHR systems are anticipated as having positive effects on the performance of hospitals, their implementation is a complex undertaking. This systematic review reveals reasons for this complexity and presents a framework of 19 interventions that can help overcome typical problems in EHR implementation. This framework can function as a reference for implementers in developing effective EHR implementation strategies for hospitals.

310 sitasi en Medicine
arXiv Open Access 2023
Improving Speech Recognition for African American English With Audio Classification

Shefali Garg, Zhouyuan Huo, Khe Chai Sim et al.

Automatic speech recognition (ASR) systems have been shown to have large quality disparities between the language varieties they are intended or expected to recognize. One way to mitigate this is to train or fine-tune models with more representative datasets. But this approach can be hindered by limited in-domain data for training and evaluation. We propose a new way to improve the robustness of a US English short-form speech recognizer using a small amount of out-of-domain (long-form) African American English (AAE) data. We use CORAAL, YouTube and Mozilla Common Voice to train an audio classifier to approximately output whether an utterance is AAE or some other variety including Mainstream American English (MAE). By combining the classifier output with coarse geographic information, we can select a subset of utterances from a large corpus of untranscribed short-form queries for semi-supervised learning at scale. Fine-tuning on this data results in a 38.5% relative word error rate disparity reduction between AAE and MAE without reducing MAE quality.

en eess.AS, cs.CL
arXiv Open Access 2023
Marathi-English Code-mixed Text Generation

Dhiraj Amin, Sharvari Govilkar, Sagar Kulkarni et al.

Code-mixing, the blending of linguistic elements from distinct languages to form meaningful sentences, is common in multilingual settings, yielding hybrid languages like Hinglish and Minglish. Marathi, India's third most spoken language, often integrates English for precision and formality. Developing code-mixed language systems, like Marathi-English (Minglish), faces resource constraints. This research introduces a Marathi-English code-mixed text generation algorithm, assessed with Code Mixing Index (CMI) and Degree of Code Mixing (DCM) metrics. Across 2987 code-mixed questions, it achieved an average CMI of 0.2 and an average DCM of 7.4, indicating effective and comprehensible code-mixed sentences. These results offer potential for enhanced NLP tools, bridging linguistic gaps in multilingual societies.

en cs.CL

Halaman 21 dari 477352