Hasil "Greek philology and language"

arXiv Open Access 2026

Task-Specific Efficiency Analysis: When Small Language Models Outperform Large Language Models

Jinghan Cao, Yu Ma, Xinjin Li et al.

Large Language Models achieve remarkable performance but incur substantial computational costs unsuitable for resource-constrained deployments. This paper presents the first comprehensive task-specific efficiency analysis comparing 16 language models across five diverse NLP tasks. We introduce the Performance-Efficiency Ratio (PER), a novel metric integrating accuracy, throughput, memory, and latency through geometric mean normalization. Our systematic evaluation reveals that small models (0.5--3B parameters) achieve superior PER scores across all given tasks. These findings establish quantitative foundations for deploying small models in production environments prioritizing inference efficiency over marginal accuracy gains.

en cs.CL, cs.LG

Detail Sumber

arXiv Open Access 2025

Language-conditioned world model improves policy generalization by reading environmental descriptions

Anh Nguyen, Stefan Lee

To interact effectively with humans in the real world, it is important for agents to understand language that describes the dynamics of the environment--that is, how the environment behaves--rather than just task instructions specifying "what to do". Understanding this dynamics-descriptive language is important for human-agent interaction and agent behavior. Recent work address this problem using a model-based approach: language is incorporated into a world model, which is then used to learn a behavior policy. However, these existing methods either do not demonstrate policy generalization to unseen games or rely on limiting assumptions. For instance, assuming that the latency induced by inference-time planning is tolerable for the target task or expert demonstrations are available. Expanding on this line of research, we focus on improving policy generalization from a language-conditioned world model while dropping these assumptions. We propose a model-based reinforcement learning approach, where a language-conditioned world model is trained through interaction with the environment, and a policy is learned from this model--without planning or expert demonstrations. Our method proposes Language-aware Encoder for Dreamer World Model (LED-WM) built on top of DreamerV3. LED-WM features an observation encoder that uses an attention mechanism to explicitly ground language descriptions to entities in the observation. We show that policies trained with LED-WM generalize more effectively to unseen games described by novel dynamics and language compared to other baselines in several settings in two environments: MESSENGER and MESSENGER-WM.To highlight how the policy can leverage the trained world model before real-world deployment, we demonstrate the policy can be improved through fine-tuning on synthetic test trajectories generated by the world model.

en cs.CL, cs.LG

Detail Sumber

arXiv Open Access 2025

Classifying German Language Proficiency Levels Using Large Language Models

Elias-Leander Ahlers, Witold Brunsmann, Malte Schilling

Assessing language proficiency is essential for education, as it enables instruction tailored to learners needs. This paper investigates the use of Large Language Models (LLMs) for automatically classifying German texts according to the Common European Framework of Reference for Languages (CEFR) into different proficiency levels. To support robust training and evaluation, we construct a diverse dataset by combining multiple existing CEFR-annotated corpora with synthetic data. We then evaluate prompt-engineering strategies, fine-tuning of a LLaMA-3-8B-Instruct model and a probing-based approach that utilizes the internal neural state of the LLM for classification. Our results show a consistent performance improvement over prior methods, highlighting the potential of LLMs for reliable and scalable CEFR classification.

en cs.CL, cs.AI

Detail Sumber

arXiv Open Access 2025

Exploring Gender Bias in Large Language Models: An In-depth Dive into the German Language

Kristin Gnadt, David Thulke, Simone Kopeinik et al.

In recent years, various methods have been proposed to evaluate gender bias in large language models (LLMs). A key challenge lies in the transferability of bias measurement methods initially developed for the English language when applied to other languages. This work aims to contribute to this research strand by presenting five German datasets for gender bias evaluation in LLMs. The datasets are grounded in well-established concepts of gender bias and are accessible through multiple methodologies. Our findings, reported for eight multilingual LLM models, reveal unique challenges associated with gender bias in German, including the ambiguous interpretation of male occupational terms and the influence of seemingly neutral nouns on gender perception. This work contributes to the understanding of gender bias in LLMs across languages and underscores the necessity for tailored evaluation frameworks.

en cs.CL, cs.LG

Detail Sumber

arXiv Open Access 2025

Effectiveness of Chain-of-Thought in Distilling Reasoning Capability from Large Language Models

Cong-Thanh Do, Rama Doddipatla, Kate Knill

Chain-of-Thought (CoT) prompting is a widely used method to improve the reasoning capability of Large Language Models (LLMs). More recently, CoT has been leveraged in Knowledge Distillation (KD) to transfer reasoning capability from a larger LLM to a smaller one. This paper examines the role of CoT in distilling the reasoning capability from larger LLMs to smaller LLMs using white-box KD, analysing its effectiveness in improving the performance of the distilled models for various natural language reasoning and understanding tasks. We conduct white-box KD experiments using LLMs from the Qwen and Llama2 families, employing CoT data from the CoT-Collection dataset. The distilled models are then evaluated on natural language reasoning and understanding tasks from the BIG-Bench-Hard (BBH) benchmark, which presents complex challenges for smaller LLMs. Experimental results demonstrate the role of CoT in improving white-box KD effectiveness, enabling the distilled models to achieve better average performance in natural language reasoning and understanding tasks from BBH.

en cs.CL

Detail Sumber

arXiv Open Access 2025

Safer in Translation? Presupposition Robustness in Indic Languages

Aadi Palnitkar, Arjun Suresh, Rishi Rajesh et al.

Increasingly, more and more people are turning to large language models (LLMs) for healthcare advice and consultation, making it important to gauge the efficacy and accuracy of the responses of LLMs to such queries. While there are pre-existing medical benchmarks literature which seeks to accomplish this very task, these benchmarks are almost universally in English, which has led to a notable gap in existing literature pertaining to multilingual LLM evaluation. Within this work, we seek to aid in addressing this gap with Cancer-Myth-Indic, an Indic language benchmark built by translating a 500-item subset of Cancer-Myth, sampled evenly across its original categories, into five under-served but widely used languages from the subcontinent (500 per language; 2,500 translated items total). Native-speaker translators followed a style guide for preserving implicit presuppositions in translation; items feature false presuppositions relating to cancer. We evaluate several popular LLMs under this presupposition stress.

en cs.CL

Detail Sumber

DOAJ Open Access 2024

Existentielle Themen im Griechischunterricht

Susanne Aretz

Greek language and literature. Latin language and literature, Philology. Linguistics

Detail DOI Sumber

DOAJ Open Access 2024

Themenheft "Existenzielle Fragen"

Susanne Aretz, Rudolf Henneböhl

Greek language and literature. Latin language and literature, Philology. Linguistics

Detail DOI Sumber

DOAJ Open Access 2024

„Thia Sinensis” Pierre’a Petita. Przekład wybranych fragmentów siedemnastowiecznego poematu o herbacie

Mikołaj Marcinkowski

The aim of this paper is to present Polish translation of selected fragments of the Latin didactic epic poem Thia Sinenis composed by Pierre Petit, a 17th-century French scholar. The poem is an interesting example of the neolatin texts that introduce the subject of exotic beverages such as tea, coffee and cacao and might be investigated as a valuable source of the reception of distant cultures in the 17th-century Europe.

Philology. Linguistics, Greek language and literature. Latin language and literature

Detail DOI Sumber

arXiv Open Access 2024

Auxiliary task demands mask the capabilities of smaller language models

Jennifer Hu, Michael C. Frank

Developmental psychologists have argued about when cognitive capacities such as language understanding or theory of mind emerge. These debates often hinge on the concept of "task demands" -- the auxiliary challenges associated with performing a particular evaluation -- that may mask the child's underlying ability. The same issues arise when measuring the capacities of language models (LMs): performance on a task is a function of the model's underlying knowledge, combined with the model's ability to interpret and perform the task given its available resources. Here, we show that for analogical reasoning, reflective reasoning, word prediction, and grammaticality judgments, evaluation methods with greater task demands yield lower performance than evaluations with reduced demands. This "demand gap" is most pronounced for models with fewer parameters and less training data. Our results illustrate that LM performance should not be interpreted as a direct indication of intelligence (or lack thereof), but as a reflection of capacities seen through the lens of researchers' design choices.

en cs.CL, cs.AI

Detail Sumber

arXiv Open Access 2024

Large Language Models Lack Understanding of Character Composition of Words

Andrew Shin, Kunitake Kaneko

Large language models (LLMs) have demonstrated remarkable performances on a wide range of natural language tasks. Yet, LLMs' successes have been largely restricted to tasks concerning words, sentences, or documents, and it remains questionable how much they understand the minimal units of text, namely characters. In this paper, we examine contemporary LLMs regarding their ability to understand character composition of words, and show that most of them fail to reliably carry out even the simple tasks that can be handled by humans with perfection. We analyze their behaviors with comparison to token level performances, and discuss the potential directions for future research.

en cs.CL

Detail Sumber

arXiv Open Access 2024

A Capabilities Approach to Studying Bias and Harm in Language Technologies

Hellina Hailu Nigatu, Zeerak Talat

Mainstream Natural Language Processing (NLP) research has ignored the majority of the world's languages. In moving from excluding the majority of the world's languages to blindly adopting what we make for English, we first risk importing the same harms we have at best mitigated and at least measured for English. However, in evaluating and mitigating harms arising from adopting new technologies into such contexts, we often disregard (1) the actual community needs of Language Technologies, and (2) biases and fairness issues within the context of the communities. In this extended abstract, we consider fairness, bias, and inclusion in Language Technologies through the lens of the Capabilities Approach. The Capabilities Approach centers on what people are capable of achieving, given their intersectional social, political, and economic contexts instead of what resources are (theoretically) available to them. We detail the Capabilities Approach, its relationship to multilingual and multicultural evaluation, and how the framework affords meaningful collaboration with community members in defining and measuring the harms of Language Technologies.

en cs.CL, cs.CY

Detail Sumber

arXiv Open Access 2024

Greek2MathTex: A Greek Speech-to-Text Framework for LaTeX Equations Generation

Evangelia Gkritzali, Panagiotis Kaliosis, Sofia Galanaki et al.

In the vast majority of the academic and scientific domains, LaTeX has established itself as the de facto standard for typesetting complex mathematical equations and formulae. However, LaTeX's complex syntax and code-like appearance present accessibility barriers for individuals with disabilities, as well as those unfamiliar with coding conventions. In this paper, we present a novel solution to this challenge through the development of a novel speech-to-LaTeX equations system specifically designed for the Greek language. We propose an end-to-end system that harnesses the power of Automatic Speech Recognition (ASR) and Natural Language Processing (NLP) techniques to enable users to verbally dictate mathematical expressions and equations in natural language, which are subsequently converted into LaTeX format. We present the architecture and design principles of our system, highlighting key components such as the ASR engine, the LLM-based prompt-driven equations generation mechanism, as well as the application of a custom evaluation metric employed throughout the development process. We have made our system open source and available at https://github.com/magcil/greek-speech-to-math.

en cs.CL, cs.AI

Detail Sumber

arXiv Open Access 2024

Is Self-knowledge and Action Consistent or Not: Investigating Large Language Model's Personality

Yiming Ai, Zhiwei He, Ziyin Zhang et al.

In this study, we delve into the validity of conventional personality questionnaires in capturing the human-like personality traits of Large Language Models (LLMs). Our objective is to assess the congruence between the personality traits LLMs claim to possess and their demonstrated tendencies in real-world scenarios. By conducting an extensive examination of LLM outputs against observed human response patterns, we aim to understand the disjunction between self-knowledge and action in LLMs.

en cs.CL, cs.CY

Detail Sumber

DOAJ Open Access 2023

Un recurso jurídico de apelación

María Delia Buisel

Siguiendo los capítulos 21-28 de los Hechos de los Apóstoles tratamos el conflicto entre el apóstol Pablo y las autoridades tanto del Templo de Jerusalén como con el Tribunal del Sanedrín, que quieren su condena a muerte. Examinamos los alegatos de Pablo ante ambos, su exculpación por acusación dolosa y falta de pruebas, la invalidez para juzgarlo ya que como ciudadano romano le corresponden autoridades romanas, las artimañas de sus acusadores, sus dos juicios ante la procuración imperial en Cesarea, los dos años de prisión, la apelación al César, el juicio ante el rey Herodes Agripa II, su llegada a Roma después de un accidentado viaje y el resultado positivo de su apelación. Incorporamos el interesante comentario de F. de Quevedo, muy poco considerado.

Philology. Linguistics, Greek language and literature. Latin language and literature

Detail DOI Sumber

DOAJ Open Access 2023

[Hantzaroula, Pothiti. Child survivors of the Holocaust in Greece : memory, testimony and subjectivity]

Julia Fröhlich

History of Greece, Translating and interpreting

Detail DOI Sumber

DOAJ Open Access 2023

Cicero’s De Divinatione in Religious and Historical Perspective

Elisabeth Begemann

The article argues that the imperial expansion of the Late Roman Republic is reflected in Cicero’s philosophical texts by example of De divinatione. The expansion of Rome made it necessary to consider what it meant to be Roman and what Roman practices are, especially confronted with other, alien practices that might seem similar. Cicero offers his texts as an admonition to consider religious practices – here: divination – the Romans might encounter in the provinces and how to deal with them, to consider their usefulness while also bearing in mind the latent danger in not doing them right or exceeding the religious need which upholds the pax deorum. Being put in the context of the expanding empire, the article makes sense of the multiple non-Roman examples cited especially in Book 1 of the treatise De divinatione.

Philology. Linguistics, Greek language and literature. Latin language and literature

Detail DOI Sumber

arXiv Open Access 2023

Leveraging Language ID to Calculate Intermediate CTC Loss for Enhanced Code-Switching Speech Recognition

Tzu-Ting Yang, Hsin-Wei Wang, Berlin Chen

In recent years, end-to-end speech recognition has emerged as a technology that integrates the acoustic, pronunciation dictionary, and language model components of the traditional Automatic Speech Recognition model. It is possible to achieve human-like recognition without the need to build a pronunciation dictionary in advance. However, due to the relative scarcity of training data on code-switching, the performance of ASR models tends to degrade drastically when encountering this phenomenon. Most past studies have simplified the learning complexity of the model by splitting the code-switching task into multiple tasks dealing with a single language and then learning the domain-specific knowledge of each language separately. Therefore, in this paper, we attempt to introduce language identification information into the middle layer of the ASR model's encoder. We aim to generate acoustic features that imply language distinctions in a more implicit way, reducing the model's confusion when dealing with language switching.

en cs.CL, cs.SD

Detail Sumber

arXiv Open Access 2023

UzbekTagger: The rule-based POS tagger for Uzbek language

Maksud Sharipov, Elmurod Kuriyozov, Ollabergan Yuldashev et al.

This research paper presents a part-of-speech (POS) annotated dataset and tagger tool for the low-resource Uzbek language. The dataset includes 12 tags, which were used to develop a rule-based POS-tagger tool. The corpus text used in the annotation process was made sure to be balanced over 20 different fields in order to ensure its representativeness. Uzbek being an agglutinative language so the most of the words in an Uzbek sentence are formed by adding suffixes. This nature of it makes the POS-tagging task difficult to find the stems of words and the right part-of-speech they belong to. The methodology proposed in this research is the stemming of the words with an affix/suffix stripping approach including database of the stem forms of the words in the Uzbek language. The tagger tool was tested on the annotated dataset and showed high accuracy in identifying and tagging parts of speech in Uzbek text. This newly presented dataset and tagger tool can be used for a variety of natural language processing tasks such as language modeling, machine translation, and text-to-speech synthesis. The presented dataset is the first of its kind to be made publicly available for Uzbek, and the POS-tagger tool created can also be used as a pivot to use as a base for other closely-related Turkic languages.

en cs.CL

Detail Sumber

S2 Open Access 2023

ON UNDERSTANDINGS OF POLITENESS IN GREEK, AGAIN!

zeszyt 6 – ROCZNIKI HUMANISTYCZNE LXXI, D. M. A. C. Z. -. A. –. U. B. H. them, Wydział Łódzki et al.

en

Detail Sumber

Hasil untuk "Greek philology and language"