Hasil "Comparative grammar"

DOAJ Open Access 2025

La punteggiatura italiana nella traduzione. Analisi comparativa della Costituzione italiana nella versione originale e dell’apposita traduzione in polacco

Katarzyna Maniowska

Quanto pesa una virgola nel testo? Quanto vale l’aggiunta o l’omissione della virgola nella traduzione? Un diverso uso della punteggiatura può comportare differenze semantiche? La lettura parallela del testo della Costituzione della Repubblica Italiana e della sua traduzione polacca ci offrirà il punto di partenza per ulteriori riflessioni sul significato di questo elemento tanto labile. L’analisi comparativa verrà preceduta da brevi cenni sullo sviluppo dell’interpunzione in italiano e in polacco, nonché da osservazioni sugli usi diversi della punteggiatura in entrambe le lingue. Dal momento che l’interpunzione italiana è di tipo prosodico-sintattica, mentre il polacco opta per l’interpunzione basata su rigide regole sintattiche, quest’incongruenza dei sistemi di interpunzione è rilevante anche dal punto di vista traduttivo. Infatti, una corretta trasposizione del senso viene effettuata anche attraverso la punteggiatura. La Costituzione della Repubblica Italiana appartiene alla categoria dei testi vincolanti, perciò richiede dal traduttore la massima rigidità interpretativa. Si ipotizza che, nonostante diversi criteri di applicazione della virgola in entrambe le lingue, sia il testo originale che il testo tradotto esprimano lo stesso significato senza maggiori deviazioni di senso. Attraverso esempi tratti dal prototesto si individueranno possibili punti di incontro nel sistema di interpunzione italiano e polacco, nondimeno saranno evidenziate le differenze inconciliabili che derivano dalla diversa concezione della punteggiatura in ambedue le lingue.

Romanic languages, Language. Linguistic theory. Comparative grammar

Detail DOI Sumber

DOAJ Open Access 2025

Gender agreement mismatch in sandwiched coordination

Marija Brašić

This paper reports on an experimental study on noun-verb agreement in so-called sandwiched configurations, where a coordinated subject (Goal) is situated between two participles (Probes). As reported by Marušič & Nevins (2020), in such configurations the two conjuncts of a coordinate subject control agreement with different probes, each one with the linearly closest probe – the first conjunct with the probe preceding the subject and the second conjunct with the probe following the subject. This study in addition confirms that two participles show agreement with the single, hierarchically highest conjunct, as well as having the first probe reflecting gender features of the closest (hierarchically highest) noun and the second one displaying default gender agreement. Hence, the configurations mirror exactly the agreement strategies found in simple, pre- and postverbal, coordinate structures. The acceptability rating experiment of Marušič & Nevins (2020) is conceptually replicated in Bosnian-Croatian-Montenegrin-Serbian (BCMS), with results confirming their study and their conclusions. The distributed model of agreement (Arregi & Nevins 2012) considers a two-step Agree operation, the second step of which, Agree-Copy, taking place post-syntactically, before or after linearisation, thus allowing Linear order to have a direct effect on agreement.

Language. Linguistic theory. Comparative grammar

Detail DOI Sumber

arXiv Open Access 2025

Proposing TAGbank as a Corpus of Tree-Adjoining Grammar Derivations

Jungyeul Park

The development of lexicalized grammars, particularly Tree-Adjoining Grammar (TAG), has significantly advanced our understanding of syntax and semantics in natural language processing (NLP). While existing syntactic resources like the Penn Treebank and Universal Dependencies offer extensive annotations for phrase-structure and dependency parsing, there is a lack of large-scale corpora grounded in lexicalized grammar formalisms. To address this gap, we introduce TAGbank, a corpus of TAG derivations automatically extracted from existing syntactic treebanks. This paper outlines a methodology for mapping phrase-structure annotations to TAG derivations, leveraging the generative power of TAG to support parsing, grammar induction, and semantic analysis. Our approach builds on the work of CCGbank, extending it to incorporate the unique structural properties of TAG, including its transparent derivation trees and its ability to capture long-distance dependencies. We also discuss the challenges involved in the extraction process, including ensuring consistency across treebank schemes and dealing with language-specific syntactic idiosyncrasies. Finally, we propose the future extension of TAGbank to include multilingual corpora, focusing on the Penn Korean and Penn Chinese Treebanks, to explore the cross-linguistic application of TAG's formalism. By providing a robust, derivation-based resource, TAGbank aims to support a wide range of computational tasks and contribute to the theoretical understanding of TAG's generative capacity.

en cs.CL

Detail Sumber

arXiv Open Access 2025

Zero-Shot Grammar Competency Estimation Using Large Language Model Generated Pseudo Labels

Sourya Dipta Das, Shubham Kumar, Kuldeep Yadav

Grammar competency estimation is essential for assessing linguistic proficiency in both written and spoken language; however, the spoken modality presents additional challenges due to its spontaneous, unstructured, and disfluent nature. Developing accurate grammar scoring models further requires extensive expert annotation, making large-scale data creation impractical. To address these limitations, we propose a zero-shot grammar competency estimation framework that leverages unlabeled data and Large Language Models (LLMs) without relying on manual labels. During training, we employ LLM-generated predictions on unlabeled data by using grammar competency rubric-based prompts. These predictions, treated as pseudo labels, are utilized to train a transformer-based model through a novel training framework designed to handle label noise effectively. We show that the choice of LLM for pseudo-label generation critically affects model performance and that the ratio of clean-to-noisy samples during training strongly influences stability and accuracy. Finally, a qualitative analysis of error intensity and score prediction confirms the robustness and interpretability of our approach. Experimental results demonstrate the efficacy of our approach in estimating grammar competency scores with high accuracy, paving the way for scalable, low-resource grammar assessment systems.

en cs.CL

Detail Sumber

S2 Open Access 2024

Роль переводческих трансформаций в акцентуации значения языковых единиц (на примере немецкого перевода романа «Harry Potter and the Chamber of Secrets»)

Анна Ильясовна Мухаметова

The article examines the role of translation transformations in comprehension of a text in a foreign language. The object of the research includes the original text of the novel "Harry Potter and the Chamber of Secrets" and its German version. The subject of the research is translation transformations used in the process of translating the English text into the German language. Methods of the research include comparative-contrastive analysis and contextual analysis. The results of the research revealed that in the German translation of the novel "Harry Potter and the Chamber of Secrets" mostly concretization (type of lexical transformation) and grammar replacements (grammatical transformations) are used. This proves different perception of the reality described in the literary work of art by the representatives of different cultures.

8 sitasi en

Detail DOI Sumber

S2 Open Access 2024

A Morphological Analysis of Micro-variations on Verbal Extensions in ciLungu, ciMambwe and ciNamwanga Languages

Pethias Siame, Felix Banda

Guthrie (1948) classifies ciLungu as M14, ciMambwe as M15 and ciNamwanga as M22. The study was undertaken to enrich the available scanty grammar by establishing and analysing the morphological micro-variations on verbal extensions and the accompanying vowel harmony. The descriptive framing informed the study. Qualitative design, descriptive approach, comparative, elicitation and document analysis methods were used. The study shows that micro-variations in the three languages exist in applicative, causative, reversive, stative, intensive, reciprocative, frequentative and perfective extensions. For instance, ciLungu and ciMambwe attach the semi-vowel [w] before the suffix marker [-ela] or [-ila] which is not applicable in ciNamwanga when forming applicative verbs. In addition, ciLungu uses the voiceless fricative glottal [h] in the suffix of long causatives and intensive extensions while ciMambwe and ciNamwanga use [i] which is realised as [y]. On the other hand, the study reveals that the reduplicative, passive and reflexive extensions have similar principles.

5 sitasi en

Detail DOI Sumber

arXiv Open Access 2024

Structural Optimization Ambiguity and Simplicity Bias in Unsupervised Neural Grammar Induction

Jinwook Park, Kangil Kim

Neural parameterization has significantly advanced unsupervised grammar induction. However, training these models with a traditional likelihood loss for all possible parses exacerbates two issues: 1) $\textit{structural optimization ambiguity}$ that arbitrarily selects one among structurally ambiguous optimal grammars despite the specific preference of gold parses, and 2) $\textit{structural simplicity bias}$ that leads a model to underutilize rules to compose parse trees. These challenges subject unsupervised neural grammar induction (UNGI) to inevitable prediction errors, high variance, and the necessity for extensive grammars to achieve accurate predictions. This paper tackles these issues, offering a comprehensive analysis of their origins. As a solution, we introduce $\textit{sentence-wise parse-focusing}$ to reduce the parse pool per sentence for loss evaluation, using the structural bias from pre-trained parsers on the same dataset. In unsupervised parsing benchmark tests, our method significantly improves performance while effectively reducing variance and bias toward overly simplistic parses. Our research promotes learning more compact, accurate, and consistent explicit grammars, facilitating better interpretability.

en cs.CL

Detail Sumber

S2 Open Access 2023

ChatGPT Evaluation: Can It Replace Grammarly and Quillbot Tools?

Hussein A. younis, Osamah Mohammed, Muthmainnah et al.

This study aims to assess the capabilities of ChatGPT in comparison to widely utilized writing assistance tools such as Grammarly and Quillbot. The evaluation involves an examination of ChatGPT's performance across multiple dimensions, encompassing grammar and style correction, plagiarism detection, and overall enhancement of writing quality. Employing a series of tests and comparative analyses, this research investigates the potential suitability of ChatGPT as a viable alternative to Grammarly and Quillbot in supporting writers and enhancing the quality of written content. The outcomes and insights derived from this evaluation offer valuable guidance for individuals and professionals seeking effective writing assistance tools. This article comprehensively reviewed 10 distinct cases of Question(, covering aspects like grammar, parts of speech, proofreading, rephrasing, plagiarism detection, citation, audience engagement, writing style, tone (with a focus on optimism), and the creation of a short story complete with a title. This evaluation encompassed an assessment of various artificial intelligence tools, with specific attention given to ChatGPT. The results of this assessment demonstrated enhanced efficiency and effectiveness in achieving desired writing goals, with notable exceptions observed in addressing plagiarism and citation-related issues.

20 sitasi en

Detail DOI Sumber

DOAJ Open Access 2023

Un nuovo modello per l’insegnamento della traduzione russo-italiano nei corsi universitari magistrali attraverso Moodle

Torresin, Linda

The author describes a new didactic model developed on the Moodle platform for teaching Russian‑Italian translation in the university context. The model, aimed at developing the translation competence of learners, is based on the cooperative learning method and general foreign language teaching principles. Using a longitudinal study involving the collection of qualitative and quantitative data, the model was tested in a 42‑hour asynchronous MA course unit. The findings indicate that the proposed didactic model, which brings together translation theory and practice, can successfully promote the development of translation competence in students.

Language. Linguistic theory. Comparative grammar

Detail DOI Sumber

DOAJ Open Access 2023

Teachers’ Perspectives on Supportive and Inclusive Practices: A Study of Four Schools With Diverse Literacy Practices

Catharina Cesilia Tjernberg, Karin Forsling

Previous research states that accessibility and participation are prerequisites for students, regardless of ability, to be able to develop in school. The point of accessibility and participation is that everyone should be able to feel included and have access to learning. From an inclusive education perspective, the interest of this study is how teachers work in diverse literacy practices to address a variety of student conditions and needs. By studying how some primary education teachers describe how they design and stage learning environments, we want to contribute new knowledge about inclusive literacy practices. The data collection was carried out through focus group discussions. We analysed the data with Critical Literacy (CL) as a theoretical framework, and the related concepts of domination, access, diversity, and design were used. The general impression that emerges from our study is that teachers design literacy activities based on both supportive and inclusive aspects and with a focus on variation and diversity. In the study, more similarities than differences emerged, for instance a focus on multimodal methods and the importance of didactic flexibility. The teachers also highlighted the importance of producing text, not just consuming and processing existing texts. Sometimes the analysis reveals differences. Even so, the teachers’ intentions to include all students in the classroom education remain significant.

Education (General), Language. Linguistic theory. Comparative grammar

Detail DOI Sumber

arXiv Open Access 2023

Grammar Boosting: A New Technique for Proving Lower Bounds for Computation over Compressed Data

Rajat De, Dominik Kempa

Grammar compression is a general compression framework in which a string $T$ of length $N$ is represented as a context-free grammar of size $n$ whose language contains only $T$. In this paper, we focus on studying the limitations of algorithms and data structures operating on strings in grammar-compressed form. Previous work focused on proving lower bounds for grammars constructed using algorithms that achieve the approximation ratio $ρ=\mathcal{O}(\text{polylog }N)$. Unfortunately, for the majority of grammar compressors, $ρ$ is either unknown or satisfies $ρ=ω(\text{polylog }N)$. In their seminal paper, Charikar et al. [IEEE Trans. Inf. Theory 2005] studied seven popular grammar compression algorithms: RePair, Greedy, LongestMatch, Sequential, Bisection, LZ78, and $α$-Balanced. Only one of them ($α$-Balanced) is known to achieve $ρ=\mathcal{O}(\text{polylog }N)$. We develop the first technique for proving lower bounds for data structures and algorithms on grammars that is fully general and does not depend on the approximation ratio $ρ$ of the used grammar compressor. Using this technique, we first prove that $Ω(\log N/\log \log N)$ time is required for random access on RePair, Greedy, LongestMatch, Sequential, and Bisection, while $Ω(\log\log N)$ time is required for random access to LZ78. All these lower bounds hold within space $\mathcal{O}(n\text{ polylog }N)$ and match the existing upper bounds. We also generalize this technique to prove several conditional lower bounds for compressed computation. For example, we prove that unless the Combinatorial $k$-Clique Conjecture fails, there is no combinatorial algorithm for CFG parsing on Bisection (for which it holds $ρ=\tildeΘ(N^{1/2})$) that runs in $\mathcal{O}(n^c\cdot N^{3-ε})$ time for all constants $c>0$ and $ε>0$. Previously, this was known only for $c<2ε$.

en cs.DS

Detail DOI Sumber

DOAJ Open Access 2022

Supramodal Sentence Processing in the Human Brain: fMRI Evidence for the Influence of Syntactic Complexity in More Than 200 Participants

Julia Uddén, Annika Hultén, Jan-Mathijs Schoffelen et al.

AbstractThis study investigated two questions. One is: To what degree is sentence processing beyond single words independent of the input modality (speech vs. reading)? The second question is: Which parts of the network recruited by both modalities is sensitive to syntactic complexity? These questions were investigated by having more than 200 participants read or listen to well-formed sentences or series of unconnected words. A largely left-hemisphere frontotemporoparietal network was found to be supramodal in nature, i.e., independent of input modality. In addition, the left inferior frontal gyrus (LIFG) and the left posterior middle temporal gyrus (LpMTG) were most clearly associated with left-branching complexity. The left anterior temporal lobe showed the greatest sensitivity to sentences that differed in right-branching complexity. Moreover, activity in LIFG and LpMTG increased from sentence onset to end, in parallel with an increase of the left-branching complexity. While LIFG, bilateral anterior temporal lobe, posterior MTG, and left inferior parietal lobe all contribute to the supramodal unification processes, the results suggest that these regions differ in their respective contributions to syntactic complexity related processing. The consequences of these findings for neurobiological models of language processing are discussed.

Language. Linguistic theory. Comparative grammar, Neurophysiology and neuropsychology

Detail DOI Sumber

DOAJ Open Access 2022

شگرد افعال گذشته درکدبرگردانی ترجمه رمان پرندگان می‌روند‌‌ در پرو می‌میرند اثر رومن گاری

فرناز ساسانی, مه کامه اینانلو

در این جستار سعی‌ بر آن بوده است تا بر مقوله کاربرد زمان گذشته در ترجمه متون روایی بپردازیم. بنا به نظر جاکوبسون (Jakobson) و نیدا (Nida) چگونگی تعبیر پیام نویسنده و برگزیدن معادل صحیح یکی از مشکلات عمده ترجمه متون ادبی از جمله ترجمه متون روایی است. مفهوم زمان از منظر روایت‌شناسی تعریف خاص خود را دارد و برای درک بهتر داستان از اهمیت بسیاری برخوردار است؛ از سوی دیگر کاربرد زمان‌ها بویژه زمان گذشته بدنه اصلی یک متن روایی را تشکیل می‌دهد که مطالعه نمودهای فعل و زمان‌های آن در تعامل با جنبه‌های روایت‌شناختی، مفاهیمی همچون نظم و ترتیب، تداوم یا دیرش و نیز بسامد افعال را در داستان جهت ارائه ترجمه صحیح با در نظر گرفتن نقش فعل در زمان پریشی روایت شفاف می‌سازد. در این پژوهش مشاهده شد که شگردهای فعل در متن روایت، تاثیر بسزایی در انتقال معنای داستان و ضرب آهنگ روایت داشته است که بطور کلی در ترجمه این متون می‌تواند مشکلاتی برای مترجم ایجاد نماید. به این منظور در ترجمه اثری فاخر از رومن گاری، نویسنده فرانسوی، به نام پرندگان می‌میروند در پرو می‌میرند به نحوه خوانش مترجم از متن اصلی و چگونگی نمودهای فعل در روایت متن مقصد پرداخته شده تا ظرفیت‌های فعل در بازسازی متون روایی در ترجمه را نمایان کند. این بررسی نشان داد که عملکرد افعال و مفهوم زمان تا چه اندازه در ترجمه، مترجم را به مسیر روایی متن نزدیک نموده است.

Language. Linguistic theory. Comparative grammar, Indo-Iranian languages and literature

Detail DOI Sumber

DOAJ Open Access 2022

O poema imortal de Christian Bök

Andressa Nunes Santos, Diego Grando

Christian Bök é um escritor e poeta conceitual canadense, professor de Escrita Criativa na Universidade de Melbourne, na Austrália. Na esteira de outros poetas conceituais que deram início ao campo de produção da arte transgênica, Bök engendrou um projeto híbrido entre as áreas da poesia e da biologia a fim de criar um poema vivo e imortal. Através de uma formação autodidata nas áreas de biologia molecular, biogenética e engenharia genética, o poeta desenvolveu, em teoria, uma forma de escrever um pequeno poema e introduzi-lo no DNA de uma bactéria resistente ao fogo, à água, à radiação e à própria explosão do Sol, capaz de sobreviver, inclusive, no espaço sideral. Além disso, parte do seu projeto era induzir a bactéria a ler o poema original e a produzir um verso poético em resposta. Este artigo descreve como Christian Bök comprovou em laboratório a viabilidade da sua teoria ao escrever um verso no DNA de uma bactéria modelo para estudos genéticos, a Escherichia coli, e obter dela, na sequência, um verso em resposta.

History of scholarship and learning. The humanities, Philology. Linguistics

Detail DOI Sumber

DOAJ Open Access 2022

The relation between metaphor and impoliteness in the context of interpersonal media gossip

Seyede Sahar Javid, Jalal Rahimian, Amirsaeid Moloodi et al.

The present study aims at examining the possible relation between metaphor and impoliteness in the context of interpersonal media gossip. To this end, we collected data related to 20 celebrities (10 actors and 10 actresses) regarding 60 linguistic situations that occurred from August 2018 to April 2020. Words under examination were 51449. According to the findings of the study, human conceptualization based on source domains such as shit/ animal waste matter, useless things, cartoon/ fictional characters, useless money, dirty substance, imaginary/ metaphysical beings, mushroom, bread, weekdays, fat, and disease/blight were accompanied by impoliteness. These findings justify the role of impoliteness in confirming the direct relation between metaphor and impoliteness to the extent that human is considered as one of the above domains. Interpersonal media gossip prepares the grounds for the use of 345 metaphors along with impoliteness which has its roots in special features of this type of gossip. Among all source domains, concepts such as shit/ animal waste matter are of the highest frequencies. Among impoliteness strategies related to naming, one must point to the negative impoliteness sub-strategy. Language users attempt to hurt the positive face of people from themselves by using positive impoliteness.

Language and Literature, Language. Linguistic theory. Comparative grammar

Detail DOI Sumber

arXiv Open Access 2022

A generative grammar of cooking

Ganesh Bagler

Cooking is a uniquely human endeavor for transforming raw ingredients into delicious dishes. Over centuries, cultures worldwide have evolved diverse cooking practices ingrained in their culinary traditions. Recipes, thus, are cultural capsules that capture culinary knowledge in elaborate cooking protocols. While simple quantitative models have probed the patterns in recipe composition and the process of cuisine evolution, unlike other cultural quirks such as language, the principles of cooking remain hitherto unexplored. The fundamental rules that drive the act of cooking, shaping recipe composition and cuisine architecture, are unclear. Here we present a generative grammar of cooking that captures the underlying culinary logic. By studying an extensive repository of structured recipes, we identify core concepts and rules that together forge a combinatorial system for culinary synthesis. Building on the body of work done in the context of language, the demonstration of a logically consistent generative framework offers profound insights into the act of cooking. Given the central role of food in nutrition and lifestyle disorders, culinary grammar provides leverage to improve public health through dietary interventions beyond applications for creative pursuits such as novel recipe generation.

en physics.soc-ph, cs.AI

Detail Sumber

arXiv Open Access 2022

A Neural Model for Regular Grammar Induction

Peter Belcák, David Hofer, Roger Wattenhofer

Grammatical inference is a classical problem in computational learning theory and a topic of wider influence in natural language processing. We treat grammars as a model of computation and propose a novel neural approach to induction of regular grammars from positive and negative examples. Our model is fully explainable, its intermediate results are directly interpretable as partial parses, and it can be used to learn arbitrary regular grammars when provided with sufficient data. We find that our method consistently attains high recall and precision scores across a range of tests of varying complexity.

en cs.LG, cs.CL

Detail Sumber

arXiv Open Access 2021

Doing Natural Language Processing in A Natural Way: An NLP toolkit based on object-oriented knowledge base and multi-level grammar base

Yu Guo

We introduce an NLP toolkit based on object-oriented knowledge base and multi-level grammar base. This toolkit focuses on semantic parsing, it also has abilities to discover new knowledge and grammar automatically, new discovered knowledge and grammar will be identified by human, and will be used to update the knowledge base and grammar base. This process can be iterated many times to improve the toolkit continuously.

en cs.CL, cs.AI

Detail Sumber

DOAJ Open Access 2019

Análise crítica de produtos midiáticos: experiência de extensão universitária em escolas públicas de Porto Alegre

Isaura Generoso Mourão, Bruno Garcia Vinhola, Nicole Morás et al.

O artigo apresenta a experiência do Programa de Extensão Análise Crítica de Produtos Midiáticos desenvolvido em escolas públicas da cidade de Porto Alegre (RS) pelo Grupo de Pesquisa em Comunicação Organizacional, Cultura e Relações de Poder (GCCOP) da Universidade Federal do Rio Grande do Sul (UFRGS), cujo objetivo é debater com jovens do ensino médio a mídia e seus produtos a partir da análise crítica dos modos de produção e de veiculação desses conteúdos. Partindo teoricamente dos conceitos de midiatização e de letramento midiático, o artigo analisa o impacto do programa para a formação de jovens cidadãos, bem como reflete sobre o papel da universidade no desenvolvimento de ações de extensão que permitam ampliar o alcance das pesquisas e das práticas acadêmicas no campo da Comunicação junto à sociedade.

Discourse analysis

Detail DOI Sumber

DOAJ Open Access 2019

Imaginario social e ideología en discursos patronales de Sax (1890-2010) y Rubí (1890-2003) bajo la perspectiva retórico-pragmática

Xavier Laborda Gil

El artículo trata de la retórica de las salutaciones de las fiestas patronales en las localidades de Sax (Alicante, 1890–2010) y Rubí (Barcelona, 1890–2003). Estos corpus textuales proceden de los programa de las fiestas patronales. Los discursos reflejan vinculaciones institucionales, circunstancias históricas e influencias del periodismo y la publicidad. El estudio de las salutaciones, del subgénero epidíctico, muestra elementos de influencia ideológica y de creación del imaginario social. En los corpus de Sax y Rubí se aprecian las etapas informativa (1890–1930), ideológica (1931–1978) y publicitaria (1979–), en las que se pasa de una producción descriptiva (1890–1930) a otras de intención explícitamente ideológica (1931–1978) y, finalmente, publicitaria (1979–). Los discursos y los programas en que aparecen son constructos semióticos que reflejan discursos ideológicos y persuasivos sobre intereses sociales cambiantes, que cumplen finalidades de estabilización o de cuestionamiento de formas de poder.

Romanic languages, Philology. Linguistics

Detail DOI Sumber

Hasil untuk "Comparative grammar"