Hasil "Comparative grammar"

arXiv Open Access 2026

On the Reachability Problem for One-Dimensional Thin Grammar Vector Addition Systems

Chengfeng Xue, Yuxi Fu

Vector addition systems with states (VASS) are a classic model in concurrency theory. Grammar vector addition systems (GVAS), equivalently, pushdown VASS, extend VASS by using a context-free grammar to control addition. In this paper, our main focus is on the reachability problem for one-dimensional thin GVAS (thin 1-GVAS), a structurally restricted yet expressive subclass. By adopting the index measure for complexity, and by generalizing the decomposition technique developed in the study of VASS reachability to grammar-generated derivation trees of GVAS, an effective integer programming system is established for a thin 1-GVAS. In this way, a nondeterministic algorithm with $\mathbf{F}_{2k}$ complexity is obtained for the reachability of thin 1-GVAS with index $k$, yielding a tighter upper bound than the previous one.

en cs.LO

Detail Sumber

DOAJ Open Access 2025

La vita al tempo della crisi climatica: il romanzo, l'Antropocene e le forme del realismo

Niccolò Scaffai

In this issue of Between, which focuses on post-apocalyptic narratives, the In Discussion column is dedicated to several recent novels that place images of disaster in the foreground or background, but always in relation to the realistic portrayal of the daily lives of individuals and communities. The works in question are The Deluge by Stephen Markley (2022), The Bee Sting by Paul Murray (2023) and What We Can Now by Ian McEwan (2025).

Geography. Anthropology. Recreation, Language. Linguistic theory. Comparative grammar

Detail DOI Sumber

arXiv Open Access 2025

Uniform Membership for Hyperedge Replacement Grammars and Related Decision Problems

Tikhon Pshenitsyn

This paper investigates complexity of the uniform membership problem for hyperedge replacement grammars in comparison with other mildly context-sensitive grammar formalisms. It turns out that the complexity of this problem depends on how one defines a hypergraph. There are two commonly used definitions in the field, which differ in whether repetitions of attachment nodes of a hyperedge are allowed in a hypergraph or not. We show that, in general, the problem under consideration is EXPTIME-complete, even for string-generating hyperedge replacement grammars, but it is NP-complete if repetitions are not allowed. We extend the developed proof techniques in order to prove a general meta-theorem: checking whether a given hyperedge replacement grammar generates a hypergraph satisfying a non-Parikh property is EXPTIME-hard. Non-Parikh properties are those that are not preimages of properties on Parikh vectors of hypergraphs. This includes any graph property relying significantly on structure of graphs, e.g. connectivity, Eulerianity, Hamiltonianity, acyclicity. A tight upper bound is established for EXPTIME-compatible properties via Filter Theorem.

en cs.FL

Detail Sumber

DOAJ Open Access 2024

The relationship between the coarticulatory source and effect in sound change: evidence from Italo-Romance metaphony in the Lausberg area

Jonathan Harrington, Michele Gubian, Pia Greca

In ongoing sound changes, a coarticulatory effect is often enhanced as the coarticulatory source that gives rise to it wanes. But quite how phonologisation and these reciprocal coarticulatory changes are connected is still poorly understood. The present study addresses this issue through an acoustic analysis of metaphony, which like umlaut has its phonetic origins in VCV coarticulation, and which was analysed in three geographically proximal varieties spoken in the so-called Lausberg area in Southern Italy. The corpus was of 35 speakers producing mostly disyllabic words with phonetically mid stem vowels and suffix vowels that varied in phonetic height. The results of functional principal components analysis applied to the stem vowels’ first two formant frequencies showed a progressively greater enhancement to the vowel stem across the three regions that was characterised by raising, diphthongisation, and then further raising and monophthongisation. Suffix erosion was quantified by counting deletions and the degree of vowel centralisation. The analysis showed a reciprocal relationship between stem enhancement and suffix erosion across, but not within, the three dialects. Overall, the results suggest that a trade-off of cues between suffix and stem vowel has progressed to different degrees between the three varieties.

Language. Linguistic theory. Comparative grammar

Detail DOI Sumber

DOAJ Open Access 2024

Écrire L’État honteux en Afrique hispanophone et lusophone

Marthe OYANE METOGHO

Based on L’Etat honteux by Sony Labou Tansi, the study constructs the archetype of the African postcolonial state. The portrait that emerges is an aggregate of motifs drawn from colonial discourse. This State/state ends in post-transition novels, a space for the construction of authorial utopias. This problematizes the relevance of discourses carried by concepts that drain ideological postures

French literature - Italian literature - Spanish literature - Portuguese literature, Language. Linguistic theory. Comparative grammar

Detail Sumber

DOAJ Open Access 2024

The Translation of Sex-Related Language in TV Series: Analyzing the Fictional Speech of LGBTQ+ Characters

Sonia González Cruz

In recent years, there has been a notable rise in the portrayal of LGBTQ+ characters in TV series available on online platforms. This poses a challenge for translators from a linguistic, social and cultural perspective, as they need to deal with the transference of fictional speech according to diverse identities. In this respect, translators are not only in charge of translating the fictional speech for a given audiovisual product to be either subtitled or dubbed into a different language, but they have the role of conveying and preserving LGBTQ+ characters’ identities accurately. The objective of this paper is to analyze the translation of sex-related language in TV series with LGBTQ+ representation. On the basis of a selected corpus of two different English-language TV series (Euphoria and Sex Education), this descriptive study analyzes the fictional speech of several LGBTQ+ characters and focuses on the translation of sex-related language from English into Spanish in both their dubbed and subtitled versions. The translation strategies used to render sex-related conversations when translating audiovisual fiction are discussed throughout the study in order to show different ways of facing the translation of specific sexual expressions. In this respect, the study intends to highlight the fact that all decisions made when translating fictional conversations that LGBTQ+ characters have about sex may have an influence on the representation of several topics such as sexuality, gender or identity. The study also discusses how other aspects such as the translation of inclusive language and the expression of gender identity may also affect the portrayal of LGBTQ+ characters.

Language. Linguistic theory. Comparative grammar, Communication. Mass media

Detail DOI Sumber

arXiv Open Access 2024

Statistical properties of probabilistic context-sensitive grammars

Kai Nakaishi, Koji Hukushima

Probabilistic context-free grammars (PCFGs), which are commonly used to generate trees randomly, have been well analyzed theoretically, leading to applications in various domains. Despite their utility, the distributions that the grammar can express are limited to those in which the distribution of a subtree depends only on its root and not on its context. This limitation presents a challenge for modeling various real-world phenomena, such as natural languages. To overcome this limitation, a probabilistic context-sensitive grammar (PCSG) is introduced, where the distribution of a subtree depends on its context. Numerical analysis of a PCSG reveals that the distribution of a symbol does not constitute a qualitative difference from that in the context-free case, but mutual information does. Furthermore, a novel metric introduced to directly quantify the breaking of this limitation detects a distinct difference between PCFGs and PCSGs. This metric, applicable to an arbitrary distribution of a tree, allows for further investigation and characterization of various tree structures that PCFGs cannot express.

en cond-mat.dis-nn, cond-mat.stat-mech

Detail DOI Sumber

arXiv Open Access 2024

Back to School: Translation Using Grammar Books

Jonathan Hus, Antonios Anastasopoulos

Machine translation systems for high resource languages perform exceptionally well and produce high quality translations. Unfortunately, the vast majority of languages are not considered high resource and lack the quantity of parallel sentences needed to train such systems. These under-represented languages are not without resources, however, and bilingual dictionaries and grammar books are available as linguistic reference material. With current large language models (LLMs) supporting near book-length contexts, we can begin to use the available material to ensure advancements are shared among all of the world's languages. In this paper, we demonstrate incorporating grammar books in the prompt of GPT-4 to improve machine translation and evaluate the performance on 16 topologically diverse low-resource languages, using a combination of reference material to show that the machine translation performance of LLMs can be improved using this method.

en cs.CL

Detail Sumber

S2 Open Access 2023

THE IMPACT OF LANGUAGE CHANGES CAUSED BY TECHNOLOGY AND SOCIAL MEDIA

Nurasia Natsir, Nuraziza Aliah, Zulkhaeriyah Zulkhaeriyah et al.

This research discusses language change as a result of the influence of social media. In an increasingly advanced digital era, social media has become one of the primary communication tools for individuals worldwide. This study utilizes descriptive and comparative analysis methods to explore the influence of social media on language change. Firstly, the research identifies grammar, syntax, and vocabulary changes due to social media usage. Then, the study compares the language used in traditional communication with that used in social media communication. The findings of this research indicate significant language changes due to the use of social media. There is an increase in the use of abbreviations, emoticons, and distinctive terms specific to social media that affect the way humans communicate in a digital context. Additionally, casual writing styles, non-formal language use, and the adaptation of foreign words have become characteristics of social media communication. These language changes can have both positive and negative impacts. On the positive side, social media has enabled faster and more efficient communication between individuals across the globe. Using a more casual and non-formal language can also strengthen social bonds among social media users. However, on the other hand, these language changes can also pose challenges to understanding and communication between different generations or in formal contexts.

14 sitasi en

Detail DOI Sumber

S2 Open Access 2023

Comparison of Leading Language Parsers – ANTLR, JavaCC, SableCC, Tree-sitter, Yacc, Bison

Afshan Latif, F. Azam, Muhammad Waseem Anwar et al.

Software engineering applications in domains like embedded systems and health care have increased exponentially during the last few years. Developing, analyzing, and customization of languages is one of the core software engineering aspects. This usually involves lexical, syntactical, and semantic operations, technically termed parsing. For this, several parsers have been introduced in state-of-the-art. However, due to diverse features, selecting a parser for a particular operation during software engineering applications is always problematic. In this article, we identified six leading parsers (i.e., ANTLR, JavaCC, SableCC, Tree-sitter, Yacc, and Bison) from the state-of-the-art. Furthermore, we also identified significant parser features to perform meaningful comparative analysis. Results indicate that ANTLR and JavaCC provide enhanced parsing features, such as the parsing algorithm and the extended grammar notation. However, JavaCC is suitable for simple grammar definition, whereas ANTLR allows specifying complex grammar with multiple alternative paths. The findings of this article are highly beneficial for researchers and practitioners while selecting the right parser to perform specific software engineering tasks.

13 sitasi en

Detail DOI Sumber

DOAJ Open Access 2023

Memes: dos sentidos às multimodalidades em redes sociais digitais

Robério Pereira Barreto

RESUMO: Neste texto objetiva-se através de revisão e diálogo com a literatura da área mostrar que os memes, gênero discursivo híbrido nativo do ambiente digital é potente objeto de ensino de multimodalidade e construção de sentidos. Nos memes há design visual agregador de múltiplas linguagens que amplia a construção dos sentidos. Desse modo, reflete-se sobre a educação linguística e digital através do design visual e da distribuição dos sentidos socialmente reconhecidos no meme-texto. Ademais, promover multiletramentos na sala de aula da educação básica ao ensino superior, aplicando multimodalidade, a qual é uma marca da comunicação e interação online. Conclui-se que a maneira pedagógica da tradição escolar de ensino de leitura buscando os sentidos a partir de textos reconhecidos na cultura do letramento impresso, onde se privilegiou por décadas, o verbal em detrimento do visual, deve abrir as suas fronteiras e ampliá-las a ponto de que sejam incorporadas a esses repertórios, as múltiplas matrizes de linguagens correntes nas plataformas de comunicação e rede sociais digitais, onde a multimodalidade associada ao design do visual garante a ressignificação de sentidos e ideologias contidas nos memes.

Language. Linguistic theory. Comparative grammar, Literature (General)

Detail Sumber

arXiv Open Access 2023

Greedy Grammar Induction with Indirect Negative Evidence

Joseph Potashnik

This paper offers a fresh look at the pumping lemma constant as an upper bound on the information required for learning Context Free Grammars. An objective function based on indirect negative evidence considers the occurrences, and non-occurrences, of a finite number of strings, encountered after a sufficiently long presentation. This function has optimal substructure in the hypotheses space, giving rise to a greedy search learner in a branch and bound method. A hierarchy of learnable classes is defined in terms of the number of production rules that must be added to interim solutions in order to incrementally fit the input. Efficiency strongly depends on the position of the target grammar in the hierarchy and on the richness of the input.

en cs.CL

Detail Sumber

arXiv Open Access 2023

Differential operators, grammars and Young tableaux

Shi-Mei Ma, Jean Yeh, Yeong-Nan Yeh

In algebraic combinatorics and formal calculation, context-free grammar is defined by a formal derivative based on a set of substitution rules. In this paper, we investigate this issue from three related viewpoints. Firstly, we introduce a differential operator method. As one of the applications, we deduce a new grammar for the Narayana polynomials. Secondly, we investigate the normal ordered grammars associated with the Eulerian polynomials. Thirdly, motivated by the theory of differential posets, we introduce a box sorting algorithm which leads to a bijection between the terms in the expansion of $(cD)^nc$ and a kind of ordered weak set partitions, where $c$ is a smooth function in the indeterminate $x$ and $D$ is the derivative with respect to $x$. Using a map from ordered weak set partitions to standard Young tableaux, we find an expansion of $(cD)^nc$ in terms of standard Young tableaux. Combining this with the theory of context-free grammars, we provide a unified interpretations for the Ramanujan polynomials, André polynomials, left peak polynomials, interior peak polynomials, Eulerian polynomials of types $A$ and $B$, $1/2$-Eulerian polynomials, second-order Eulerian polynomials, and Narayana polynomials of types $A$ and $B$ in terms of standard Young tableaux. Along the same lines, we present an expansion of the powers of $c^kD$ in terms of standard Young tableaux, where $k$ is a positive integer. In particular, we provide four interpretations for the second-order Eulerian polynomials. All of the above apply to the theory of formal differential operator rings.

en math.CO

Detail Sumber

arXiv Open Access 2023

Grammar-Constrained Decoding for Structured NLP Tasks without Finetuning

Saibo Geng, Martin Josifoski, Maxime Peyrard et al.

Despite their impressive performance, large language models (LMs) still struggle with reliably generating complex output structures when not finetuned to follow the required output format exactly. To address this issue, grammar-constrained decoding (GCD) can be used to control the generation of LMs, guaranteeing that the output follows a given structure. Most existing GCD methods are, however, limited to specific tasks, such as parsing or code generation. In this work, we demonstrate that formal grammars can describe the output space for a much wider range of tasks and argue that GCD can serve as a unified framework for structured NLP tasks in general. For increased flexibility, we introduce input-dependent grammars, which allow the grammar to depend on the input and thus enable the generation of different output structures for different inputs. We then empirically demonstrate the power and flexibility of GCD-enhanced LMs on (1) information extraction, (2) entity disambiguation, and (3) constituency parsing. Our results indicate that grammar-constrained LMs substantially outperform unconstrained LMs or even beat task-specific finetuned models. Grammar constraints thus hold great promise for harnessing off-the-shelf LMs for a wide range of structured NLP tasks, especially where training data is scarce or finetuning is expensive. Code and data: https://github.com/epfl-dlab/GCD.

en cs.CL, cs.AI

Detail Sumber

DOAJ Open Access 2022

Enhancing FL Learners' Perception of Non-native English Pronunciation with a Telecollaborative Project Work

Ricardo Casañ Pitarch, Miguel Ángel Candel Mora

Este trabajo tiene como objetivo analizar la opinión de los estudiantes hacia el inglés nativo y no nativo y su percepción de la pronunciación del inglés de hablantes no nativos. Para ello, en el marco de un proyecto telecolaborativo con estudiantes de la Universitat Politècnica de València (UPV), España, y del Instituto Politécnico de Kiev "Igor Sikorsky", Ucrania, los estudiantes se dividieron en un grupo experimental y otro de control. Ambos grupos completaron dos encuestas antes y después del proyecto de telecolaboración, y se midió su progreso. En la primera encuesta se preguntaba a los estudiantes sobre su percepción de otros acentos ingleses no nativos (adaptado de He & Li, 2009). En la segunda encuesta, los estudiantes evaluaron el acento de otros estudiantes internacionales no nativos de inglés (basado en Bayard, Weatherall, Gallois & Pittam, 2001; y Zahn & Hopper, 1985). Los resultados mostraron que los estudiantes que habían estado en contacto con otros hablantes no nativos cambiaron positivamente su percepción de esa variedad de inglés. En conclusión, la telecolaboración parece ser una herramienta valiosa para desarrollar la competencia cultural y evitar los prejuicios contra los hablantes no nativos.

Language. Linguistic theory. Comparative grammar

Detail Sumber

DOAJ Open Access 2021

Editorial

BERTHA RAMOS HOLGUÍN, Anna Carolina Peñaloza

In Enletawa Journal we have been fortunate to come out of strong professional networks that have nourished us over the years. Thanks to all the reviewers and authors that have worked with us over the past several years, we have been able to flourish as an academic journal that presents articles of teachers who have realized that writing is a possibility to act autonomously and echo their voices in the development of the pedagogical experience. In this volume, we share stories from diverse corners of Colombia. Thanks to the many teachers who care about their students and who work tirelessly to make publishing happen. We are glad you wanted to share your experiences with us because, without any doubt, teachers are an important part of the teaching processes because what teachers say and do impact students’ lives.

Language. Linguistic theory. Comparative grammar, Romanic languages

Detail DOI Sumber

arXiv Open Access 2021

Production vs Perception: The Role of Individuality in Usage-Based Grammar Induction

Jonathan Dunn, Andrea Nini

This paper asks whether a distinction between production-based and perception-based grammar induction influences either (i) the growth curve of grammars and lexicons or (ii) the similarity between representations learned from independent sub-sets of a corpus. A production-based model is trained on the usage of a single individual, thus simulating the grammatical knowledge of a single speaker. A perception-based model is trained on an aggregation of many individuals, thus simulating grammatical generalizations learned from exposure to many different speakers. To ensure robustness, the experiments are replicated across two registers of written English, with four additional registers reserved as a control. A set of three computational experiments shows that production-based grammars are significantly different from perception-based grammars across all conditions, with a steeper growth curve that can be explained by substantial inter-individual grammatical differences.

en cs.CL

Detail Sumber

S2 Open Access 2016

Around the world in three alternations: Modeling syntactic variation in varieties of English

Benedikt Szmrecsanyi, Jason Grafmiller, B. Heller et al.

155 sitasi en Sociology

Detail DOI Sumber

DOAJ Open Access 2020

Language Norms of International Treaties

Natalia V. Alontseva, Yury A. Ermoshin

This article discusses features of the implementation of linguistic norms in international treaties.The proposed study has a purpose to identify linguistic means present in international document texts, i.e. treaties that are to fix the agreement that parties achieve with a view to establishing relations and regulating them in future. The research material is 1000 texts of international treaties. The total amount of factual material analyzed is over 6000 pages. Our methodology is based on the works by domestic and foreign authors on general theory of speech activity, laws of perception and understanding of speech, and the peculiarities of the generation of a statement, translation theory, and international law. One of the most important means of expressing information in a text is its lexical composition. International treaties texts comprise different types of vocabulary (common, terminological, specialized, etc.) that performs text- and style forming functions. From the point of view of grammar, compiling international treaties involves using particular grammatical forms and categories, syntactic structures and types of phrases. The essence of international treaties texts implies the presence of special clichs of a business style. In the preparation and editing of international treaties, the adequate use of appropriate vocabulary and grammatical means leads to a reduction of ambiguities and discrepancies in the texts of these documents.

Language. Linguistic theory. Comparative grammar, Semantics

Detail DOI Sumber

DOAJ Open Access 2020

PREFACE (FERNANDO PRIETO RAMOS, GUEST EDITOR)

Fernando PRIETO RAMOS

Language. Linguistic theory. Comparative grammar, Comparative law. International uniform law

Detail Sumber

Hasil untuk "Comparative grammar"