Hasil "Comparative grammar"

arXiv Open Access 2026

Current LLMs still cannot 'talk much' about grammar modules: Evidence from syntax

Mohammed Q. Shormani, Yehia A. AlSohbani

We aim to examine the extent to which Large Language Models (LLMs) can 'talk much' about grammar modules, providing evidence from syntax core properties translated by ChatGPT into Arabic. We collected 44 terms from generative syntax previous works, including books and journal articles, as well as from our experience in the field. These terms were translated by humans, and then by ChatGPT-5. We then analyzed and compared both translations. We used an analytical and comparative approach in our analysis. Findings unveil that LLMs still cannot 'talk much' about the core syntax properties embedded in the terms under study involving several syntactic and semantic challenges: only 25% of ChatGPT translations were accurate, while 38.6% were inaccurate, and 36.4.% were partially correct, which we consider appropriate. Based on these findings, a set of actionable strategies were proposed, the most notable of which is a close collaboration between AI specialists and linguists to better LLMs' working mechanism for accurate or at least appropriate translation.

en cs.CL

Detail Sumber

DOAJ Open Access 2025

Development of Higher Order Thinking Skills (HOTS) Based Inferential Reading Module

Uyun Nafiah, Wahyuni Fitria, Ayuliamita Abadi

Nowadays, literacy is very important and critical thinking should be added in the teaching and learning. Therefore, this research is aimed at developing Higher Order Thinking Skills (HOTS) Based Inferential Reading Module which is suitable and practical for students of English Education Study Program. This was developmental research which used ADDIE model that consisted of five stages: analyze, design, develop, implement, and evaluate. Based on the result of validity test by two expert validators, it gained mean percentage 84.24% categorized as very valid in term of content eligibility, linguistics, presentation, and graphics components. Besides, based on the result of students’ practicality in terms of ease of use, efficiency of learning time and benefits, it was found percentage 81.13% categorized as very practical. It could be concluded that Higher Order Thinking Skills (HOTS) Based Inferential Reading Module is suitable, practical, and beneficial for the students of English Education Study Program UIN Sulthan Thaha Saifuddin Jambi.

Language. Linguistic theory. Comparative grammar, English language

Detail DOI Sumber

DOAJ Open Access 2025

Pragmatic and Grammatical Competence Interface in Second Language Acquisition: Conceptual Framework and Construction

Andi Rustandi, R. Bunga Febriani, Bambang Ruby Sugiarto

This study investigated the interface between pragmatic and grammatical competence in second language acquisition. This study aims to discover the conceptual framework of how pragmatics competence interfaces with grammatical competence. This study employs three research questions such as (1) how pragmatics competence and grammatical competence are interfaced with each other, (2) what kind of pragmatics competence should be learned by the learner of a second language, and (3) to what extent grammatical construction interface with the pragmatic domain. This study used the groundwork of the library research method, which forms the theoretical framework of this study by searching, reading, and evaluating some journals from the online journal that deals with the topic. The result revealed that pragmatic competence and grammatical competence are interfaced with each other since grammatical construction in the utterance contributes to the language use expression.

Language. Linguistic theory. Comparative grammar, Theory and practice of education

Detail DOI Sumber

DOAJ Open Access 2025

Publicist Mihael Kunić i njegov prinos razvoju hrvatske biografistike

Iva Mandušić

Publicist Mihael Kunić slabo je poznat hrvatskoj kulturnoj javnosti. Podrijetlom Slovak, zanimanjem umirovljeni pedagog i školski nadzornik u zemljama Habsburške Monarhije, u Hrvatskoj je boravio u posljednjem dijelu svoga života te je, aktivno sudjelujući u društvenom životu Zagreba, Karlovca, Varaždina, ali i u ladanjskom životu hrvatskoga plemstva, osobito u Slavoniji, zabilježio brojne zanimljivosti, danas vrijedne za poznavanje građanske kulture u prvim desetljećima XIX. st. Iz razdoblja njegova života prije dolaska u hrvatske zemlje potječu brojni njegovi radovi na njemačkom jeziku s područja pedagogije, lingvistike, povijesti i hortikulture, a slijedeći i razrađujući vlastitu biografsku metodu, Kunić je i u europskim razmjerima ostvario velik leksikografski pothvat, objavivši višesveščani biografski leksikon znamenitih ljudi Habsburške Monarhije Biographien merkwürdiger Männer der Österreichischen Monarchie (1805–12). U domaćoj literaturi uglavnom su poznate njegove povijesno-topografske studije i radovi o građanskim, plemićkim i javnim perivojima i parkovnoj arhitekturi, kojima je znatno pridonio poznavanju povijesti vrtne umjetnosti i kulturnih prilika u Hrvatskoj na početku XIX. st., a osim njih, u skladu s prosvjetiteljskim nastojanjima da se opišu životi osoba koje djeluju za opće dobro, objavio je i niz biografskih priloga o istaknutim osobama pretpreporodne Hrvatske i brojne prigodne pjesme kojima je obilježio razna javna i privatna događanja. U radu se, uz povijesni pregled razvoja hrvatske biografistike, razmatraju kriteriji odabira obrađenih osoba u Kunićevu opusu, motivi i okolnosti njegova djelovanja te njegov prinos razvoju ove discipline, s naglaskom na biografiju Josipa Sermagea, u kojoj opisuje svoje metodološke postupke, i krug osoba povezanih sa zagrebačkim Kaptolom kao dijela građanskoga sloja u nastajanju.

Lexicography

Detail Sumber

arXiv Open Access 2025

On Quantum Context-Free Grammars

Merina Aruja, Lisa Mathew, Jayakrishna Vijayakumar

Quantum computing is a relatively new field of computing, which utilises the fundamental concepts of quantum mechanics to process data. The seminal paper of Moore et al. [2000] introduced quantum grammars wherein a set of amplitudes was attached to each production. However they did not study the final probability of the derived word. Aruja et al. [2025] considered conditions for the well-formedness of quantum context-free grammars (QCFGs), in order to ensure that the probabilty of the derived word does not exceed one. In this paper we propose certain necessary and sufficient conditions (also known as unitary conditions) for well-formedness of QCFGs

en cs.FL

Detail Sumber

arXiv Open Access 2025

Disjoint Processing Mechanisms of Hierarchical and Linear Grammars in Large Language Models

Aruna Sankaranarayanan, Dylan Hadfield-Menell, Aaron Mueller

All natural languages are structured hierarchically. In humans, this structural restriction is neurologically coded: when two grammars are presented with identical vocabularies, brain areas responsible for language processing are only sensitive to hierarchical grammars. Using large language models (LLMs), we investigate whether such functionally distinct hierarchical processing regions can arise solely from exposure to large-scale language distributions. We generate inputs using English, Italian, Japanese, or nonce words, varying the underlying grammars to conform to either hierarchical or linear/positional rules. Using these grammars, we first observe that language models show distinct behaviors on hierarchical versus linearly structured inputs. Then, we find that the components responsible for processing hierarchical grammars are distinct from those that process linear grammars; we causally verify this in ablation experiments. Finally, we observe that hierarchy-selective components are also active on nonce grammars; this suggests that hierarchy sensitivity is not tied to meaning, nor in-distribution inputs.

en cs.CL, cs.AI

Detail Sumber

arXiv Open Access 2024

Grammar-based Game Description Generation using Large Language Models

Tsunehiko Tanaka, Edgar Simo-Serra

Game Description Language (GDL) provides a standardized way to express diverse games in a machine-readable format, enabling automated game simulation, and evaluation. While previous research has explored game description generation using search-based methods, generating GDL descriptions from natural language remains a challenging task. This paper presents a novel framework that leverages Large Language Models (LLMs) to generate grammatically accurate game descriptions from natural language. Our approach consists of two stages: first, we gradually generate a minimal grammar based on GDL specifications; second, we iteratively improve the game description through grammar-guided generation. Our framework employs a specialized parser that identifies valid subsequences and candidate symbols from LLM responses, enabling gradual refinement of the output to ensure grammatical correctness. Experimental results demonstrate that our iterative improvement approach significantly outperforms baseline methods that directly use LLM outputs. Our code is available at https://github.com/tsunehiko/ggdg

en cs.AI

Detail DOI Sumber

DOAJ Open Access 2023

Images of a Good and Evil Person in Russian and Chinese Paremiological Pictures of the World

Licheng Zhang, Olga P. Kormazina

The linguoculturological studies in paremiology make one of the most actively developing, researchers have repeatedly analyzed the concepts of GOOD and EVIL on the material of paremias of a particular language. However, the study of character traits «good» and «evil» on the basis of the proverbs of the Russian and Chinese languages has not yet been considered, which is the novelty of this study. The article studies paremiological units of the Russian and Chinese languages, representing a good and evil person. The goal of the work is to identify common and peculiar ethno-specific features reflected in the proverbs of the Russian language against the background of their counterparts in the Chinese language, as well as to establish their motivation. To achieve this goal, methods of comparative and linguoculturological analyses were applied. The sources of the material were the proverbs dictionary, as well as data from Russian and Chinese corpora. The paper concludes that there are much more different cultural attitudes contained in the proverbs we are interested in than common ones, which causes a big difference in understanding the character traits of «good» and «evil» between native Russian and Chinese speakers. Understanding the universal and nationally marked ideas of Russians and Chinese effectively contributes to cultural communication between peoples.

Language. Linguistic theory. Comparative grammar, Semantics

Detail DOI Sumber

arXiv Open Access 2023

On Some Closure Properties of nc-eNCE Graph Grammars

Jayakrishna Vijayakumar, Lisa Mathew

In the study of automata and grammars, closure properties of the associated languages have been studied extensively. In particular, closure properties of various types of graph grammars have been examined in (Rozenberg and Welzl, Inf. and Control,1986) and (Rozenberg and Welzl, Acta Informatica,1986). In this paper we examine some critical closure properties of the nc-eNCE graph grammars discussed in (Jayakrishna and Mathew, Symmetry 2023) and (Jayakrishna and Mathew, ICMICDS 2022).

en cs.FL, math.CO

Detail Sumber

arXiv Open Access 2023

Computational Semantics and Evaluation Benchmark for Interrogative Sentences via Combinatory Categorial Grammar

Hayate Funakura, Koji Mineshima

We present a compositional semantics for various types of polar questions and wh-questions within the framework of Combinatory Categorial Grammar (CCG). To assess the explanatory power of our proposed analysis, we introduce a question-answering dataset QSEM specifically designed to evaluate the semantics of interrogative sentences. We implement our analysis using existing CCG parsers and conduct evaluations using the dataset. Through the evaluation, we have obtained annotated data with CCG trees and semantic representations for about half of the samples included in QSEM. Furthermore, we discuss the discrepancy between the theoretical capacity of CCG and the capabilities of existing CCG parsers.

en cs.CL, cs.AI

Detail Sumber

arXiv Open Access 2023

Cyclic Operator Precedence Grammars for Parallel Parsing

Michele Chiari, Dino Mandrioli, Matteo Pradella

Operator precedence languages (OPL) enjoy the local parsability property, which essentially means that a code fragment enclosed within a pair of markers -- playing the role of parentheses -- can be compiled with no knowledge of its external context. Such a property has been exploited to build parallel compilers for languages formalized as OPLs. It has been observed, however, that when the syntax trees of the sentences have a linear substructure, its parsing must necessarily proceed sequentially making it impossible to split such a subtree into chunks to be processed in parallel. Such an inconvenience is due to the fact that so far much literature on OPLs has assumed the hypothesis that equality precedence relation cannot be cyclic. This hypothesis was motivated by the need to keep the mathematical notation as simple as possible. We present an enriched version of operator precedence grammars, called cyclic, that allows to use a simplified version of regular expressions in the right hand sides of grammar's rules; for this class of operator precedence grammars the acyclicity hypothesis of the equality precedence relation is no more needed to guarantee the algebraic properties of the generated languages. The expressive power of the cyclic grammars is now fully equivalent to that of other formalisms defining OPLs such as operator precedence automata, monadic second order logic and operator precedence expressions. As a result cyclic operator precedence grammars now produce also unranked syntax trees and sentences with flat unbounded substructures that can be naturally partitioned into chunks suitable for parallel parsing.

en cs.FL

Detail Sumber

DOAJ Open Access 2022

Estate, Capital and Province in the Alexander Potyomkin’s novel Man is canceled (2007)

Olga Bogdanova

The article analyzes the novel of the modern Russian writer A. Potyomkin Man is canceled (2007), which received a wide public response. The main idea of the work is the need for a radical change of the “mass man” of the turn of the XX-XXI centuries at the psychosomatic level. The ideological and compositional center is the specially built Rimushkino estate in the Oryol province, where the serf spirit of the Russian Empire at the turn of the XVIII-XIX centuries is reproduced. To answer the question of why the estate space of Russia is becoming the most representative field for anthropological experiments of the beginning of the XXI century, we consider the estate neo-myths of the Silver Age (the lost paradise on earth) and the Soviet period (the camp hell living in the mentality), as well as the imperial-colonial concept of the postmodern era (the estate as a frontier in the process of class-oriented internal colonization of the country). The multidimensional semiotics of the estate sets a new relationship between “metropolitan” and “provincial” concerning the other loci of the novel.

Language. Linguistic theory. Comparative grammar, Style. Composition. Rhetoric

Detail DOI Sumber

arXiv Open Access 2022

No Grammar to Rule Them All: A Survey of JSON-style DSLs for Visualization

Andrew McNutt

There has been substantial growth in the use of JSON-based grammars, as well as other standard data serialization languages, to create visualizations. Each of these grammars serves a purpose: some focus on particular computational tasks (such as animation), some are concerned with certain chart types (such as maps), and some target specific data domains (such as ML). Despite the prominence of this interface form, there has been little detailed analysis of the characteristics of these languages. In this study, we survey and analyze the design and implementation of 57 JSON-style DSLs for visualization. We analyze these languages supported by a collected corpus of examples for each DSL (consisting of 4395 instances) across a variety of axes organized into concerns related to domain, conceptual model, language relationships, affordances, and general practicalities. We identify tensions throughout these areas, such as between formal and colloquial specifications, among types of users, and within the composition of languages. Through this work, we seek to support language implementers by elucidating the choices, opportunities, and tradeoffs in visualization DSL design.

en cs.HC

Detail Sumber

arXiv Open Access 2022

Grammar Detection for Sentiment Analysis through Improved Viterbi Algorithm

Surya Teja Chavali, Charan Tej Kandavalli, Sugash T M

Grammar Detection, also referred to as Parts of Speech Tagging of raw text, is considered an underlying building block of the various Natural Language Processing pipelines like named entity recognition, question answering, and sentiment analysis. In short, forgiven a sentence, Parts of Speech tagging is the task of specifying and tagging each word of a sentence with nouns, verbs, adjectives, adverbs, and more. Sentiment Analysis may well be a procedure accustomed to determining if a given sentence's emotional tone is neutral, positive or negative. To assign polarity scores to the thesis or entities within phrase, in-text analysis and analytics, machine learning and natural language processing, approaches are incorporated. This Sentiment Analysis using POS tagger helps us urge a summary of the broader public over a specific topic. For this, we are using the Viterbi algorithm, Hidden Markov Model, Constraint based Viterbi algorithm for POS tagging. By comparing the accuracies, we select the foremost accurate result of the model for Sentiment Analysis for determining the character of the sentence.

en cs.CL, cs.LG

Detail DOI Sumber

arXiv Open Access 2022

A Grammar-Based Approach for Applying Visualization Taxonomies to Interaction Logs

Sneha Gathani, Shayan Monadjemi, Alvitta Ottley et al.

Researchers collect large amounts of user interaction data with the goal of mapping user's workflows and behaviors to their higher-level motivations, intuitions, and goals. Although the visual analytics community has proposed numerous taxonomies to facilitate this mapping process, no formal methods exist for systematically applying these existing theories to user interaction logs. This paper seeks to bridge the gap between visualization task taxonomies and interaction log data by making the taxonomies more actionable for interaction log analysis. To achieve this, we leverage structural parallels between how people express themselves through interactions and language by reformulating existing theories as regular grammars. We represent interactions as terminals within a regular grammar, similar to the role of individual words in a language, and patterns of interactions or non-terminals as regular expressions over these terminals to capture common language patterns. To demonstrate our approach, we generate regular grammars for seven visualization taxonomies and develop code to apply them to three interaction log datasets. In analyzing our results, we find that existing taxonomies at the low-level (i.e., terminals) show mixed results in expressing multiple interaction log datasets, and taxonomies at the high-level (i.e., regular expressions) have limited expressiveness, due to primarily two challenges: inconsistencies in interaction log dataset granularity and structure, and under-expressiveness of certain terminals. Based on our findings, we suggest new research directions for the visualization community for augmenting existing taxonomies, developing new ones, and building better interaction log recording processes to facilitate the data-driven development of user behavior taxonomies.

en cs.HC

Detail Sumber

DOAJ Open Access 2021

La co-costruzione dell’ambiente di apprendimento online. Etnografia di un quadrimestre in dad nella scuola secondaria di primo grado

Andreina Re

The co-construction of the online learning environment. Ethnography of first e-learning term in lower secondary school during the pandemic ------- Abstract --- This essay investigates how the online learning environment was built in Italian junior high school (age 11 to 14) during the first COVID-19 lockdown period (February-June 2020). Teachers and pupils found themselves projected into unknown virtual space, almost never explored at school before the surge of the pandemic. They had to engage in adventurous explorations of this new territory, conquering new spaces subduing the rigid structures of educational platforms and official regulatory codes.

Ethnology. Social and cultural anthropology, Language. Linguistic theory. Comparative grammar

Detail Sumber

arXiv Open Access 2021

Semantic Construction Grammar: Bridging the NL / Logic Divide

Dave Schneider, Michael Witbrock

In this paper, we discuss Semantic Construction Grammar (SCG), a system developed over the past several years to facilitate translation between natural language and logical representations. Crucially, SCG is designed to support a variety of different methods of representation, ranging from those that are fairly close to the NL structure (e.g. so-called 'logical forms'), to those that are quite different from the NL structure, with higher-order and high-arity relations. Semantic constraints and checks on representations are integral to the process of NL understanding with SCG, and are easily carried out due to the SCG's integration with Cyc's Knowledge Base and inference engine.

en cs.CL, cs.AI

Detail DOI Sumber

DOAJ Open Access 2020

Entrevista com Guilherme Gontijo Flores

Rodrigo Tadeu Gonçalves, Layla Gabriel de Oliveira

Entrevista com Guilherme Gontijo Flores

Language and Literature, Translating and interpreting

Detail DOI Sumber

DOAJ Open Access 2020

THE ANALYSIS OF CODE SWITCHING AND CODE MIXING PHENOMENON IN ENGLISH FACEBOOK GROUP OF YOUTH COMMUNITY

Juvrianto Chrissunday Jakob

The phenomenon of code switching and code mixing has appeared in our daily life as an event or a trend around us. Facebook as a social media has become of code switching and code mixing phenomenon. This study aimed to analyze and to compare the kind of code switching and code mixing in Facebook group of English Youth Community. The analyzing process of code switching and code mixing in English Youth Community was taken from some comments of the members in English Youth Community where the researcher took screenshot of the members’ comments on Facebook group then analyzed the code switching and code mixing phenomenon in the comments of the members. The resarch subjects of this study are the members of English Youth Community on Facebook group. The method of this study was descriptive qualitative since the researcher intended to obtain the conversations among the members of English Youth Community on Facebook. The result of this study showed that intra essential code switching is the most widely used in some comments in English Youth Community than intra essential code switching and extra essential code switching. Code switching and code mixing phenomenon are done consciously by the members because of the change situation or the change of the topic in the English Youth Community group.

Language. Linguistic theory. Comparative grammar

Detail DOI Sumber

DOAJ Open Access 2020

SOCRATIC QUESTIONS USED BY THE SEVENTH SEMESTER STUDENTS OF ENGLISH DEPARTMENT OF FKIP NOMMENSEN IN SEMINAR CLASS

Fenty Debora Napitupulu

This study is aimed to know to find out the types of Socratic questions used by the students of seventh semester in English Department of FKIP UHN Medan. This research was conducted by descriptive qualitative where the subject is seventh English Department students in Nommensen University academic year 2019/2020 on Seminar Class and the data is students’questions. After analizing questions, it was found that Socratic questions used by the seventh semester of English students in FKIP UHN Medan are questions and clarification, questions that prope purposes, questions that probe assumptions, Question that Probe Information, Reason, Evidence, and Cause, Question about viewpoints or perspectives, Questions that Probe Implication and Consequences, Question about question, Question that Probe Concept, and Questions that probe Inferences and Interpretation. The most dominant is Question that Probe Information, Reason, Evidence, and Cause. It means that the students’ ability in making questions in seminar on ELT presentation is still on the level of getting information from the text, less to have capacity to view or to judge things from some other perspectives, less of preparing themselves reading the seminar paper before the presentation starts. The writer assumed that the students lack of reading.

Language. Linguistic theory. Comparative grammar

Detail DOI Sumber

Hasil untuk "Comparative grammar"