Notes on the Modern Greek Literary Model (From Its Beginnings to the 1930s)
Fotiny Christakoudy-Konstantinidou
This article examines the peculiarities of the modern Greek literary model, spanning from its emergence following the national liberation of 1821 to the transformative period of the 1930s. Particular emphasis is placed on the continuity between folk traditions and modern Greek literature, facilitated by the adoption of the dimotiki language form as the foundation for artistic literary expression. The study explores the interplay of internal and external cultural influences throughout this extended period, highlighting their impact on the aesthetic (self)definition of contemporary Greek literature. A central focus is placed on the developmental framework of national literature proposed by Georgios Theotokas (1905–1966) in his seminal essay “The Free Spirit” (1929). Original title in Bulgarian: Бележки върху новогръцкия литературен модел (от неговите начала до 30-те години на XX в.).
Slavic languages. Baltic languages. Albanian languages
Legal and Regulatory Framework for Cybersecurity in Arab Gulf Monarchies: Stages and Features of Development
G. N. Valiakhmetova, L. V. Tsukanov
This article examines the evolution of the legal and regulatory foundations of the national cybersecurity systems in the member states of the Gulf Cooperation Council (GCC). The authors identify and analyze a complex set of factors that have shaped the dynamics and distinctive features of the development of the legal framework in the Gulf monarchies within the context of their progress toward a secure digital environment. A periodization of the history of the cybersecurity sector's legal base is proposed, with a characterization of its main stages. Particular attention is paid to the analysis of new directions in the modernization of sector-specific legislation, including the fields of financial technology and artificial intelligence. The paper presents the results of a comparative analysis of the concepts for legal regulation and protection of the digital space in the Arabian monarchies. Key challenges hindering the harmonization of the legal framework within the GCC are identified. The authors conclude that the accelerated development of the cybersecurity legal and regulatory framework aligns with the national interests of the GCC states and their long-term development “Visions” (e.g., Saudi Vision 2030). It is substantiated that all the countries under consideration have developed their own unique legal practices in the realm of digital protection and strive to adhere to high international cybersecurity standards and a proactive approach to risk management.
Slavic languages. Baltic languages. Albanian languages
On the Complexity of Language Membership for Probabilistic Words
Antoine Amarilli, Mikaël Monet, Paul Raphaël
et al.
We study the membership problem to context-free languages L (CFLs) on probabilistic words, that specify for each position a probability distribution on the letters (assuming independence across positions). Our task is to compute, given a probabilistic word, what is the probability that a word drawn according to the distribution belongs to L. This problem generalizes the problem of counting how many words of length n belong to L, or of counting how many completions of a partial word belong to L. We show that this problem is in polynomial time for unambiguous context-free languages (uCFLs), but can be #P-hard already for unions of two linear uCFLs. More generally, we show that the problem is in polynomial time for so-called poly-slicewise-unambiguous languages, where given a length n we can tractably compute an uCFL for the words of length n in the language. This class includes some inherently ambiguous languages, and implies the tractability of bounded CFLs and of languages recognized by unambiguous polynomial-time counter automata; but we show that the problem can be #P-hard for nondeterministic counter automata, even for Parikh automata with a single counter. We then introduce classes of circuits from knowledge compilation which we use for tractable counting, and show that this covers the tractability of poly-slicewise-unambiguous languages and of some CFLs that are not poly-slicewise-unambiguous. Extending these circuits with negation further allows us to show tractability for the language of primitive words, and for the language of concatenations of two palindromes. We finally show the conditional undecidability of the meta-problem that asks, given a CFG, whether the probabilistic membership problem for that CFG is tractable or #P-hard.
Reactive Semantics for User Interface Description Languages
Basile Pesin, Celia Picard, Cyril Allignol
User Interface Description Languages (UIDLs) are high-level languages that facilitate the development of Human-Machine Interfaces, such as Graphical User Interface (GUI) applications. They usually provide first-class primitives to specify how the program reacts to an external event (user input, network message), and how data flows through the program. Although these domain-specific languages are now widely used to implement safety-critical GUIs, little work has been invested in their formalization and verification. In this paper, we propose a denotational semantic model for a core reactive UIDL, Smalite, which we argue is expressive enough to encode constructs from more realistic languages. This preliminary work may be used as a stepping stone to produce a formally verified compiler for UIDLs.
Localizing AI: Evaluating Open-Weight Language Models for Languages of Baltic States
Jurgita Kapočiūtė-Dzikienė, Toms Bergmanis, Mārcis Pinnis
Although large language models (LLMs) have transformed our expectations of modern language technologies, concerns over data privacy often restrict the use of commercially available LLMs hosted outside of EU jurisdictions. This limits their application in governmental, defence, and other data-sensitive sectors. In this work, we evaluate the extent to which locally deployable open-weight LLMs support lesser-spoken languages such as Lithuanian, Latvian, and Estonian. We examine various size and precision variants of the top-performing multilingual open-weight models, Llama~3, Gemma~2, Phi, and NeMo, on machine translation, multiple-choice question answering, and free-form text generation. The results indicate that while certain models like Gemma~2 perform close to the top commercially available models, many LLMs struggle with these languages. Most surprisingly, however, we find that these models, while showing close to state-of-the-art translation performance, are still prone to lexical hallucinations with errors in at least 1 in 20 words for all open-weight multilingual LLMs.
Left Branch Extraction in Lower Sorbian
Andreas Pankau
abstract:Corver (1990) and Bošković (2005) note that the availability of Left Branch Extraction–the extraction of material to the left of the head out of DPs, APs, and PPs–in a language correlates with the presence and absence of definite articles. Languages with definite articles do not allow Left Branch Extraction (like English), languages without definite articles do allow Left Branch Extraction (like Polish). They further suggest that the two phenomena are causally linked, claiming Left Branch Extraction in languages with definite articles is blocked by the DP projection. In this paper, I show that the presence of definite articles does not imply the absence of Left Branch Extraction. The evidence comes from Lower Sorbian, an endangered Slavic language spoken in South East Germany. I show that Lower Sorbian has both definite articles and Left Branch Extraction.
Data-Augmentation-Based Dialectal Adaptation for LLMs
FAHIM FAISAL, Antonios Anastasopoulos
This report presents gmnlp’s participation to the Dialect-Copa shared task at VarDial 2024 (Chifu et al., 2024), which focuses on evaluating the commonsense reasoning capabilities of large language models (LLMs) on South Slavic micro-dialects. The task aims to assess how well LLMs can handle non-standard dialectal varieties, as their performance on standard languages is already well-established. We propose an approach that combines the strengths of different types of language models and leverages data augmentation techniques to improve task performance on three South Slavic dialects: Chakavian, Cherkano, and Torlak. We conduct experiments using a language-family-focused encoder-based model (BERTić) and a domain-agnostic multilingual model (AYA-101). Our results demonstrate that the proposed data augmentation techniques lead to substantial performance gains across all three test datasets in the open-source model category. This work highlights the practical utility of data augmentation and the potential of LLMs in handling non-standard dialectal varieties, contributing to the broader goal of advancing natural language understanding in low-resource and dialectal settings.
4 sitasi
en
Computer Science
The Past and Present of Slavistics in China
Zhen Wu
The origins of Slavistics in China can be traced back to 1708. However, Slavistics so far cannot be called a popular scientific field, which is manifested in a fairly small number of scientific achievements directly related to Slavistics, and in a small number of organizations specializing in Slavistics. In recent years, under the influence of international and domestic factors, especially due to the promotion and implementation of the “the Belt and Road Initiative” the number of researchers interested in Slavic studies has shown an increasing trend, and relevant research organizations have been established consistently. This article provides a brief overview of the development of Slavic studies in China, introduces leading scientific organizations, scientific events and publications. In addition, the author pays special attention to the directions and results of research at the Institute of Slavic Languages (the only research institute called “Slavic Languages” in China) and the Research Center for Slavic Nations at Harbin Normal University under the Ministry of Education of the People's Republic of China, using it as a prism to reflect the current state of Slavic studies in China. The research team of the Institute and the Center conducts both traditional and innovative research on languages, education, language consciousness, literature, folklore, and art of Slavic countries. Based on their research, we attempt to get a whole picture of the current status of the Slavic studies in China. In general, focusing on high-quality training of specialists in Slavistics and the construction of the discipline “Slavistics”, Chinese Slavists make efforts and achieve success in scientific and educational fields.
Licensing of DP/PRO embedded subjects in Russian
Irina Burukina
abstract:The paper contributes to the ongoing discussion of DP/PRO alternation examining properties of Russian evaluative adjectival predicatives that embed a non-finite clause (i.e. važno ‘important’) and arguing that (i) sentences with these predicates and an embedded non-finite clause are ambiguous between obligatory control and overt embedded subject analyses, (ii) the DP/PRO alternation does not correlate with the feature specification or the structural size of an embedded clause, (iii) the alternation is not free and can be formally accounted for by an analysis in terms of cross-clausal licensing of embedded overt DP subjects. The novel data from Russian challenge existing approaches to DP/PRO distribution and non-finite subordination in Slavic languages and provide further support for (a version of) Case filter.
Сны о городах, литературная реальность: Центральная и Юго-Восточная Европа в лабиринте письменного слова
Адията Ибришимович-Шабич
В рецензии представлена коллективная монография под названием «Топос города в синхронии и диахронии: литературная парадигма Центральной и Юго-Восточной Европы». Монография посвящена вопросу функционирования топоса города и городского пространства в художественном тексте на примерах избранных произведений XX и XXI вв., тематизирующих пространство городов-«генераторов культуры» (Ю. Лотман): Москва, Минск, Харьков, Варшава, Прага, Братислава, Сараево, София, Будапешт, Бухарест, Париж, Новый Орлеан, Нью-Йорк. Работа существенно расширяет представление об урбанистической проблематике и особенностях поэтики пространства в литературах Центральной и Юго-Восточной Европы, вводит в научный оборот новый концептуально систематизированный литературный материал. Монография отличается междисциплинарным подходом и применением актуального теоретико-понятийного аппарата, что ставит тему городского пространства в современный научный контекст и освещает ее с самых разных точек зрения.
Рецензия поступила в редакцию 02.09.2024.
Цитирование
Ибришимович-Шабич А. Сны о городах, литературная реальность: Центральная и Юго-Восточная Европа в лабиринте письменного слова // Славянский альманах. 2024. No 3–4. С. 515–534. DOI: 10.31168/2073-5731.2024.3-4.28
History of Russia. Soviet Union. Former Soviet Republics, Slavic languages. Baltic languages. Albanian languages
I cantanti di strada degli anni Venti e le loro ‘canzoni-cronaca’: a proposito dei contesti sociali di funzionamento dei fenomeni (post)folklorici (traduzione di Riccardo Mini)
Michail Lur’e
Italian translation of Gorodskie ulichnye pevtsy 1920-kh godov i ikh “pesni-khroniki”: k voprosu o sotsialnykh kontekstakh funktsionirovaniia (post)folklornykh iavlenii by Mikhail Lurie.
Slavic languages. Baltic languages. Albanian languages
LPR: Large Language Models-Aided Program Reduction
Mengxiao Zhang, Yongqiang Tian, Zhenyang Xu
et al.
Program reduction is a prevalent technique to facilitate compilers' debugging by automatically minimizing bug-triggering programs. Existing program reduction techniques are either generic across languages (e.g., Perses and Vulcan) or specifically customized for one certain language by employing language-specific features, like C-Reduce. However, striking the balance between generality across multiple programming languages and specificity to individual languages in program reduction is yet to be explored. This paper proposes LPR, the first technique utilizing LLMs to perform language-specific program reduction for multiple languages. The core insight is to utilize both the language-generic syntax level program reduction (e.g., Perses) and the language-specific semantic level program transformations learned by LLMs. Alternately, language-generic program reducers efficiently reduce programs into 1-tree-minimality, which is small enough to be manageable for LLMs; LLMs effectively transform programs via the learned semantics to expose new reduction opportunities for the language-generic program reducers to further reduce the programs. Our extensive evaluation on 50 benchmarks across three languages (C, Rust, and JavaScript) has highlighted LPR's practicality and superiority over Vulcan, the state-of-the-art language-generic program reducer. For effectiveness, LPR surpasses Vulcan by producing 24.93%, 4.47%, and 11.71% smaller programs on benchmarks in C, Rust and JavaScript. Moreover, LPR and Vulcan have demonstrated their potential to complement each other. By using Vulcan on LPR's output for C programs, we achieve program sizes comparable to those reduced by C-Reduce. For efficiency, LPR takes 10.77%, 34.88%, 36.96% less time than Vulcan to finish all benchmarks in C, Rust and JavaScript, separately.
Swap distance minimization in SOV languages. Cognitive and mathematical foundations
Ramon Ferrer-i-Cancho, Savithry Namboodiripad
Distance minimization is a general principle of language. A special case of this principle in the domain of word order is swap distance minimization. This principle predicts that variations from a canonical order that are reached by fewer swaps of adjacent constituents are lest costly and thus more likely. Here we investigate the principle in the context of the triple formed by subject (S), object (O) and verb (V). We introduce the concept of word order rotation as a cognitive underpinning of that prediction. When the canonical order of a language is SOV, the principle predicts SOV < SVO, OSV < VSO, OVS < VOS, in order of increasing cognitive cost. We test the prediction in three flexible order SOV languages: Korean (Koreanic), Malayalam (Dravidian), and Sinhalese (Indo-European). Evidence of swap distance minimization is found in all three languages, but it is weaker in Sinhalese. Swap distance minimization is stronger than a preference for the canonical order in Korean and especially Malayalam.
The Narcissism of Minor Differences in the Context of Post-Imperial Macedonian Neighbouring
Katica Kulavkova
The Narcissism of Minor Differences in the Context of Post-Imperial Macedonian Neighbouring
The conflicting relations among neighbouring nations in the Balkans may very accurately be explained by S. Freud’s theory of the Narcissism of Minor Differences. Related identities among nations and the bordering zones between countries have always been and continue to be a generator of racial, national, religious and cultural tensions. Whenever the discourse of identities is radicalized, cultural and political hegemony comes to life: identities are ranked according to worth; borders are changed according to national identity; methods of physical and metaphysical violence are used; shared places of memory are appropriated, and those not shared are negated. Perception is in crisis and, as a result, promotes a kind of conflictual mutual misrecognition. This text aims to demystify such installations of hegemony in the (North) Macedonian neighbouring region, and to articulate some principles of a post-hegemonistic paradigm.
Narcyzm małych różnic w kontekście postimperialnego sąsiedztwa Macedonii
Konfliktowe relacje między sąsiednimi narodami na Bałkanach można bardzo trafnie wyjaśnić teorią narcyzmu małych różnic Z. Freuda. Pokrewne tożsamości w obrębie tych narodów oraz stref przygranicznych między poszczególnymi krajami były i są generatorem napięć na tle rasowym, narodowym, religijnym i kulturowym. Ilekroć dyskurs o tożsamościach ulega radykalizacji, ożywa kulturowa i polityczna hegemonia: tożsamości są szeregowane według wartości; granice są zmieniane zgodnie z tożsamością narodową; stosowane są metody przemocy fizycznej i metafizycznej; współdzielone miejsca pamięci są zawłaszczane, a te, które nie są współdzielone, są negowane. Percepcja znajduje się w kryzysie i w rezultacie sprzyja rozwojowi wzajemnego niezrozumienia, które prowadzi do konfliktów. Celem niniejszego tekstu jest demistyfikacja takich działań o charakterze hegemonicznym w sąsiednim regionie (północnej) Macedonii oraz wyartykułowanie pewnych zasad paradygmatu posthegemonistycznego.
Ethnology. Social and cultural anthropology, Slavic languages. Baltic languages. Albanian languages
Argumentative Values of Causality in French Constructions Like <i>s’il échoue, c’est qu’il n’apprend pas ses leçons</i>
Bauvarie Mounga
Our paper focuses on the expression of causality, in particular its argumentative values in French constructions like <i>s’il échoue, c’est qu’il n’apprend pas ses leçons</i>. It should be noted that this type of constructions <i>if P, Q</i>, is observable when <i>if P</i> is factual, i.e. when the proposition P does not necessarily express a hypothesis as is generally the case, but an event that has happened or is going to happen. Thus, we find concessional, adversarial, additive and emphatic <i>if P, Q</i> systems. However, in the context of our paper, we will only devote ourselves to causative if-constructions. Our work will be structured in three parts. First, we will highlight the functioning of causality in factual <i>if</i>-constructions, then the enunciative postures generated by causality and finally the self-presentation of the speaker in French causative non hypothetical constructions.
Slavic languages. Baltic languages. Albanian languages
От Редакции
Agata Piasecka
Philology. Linguistics, Slavic languages. Baltic languages. Albanian languages
Rust: The Programming Language for Safety and Performance
William Bugden, Ayman Alahmar
Rust is a young programming language gaining increased attention from software developers since it was introduced to the world by Mozilla in 2010. In this study, we attempt to answer several research questions. Does Rust deserve such increased attention? What is there in Rust that is attracting programmers to this new language? Safety and performance were among the very first promises of Rust, as was claimed by its early developers. Is Rust a safe language with high performance? Have these claims been achieved? To answer these questions, we surveyed and analyzed recent research on Rust and research that benchmarks Rust with other available prominent programming languages. The results show that Rust deserves the increased interest by programmers, and recent experimental results in benchmarking research show Rust's overall superiority over other well-established languages in terms of performance, safety, and security. Even though this study was not comprehensive (and more work must be done in this area), it informs the programming and research communities on the promising features of Rust as the language of choice for the future.
Formally Verified Native Code Generation in an Effectful JIT -- or: Turning the CompCert Backend into a Formally Verified JIT Compiler
Aurèle Barrière, Sandrine Blazy, David Pichardie
Modern Just-in-Time compilers (or JITs) typically interleave several mechanisms to execute a program. For faster startup times and to observe the initial behavior of an execution, interpretation can be initially used. But after a while, JITs dynamically produce native code for parts of the program they execute often. Although some time is spent compiling dynamically, this mechanism makes for much faster times for the remaining of the program execution. Such compilers are complex pieces of software with various components, and greatly rely on a precise interplay between the different languages being executed, including on-stack-replacement. Traditional static compilers like CompCert have been mechanized in proof assistants, but JITs have been scarcely formalized so far, partly due to their impure nature and their numerous components. This work presents a model JIT with dynamic generation of native code, implemented and formally verified in Coq. Although some parts of a JIT cannot be written in Coq, we propose a proof methodology to delimit, specify and reason on the impure effects of a JIT. We argue that the daunting task of formally verifying a complete JIT should draw on existing proofs of native code generation. To this end, our work successfully reuses CompCert and its correctness proofs during dynamic compilation. Finally, our prototype can be extracted and executed.
Les démons dans la critique littéraire des trente dernières années
Laetitia Decourt
One hundred and fifty years after the first instalment of Demons in The Russian Messenger, the novel is still being actively investigated, not least since the end of USSR made possible its return to the fore of dostoevskian studies. Rehabilitation of the novel as a whole also extends to specific areas, which were consistently denigrated before, such as novelistic structure and narrative techniques, seen as proof of an outstanding artistic success. Considering their very plot, Demons inspire very dynamic studies of the real prototypes behind the characters and situations depicted, as well as persistent analyses of metaphysical, religious and medical related subjects. Today, perhaps even more than thirty years ago, not only everyday readers but a fair number of critics still tend to consider Dostoevsky as a misunderstood prophet and his novels as moral guides, cathartic manuals.
Slavic languages. Baltic languages. Albanian languages
East Slavic language landscape and problems of its study
Irina A. Bukrinskaya, Olga E. Karmakova
The article presents the history of East Slavic isoglosses project (ESI). The authors show the possibilities of interpretation of its materials, analyze selected maps from the ESI issues, representative for classification of the East Slavic continuum. The article discusses topical issues of linguogeography: the opposition between centre and periphery, how vocal or consonantal dialects are, the presence of archaic and innovative phenomena.