Slavic Settlers and Conversion to Islam in Russian Central Asia
Paolo Sartori
Abstract In this article I set out to show that tsarist officials and churchmen attempted to discredit anyone who wanted to convert from Orthodoxy to Islam in Russian Central Asia. Surrounded by a Muslim-majority population which they perceived as intrinsically hostile by dint of its religious affiliation, imperial officials regarded any apostate not only as someone breaking the law, but also in moral terms on a par with anyone who passed over to the enemy. I shall devote particular attention to the language employed by colonial authorities to frame apostasy either as the work of Muslim proselytizers or as reflective of a personality disorder on the part of the convert, such as poor morals, mental deficiency, or greed. By proceeding in this fashion, I shall argue that Russian colonial officials pursued smear campaigns against all successful converts as a means of minimizing and dismissing the phenomenon of Orthodox Christians’ conversions to Islam.
Submorphemic Word Formation in Lithuanian Neology: Patterns in the Formation of Lexical Blends
Gabrielė Brazaitytė
The article examines the formation patterns of lexical blends—words created by non-morphemically combining at least two words—in the Lithuanian language. Although traditionally blends are not considered derivational formations, the phonetic regularities identified in this study suggest that blends are systematically motivated units that may be integrated into the general word-formation system. The aim of the study is to structurally analyze lexical blends recorded in the Database of Lithuanian Neologisms and to determine the phonetic regularities that govern their formation. The study reveals that blends are typically formed in two ways: through overlap (when phonemes from both source words coincide) and through concatenation (when the components are joined without phonemic overlap). The formation of blends is motivated by similarity to a prototype word— most often the second component—which determines the syllable count, stress pattern, and syllable structure of the blend. Regularities in breakpoint placement—the non-morphemic boundary where the two source words are joined—were also identified: the first source word most frequently breaks at the onset–nucleus boundary, while the second breaks at the syllable boundary. These findings suggest that blends are formed in accordance with Lithuanian phonotactic constraints and with the aim of preserving recognizability. Breakpoints at the nucleus–coda boundary are avoided, as the nucleus and coda form the rhyme, which is crucial to a word’s sound and recognizability. Thus, although lexical blends lack the morphemic structure typical of traditional word-formation, the study demonstrates that they follow consistent phonetic patterns and should therefore be considered a potential part of the word-formation system.
Slavic languages. Baltic languages. Albanian languages
Differenzierung mit digitalen Medien
Sonja Bacher
Der vorliegende Artikel gibt einen kurzen Überblick darüber, wie digitale Medien im Fremdsprachenunterricht zu Zwecken der Inklusion und Differenzierung eingesetzt werden können. Auf Grundlage des Kompetenzmodells DigCompEdu (2017) werden praxisnahe Maßnahmen diskutiert, wodurch die Selbsttätigkeit der Lernenden gefördert sowie ein inklusiver, handlungs- und medienbasierter Unterricht gewährleistet werden kann. Im Artikel wird außerdem auf potenzielle Grenzen von Online-Tools hingewiesen.
Slavic languages. Baltic languages. Albanian languages
Other and Alien: Faces of Medea in Dramatic and Stage Interpretations of 20th-21st Centuries
T. A. Sharypina
This study analyzes the discourse of the Other and the Alien in transformations of the Medea narrative from the 20th to the 21st centuries, prominently seen through binary dramaturgical parallels in the works of H. H. Jahnn, G. Butler, H. Müller, L. Gaudé, M. Kurochkin, V. Klimenko, J. Anouilh, and T. Lanoye. This contributes to a productive understanding of how the constant components of the myth are perceived by different national mentalities within the context of this opposition. The relevance of this work lies in examining artistic texts through the lens of the national code phenomenon within historical epochs. The Medea archetype is reinterpreted in the context of contemporary life, encapsulating the struggles of women for intellectual and social independence. Diachronic typological parallels of this discourse are identified across all levels of the ethological structure of dramatic and stage adaptations as modern art becomes increasingly visualized and dramatic works require both verbal and visual, auditory perception. It is argued that the interaction of artistic codes from various art forms enhances the multifaceted meanings of Medea’s narratives, intensifying emotional effects and contributing to a psychologically nuanced characterization of the heroine.
Slavic languages. Baltic languages. Albanian languages
VERB ANGLICISMS IN RUSSIAN AND CZECH LANGUAGES IN INFORMAL COMMUNICATION OF INTERNET USERS
E. Petruchina, M. De Poj
The main focus of the article is on the adaptation of verbs borrowed from English to the Russian and Czech language systems, which name the actions of Internet users in informal online communication. A corpus study of verb anglicisms in two closely related Slavic languages reveals a lower degree of their adaptation in Czech compared to Russian, first of all in terms of graphics and orthography, as well as a lower stylistic marking.
Named Entity Recognition for Low-Resource Languages - Profiting from Language Families
Sunna Torge, Andrei Politov, Christoph U. Lehmann
et al.
Machine learning drives forward the development in many areas of Natural Language Processing (NLP). Until now, many NLP systems and research are focusing on high-resource languages, i.e. languages for which many data resources exist. Recently, so-called low-resource languages increasingly come into focus. In this context, multi-lingual language models, which are trained on related languages to a target low-resource language, may enable NLP tasks on this low-resource language. In this work, we investigate the use of multi-lingual models for Named Entity Recognition (NER) for low-resource languages. We consider the West Slavic language family and the low-resource languages Upper Sorbian and Kashubian. Three RoBERTa models were trained from scratch, two mono-lingual models for Czech and Polish, and one bi-lingual model for Czech and Polish. These models were evaluated on the NER downstream task for Czech, Polish, Upper Sorbian, and Kashubian, and compared to existing state-of-the-art models such as RobeCzech, HerBERT, and XLM-R. The results indicate that the mono-lingual models perform better on the language they were trained on, and both the mono-lingual and language family models outperform the large multi-lingual model in downstream tasks. Overall, the study shows that low-resource West Slavic languages can benefit from closely related languages and their models.
Large Language Models for Multilingual Slavic Named Entity Linking
Rinalds Viksna, I. Skadina, Daiga Deksne
et al.
This paper describes our submission for the 4th Shared Task on SlavNER on three Slavic languages - Czech, Polish and Russian. We use pre-trained multilingual XLM-R Language Model (Conneau et al., 2020) and fine-tune it for three Slavic languages using datasets provided by organizers. Our multilingual NER model achieves 0.896 F-score on all corpora, with the best result for Czech (0.914) and the worst for Russian (0.880). Our cross-language entity linking module achieves F-score of 0.669 in the official SlavNER 2023 evaluation.
Is German secretly a Slavic language? What BERT probing can tell us about language groups
A. Mysiak, J. Cyranka
In the light of recent developments in NLP, the problem of understanding and interpreting large language models has gained a lot of urgency. Methods developed to study this area are subject to considerable scrutiny. In this work, we take a closer look at one such method, the structural probe introduced by Hewitt and Manning (2019). We run a series of experiments involving multiple languages, focusing principally on the group of Slavic languages. We show that probing results can be seen as a reflection of linguistic classification, and conclude that multilingual BERT learns facts about languages and their groups.
Ujemanje v zgradbah s količinskimi zvezami: slovenščina v sopostavitvi z drugimi zahodnimi južnoslovanskimi jeziki
Mladen Uhlik, Andreja Žele
Prispevek obravnava skladenjske lastnosti količinskih zvez z glavnimi števniki, ki jih glede na problematiko lahko okvirno razdelimo na dve skupini: števniki od 1 do 4 in števniki od 5 naprej. Kontrastivna obravnava opozarja predvsem na posebnosti rabe v slovenščini, štokavščini, v izbranem srednječakavskem in kajkavskem govoru. Obravnavana so razmerja na besednozvezni in predikacijski ravnini.
Literature (General), Slavic languages. Baltic languages. Albanian languages
A cross-linguistic study of the discourse functions of antonymy in Albanian and English
Ekaterina Strati, Ergys Bezhani
Antonymy is considered a binary lexical-semantic opposition of words, a lexical-sematic relation among other relations such as synonymy, hyponymy, and meronymy. Several authors have categorized the binary opposition of antonyms when considering the typology of antonyms. Issues such as the use of antonyms in discourse, how different languages use antonyms, patterns of distribution, etc., will be dealt with in this paper. This study aims to provide a comparative perspective of the Albanian and English patterns of co-occurrence of antonymous pairs in sentences. The co-occurrence of antonyms in sentences and the roles of antonymous pairs have been the object of cross-linguistic study in different languages, including French, Swedish, Chinese, Serbian, Rumanian, etc. This article adds to the list of cross-linguist studies, providing a broader database for further comparative studies and opening the way to other comparative studies between similar and different languages other than English. This article aims to identify examples of antonym pairs co-occurring in the same syntactic frame using an empirical, quantitative method. The Albanian National Corpus is used as a “clinical setting” for the investigation. A detailed analysis of the distribution of such pairs in the English and Albanian corpora, with examples illustrating similarities and differences. Both languages show a predominance of ancillary and coordinated antonymy. Also, coordinated antonyms are more significant in number in both languages and are followed by the second major group of ancillary antonyms. Regarding differences, far more examples were classified as residual in Albanian than in English. Key Words: antonymy, co-occurrence, pairs, corpus-based research, discourse functions
Borrowings from the Old Slavic Digenis Akritis in the Muscovite Buovo d’Antona (Bova Korolevich)
Robert Romanchuk
The Byzantine Greek “romantic epic” Digenis Akritis and the Venetian romance Buovo d’Antona, translated into different Slavic languages under different circumstances and traversing Slavic space by different routes, reached Muscovy in the sixteenth century – each undergoing significant editing there and ultimately becoming textually entangled. It has not been noticed before that the editor of a Muscovite redaction of Buovo borrowed a number of passages from an early redaction of the “formulaically styled” Slavic Digenis. The encounter of Buovo with Digenis may help explain the former work’s subsequent popularity in Russia, as it transformed the courtly hero Buovo d’Antona into the Russian folk hero Bova Korolevich. It also helps sketch a context for the direct ancestor of the most important extant manuscript of the Slavic Digenis.
Proto-Slavic *čerěnъ and *černъ. I. Vault (of the hearth)
M. Saenko
In etymological literature, one can come across the reconstruction of several homonymous words *černъ and/or *čerěnъ in the Proto-Slavic language. The highest number of reconstructed words is four while the lowest number is two. The reconstructions differ both in form and semantics. This article attempts to determine which position is best supported by facts and to clarify the reconstructed meanings. The analysed material suggests that we should differentiate between *černъ ‘handle’ and *čerěnъ / *čerěnь / *čerenъ / *čerenь (?), which denoted a certain part of the hearth, most likely the vault. It is highly probable that certain designations of nets in West Slavic languages are also continuants of the latter word. Etymologically, *čerěnъ / *čerěnь / *čerenъ / *čerenь is most probably related to the verb *kuriti (sę), although alternative explanations are possible.
East Slavic Contact Interaction in Primorsky Krai (According to the Linguistic Expedition of 2023)
G. Pilipenko
The paper presents the first results of the field research carried out in June 2023 in the Khorol and Chernigov districts of Primorsky Krai of Russia among the descendants of immigrants from Ukrainian and Belarusian lands. The purpose of the expedition survey was to record the speech of dialect speakers of the Ukrainian and Belarusian languages, to study the functioning of dialects in contact with closely related Russian language, to identify the preservation of traditional folk culture. The main problem during field research was related to the difficulties of informants when trying to record spontaneous dialect speech. Relics of Belarusian dialects were not found in this expedition. The utterances in Ukrainian were recorded in cases of quoting speech of older relatives, in memories about childhood and in folklore. The paper presents some transcribed texts, reveals phonetic and morphological features, analyzes individual lexical units. Narratives on the following topics are presented: the linguistic situation among migrants at the micro and macro levels, metalanguage observations, stories about local place names and ethnic nicknames. In addition to audio recordings, a written source is analyzed, the history of the formation of the Khorol village, written in Russian with Ukrainian dialect items. This text reveals all the linguistic features noted in the oral speech of the informants. More-over, it contains valuable information about the daily activities of the first settlers, about their holidays, customs, contacts with neighbors of different ethnicity. In the future, it is planned to work on transcribing the entire corpus of recorded texts for their publication and analysis, as well as to publish a written history of the village of Khorol.
Albanian Fake News Detection
Ercan Canhasi, Rexhep Shijaku, Erblin Berisha
Recent years have witnessed the vast increase of the phenomenon known as the fake news. Among the main reasons for this increase are the continuous growth of internet and social media usage and the real-time information dissemination opportunity offered by them. Deceiving, misleading content, such as the fake news, especially the type made by and for social media users, is becoming eminently hazardous. Hence, the fake news detection problem has become an important research topic. Despite the recent advances in fake news detection, the lack of fake news corpora for the under-resourced languages is compromising the development and the evaluation of existing approaches in these languages. To fill this huge gap, in this article, we investigate the issue of fake news detection for the Albanian language. In it, we present a new public dataset of labeled true and fake news in Albanian and perform an extensive analysis of machine learning methods for fake news detection. We performed a comprehensive feature engineering and feature selection experiments. In doing so, we explored the Albanian language-related feature categories such as the lexical, syntactic, lying-detection, and psycho-linguistic features. Each article was also modeled in four different ways: with the traditional bag-of-words (BoW) and with three distributed text representations using the state-of-the-art Word2Vec, FastText, and BERT methods. Additionally, we investigated the best combination of features and various types of classification methods. The conducted experiments and obtained results from evaluations are finally used to draw some conclusions. They shed light on the potentiality of the methods and the challenges that the Albanian fake news detection presents.
27 sitasi
en
Computer Science
Rapid radiation of the inner Indo-European languages: an advanced approach to Indo-European lexicostatistics
A. Kassian, M. Zhivlov, George Starostin
et al.
Abstract In this article we present a new reconstruction of Indo-European phylogeny based on 13 110-item basic wordlists for protolanguages of IE subgroups (Proto-Germanic, Proto-Slavic, etc.) or ancient languages of the corresponding subgroups (Hittite, Ancient Greek, etc.). We apply reasonably formal techniques of linguistic data collection and post-processing (onomasiological reconstruction, derivational drift elimination, homoplastic optimization) that have been recently proposed or specially developed for the present study. We use sequential phylogenetic workflow and obtain a consensus tree based on several algorithms (Bayesian inference, maximum parsimony, neighbor joining; without topological constraints applied). The resulting tree topology and datings are entirely compatible with established expert views. Our main finding is the multifurcation of the Inner IE clade into four branches ca. 3357–2162 bc: (1) Greek-Armenian, (2) Albanian, (3) Italic-Germanic-Celtic, (4) Balto-Slavic–Indo-Iranian. The proposed radiation scenario may be reconciled with diverse opinions on Inner IE branchings previously expressed by Indo-Europeanists.
A miracle in a dairy shop: On a typology of an “emigrant” novel
Tatiana Marchenko
Alja Rachmanowa (real name Galina Dyuryagina-Hoyer) is a Russian writer with a European success in 1930s. Her books were published in German translation made by her husband Arnulf Hoyer, and still remain obscure in Russia. The phenomenon is rather fascinating from the point of view of typology of émigré prose. A novel “Milchfrau in Ottakring” (1933) of the prolific author of three dozen books was extremely popular, not only it remains relevant, but looks very modern as an “emigrant” novel of special type. In a diary form based on a personal experience, the writer sets out a story of success. A qualified philologist, Alja Rachmanowa (her literary pseudonym is usually referred to) was forced to become for a couple of years a saleswoman in a rented dairy shop. This experience of a “foreigner”, her national and sociocultural identity, adaptation, and ultimately successful integration are reflected in the diary autobiographical novel. The Russian component of the book in German whose author / heroine balances between the spheres of “own” and “stranger,” has driven a success of the “Milchfrau in Ottakring”. Russian realities, Russian mentality, nostalgie for the native country permeating the narration, especially attracted the readership. One of the important markers of “Russianness” is a citation of Russian literature, not in the form of a mere quotation, but as a rethinking, re-interpretation, a dispute with the classics. The article deals with some examples of such citing (F.M. Dostoevsky, A.V. Koltsov, A.N. Pleshcheev, Z.N. Gippius). A fragment of the novel in Russian translation is given in Annex.
Literature (General), Slavic languages. Baltic languages. Albanian languages
Key Approaches to the Study of History of Altai in 18th Century in Modern Russian Historiography
D. S. Bobrov, L. G. Zaitseva
The article is devoted to a comprehensive examination of theoretical approaches to the study of the history of Altai in the 18th century, developed by Russian specialists over the past three decades. The relevance of the study is associated with the emerging trend in domestic science towards a holistic understanding of the retrospective of individual regions, including through understanding the interdependent role of various social and administrative factors in the colonization process. Special attention is paid to the conceptual assessment of the nature of the initial development of Altai. The methodological grounds for applying the theories of military colonization, frontier, frontier modernization to the history of the region are outlined. The authors state the presence in the scientific literature of two non-identical approaches to the characterization of the Russian border in the south of Western Siberia. The authors demonstrate the obvious desire of historians to carry out comprehensive studies of regional and local levels of civil and mining management, which has emerged against the background of the preservation of the heuristic significance of microhistorical analysis of the role of individual fortified (fortresses, forts) or production facilities (factories). The diversity and conceptual heterogeneity of modern Russian historiography of the history of Altai in the 18th century is summarized. Some forecasts of the development of the historiographic situation are formulated.
Slavic languages. Baltic languages. Albanian languages
Od specijaliziranih mrežnih korpusa do rječnika za neizvorne govornike
Irena Srdanović
U ovom radu predstavljena su dva pristupa u stvaranju specijaliziranih mrežnih korpusa o hrvatskom turizmu na japanskome jeziku za potrebe izrade specijaliziranoga dvojezičnog japansko-hrvatskog rječnika za učenike japanskoga jezika. Oba pristupa koriste se tehnologijom WebBootCat unutar platforme Sketch Engine za automatsko stvaranje specijaliziranih mrežnih korpusa (Baroni i dr. 2006, Kilgarriff i dr. 2014). Prvi pristup stvara korpus iz odabranih riječi, odn. polaznih pojmova (eng. <i>seed words</i>) koje su najrelevantnije za temu. Drugi pristup skuplja brojne mrežne stranice sa sadržajima o turizmu u određenim regijama, gradovima i mjestima u Hrvatskoj napisanim na japanskome jeziku, koje se zatim upotrebljavaju za stvaranje mrežnih korpusa. Oba pristupa pružaju specijalizirane internetske korpuse koji su, bez obzira na malu veličinu, korisni za profiliranje leksika u određenome području, u ovome slučaju u području turizma. U procesu stvaranja rječnika drugi se pristup pokazao osobito korisnim za odabir natuknica, dok su se oba pristupa pokazala vrlo korisnim za istraživanje i odabir autentičnih primjera iz korpusa. Uočeni su i određeni nedostatci u jezičnoj obradi podataka na japanskome jeziku, npr. pogreške u lematizaciji nekih pojmova i naziva svojstvenih određenoj kulturi te je ukazano na potrebu za usavršavanjem postojećih alata za jezičnu obradu podataka na japanskome jeziku. Japansko-hrvatski dvojezični rječnik namijenjen učenicima japanskoga jezika trenutačno je u eksperimentalnoj fazi te se učenici i nastavnici njime koriste i grade ga s pomoću otvorene platforme za izradu mrežnih rječnika Lexonomy (Mechura 2017). Osim što je rad na dvojezičnome rječniku koristan kao sredstvo za stručnu obuku u analizi i opisu jezika s pomoću suvremenih tehnologija (npr. korpusa, platforma za pretraživanje korpusa i izradu rječnika), važnost rječnika vidi se i u izobrazbi novih stručnjaka osposobljenih za rad u turizmu na japanskome jeziku, što je izrazito potrebno. U budućnosti bi se mogao primijeniti isti pristup za stvaranje specijaliziranih korpusa i rječnika za japanski i druge jezične parove.
Slavic languages. Baltic languages. Albanian languages
Določitev spola anglizmov v slovenščini
Eva Sicherl
Prispevek osvetljuje principe, ki delujejo na določitev spola pri prevzemanju angleških leksemov v slovenščino. Pri določitvi spola samostalnikov se namreč pri prehodu iz jezikovnega sistema, ki uporablja naravni spol (angleščina), v jezikovni sistem s slovničnim spolom (slovenščina) praviloma ohranja biološki spol pri samostalnikih za bitja, spol samostalnikov, ki poimenujejo neživo, pa se oblikuje po drugačnih kriterijih.
Literature (General), Slavic languages. Baltic languages. Albanian languages
Problematics and Composition Originality of P. Ackroyd’s Novel “The Plato Papers”
E. V. Ushakova
The composition of the novel by the modern English writer P. Ackroyd “The Plato Papers” in connection with the problems of the work is considered. The review of the last researches devoted to creativity of this author is given. Attention is paid to how the work reveals the problem of interpretation of history, its ambiguity, the concept of the movement of time, progress, the relationship of the past, present and future. The author shows that when shifting the time of the novel in the future, Ackroyd uses the documentary imitation technique, changes narrators and types of narration. The composition of the work consisting of separate small parts connected in a key way by the speaker Plato is analyzed. Such properties of the composition of the analyzed novel as fragmentation and nonlinearity are noted. It is shown how the novel realizes postmodern idea about the relativity of the interpretations of the events of the past. The position of the main character endowed with the gift of insight, able to penetrate into the past, to look from an unusual point of view at the present, to doubt and seek the truth is revealed.
Slavic languages. Baltic languages. Albanian languages