Hasil untuk "Ural-Altaic languages"

Menampilkan 20 dari ~218059 hasil · dari DOAJ, arXiv, CrossRef

JSON API
arXiv Open Access 2026
Automating the Analysis and Improvement of Dynamic Programming Algorithms with Applications to Natural Language Processing

Tim Vieira

This thesis develops a system for automatically analyzing and improving dynamic programs, such as those that have driven progress in natural language processing and computer science, more generally, for decades. Finding a correct program with the optimal asymptotic runtime can be unintuitive, time-consuming, and error-prone. This thesis aims to automate this laborious process. To this end, we develop an approach based on 1. a high-level, domain-specific language called Dyna for concisely specifying dynamic programs 2. a general-purpose solver to efficiently execute these programs 3. a static analysis system that provides type analysis and worst-case time/space complexity analyses 4. a rich collection of meaning-preserving transformations to programs, which systematizes the repeated insights of numerous authors when speeding up algorithms in the literature 5. a search algorithm for identifying a good sequence of transformations that reduce the runtime complexity, given an initial, correct program We show that, in practice, automated search -- like the mental search performed by human programmers -- can find substantial improvements to the initial program. Empirically, we show that many speed-ups described in the NLP literature could have been discovered automatically by our system. We provide a freely available prototype system at https://github.com/timvieira/dyna-pi.

en cs.PL, cs.CL
arXiv Open Access 2026
NILE: Formalizing Natural-Language Descriptions of Formal Languages

Tristan Kneisel, Marko Schmellenkamp, Fabian Vehlken et al.

This paper explores how natural-language descriptions of formal languages can be compared to their formal representations and how semantic differences can be explained. This is motivated from educational scenarios where learners describe a formal language (presented, e.g., by a finite state automaton, regular expression, pushdown automaton, context-free grammar or in set notation) in natural language, and an educational support system has to (1) judge whether the natural-language description accurately describes the formal language, and to (2) provide explanations why descriptions are not accurate. To address this question, we introduce a representation language for formal languages, Nile, which is designed so that Nile expressions can mirror the syntactic structure of natural-language descriptions of formal languages. Nile is sufficiently expressive to cover a broad variety of formal languages, including all regular languages and fragments of context-free languages typically used in educational contexts. Generating Nile expressions that are syntactically close to natural-language descriptions then allows to provide explanations for inaccuracies in the descriptions algorithmically. In experiments on an educational data set, we show that LLMs can translate natural-language descriptions into equivalent, syntactically close Nile expressions with high accuracy - allowing to algorithmically provide explanations for incorrect natural-language descriptions. Our experiments also show that while natural-language descriptions can also be translated into regular expressions (but not context-free grammars), the expressions are often not syntactically close and thus not suitable for providing explanations.

en cs.FL, cs.CL
DOAJ Open Access 2025
Mehis Heinsaare mistahes-ruumid. Igatsus tõelise tundmuse välu järele ja selle kriitiline potentsiaal

Sven Vabar

The any-spaces-whatever of Mehis Heinsaar: A longing for the field of authentic feeling and its critical potential The works of Mehis Heinsaar have seldom been examined through the philosophical or theoretical frameworks prevalent in Western humanities and social sciences. One notable exception is the philosophy of Gilles Deleuze, for whom Heinsaar seems to have a particular affinity: Deleuze’s ideas have been repeatedly employed in interpreting Heinsaar’s works, and the author himself has expressed interest in them. However, previous engagements with Deleuze’s philosophy have primarily drawn on the concepts of desiring-machines and rhizomes, as introduced in Kafka by Deleuze and Guattari – concepts that are well suited to analyzing only one aspect of Heinsaar’s writing. This article instead turns to the concept of the any-space-whatever, developed in Deleuze’s books Cinema 1 and Cinema 2, to extend the analysis of Heinsaar’s work beyond the point at which the desiring-machines described in Kafka break down and the individual’s sensorimotor schema begins to disintegrate. It emerges that any-spaces-whatever are strikingly prevalent in Heinsaar’s writing; indeed, one could argue that a significant portion of his literary output is oriented towards discovering and creating such spaces.

Other Finnic languages and dialects
DOAJ Open Access 2025
Tähenduste seletamine leksikograafias: kuivõrd on abi suurtest keelemudelitest?

Maria Tuulik, Ene Vainik, Esta Prangel et al.

Artiklis uurime, kas ja mil määral saavad suured keelemudelid (SKMid) leksikograafi aidata sõnastikuseletuste loomisel ja sõnatähendus(t)e esitusel. Katsetatud SKMidest jäid lõplikku hindamisküsitlusse GPT-4o ja Gemini 1.5 Pro tähendusseletused 60 sõna kohta. Kasutatud näitetu promptimise (zero-shot) meetod võimaldas selgitada leksikograafide eelistusi ja nõudmisi seletustele ilma nende vormi ennetavalt piiramata. Tähendusseletusi hindas 21 leksikograafi. Vastajate üldine rahulolu oli suhteliselt kõrge: leksikograafid hindasid oma töö jaoks väga kasulikuks või mõnevõrra kasulikuks 83% seletustest. Mudelite sooritus eesti keeles enamasti rahuldas vastajaid: 75% juhtudel hinnati seletuste keelt grammatiliselt korrektseks. Rahulolu semantilise esitusega oli mõnevõrra madalam: väljatoodud tähendusi peeti õigeks 56% juhtudel. Kahest SKM-ist hinnati kõrgemalt GPT-4o tähendusseletusi. Leksikograafide hinnangud varieerusid. Hinnangutele lisatud kommentaaride analüüs selgitas leksikograafide rahulolu/rahulolematuse sisemist struktuuri, millele toetudes on edaspidi võimalik luua täpsemaid prompte eri tüüpi sõnastikuseletuste tarvis. Abstract. Maria Tuulik, Ene Vainik, Esta Prangel, Margit Langemets, Eleri Aedmaa, Kristina Koppel, Lydia Risberg: Describing senses for lexicography: how helpful are large language models? In this article, we investigate whether and to what extent large language models (LMMs) can assist lexicographers in writing definitions and presenting word meaning(s). Of the LLMs tested, the GPT-4o and Gemini 1.5 Pro meaning descriptions for 60 words were selected for the final evaluation. The zero-shot prompting method used made it possible to identify the lexicographers’ preferences and demands for the descriptions without preemptively limiting their form. The meaning descriptions were evaluated by 21 lexicographers. The overall satisfaction of respondents was relatively high: lexicographers rated 83% of the meaning descriptions as very or somewhat useful for their work. The performance of the models in Estonian was mostly satisfactory for the respondents: in 75% of the cases, the language of the meaning descriptions was judged to be grammatically correct. Satisfaction with the semantic presentation was somewhat lower: the meanings given were considered correct in 56% of cases. Of the two LLM-s, GPT-4o’s output was rated higher. Lexicographers’ evaluations varied. The analysis of the comments accompanying the evaluations clarified the internal structure of the lexicographers’ satisfaction/dissatisfaction and will allow the findings to be used in the future to create detailed prompts providing basis for different types of dictionary definitions.

Philology. Linguistics, Finnic. Baltic-Finnic
arXiv Open Access 2025
Dual-Language General-Purpose Self-Hosted Visual Language and new Textual Programming Language for Applications

Mahmoud Samir Fayed

Most visual programming languages (VPLs) are domain-specific, with few general-purpose VPLs like Programming Without Coding Technology (PWCT). These general-purpose VPLs are developed using textual programming languages and improving them requires textual programming. In this thesis, we designed and developed PWCT2, a dual-language (Arabic/English), general-purpose, self-hosting visual programming language. Before doing so, we specifically designed a textual programming language called Ring for its development. Ring is a dynamically typed language with a lightweight implementation, offering syntax customization features. It permits the creation of domain-specific languages through new features that extend object-oriented programming, allowing for specialized languages resembling Cascading Style Sheets (CSS) or Supernova language. The Ring Compiler and Virtual Machine are designed using the PWCT visual programming language where the visual implementation is composed of 18,945 components that generate 24,743 lines of C code, which increases the abstraction level and hides unnecessary details. Using PWCT to develop Ring allowed us to realize several issues in PWCT, which led to the development of the PWCT2 visual programming language using the Ring textual programming language. PWCT2 provides approximately 36 times faster code generation and requires 20 times less storage for visual source files. It also allows for the conversion of Ring code into visual code, enabling the creation of a self-hosting VPL that can be developed using itself. PWCT2 consists of approximately 92,000 lines of Ring code and comes with 394 visual components. PWCT2 is distributed to many users through the Steam platform and has received positive feedback, On Steam, 1772 users have launched the software, and the total recorded usage time exceeds 17,000 hours, encouraging further research and development.

en cs.PL, cs.SE
DOAJ Open Access 2024
After Integration Training: Variation in profiles and investment in Finnish as a second language

Tanja Seppälä

This study examines adult migrants’ situation in Finland one year after completing Integration Training, a year-long full-time labour market training. Narrative inquiry, influenced by the core story approach, is applied in analysing semi-structured interviews and questionnaires. Participants (n = 30) are clustered into nine profiles based on their work or study status and occupational orientation. In addition, the concept of investment is used in analysing the participants’ small stories within these profiles. The most participants recognise a need to improve their Finnish proficiency, but their opportunities to invest in Finnish language learning and use are often limited. Many participants compensate for the lack of Finnish learning opportunities by investing in self-study. Imagined futures as students, employees, or Finnish citizens seem to drive the investment in second language learning for most participants. The results shed light on the diversity of migrants’ life situations after Integration Training and make their challenges in second language investment visible. "Kotoutumiskoulutuksen jälkeen: variaatio profiileissa ja investointi suomeen toisena kielenä." Tässä artikkelissa tarkastellaan aikuisten Suomeen muuttaneiden (n = 30) työ- ja opiskelutilannetta sekä suomi toisena kielenä -investointia vuosi työvoimakoulutuksena järjestetyn, noin vuoden mittaisen kotoutumiskoulutuksen jälkeen. Kotoutumiskoulutuksen tavoitteena on tukea maahanmuuttajia suomen tai ruotsin kielen oppimisessa ja koulutukseen tai työelämään pääsemisessä. Aineisto on kerätty vuosina 2019–2021, ja siihen kuuluu puolistrukturoituja haastatteluja (n = 27) ja kyselylomakevastauksia (n = 3). Aineiston analysoinnin lähtökohtana oli tyyppitarinamainen tarkastelu ja osallistujat ryhmiteltiin työ- ja opiskelutilanteen sekä urasuuntautumisen mukaan, minkä tuloksena osallistujat jaettiin yhdeksään profiiliin. Lisäksi profiileittain on tarkasteltu small stories -analyysillä eli pieniä kertomuksia tutkimalla suomen kieleen investointia eli ajan ja energian käyttämistä toisen kielen oppimiseen ja käyttöön. Tulosten perusteella useimmilla osallistujilla on tarve investoida suomen kieleen ja kehittää suomen kielen taitoaan, mutta monilla on erittäin rajalliset mahdollisuudet siihen. Monet osallistujat kompensoivat suomen kielen oppimis- ja käyttämistilanteiden puutetta investoimalla nonformaalisti kotiopiskeluun. Toisaalta osalle, etenkin korkeakoulutuksessa, englannin taito on tärkeä toivotulle uralle pääsemisessä. Kuvitellut tulevaisuudet työntekijöinä, opiskelijoina tai kansalaisina näyttivät olevan useimpien investoinnin kannustimena. Vain erittäin harva oli kuitenkaan vakaassa työsuhteessa. Lisäksi monet ovat päätyneet vaihtamaan uransa suuntaa, mikä voi johtaa pääoman ja osaamisen menettämiseen. Koulutuksen kielitaitovaatimukset näyttävät vaikuttavan joidenkin kohdalla koulutuksessa eteenpäin pääsemiseen ja uran suunnan vaihtamiseen. Osallistujat ovat saaneet kotoutumiskoulutuksesta monenlaista, olennaista pääomaa, mutta monien polut työelämään näyttävät siitä huolimatta monivaiheisilta. Tämä tutkimus edistää toisen kielen alan tutkimusta tekemällä näkyväksi maahanmuuttajien toisen kielen investoinnin jännitteet sekä moninaiset, erittäin yksilölliset tilanteet, joissa maahanmuuttajat ovat kotoutumiskoulutuksen jälkeen. Tulokset myös osoittavat, että kotoutumiskoulutuksen jälkeen suomen kielen kehittämiseen ja käyttöön tarvitaan tukea vertaisilta, syntyperäisiltä puhujilta, oppilaitoksilta ja työpaikoilta.

Finnic. Baltic-Finnic
DOAJ Open Access 2024
Mõrtsukatöö Liiva järvel

Juhan Kreem

Renowned Estonian novelist Karl Ristikivi published a gothic short story titled Luige­laul (“Swansong”) in 1968, drawing inspiration from a true event from the Baltic Middle Ages. In 1428, Goswin von Aschenberg, a Vogt of the Teutonic Order stationed at Grobiņa Castle in the south-western corner of present-day Latvia, perpetrated the murder of Livonian bishops’ envoys en route to Rome to denounce the Order’s tyranny. The precise source of Ristikivi’s inspiration remains obscure. This article endeavours to trace the massacre on Lake Liepāja as a motif in Baltic German and Estonian fiction, exploring its significance as a lieu de mémoire. The historical episode gained widespread recognition through the publication of the famous Wandalia by Albertus Krantz in 1519. Baltic Enlightenment authors (Arndt, Bergmann, Merkel, Küttner) utilized it in their general polemics against medieval feudal barbarism. By the nineteenth century, the event had firmly entrenched itself in the collective understanding of the Livonian Middle Ages. Baltic German writers crafted ballads (Andrejanoff, Hirschheydt) and light fiction (Schneider) around the incident. In 1866 the first literary adaptation of the massacre on Lake Liepāja emerged as a sentimental novella in Estonian, akin to the tales of Genevieve of Brabant and Robinson Crusoe popular among Estonian readers at the time. Although the event took place relatively far from Estonia, it remained embedded in the memory of Estonian readers throughout the first half of the twentieth century, primarily through sporadic newspaper articles as well as history textbooks. Thus, the massacre on Lake Liepāja provides a good example of the intertwined cultural memories of Estonians and Baltic Germans. While these cultural memories have often been perceived as conflicting or, at best, mutually unaware, there are notable instances of overlap and consensus that warrant further exploration and consideration in future research.

Other Finnic languages and dialects
arXiv Open Access 2024
Cocobo: Exploring Large Language Models as the Engine for End-User Robot Programming

Yate Ge, Yi Dai, Run Shan et al.

End-user development allows everyday users to tailor service robots or applications to their needs. One user-friendly approach is natural language programming. However, it encounters challenges such as an expansive user expression space and limited support for debugging and editing, which restrict its application in end-user programming. The emergence of large language models (LLMs) offers promising avenues for the translation and interpretation between human language instructions and the code executed by robots, but their application in end-user programming systems requires further study. We introduce Cocobo, a natural language programming system with interactive diagrams powered by LLMs. Cocobo employs LLMs to understand users' authoring intentions, generate and explain robot programs, and facilitate the conversion between executable code and flowchart representations. Our user study shows that Cocobo has a low learning curve, enabling even users with zero coding experience to customize robot programs successfully.

en cs.HC, cs.AI
CrossRef Open Access 2024
Theory of Altaic language and grammatical relationship between Turkic and Mongolian languages

Bold L

Алтай хэлний онол ба түрэг, монгол хэлний хэл зүйн харьцаа Зохиогч энэхүү илтгэлдээ алтай хэлний онолыг зөвшөөрөгчид хийгээд үл зөвшөөрөгчдийн байр суурийн талаар тодорхой баримтад түшиглэн өгүүлээд, улмаар түрэг, монгол хэлний хэл зүйн харьцааны асуудлыг уг хэлнүүдэд буй зүг чиг заах тийн ялгалын айгаар төлөөлүүлэн, эртний түрэг, монгол хэл хийгээд орчин цагийн түрэг, монгол хэлний жишээ баримтаар тодруулан үзээд дараах дүгнэлтэд хүрчээ. Үүнд, алтай хэл судлалын хүрээнд багагүй чухал байр эзэлдэг түрэг, монгол хэлний харьцааны асуудлыг уг хэлнүүдэд буй нийтлэг зүйлийг нь гол болгон гарал зүйнхээ хувьд холбоотой гэх, эсвэл уг нийтлэг зүйлийг нь нэг хэлнээсээ нөгөө хэлэндээ үзүүлсэн нөлөөгөөр тайлбарласаар нэлээд хугацаа өнгөрч, энэ хоёр эсрэг тэсрэг онолын аль нэгийг зөвшөөрч, нөгөөг нь үгүйсгэсэн бүтээл туурвил гарсаар байна. Гэвч энд зохиогчийн задлан шинжилсэн баримтаас үзэхэд түрэг, монгол хэлний харьцааг уг хоёр эсрэг тэсрэг онолын аль нэгийг нь баримтлан нөгөөг нь үгүйсгэн тайлбарлах нь учир дутагдалтай, уг хоёр онолыг тэгш харгалзан үзэх шаардлагатай нь тодорхой байна. Энэ нь эдгээр эсрэг тэсрэг онолууд нь харилцан бие биеэ үгүйсгэсэн боловч бас харилцан бие биеэ нөхсөн шинжтэй гэдгийг харуулж байна. Өөрөөр хэлбэл, түрэг, монгол хэл нь гарал зүйнхээ үүднээс холбогдож болох авч бас бие биедээ үлэмж нөлөө үзүүлсэн, гэхдээ зөвхөн түрэг хэл монгол хэлэнд биш бас орчин цагийн түрэг хэлнүүдийг бүрэлдэн тогтоход монгол хэл багагүй чухал үүрэг гүйцэтгэжээ хэмээн дүгнэжээ.

DOAJ Open Access 2023
Eesti NSV kirjanike liidu kirjanduslikud konsultandid kui väravavalvurid

Eve Annuk

Literature was a major ideological tool in the cultural policy of the Soviet Union. Literature played an important social role, as it facilitated the upbringing of a new kind of people – Soviet people. Due to the importance of literature, writers, as representatives of the cultural elite, were also considered major cultural influencers. Writers’ work was guided by the Writers’ Union, which was deeply interwoven with the rules of the Soviet cultural policy. The ideological role of the Writers’ Union is evidenced, among other things, by the fact that from 1945 onwards it employed literary consultants who were appointed by a decision of the management board of the Writers’ Union. The article looks at the role of the literary consultants of the Writers’ Union of the Estonian SSR as guides for young/inexperienced authors. Literary consultants were writers hired by the Writers’ Union to evaluate the works of novice writers as well as to offer guidance and instruction. Although there were separate consultants for poetry and prose, several consultants handled both poems and prose writings. Various writers, such as Debora Vaarandi, Ellen Niit, Paul Kuusberg, Eduard Männik and others, served as consultants. In addition to reading manuscripts and providing feedback, the consultants were tasked with organizing the work of the young authors’ section, actively seeking out new talented authors, advising experienced writers, etc. Novice authors would send their works to the consultants, who would provide both written and oral feedback. The authors who sent their texts to the consultants included regular working people as well as those who went on to become well-known writers, such as Ilmi Kolla, Enn Vetemaa, Ly Seppel, Kalju Kangur, Jüri and Ülo Tuulik, among others. The communication between a consultant and an author would sometimes turn tense, if the author did not agree with the consultant’s assessment. A consultant could endorse a work for publication or reject it. Consultants were required to base their assessments on both aesthetic and ideological criteria. From the ideological aspect, they had to rely on the officially approved Soviet views and rhetoric; in other words, a consultant was expected to perform a certain role. The authors had to follow ideological guidelines in their work as well. The article also looks at the extent to which both the consultants and the authors had wiggle room in this matter.

Other Finnic languages and dialects
DOAJ Open Access 2023
Extralinguistic arguments in 21st century language planning discourse: a “superdictionary” between language standardization from above and below

Kadri Koreinik

Language standardization has historically been a critical area of inquiry in language policy and planning (LPP) research. This is a political matter, which contributes to “more (and hierarchical) heterogeneity” rather than linguistic homogeneity (Gal 2006: 171). The paper empirically explores extralinguistic arguments, which are used by language professionals (planners, academics, educators) in mainstream media discourse. This public discourse is initiated by the launch of an Estonian “superdictionary” in 2019 (see Tavast et al. 2020), and its public reception. By using Critical Discourse Analysis (CDA) as a method, the paper also provides insight into the discursive construction of language as such and (Standard) Estonian by different LPP actors. Above all, it aims to understand the issues of power and authority in language standardization. The discourse illustrates the paradigmatic change in standardization and lexicography: from including selected language samples to the acceptance of non-elite language variants and varieties. This change has generated a polarization of stance among language professionals, and similar discursive moves, e.g., references to the past and future dangers, metaphors and other comparisons are used. Kokkuvõte. Kadri Koreinik: Keelevälised argumendid 21. sajandi keelekorraldusdiskursuses: „supersõnaraamat“ ülalt-alla ja alt-üles keelekorralduse vahel. Keele standardimist peetakse keelepoliitika ja -korraldusuuringute üheks keskseks huviobjektiks. Standardimine ehk normimine on poliitiline valik, mis selmet keelelist homogeensust luua, tekitab hoopis heterogeensust, mis on olemuselt hierarhiline: normikeelt kas osatakse või mitte (Gal 2006). Artiklis uuritakse keeleväliseid argumente, mida kasutavad peavoolumeedias normimise üle arutlemiseks keelega professionaalselt seotud inimesed (keelekorraldajad, -teadlased, haridustegelased). Uuritav diskursus lähtub nn sõnastikureformist ja selle retseptsioonist. Analüüsimeetodina kasutatakse kriitilist diskursusanalüüsi, mis võimaldab süvitsi uurida, kuidas avalikkuses konstrueeritakse keelega, sh eesti kirjakeelega seotud tähendusi. Lisaks aitab analüüs mõista võimu ja domineerimise küsimusi keele normimisel. Diskursuses kajastub paradigmaatiline muutus, mis väärtustab kasutus- ja korpuspõhist lähenemist leksikograafias, aga ka keele normimisel. Kuigi muutus on löönud „keeleinimesed“ kahte lehte, kasutatakse poolt- või vastuargumentides samu võtteid: viiteid autoriteetidele, keele ohustatusele, metafoore ja teisi võrdlusi.

Philology. Linguistics, Finnic. Baltic-Finnic
DOAJ Open Access 2022
Irrealis moods in southwestern (Oghuz) Turkic dialects according to Russian turkologist

LIRA MAIRAMBEK KYZY

Turkic moods are divided into two categories: indicative and irrealis. While indicative moods include structures that give meaning to time, irrealis moods contain structures that express request, order, condition and necessity. It has been observed that these irrealis moods which indicate the attitude, intent and emotion of the speaker against the action, sometimes vary from dialect to dialect in terms of form, meaning, and function, and sometimes intertwine with each other. This paper deals with how Russian Turkologists such as Baskakov, Kononov, Shcherbak, Serebrennikov, Gadzhieva Pokrovskaya, Grunina, Budagova, and Guzev have examined the state, structure, and use of irrealis moods in southwestern Turkic dialects.

Language and Literature, Ural-Altaic languages
arXiv Open Access 2022
Annotating Norwegian Language Varieties on Twitter for Part-of-Speech

Petter Mæhlum, Andre Kåsen, Samia Touileb et al.

Norwegian Twitter data poses an interesting challenge for Natural Language Processing (NLP) tasks. These texts are difficult for models trained on standardized text in one of the two Norwegian written forms (Bokmål and Nynorsk), as they contain both the typical variation of social media text, as well as a large amount of dialectal variety. In this paper we present a novel Norwegian Twitter dataset annotated with POS-tags. We show that models trained on Universal Dependency (UD) data perform worse when evaluated against this dataset, and that models trained on Bokmål generally perform better than those trained on Nynorsk. We also see that performance on dialectal tweets is comparable to the written standards for some models. Finally we perform a detailed analysis of the errors that models commonly make on this data.

en cs.CL
DOAJ Open Access 2021
An etymologıcal evaluatıon on the +gil suffıx

Süleyman Kaan YALÇIN

The "+gil" suffix, which comes to names in Turkish and derives the names of kinship, family and relationship from them, is one of the controversial suffixes due to its origin. Although there is a general consensus on the use and function of the suffix, there is no agreement on the origin of it. Various researches have put forward ideas about the origin of the +gil suffix. Some of them have supposed that the suffix is a compound of two suffixes and the others have been on the opinion that the suffix originates from a single word or a particle. While these studies generally do not give determinations about the word or particle on which the suffix is based; only the relationship situation of the suffix and the word "kil", which means house in Chuvash Turkic, has been discussed in detail. In this study, by defending the opinion that the +gil suffix originates from an independent structure in ancient times, which has been also voiced by many researchers, it also will be defended that the +gil suffix was originated from the same root with the +lAr suffix which has similar functions with the +gil and thought to derive from the a-lar / an-lar (personal pronoun + plural suffix) structure. It will be tried to prove that a structure originating from *+l plural suffix and a pronoun-based word in both Turkic and Altaic languages are in the structure of +gil suffix.

Language and Literature, Ural-Altaic languages
DOAJ Open Access 2021
Window to a world beyond: Göran Schildt’s journey to Bulgaria and Romania in 1963 and some multilingual and multicultural strategies

Sabira Ståhlberg

International traveller and acclaimed Swedish-language author Göran Schildt sailed in the Black Sea in the summer of 1963. He was a well-read scholar with a deep interest in the Antiquity and a seasoned traveller with a vast experience of multilingual and multicultural situations. This was the first and last visit of his yacht Daphne to the Black Sea and the Eastern Bloc. Through the eyes of this keen observer, a small aperture can be detected among the bricks in the walls dividing Europe. A window had been opened by world politicians in the Iron Curtain at the end of the 1950s. Although there were periods of high global tension, new possibilities for travel and tourism were created in some Eastern Bloc countries, among them Bulgaria and Romania. Visits by dozens of journalists, writers and artists and thousands of charter tourists from the Western Bloc over the next few decades opened up new windows to the world beyond the Iron Curtain. Göran Schildt stands out among the Nordic cultural visitors to Bulgaria and Romania in the post-war period. His desire to get acquainted with everyday life and ordinary people, capability to see behind facades and analysing experiences could be defined as journalistic, but his travel writing went deeper. In comparison with some other writers from Finland, who visited Bulgaria or Romania during the Cold War, such as the poet Lassi Nummi or comic fiction writer Arto Paasilinna, and the Bulgarian author Yordan Radichkov who visited Sweden, Schildt’s background, interests and multilingual and multicultural strategies supported the discovery and collection of extensive information and the processing of it into a multidimensional travel book. This article discusses the journey and travel narrative of Göran Schildt from the perspective of multilingual and multicultural strategies for encountering other languages, societies and cultures, and the processing of experiences as recorded in his diary and his popular travel narrative.

Finnic. Baltic-Finnic, Social Sciences
arXiv Open Access 2021
Representations of Language Varieties Are Reliable Given Corpus Similarity Measures

Jonathan Dunn

This paper measures similarity both within and between 84 language varieties across nine languages. These corpora are drawn from digital sources (the web and tweets), allowing us to evaluate whether such geo-referenced corpora are reliable for modelling linguistic variation. The basic idea is that, if each source adequately represents a single underlying language variety, then the similarity between these sources should be stable across all languages and countries. The paper shows that there is a consistent agreement between these sources using frequency-based corpus similarity measures. This provides further evidence that digital geo-referenced corpora consistently represent local language varieties.

en cs.CL
DOAJ Open Access 2020
Katse kirjeldada artefaktinimesid

Tiina Laansalu, Peeter Päll, Tõnu Tender

Artefaktinimed on traditsiooniliselt liigitatud kultuurnimede alaliigiks, mis hõlmab inimese loodud rajatiste nimesid. Inimtekkeliste objektide hulk on tänapäeval aga üha suurem ja erinäolisem, eriti linlikus keskkonnas, mistõttu vajab traditsioonilise kohanimede klassifikatsiooni artefaktinimede rühma kirjeldus täpsustamist ja täiendamist. Artikkel vaatleb artefaktinimesid kolmes kasutussfääris: maapiirkonnas, linnapiirkonnas ja slängis. Ruraalsete artefaktinimede puhul on analüüsitud Harjumaa Jüri ja Kose kihelkonna kohanimistuid, urbonüümide alusandmed pärinevad KNABist ning keskenduvad suures osas Tallinna nimedele. Slänginimede puhul on esitatud valdavalt Tallinna ja Tartu näited, mis pärinevad eri aegadel valminud teadustöödest, artiklitest ning internetist. Abstract. Tiina Laansalu, Peeter Päll, and Tõnu Tender: An attempt to describe artefact names. Artefact names have been traditionally classified as a subtype of cultural names designating names of man-made facilities. The number of human-constructed objects is growing rapidly and they are very diverse, especially in the urban environment, therefore the classical description of artefact names needs updating and elaborating. The article looks into artefact names in three domains: the rural environment, the urban environment and slang expressions. Examples of rural artefact names have been taken from the Jüri and Kose parishes in Harjumaa, urban names are based on the data from Tallinn of the Place Names Database (KNAB) of the Institute of the Estonian Language, slang names are taken from various sources covering mainly Tallinn and Tartu.

Philology. Linguistics, Finnic. Baltic-Finnic
arXiv Open Access 2020
Towards Large-Scale Data Mining for Data-Driven Analysis of Sign Languages

Boris Mocialov, Graham Turner, Helen Hastie

Access to sign language data is far from adequate. We show that it is possible to collect the data from social networking services such as TikTok, Instagram, and YouTube by applying data filtering to enforce quality standards and by discovering patterns in the filtered data, making it easier to analyse and model. Using our data collection pipeline, we collect and examine the interpretation of songs in both the American Sign Language (ASL) and the Brazilian Sign Language (Libras). We explore their differences and similarities by looking at the co-dependence of the orientation and location phonological parameters

en cs.CL
DOAJ Open Access 2019
Setting the boundaries: Partitive verbs in Estonian verb classifications

Anne Tamm, Natalia Vaiss

The paper examines a class of Estonian transitive verbs referred to as ‘partitive verbs’: verbs that appear with partitive-marked objects. This class is more heterogeneous than previously assumed. Firstly, there are verbs that cannot have total (accusative) objects. Secondly, there are verbs that can have total objects in lexically restricted combinations. Thirdly, some verbs combine freely with a boundarymarking element. Finally, there is a group of less studied verbs that normally represent unbounded situations but nevertheless allow total object marking if the context highlights the boundary of a situation or result. We clarify the conditions of partitive verbs appearing with total objects, hoping to enhance Estonian L2 instruction and lexicography. *** "Piiride seadmisest: partitiivverbid eesti verbiklassifikatsioonides" Artiklis vaatleme eesti keele transitiivverbide klassifikatsioonides partitiivverbideks liigitatud verbe, keskendudes nn pehmetele partitiivverbidele. Kasutusnäidetest ilmneb, et pehmete partitiivverbide näol on tegemist heterogeensema verbiklassiga, kui on eelnevate uurimuste alusel järeldatud. Täiustame seniseid klassifikatsioone eesmärgiga täpsustada verbide esitust eesti keele õppesõnaraamatuis. Eristame partitiivverbide klassi, mis tüüpjuhtudel väljendavad tulemuseta ja piiritlemata situatsioone, ent esinevad siiski ka totaalobjektiga, kui diskursuse kontekst tõstab esile situatsiooni piirid ja tulemuse.

Philology. Linguistics, Finnic. Baltic-Finnic
DOAJ Open Access 2018
PLACE NAMES IN KADI BURHANEDDIN’S POETRY

Nevzat Özkan

Although Kadı Burhaneddin was a poet and a statesman who lived in the Central Anatolia, his fame spread beyond the boundaries of Anatolia, and he made great contributions to establishing the Turkish Language as a written language in the whole Anatolia, the Balkans and the Caucasus. However, different ideas were put forward in Anatolia and Azerbaijan about the language he employed in his poems. Kadı Burhanettin’s geography is important not just because it determined the language his poems were written in but also it shaped the poems. In this respect, the study of the place names used in his poetry will provide us with important insights into his style, and it will also reveal the poet’s conception of the place and his perception of the geography.

Language and Literature, Ural-Altaic languages

Halaman 8 dari 10903