Hasil untuk "Germanic languages. Scandinavian languages"

Menampilkan 20 dari ~329562 hasil · dari arXiv, CrossRef, Semantic Scholar, DOAJ

JSON API
arXiv Open Access 2025
Unitary Expressions: A Necessary Abstraction for Extensible Quantum Programming Languages and Systems

Ed Younis

Quantum gates are the fundamental instructions of digital quantum computers. Current programming languages, systems, and software development toolkits identify these operational gates by their titles, which requires a shared understanding of their meanings. However, in the continuously developing software ecosystem surrounding quantum computing -- spanning high-level programming systems to low-level control stacks -- this identification process is often error-prone, challenging to debug, maintenance-heavy, and resistant to change. In this paper, we propose replacing this nominal gate representation with a functional one. We introduce the OpenQudit system for describing, parsing, optimizing, analyzing, and utilizing programs comprising gates described as symbolic unitary expressions. As part of this effort, we design the Qudit Gate Language (QGL), a unitary-specific expression language, and implement a differentiating just-in-time compiler in OpenQudit towards embedding this language in quantum programming languages and systems. Additionally, we have precisely designed and implemented the Qudit Virtual Machine (QVM) to evaluate quantum programs and their gradients efficiently. This evaluation is performed millions of times during the compilation of quantum programs. Our QVM can compute gradients approximately ten times faster than current leading numerical quantum compilation frameworks in the most common use cases. Altogether, the OpenQudit system is envisioned to (1) support many-level or qudit-based quantum systems, (2) enable the safe composition of program transformation tools, (3) accelerate circuit optimizers and transpilers, (4) enable compiler extensibility, and (5) provide a productive, simple-to-use interface to quantum practitioners.

en cs.PL, quant-ph
arXiv Open Access 2025
ZeroML: A Next Generation AutoML Language

Monirul Islam Mahmud

ZeroML is a new generation programming language for AutoML to drive the ML pipeline in a compiled and multi-paradigm way, with a pure functional core. Meeting the shortcomings introduced by Python, R, or Julia such as slow-running time, brittle pipelines or high dependency cost ZeroML brings the Microservices-based architecture adding the modular, reusable pieces such as DataCleaner, FeatureEngineer or ModelSelector. As a native multithread and memory-aware search optimized toolkit, and with one command deployability ability, ZeroML ensures non-coders and ML professionals to create high-accuracy models super fast and in a more reproducible way. The verbosity of the language ensures that when it comes to dropping into the backend, the code we will be creating is extremely clear but the level of repetition and boilerplate required when developing on the front end is now removed.

en cs.PL, cs.AI
CrossRef Open Access 2025
Anaphoric polar answers in Gallo‑Romance and West Germanic

Michiel de Vaan

Abstract In several medieval and modern varieties of Gallo-Romance and West Germanic, answers to polar questions may consist of ‘yes’ or ‘no’ followed by a personal or demonstrative pronoun with anaphoric reference. This type of answers is typologically rare, which begs the question of its origin and its possible spread via language contact. The present article reviews the dialectal evidence, especially on the Germanic side, discusses the etymology of the anaphoric responsives, and evaluates the possibilty of their contact-induced origin and spread. In passing, I propose a novel etymology for English yes .

DOAJ Open Access 2025
Film – Körper

Marc Kudlowski, Alexandra L. Zepter

Im Rahmen einer Einzelfallstudie mit dem Viertklässler Boris (Pseudonym) wurden bezüglich seiner literarästhetischen Rezeptionsprozesse zum Momo-Medienverbund unerwartet oft selbst initiierte Rollenspiele beobachtet. Darin bezog sich Boris mit ganzem Körpereinsatz auf seine Rezeptionen, indem er Inhaltsaspekte nachspielte oder potenziell Mögliches innerhalb der Regeln der fiktionalen Welt durch Substitution gestaltete. Ausgelöst wurden die Rollenspiele vorrangig im Kontext der Veräußerung von Imaginationen zu den eingesetzten Filmen (Realspielfilm, Zeichentrickspielfilm und -serie). Zudem wurde beobachtet, dass der Schüler die entstandenen inneren Bilder nicht in Sprache übersetzte, sondern beim szenisch-spielerischen Transfer die Kongruenz der referenzierten Modalitäten im Innen und Außen nutzte. Auf diese Weise brachte er latente Bedeutungsschichten körpersprachlich zum Ausdruck, die er – so unsere These – verbalsprachlich noch nicht explizieren bzw. in Worte übersetzen konnte. Im Beitrag stellen wir die Potenziale des leiblich-ästhetischen Respondierens auf Film für die Didaktik zur Diskussion und argumentieren für eine Erweiterung der Leihkörperschafts-Theorie nach Voss (2006).   Abstract (English): Film – Body. On the Human-Body Dimension of the Medium as a Resource for Literary-Aesthetic Processes As part of an individual case study with fourth-grader Boris (pseudonym), self-initiated role-playing games were observed unexpectedly often with regard to his literary-aesthetic reception processes for the Momo media mix. In these games, Boris referred to his receptions with his entire body by acting out aspects of the content or by creating potential possibilities within the rules of the fictional world through substitution. The role-playing games were primarily triggered in the context of making himself imaginations about the films used (real-life feature film, animated feature film and animated feature series). It was also observed that the student did not translate the resulting inner images into language, but used the congruence of the referenced modalities inside and outside during the role-playing games. In this way, he expressed latent layers of meaning through body language, which, according to our thesis, he was not yet able to translate into words. In this article, we discuss the potential of bodily-aesthetic responses to film for didactics and argue for an extension of Voss's (2006) theory of ‘borrowed bodies’ (Leihkörperschaft).  

Education, Communication. Mass media
arXiv Open Access 2024
IRCoder: Intermediate Representations Make Language Models Robust Multilingual Code Generators

Indraneil Paul, Goran Glavaš, Iryna Gurevych

Code understanding and generation have fast become some of the most popular applications of language models (LMs). Nonetheless, research on multilingual aspects of Code-LMs (i.e., LMs for code generation) such as cross-lingual transfer between different programming languages, language-specific data augmentation, and post-hoc LM adaptation, alongside exploitation of data sources other than the original textual content, has been much sparser than for their natural language counterparts. In particular, most mainstream Code-LMs have been pre-trained on source code files alone. In this work, we investigate the prospect of leveraging readily available compiler intermediate representations (IR) - shared across programming languages - to improve the multilingual capabilities of Code-LMs and facilitate cross-lingual transfer. To this end, we first compile SLTrans, a parallel dataset consisting of nearly 4M self-contained source code files coupled with respective intermediate representations. Next, starting from various base Code-LMs (ranging in size from 1.1B to 7.3B parameters), we carry out continued causal language modelling training on SLTrans, forcing the Code-LMs to (1) learn the IR language and (2) align the IR constructs with respective constructs of various programming languages. Our resulting models, dubbed IRCoder, display sizeable and consistent gains across a wide variety of code generation tasks and metrics, including prompt robustness, multilingual code completion, code understanding, and instruction following.

en cs.AI, cs.CL
arXiv Open Access 2024
A Uniform Framework for Language Inclusion Problems

Kyveli Doveri, Pierre Ganty, Chana Weil-Kennedy

We present a uniform approach for solving language inclusion problems. Our approach relies on a least fixpoint characterization and a quasiorder to compare words of the "smaller" language, reducing the inclusion check to a finite number of membership queries in the "larger" language. We present our approach in detail on the case of inclusion of a context-free language given by a grammar into a regular language. We then explore other inclusion problems and discuss how to apply our approach.

en cs.FL, cs.LO
DOAJ Open Access 2023
Woyzeck is Back!: A Comparative Reading of Traumatised Soldiers in Georg Büchner’s Woyzeck and Anthony Neilson’s Penetrator

Tuğba Aygan

Following its official recognition by the American Psychiatric Association in 1980, posttraumatic stress disorder (PTSD) has paved the way for modern trauma studies. Since the diagnosis was primarily framed around war-related experiences of veteran soldiers, PTSD subsequently dominated literary war narratives in portraying soldiers’ lives in the trenches and life post-discharge. Long before it was diagnosed and entered the literature, German playwright Georg Büchner delineated a character who embodied PTSD in his masterpiece Woyzeck (1913). Although the character had long been impeached for madness in literary circles due to his bizarre behaviours, this paper argues that Woyzeck, the protagonist who is constantly abused, is actually a victim of PTSD. Indeed, 157 years after Woyzeck, British playwright Anthony Neilson introduced another deranged soldier who can similarly be surmised as a victim of PTSD in his play Penetrator (1993). Building on this common ground, the present study aims to offer a comparative analysis of these two traumatised soldiers by drawing on the symptoms of PTSD resulting not fromwar but from ill-treatment in the army. By reflecting on these plays, the study comments on the deleterious effects of army life on soldiers, and how, as victims of different forms of violence, these soldiers become perpetrators of violence themselves.

German literature, Germanic languages. Scandinavian languages
arXiv Open Access 2022
Decision trees for regular factorial languages

Mikhail Moshkov

In this paper, we study arbitrary regular factorial languages over a finite alphabet $Σ$. For the set of words $L(n)$ of the length $n$ belonging to a regular factorial language $L$, we investigate the depth of decision trees solving the recognition and the membership problems deterministically and nondeterministically. In the case of recognition problem, for a given word from $L(n)$, we should recognize it using queries each of which, for some $ i\in \{1,\ldots ,n\}$, returns the $i$th letter of the word. In the case of membership problem, for a given word over the alphabet $Σ$ of the length $n$, we should recognize if it belongs to the set $L(n)$ using the same queries. For a given problem and type of trees, instead of the minimum depth $h(n)$ of a decision tree of the considered type solving the problem for $L(n)$, we study the smoothed minimum depth $H(n)=\max\{h(m):m\le n\}$. With the growth of $n$, the smoothed minimum depth of decision trees solving the problem of recognition deterministically is either bounded from above by a constant, or grows as a logarithm, or linearly. For other cases (decision trees solving the problem of recognition nondeterministically, and decision trees solving the membership problem deterministically and nondeterministically), with the growth of $n$, the smoothed minimum depth of decision trees is either bounded from above by a constant or grows linearly. As corollaries of the obtained results, we study joint behavior of smoothed minimum depths of decision trees for the considered four cases and describe five complexity classes of regular factorial languages. We also investigate the class of regular factorial languages over the alphabet $\{0,1\}$ each of which is given by one forbidden word.

en cs.FL, cs.CC
DOAJ Open Access 2022
Hand an sich legen : über die Alltäglichkeit des (unliterarischen) Suizids

Břetislav Horyna

The article Committing suicide. On the everydayness of (non-literary) suicide deals out criticism on the handling of suicides. It subjects sociological positions and also the statistical knowledge and develops a noetic perspective on suicide with the help of historical philosophical explanations and the current social developments, such as the emerging Euthanasia debate.

Germanic languages. Scandinavian languages, History of Northern Europe. Scandinavia
DOAJ Open Access 2022
Britta Olrik Frederiksen (red.): "Dansk Editionshistorie 2. Udgivelse af norrøn og gammeldansk litteratur"

Oliver Blomqvist

Utgivare av källtexter har gärna som ambition att ställa sig utanför historien: de är endast förmedlare av en text (eller en handskrift) ur det förflutna i sin mest ursprungliga form, med anspråk på att göra den tillgänglig för samtida läsare och forskare. Men utgivaren är också själv ett led i historien och återspeglar oundvikligen sin tids rådande vetenskapliga strömningar och ideologier. Det finns alltid teoretiska och metodologiska stridsspörsmål, falanger och aktörer, och varje utgåva utgör antingen implicit eller explicit en partsinlaga i en pågående debatt. Forskningsområdet editionshistoria kan därför ses som en gren av vetenskapshistorien i stort men även av idéhistorien. Editionshistorian ser de lärda och vetenskapliga utgåvorna som historiska artefakter. Betraktar man dem i ljuset av personhistoriska, institutionshistoriska eller litteratursociologiska upplysningar tecknar sig tidsandan vid deras tillblivelse med ny skärpa. Ställda på rad bildar de istället en teoretisk och utgivarteknisk utvecklingslinje för den vetenskapliga källpublikationen som verksamhet. Detta är i korta drag utgångspunkten för det nyutkomna verket Dansk Editionshistorie som i sin tur utgör kulmen av ett tvärvetenskapligt forskningsprojekt med samma namn under åren 2011–2016; se vidare inledningen till band 1 (Kondrup, Troelsgård & Bloch 2021: 31–48) och projektets webbsida.

Philology. Linguistics, Germanic languages. Scandinavian languages
DOAJ Open Access 2021
Norsk språkhistorie IV: Tidslinjer

Eva Skafte Jensen

I 2018 så fjerde og sidste bind af det store værk Norsk språkhistorie dagens lys. De første tre udkom i 2016 (I: Mønster), 2018 (II: Praksis) og 2018 (III: Ideologi). Det samlede værk er planlagt til at bestå af i alt fire bind, og dette fjerde udgør altså det afsluttende. Som de tre tidligere er dette fjerde bind tematiseret, hvilket kommer til udtryk i bindets undertitel Tidslinjer. Et værk som dette indeholder naturligt et eller – som i dette tilfælde – flere forord. I disse beskrives formålet med at skrive en ny sproghistorie: Hvad kan dette nye værk som andre tidligere værker ikke kan? Hvorfor er det overhovedet relevant at skrive en ny sproghistorie? Des­uden sættes indholdet af det pågældende værk ind i den større ramme som det samlede firebindsværk udgør.

Philology. Linguistics, Germanic languages. Scandinavian languages
S2 Open Access 2020
A brief overview of Marollien dialect features

Liubov Ulianitckaia

The article provides an overview of the lexical and grammatical features as well as the sociopolitical environment of Marollien that originated in the 18th century as a dialect on the territory of Brussels. Marollien is essentially the Dutch language in its Brabantian dialect, strongly influenced by French. There are literary works, performances, and musicals written and staged in Marollien, as well as dictionaries and journals published in it. Historically, the Marollien dialect is a sociolect: it was generally used by Belgians coming to Brussels from Wallonia in search of a job and settling in one of the districts of Brussels — Marolles. A special emphasis is placed on lexical features of the dialect: gastronomic and everyday vocabulary are looked at and the examples of French loanwords and Southern Dutch language norm deviations are provided. Standard Dutch calques in French, when translating idioms in particular, are also identified. The differences between Dutch, French, and Marollien place names are illustrated. In the field of morphology and word formation, there is a regular mixture of Germanic and Romanic stems which is indicated. Examples of Marollien phonetic features are also provided. The article acknowledges frequent code switching in Marollien speech, which by and large resembles the phenomenon of linguistic interference. Due to the fact that Marollien is rapidly disappearing, the Brussels-Capital region is trying to support the dialect: various activities are being organized in order to propagate its use and enhance its prestige. Nevertheless, Marollien is not included in the well-known citizen initiative “Marnix Plan”, aimed at developing the methodology for the sequential study of several languages for all segments of the population in Brussels. This initiative is also discussed in the article.

1 sitasi en
arXiv Open Access 2020
German's Next Language Model

Branden Chan, Stefan Schweter, Timo Möller

In this work we present the experiments which lead to the creation of our BERT and ELECTRA based German language models, GBERT and GELECTRA. By varying the input training data, model size, and the presence of Whole Word Masking (WWM) we were able to attain SoTA performance across a set of document classification and named entity recognition (NER) tasks for both models of base and large size. We adopt an evaluation driven approach in training these models and our results indicate that both adding more data and utilizing WWM improve model performance. By benchmarking against existing German models, we show that these models are the best German models to date. Our trained models will be made publicly available to the research community.

en cs.CL, cs.LG
arXiv Open Access 2020
Really Embedding Domain-Specific Languages into C++

Hal Finkel, Alexander McCaskey, Tobi Popoola et al.

Domain-specific languages (DSLs) are both pervasive and powerful, but remain difficult to integrate into large projects. As a result, while DSLs can bring distinct advantages in performance, reliability, and maintainability, their use often involves trading off other good software-engineering practices. In this paper, we describe an extension to the Clang C++ compiler to support syntax plugins, and we demonstrate how this mechanism allows making use of DSLs inside of a C++ code base without needing to separate the DSL source code from the surrounding C++ code.

en cs.PL
arXiv Open Access 2019
Attention-based method for categorizing different types of online harassment language

Christos Karatsalos, Yannis Panagiotakis

In the era of social media and networking platforms, Twitter has been doomed for abuse and harassment toward users specifically women. Monitoring the contents including sexism and sexual harassment in traditional media is easier than monitoring on the online social media platforms like Twitter, because of the large amount of user generated content in these media. So, the research about the automated detection of content containing sexual or racist harassment is an important issue and could be the basis for removing that content or flagging it for human evaluation. Previous studies have been focused on collecting data about sexism and racism in very broad terms. However, there is no much study focusing on different types of online harassment attracting natural language processing techniques. In this work, we present an multi-attention based approach for the detection of different types of harassment in tweets. Our approach is based on the Recurrent Neural Networks and particularly we are using a deep, classification specific multi-attention mechanism. Moreover, we tackle the problem of imbalanced data, using a back-translation method. Finally, we present a comparison between different approaches based on the Recurrent Neural Networks.

en cs.CL, cs.LG
arXiv Open Access 2019
Spacetime Programming: A Synchronous Language for Composable Search Strategies

Pierre Talbot

Search strategies are crucial to efficiently solve constraint satisfaction problems. However, programming search strategies in the existing constraint solvers is a daunting task and constraint-based languages usually have compositionality issues. We propose spacetime programming, a paradigm extending the synchronous language Esterel and timed concurrent constraint programming with backtracking, for creating and composing search strategies. In this formalism, the search strategies are composed in the same way as we compose concurrent processes. Our contributions include the design and behavioral semantics of spacetime programming, and the proofs that spacetime programs are deterministic, reactive and extensive functions. Moreover, spacetime programming provides a bridge between the theoretical foundations of constraint-based concurrency and the practical aspects of constraint solving. We developed a prototype of the compiler that produces search strategies with a small overhead compared to the hard-coded ones.

en cs.PL
arXiv Open Access 2019
The Mathematical Specification of the Statebox Language

Statebox Team, Fabrizio Genovese, Jelle Herold

This document defines the mathematical backbone of the Statebox programming language. In the simplest way possible, Statebox can be seen as a clever way to tie together different theoretical structures to maximize their benefits and limit their downsides. Since consistency and correctness are central requisites for our language, it became clear from the beginning that such tying could not be achieved by just hacking together different pieces of code representing implementations of the structures we wanted to leverage: Rigorous mathematics is employed to ensure both conceptual consistency of the language and reliability of the code itself. The mathematics presented here is what guided the implementation process, and we deemed very useful to release it to the public to help people wanting to audit our work to better understand the code itself.

en cs.PL, cs.DC

Halaman 46 dari 16479