Hasil untuk "Oriental languages and literatures"

Menampilkan 20 dari ~7066 hasil · dari DOAJ, arXiv, Semantic Scholar

JSON API
DOAJ Open Access 2025
Enhancing Translation Services in Jordan through Technology: A Study of Specialization Trends and Language Adaptations

Elham Salem Almakatrah

This study investigates the influence of advanced technological integration, particularly artificial intelligence (AI) and neural machine translation (NMT), on the specialization trends and language adaptation strategies within Jordan's translation industry. Despite technology’s promising potential to enhance translation efficiency and expand market reach, concerns persist regarding AI's limited ability to accurately grasp contextual nuances, especially in linguistically and culturally diverse languages like Arabic and English. The study aims to identify current technological usage levels among translation service providers, examine how technology influences specialization within legal, medical and technical translation sectors, and explore language adaptation strategies employed to manage AI-driven translations. The Technology Acceptance Model (TAM), introduced by Davis (1989), serves as the theoretical framework to assess translators' perceived usefulness and ease of use of these technologies. A qualitative approach employing purposive sampling involved ten participants representing diverse roles in Jordan’s translation industry. The obtained findings indicate significant shifts from traditional manual translation to specialized roles emphasizing quality assurance, post-editing, and terminology management; reveal increased emphasis on cultural localization strategies; and highlight challenges such as AI’s limited cultural comprehension, extensive post-editing requirements and data privacy concerns. The study provides recommendations for educational and operational adjustments to enhance technological integration in translation practices.

Language and Literature, English language
DOAJ Open Access 2025
الروابط الزمنية وأثرها الحجاجي في قصة موسى وفرعون في سورة طه

د.غالية المسند

تَتَبَّعْتُ في بحثي الموسوم بـ"الروابط الزمنيّة وأثرها الحجاجي في قصة موسى وفرعون في سورة طه"، أثر الروابط الزمنية في تشييد البنية الحجاجية للخطاب القرآني، بِعَدِّها أدواتٍ لتنظيم الحدث، وكونها آليات فاعلة في هندسة الحِجاج القرآني وبناء منطق الإقناع، وقمت بتحليل تفصيلي لسياق قصة موسى وفرعون، بوصفها أنموذجًا حيًّا لصراع الحق والباطل في نسق زمنيّ محكم، وسَبَرْتُ أغْوارَ النَّصِّ، لإماطة اللُّثُم عن الوظائف العميقة للروابط الزمنية، وأبنت كيف تسهم في ترتيب الأحداث، وتصعيد المواقف، وتوجيه المتلقي نحو استيعاب الحقائق، واستبطان الحجج، وعوّلت على المنهج الحجاجي، للكشف عن دلالات الأدوات الزمنية من مثل: (إذا، إذ، لمّا، الفاء، ثم، حتى…)، وقد أفضى البحث إلى أن هذه الروابط الزمنيَّة لا تؤدي وظيفة سردية فحسب؛ بل تسهم في بناء نسيجٍ دلاليٍّ عميقٍ، تُستثمر فيه لحظات الزمن لخدمة غايات الحجاج، فتتحوّل من أدوات تعاقب زمني إلى أدوات ضغط خطابي وتوجيه إستراتيجي للفهم، تبرز منطق الخطاب، وتحقق الأثرَ الإقناعيَّ الذي يعكس انسجام البنية الزمنيّة مع مَقْصديَّة الرسالة الإلهية.

Oriental languages and literatures
arXiv Open Access 2025
Languages of Boundedly-Ambiguous Vector Addition Systems with States

Wojciech Czerwiński, Łukasz Orlikowski

The aim of this paper is to deliver broad understanding of a class of languages of boundedly-ambiguous VASS, that is k-ambiguous VASS for some natural k. These are languages of Vector Addition Systems with States with the acceptance condition defined by the set of accepting states such that each accepted word has at most k accepting runs. We develop tools for proving that a given language is not accepted by any k-ambiguous VASS. Using them we show a few negative results: lack of some closure properties of languages of k-ambiguous VASS and undecidability of the k-ambiguity problem, namely the question whether a given VASS language is a language of some k-ambiguous VASS. Finally, we show that the regularity problem is decidable for k-ambiguous VASS.

en cs.FL
arXiv Open Access 2024
Evaluating LLM-driven User-Intent Formalization for Verification-Aware Languages

Shuvendu K. Lahiri

Verification-aware programming languages such as Dafny and F* provide means to formally specify and prove properties of a program. Although the problem of checking an implementation against a specification can be defined mechanically, there is no algorithmic way of ensuring the correctness of the {\it user-intent formalization for programs}, expressed as a formal specification. This is because intent or requirement is expressed {\it informally} in natural language and the specification is a formal artefact. Despite, the advent of large language models (LLMs) has made tremendous strides bridging the gap between informal intent and formal program implementations recently, driven in large parts by benchmarks and automated metrics for evaluation. Recent work has proposed a framework for evaluating the {\it user-intent formalization} problem for mainstream programming languages~\cite{endres-fse24}. However, such an approach does not readily extend to verification-aware languages that support rich specifications (using quantifiers and ghost variables) that cannot be evaluated through dynamic execution. Previous work also required generating program mutants using LLMs to create the benchmark. We advocate an alternate, perhaps simpler approach of {\it symbolically testing specifications} to provide an intuitive metric for evaluating the quality of specifications for verification-aware languages. We demonstrate that our automated metric agrees closely on a human-labeled dataset of Dafny specifications for the popular MBPP code-generation benchmark, yet demonstrates cases where the human labeling is not perfect. We also outline formal verification challenges that need to be addressed to apply the technique more widely. We believe our work provides a stepping stone to enable the establishment of a benchmark and research agenda for the problem of user-intent formalization for programs.

en cs.PL, cs.LG
arXiv Open Access 2024
Minuska: Towards a Formally Verified Programming Language Framework

Jan Tušil, Jan Obdržálek

Programming language frameworks allow us to generate language tools (e.g., interpreters) just from a formal description of the syntax and semantics of a programming language. As these frameworks tend to be quite complex, an issue arises whether we can trust the generated tools. To address this issue, we introduce a practical formal programming language framework called Minuska, which always generates a provably correct interpreter given a valid language definition. This is achieved by (1) defining a language MinusLang for expressing programming language definitions and giving it formal semantics and (2) using the Coq proof assistant to implement an interpreter parametric in a MinusLang definition and to prove it correct. Minuska provides strong correctness guarantees and can support nontrivial languages while performing well. This is the extended version of the SEFM24 paper of the same name.

en cs.PL
arXiv Open Access 2024
vitaLITy 2: Reviewing Academic Literature Using Large Language Models

Hongye An, Arpit Narechania, Emily Wall et al.

Academic literature reviews have traditionally relied on techniques such as keyword searches and accumulation of relevant back-references, using databases like Google Scholar or IEEEXplore. However, both the precision and accuracy of these search techniques is limited by the presence or absence of specific keywords, making literature review akin to searching for needles in a haystack. We present vitaLITy 2, a solution that uses a Large Language Model or LLM-based approach to identify semantically relevant literature in a textual embedding space. We include a corpus of 66,692 papers from 1970-2023 which are searchable through text embeddings created by three language models. vitaLITy 2 contributes a novel Retrieval Augmented Generation (RAG) architecture and can be interacted with through an LLM with augmented prompts, including summarization of a collection of papers. vitaLITy 2 also provides a chat interface that allow users to perform complex queries without learning any new programming language. This also enables users to take advantage of the knowledge captured in the LLM from its enormous training corpus. Finally, we demonstrate the applicability of vitaLITy 2 through two usage scenarios. vitaLITy 2 is available as open-source software at https://vitality-vis.github.io.

en cs.HC
arXiv Open Access 2024
The Equivalence Problem of E-Pattern Languages with Regular Constraints is Undecidable

Dirk Nowotka, Max Wiedenhöft

Patterns are words with terminals and variables. The language of a pattern is the set of words obtained by uniformly substituting all variables with words that contain only terminals. Regular constraints restrict valid substitutions of variables by associating with each variable a regular language representable by, e.g., finite automata. Pattern languages with regular constraints contain only words in which each variable is substituted according to a set of regular constraints. We consider the membership, inclusion, and equivalence problems for erasing and non-erasing pattern languages with regular constraints. Our main result shows that the erasing equivalence problem, one of the most prominent open problems in the realm of patterns, becomes undecidable if regular constraints are allowed in addition to variable equality.

en cs.FL, cs.CC
DOAJ Open Access 2023
The Role of Maulana Abul Kalam Azad in the Development and Stability of Urdu Journalism

Zekai Kardaş

Maulana Abul Kalam Azad was born in 1888 and played an important role in shaping India’s struggle for freedom. For this reason, his name has an important place in India’s history. Abul Kalam Azad was known for his sympathy and admiration of the Turks and additionally took an active role in raising awareness of Indian Muslims in the context of Ottoman-Indian relations. The Indian Muslims under British rule had anxiously been following the Ottoman Empire, which had faced troubling situations during the Crimean War, the 1887-1888 Ottoman-Russian War, the Battle of Tripoli, and the Balkan Wars, as Indian Muslims saw the Turks as their protector, especially due to the Ottoman claim to the Islamic Caliphate. During this period, some personalities such as Maulana Zafar Ali Khan, the brothers Muhammed Ali and Shauqat Ali, as well as Abul Kalam Azad attempted to raise awareness of the Indian Muslims by closely watching the situation of the Turks. To maintain this purpose, they began publishing as products of this idea the following newspapers: Al-Hilal was published by Abul Kalam Azad, Zamindar was published by Maulana Zafar Ali Khan, and Comrade was published by Muhammed Ali. These newspapers later became very popular and were called the Freedom Newspapers. Maulana Abul Kalam Azad had a versatile personality and constantly provided news about the Turks through his newspaper al-Hilal. At the same time, he organized aid campaigns through al-Hilal in order to support the Turkish people. Al-Hilal soon became known throughout India through its fierce criticism of the British government. Published weekly, it soon became one of India’s best-selling Urdu newspapers. Abul Kalam Azad started a new understanding of journalism in the Indian subcontinent with his newspaper al-Hilal and was instrumental in gaining an identity for Indian Muslim journalism in particular.

Oriental languages and literatures
DOAJ Open Access 2023
Formal and functional approaches to subordination in Kazakh

Uldanay Jumabay

This paper investigates clausal subordination in Kazakh and its functional and formal properties. Kazakh subordinate clauses manifest typical Turkic nominalization, where the dependent predicate and the first argument (if overtly expressed) differ from those of main clauses. Such differences can be seen in three grammatical aspects: syntax, semantics and prosody. Regarding the semantic-syntactic features, subordinate clauses are morphologically reduced and display various degrees of syntactic downgrading; they can display first argument co-reference and semantic integration with their superordinate clauses. Prosodically, subordinate clauses have either a separate intonation pattern or one that follows the intonation of main clauses. The aim of the paper is to describe the syntactic, semantic and prosodic features of subordinate clauses in Kazakh and to compare these peculiarities in a hierarchical order.

Philology. Linguistics, Oriental languages and literatures
DOAJ Open Access 2023
فهم معنى "بلدة طيبة و رب غفور" لبناء مجتمع إسلامي في المجتمعات مختلفة الأديان

Yulius bin Abdul Muis

This paper aims to reveal the broader and deeper meaning of the sentence: baldatun thayyibatun wa rabbun ghafur. this sentence contained in surah Saba:15, which is very popular in the Islamic community when they refer to a safe, prosperous country and it’s people are blessed by God with forgiveness. Disclosure of the meaning of baldatun thayyibatun wa rabbun ghafur more broadly and deeply is in order to find an operational foundation to build an Islamic society in a society of different religions, both for the present and the future. The method used in this study is the study of literature by trying to understand more deeply, in detail and comprehensive the sentence Baldatun Tahayyibatu wa Rabbun Ghafur through the explanation of various books of tafseer, then the meanings are summed up into several main points. Contextually, the verse 15 of the surah Saba is Allah addressed it to the land of Saba and its people. With the title "baldatun thayyibatun wa rabbun ghafur", which is Allah fond of for the country of Saba and its inhabitants shows that the inhabitants of the country of Saba at that time were believed to always maintain and behave with Islamic values in their daily lives under the mukmin leadership and fair. The main values contained in the sentence of “baldatun thayyibatun wa rabbun ghafur” such as tawhed, gratitude, ukhwah and advising fellow Muslims, and the values are relevant to be used as a reference and basis in building an Islamic society in the midst of different religious communities, in the present and future. Therefore, a strong commitment must be built from each Muslim individual in practicing and upholding what are the main values contained in the sentence of “baldatun thayyibatun wa rabbun ghafur”, so that Islamic society will be realized wherever Muslims areز

Oriental languages and literatures, Islam
DOAJ Open Access 2022
Ta’alum Al-Lughah Al-Arabiyyah Li Aghradl Khashah: Akadimi Wa Mihni

Novita Rahmi, Albarra Sarbaini, J. Sutarjo et al.

Arabic has a function and essence for the life of Islamic communication, but because of its nature which is in the midst of an ongoing educational tradition today, it requires various innovations, as a logical consequence of the development of science and technology. The development of science and technology is so rapid for human life, it is necessary to make efforts on Arabic language teaching technicians who adapt to their respective fields and competencies. Then the purpose of learning Arabic with this specific purpose is to develop the professional abilities and academic potential of students.This type of research is descriptive qualitative research, namely a research conducted systematically by using library data using data collection tools, namely, observation, and documentation. This study provides a goal-based Arabic learning model to be achieved which is called Arabic learning for special purposes. This learning is more directed at learning Arabic for a certain scope and context, for example for work or professions and also in the academic field.

Education, Education (General)
DOAJ Open Access 2022
Digital Literacy: Arabic Teacher Competencies in Distance Learning

Aulia Mustika Ilmiani Aulia, Hamidah Hamidah, Adelina Dewi Nuryaman et al.

The transformation of learning statically requires teachers to adapt to technology to face significant changes, especially in terms of adoption of digital education. What the teacher provides is not only the transmission of learning materials, but also digital literacy to facilitate the needs of students according to conditions in the field. Based on preliminary data, it is known that Arabic language teachers at Madrasah Aliyah Nuruzholam, Seruyan Regency, Central Kalimantan experience problems in terms of limited ability in the field of learning technology. The research method used was descriptive qualitative method which aimed to describe the digital literacy competence of Arabic language teachers at Madrasah Aliyah Nuruzholam. The results of the study showed that the digital literacy competence of Arabic language teachers can be seen from the teacher's activities such as; First, teachers can use the internet and use Google as a medium for finding information. Second, teachers obtain information by using the internet, Third, familiar teachers use the YouTube application as a means of finding information or learning Arabic resources. Fourth, the teacher uses the information obtained as a source of learning Arabic without any validation of data processing.

Oriental languages and literatures
DOAJ Open Access 2022
Xiyu wenjian lu by Qishiyi: Materials on the History of Central Asian Peoples in Mid-to-Late 18th Century Revisited

Natalya E. Karimova, Temur E. Tulibayev

Introduction. The article examines the Chinese written source Xiyu wenjian lu 西域闻见录 (‘Record of Things Seen and Heard in the Western Regions’) by the Manchu official Qishiyi (Chunyuan) and its data on the history and ethnography of Central Asian peoples in the mid-to-late 18th century. Goals. The work seeks to introduce new data on the history of Central Asia contained in Xiyu wenjian lu, analyze various copies and versions of the written source, and provide new information about the author of the composition. Materials and methods. The study explores the Japanese woodblock print edition of 1801 and one modern Chinese edition of 2016, translates some extracts. The work considers various works by Russian and foreign authors regarding the written source, employs the generalizing and comparative research methods, while historical/chronological analysis proves instrumental in investigating certain interesting facts from Xiyu wenjian lu. Results. The paper introduces into scientific circulation new translations of extracts from Qishiyi’s work that significantly supplement and expand data presented in the famous work of N. Ya. Bichurin (Hyacinth) titled ‘Description of Dzungaria and East Turkestan in Ancient and Present Times’. The article contains the most famous copies and versions of Xiyu wenjian lu. The biography of the author is supplemented with new facts of his life that have remained little-known to Russian-speaking researchers. Conclusions. The revealed factual materials from Xiyu wenjian lu show that the work of Manchu official Qishiyi contains original data on historical geography, socioeconomic life of Central Asian peoples, and political situation in the region. Despite its fame, Xiyu wenjian lu is to be thoroughly explored since lots of historical accounts from this source remain unexplored.

History (General), Oriental languages and literatures
arXiv Open Access 2022
CrossAligner & Co: Zero-Shot Transfer Methods for Task-Oriented Cross-lingual Natural Language Understanding

Milan Gritta, Ruoyu Hu, Ignacio Iacobacci

Task-oriented personal assistants enable people to interact with a host of devices and services using natural language. One of the challenges of making neural dialogue systems available to more users is the lack of training data for all but a few languages. Zero-shot methods try to solve this issue by acquiring task knowledge in a high-resource language such as English with the aim of transferring it to the low-resource language(s). To this end, we introduce CrossAligner, the principal method of a variety of effective approaches for zero-shot cross-lingual transfer based on learning alignment from unlabelled parallel data. We present a quantitative analysis of individual methods as well as their weighted combinations, several of which exceed state-of-the-art (SOTA) scores as evaluated across nine languages, fifteen test sets and three benchmark multilingual datasets. A detailed qualitative error analysis of the best methods shows that our fine-tuned language models can zero-shot transfer the task knowledge better than anticipated.

en cs.CL
arXiv Open Access 2022
Enumerating Regular Languages with Bounded Delay

Antoine Amarilli, Mikaël Monet

We study the task, for a given language $L$, of enumerating the (generally infinite) sequence of its words, without repetitions, while bounding the delay between two consecutive words. To allow for delay bounds that do not depend on the current word length, we assume a model where we produce each word by editing the preceding word with a small edit script, rather than writing out the word from scratch. In particular, this witnesses that the language is orderable, i.e., we can write its words as an infinite sequence such that the Levenshtein edit distance between any two consecutive words is bounded by a value that depends only on the language. For instance, $(a+b)^*$ is orderable (with a variant of the Gray code), but $a^* + b^*$ is not. We characterize which regular languages are enumerable in this sense, and show that this can be decided in PTIME in an input deterministic finite automaton (DFA) for the language. In fact, we show that, given a DFA $A$, we can compute in PTIME automata $A_1, \ldots, A_t$ such that $L(A)$ is partitioned as $L(A_1) \sqcup \ldots \sqcup L(A_t)$ and every $L(A_i)$ is orderable in this sense. Further, we show that the value of $t$ obtained is optimal, i.e., we cannot partition $L(A)$ into less than $t$ orderable languages. In the case where $L(A)$ is orderable (i.e., $t=1$), we show that the ordering can be produced by a bounded-delay algorithm: specifically, the algorithm runs in a suitable pointer machine model, and produces a sequence of bounded-length edit scripts to visit the words of $L(A)$ without repetitions, with bounded delay -- exponential in $|A|$ -- between each script. In fact, we show that we can achieve this while only allowing the edit operations push and pop at the beginning and end of the word, which implies that the word can in fact be maintained in a double-ended queue.

en cs.FL, cs.DS
arXiv Open Access 2021
Benchmarking the Status of Default Pseudorandom Number Generators in Common Programming Languages

Nils van den Honert, Diederick Vermetten, Anna V. Kononova

The ever-increasing need for random numbers is clear in many areas of computer science, from neural networks to optimization. As such, most common programming language provide easy access to Pseudorandom Number Generators. However, these generators are not all made equal, and empirical verification has previously shown some to be flawed in key ways. Because of the constant changes in programming languages, we perform the same empirical benchmarking using large batteries of statistcal tests on a wide array of PRNGs, and identify that while some languages have improved significantly over the years, there are still cases where the default PRNG fails to deliver sufficiently random results.

en cs.PL
arXiv Open Access 2021
DWUG: A large Resource of Diachronic Word Usage Graphs in Four Languages

Dominik Schlechtweg, Nina Tahmasebi, Simon Hengchen et al.

Word meaning is notoriously difficult to capture, both synchronically and diachronically. In this paper, we describe the creation of the largest resource of graded contextualized, diachronic word meaning annotation in four different languages, based on 100,000 human semantic proximity judgments. We thoroughly describe the multi-round incremental annotation process, the choice for a clustering algorithm to group usages into senses, and possible - diachronic and synchronic - uses for this dataset.

en cs.CL
DOAJ Open Access 2020
The Ancestors of Labarna I and the Cruciform Seal

Zsolt Simon

This paper argues that the evidence of the Offering List C and the Cruciform Seal on the early Hittite rulers can only be reconciled with each other, if the former’s entry on Labarna refers to the ancestors of Labarna I and not of ?attušili I, as hitherto assumed.

History of Asia, Oriental languages and literatures
arXiv Open Access 2020
ThingML+ Augmenting Model-Driven Software Engineering for the Internet of Things with Machine Learning

Armin Moin, Stephan Rössler, Stephan Günnemann

In this paper, we present the current position of the research project ML-Quadrat, which aims to extend the methodology, modeling language and tool support of ThingML - an open source modeling tool for IoT/CPS - to address Machine Learning needs for the IoT applications. Currently, ThingML offers a modeling language and tool support for modeling the components of the system, their communication interfaces as well as their behaviors. The latter is done through state machines. However, we argue that in many cases IoT/CPS services involve system components and physical processes, whose behaviors are not well understood in order to be modeled using state machines. Hence, quite often a data-driven approach that enables inference based on the observed data, e.g., using Machine Learning is preferred. To this aim, ML-Quadrat integrates the necessary Machine Learning concepts into ThingML both on the modeling level (syntax and semantics of the modeling language) and on the code generators level. We plan to support two target platforms for code generation regarding Stream Processing and Complex Event Processing, namely Apache SAMOA and Apama.

en cs.SE, cs.LG

Halaman 34 dari 354