SEA-Guard: Culturally Grounded Multilingual Safeguard for Southeast Asia
Panuthep Tasawong, Jian Gang Ngui, Alham Fikri Aji
et al.
Culturally aware safeguards are crucial for AI alignment in real-world settings, where safety extends beyond common sense and encompasses diverse local values, norms, and region-specific regulations. However, building large-scale, culturally grounded datasets is challenging due to limited resources and a scarcity of native annotators. Consequently, many safeguard models rely on machine translation of English datasets, often missing regional and cultural nuances. We present a novel agentic data-generation framework to scalably create authentic, region-specific safety datasets for Southeast Asia (SEA). On this foundation, we introduce the SEA-Guard family, the first multilingual safeguard models grounded in SEA cultural contexts. Evaluated across multiple benchmarks and cultural variants, SEA-Guard consistently outperforms existing safeguards at detecting regionally sensitive or harmful content while maintaining strong general safety performance.
MARLINA SI PEMBUNUH DALAM EMPAT BABAK: MORALITAS DALAM FILM MENURUT ANALISIS SEMIOTIKA
Khirjan Nahdi, Dewi Milandary, Roni Amrulloh
This study aims to determine the meaning of morality in the film Marlina The Killer In Four Acts by Mouly Surya using semiotic analysis. Using descriptive-qualitative research. Data and data sources in this study were obtained from primary data in the form of soft copies of the film Marlina The Killer in four acts, while secondary data were obtained from books, the internet, and journals. Data collection techniques used are documentation and observation techniques. Data analysis using content analysis and semiotic analysis of Charles Sanders Pierce. Validity of data using the power of observers and triagulation. Based on the analysis of the data used, there are 10 categories of morality from 27 data on the film Marlina The Killer in Four Acts: (1) manners in receiving guests as much as 4 data; (2) loyalty to a partner as much as 2 data; (3) courage that Tinggi sebayak 5 data; (4) never give up in all conditions sebayak 1 data; (5) have a polite attitude sebayak 2 data; (6) filial piety to parents sebayak 2 data; (7) the main responsibility as much as 2 data; (8) feel the loss of as much as 1 data; (9) Spread love to each other sebayak 3 data; and (10) Mutual Assistance sebayak 5 data.
Theory and practice of education, Languages and literature of Eastern Asia, Africa, Oceania
The Arrival of Fast Internet and Employment in Africa: Comment
David Roodman
Hjort and Poulsen (2019) frames the staggered arrival of submarine Internet cables on the shores of Africa circa 2010 as a difference-in-differences natural experiment. The paper finds positive impacts of broadband on individual- and firm-level employment -- with a bias toward skilled employment -- and on nighttime light emissions. These results largely are not robust to alternative geocoding of survey locations, to correcting for a satellite changeover at end-2009, and to revisiting a definition of the treated zone that has no clear technological basis, is narrower than the spatial resolution of nearly all the data sources, and is empirically suboptimal as a representation of the geography of broadband.
Social AI and The Equation of Wittgenstein's Language User With Calvino's Literature Machine
W. J. T. Mollema
Is it sensical to ascribe psychological predicates to AI systems like chatbots based on large language models (LLMs)? People have intuitively started ascribing emotions or consciousness to social AI ('affective artificial agents'), with consequences that range from love to suicide. The philosophical question of whether such ascriptions are warranted is thus very relevant. This paper advances the argument that LLMs instantiate language users in Ludwig Wittgenstein's sense but that ascribing psychological predicates to these systems remains a functionalist temptation. Social AIs are not full-blown language users, but rather more like Italo Calvino's literature machines. The ideas of LLMs as Wittgensteinian language users and Calvino's literature-producing writing machine are combined. This sheds light on the misguided functionalist temptation inherent in moving from equating the two to the ascription of psychological predicates to social AI. Finally, the framework of mortal computation is used to show that social AIs lack the basic autopoiesis needed for narrative façons de parler and their role in the sensemaking of human (inter)action. Such psychological predicate ascriptions could make sense: the transition 'from quantity to quality' can take place, but its route lies somewhere between life and death, not between affective artifacts and emotion approximation by literature machines.
Data Ethics in the Era of Healthcare Artificial Intelligence in Africa: An Ubuntu Philosophy Perspective
Abdoul Jalil Djiberou Mahamadou, Aloysius Ochasi, Russ B. Altman
Data are essential in developing healthcare artificial intelligence (AI) systems. However, patient data collection, access, and use raise ethical concerns, including informed consent, data bias, data protection and privacy, data ownership, and benefit sharing. Various ethical frameworks have been proposed to ensure the ethical use of healthcare data and AI, however, these frameworks often align with Western cultural values, social norms, and institutional contexts emphasizing individual autonomy and well-being. Ethical guidelines must reflect political and cultural settings to account for cultural diversity, inclusivity, and historical factors such as colonialism. Thus, this paper discusses healthcare data ethics in the AI era in Africa from the Ubuntu philosophy perspective. It focuses on the contrast between individualistic and communitarian approaches to data ethics. The proposed framework could inform stakeholders, including AI developers, healthcare providers, the public, and policy-makers about healthcare data ethical usage in AI in Africa.
NihonGO NOW!
Robert Joseph Del Greco
-
Language and Literature, Japanese language and literature
Contributors
Hiroshi Nara
-
Language and Literature, Japanese language and literature
From Outcome-Based to Language-Based Preferences
Valerio Capraro, Joseph Y. Halpern, Matjaz Perc
We review the literature on models that try to explain human behavior in social interactions described by normal-form games with monetary payoffs. We start by covering social and moral preferences. We then focus on the growing body of research showing that people react to the language in which actions are described, especially when it activates moral concerns. We conclude by arguing that behavioral economics is in the midst of a paradigm shift towards language-based preferences, which will require an exploration of new models and experimental setups.
Kwame for Science: An AI Teaching Assistant Based on Sentence-BERT for Science Education in West Africa
George Boateng, Samuel John, Andrew Glago
et al.
Africa has a high student-to-teacher ratio which limits students' access to teachers. Consequently, students struggle to get answers to their questions. In this work, we extended Kwame, our previous AI teaching assistant, adapted it for science education, and deployed it as a web app. Kwame for Science answers questions of students based on the Integrated Science subject of the West African Senior Secondary Certificate Examination (WASSCE). Kwame for Science is a Sentence-BERT-based question-answering web app that displays 3 paragraphs as answers along with a confidence score in response to science questions. Additionally, it displays the top 5 related past exam questions and their answers in addition to the 3 paragraphs. Our preliminary evaluation of the Kwame for Science with a 2.5-week real-world deployment showed a top 3 accuracy of 87.5% (n=56) with 190 users across 11 countries. Kwame for Science will enable the delivery of scalable, cost-effective, and quality remote education to millions of people across Africa.
Machine Translation from Signed to Spoken Languages: State of the Art and Challenges
Mathieu De Coster, Dimitar Shterionov, Mieke Van Herreweghe
et al.
Automatic translation from signed to spoken languages is an interdisciplinary research domain, lying on the intersection of computer vision, machine translation and linguistics. Nevertheless, research in this domain is performed mostly by computer scientists in isolation. As the domain is becoming increasingly popular - the majority of scientific papers on the topic of sign language translation have been published in the past three years - we provide an overview of the state of the art as well as some required background in the different related disciplines. We give a high-level introduction to sign language linguistics and machine translation to illustrate the requirements of automatic sign language translation. We present a systematic literature review to illustrate the state of the art in the domain and then, harking back to the requirements, lay out several challenges for future research. We find that significant advances have been made on the shoulders of spoken language machine translation research. However, current approaches are often not linguistically motivated or are not adapted to the different input modality of sign languages. We explore challenges related to the representation of sign language data, the collection of datasets, the need for interdisciplinary research and requirements for moving beyond research, towards applications. Based on our findings, we advocate for interdisciplinary research and to base future research on linguistic analysis of sign languages. Furthermore, the inclusion of deaf and hearing end users of sign language translation applications in use case identification, data collection and evaluation is of the utmost importance in the creation of useful sign language translation models. We recommend iterative, human-in-the-loop, design and development of sign language translation models.
Annotating Norwegian Language Varieties on Twitter for Part-of-Speech
Petter Mæhlum, Andre Kåsen, Samia Touileb
et al.
Norwegian Twitter data poses an interesting challenge for Natural Language Processing (NLP) tasks. These texts are difficult for models trained on standardized text in one of the two Norwegian written forms (Bokmål and Nynorsk), as they contain both the typical variation of social media text, as well as a large amount of dialectal variety. In this paper we present a novel Norwegian Twitter dataset annotated with POS-tags. We show that models trained on Universal Dependency (UD) data perform worse when evaluated against this dataset, and that models trained on Bokmål generally perform better than those trained on Nynorsk. We also see that performance on dialectal tweets is comparable to the written standards for some models. Finally we perform a detailed analysis of the errors that models commonly make on this data.
Fujiwara House Biography: Continental Prototypes
E. B. Sakharova
The purpose of this article is to research the continental prototypesfor Fujiwara-no Kamatari Biography (the main part of “Toshi Kaden”/“TheFujiwara House Biography”). Kamatari was the key political figure of his timeand one of the Japanese statehood fathers. Whereas the Chinese prototypes arewell investigated, we could not say so about the Korean ones. However, Kamatariis compared with both Chinese and Korean statesmen. It is for this reason, thatthe paper focuses on the analysis of the possible Korean prototypes.
Japanese language and literature
Towards Large-Scale Data Mining for Data-Driven Analysis of Sign Languages
Boris Mocialov, Graham Turner, Helen Hastie
Access to sign language data is far from adequate. We show that it is possible to collect the data from social networking services such as TikTok, Instagram, and YouTube by applying data filtering to enforce quality standards and by discovering patterns in the filtered data, making it easier to analyse and model. Using our data collection pipeline, we collect and examine the interpretation of songs in both the American Sign Language (ASL) and the Brazilian Sign Language (Libras). We explore their differences and similarities by looking at the co-dependence of the orientation and location phonological parameters
Development in astronomy in Ethiopia and East-Africa through nuclear activity in galaxies
Mirjana Pović
In this paper we summarise the research that is currently going on in Ethiopia and East-Africa in extragalactic astronomy and physics of active galaxies and active galactic nuclei (AGN). The study is focused on some of the still open questions such as: what are the stellar ages and populations of ultra hard X-ray detected AGN and connection between AGN and their host galaxies?, what are the properties of AGN in galaxy clusters and the role that environment has in triggering nuclear activity?, what are the morphological properties of AGN and how precisely we can deal with morphological classification of active galaxies?, what are the properties of galaxies in the green valley and the role of AGN in galaxy evolution?, and what are the properties of radio-loud and radio-quiet quasars (QSO) and dichotomy between the two?. Each of these questions has been developed under one specific project that will be briefly introduced. These projects involve 6 PhD and 3 MSc students and collaborations between Ethiopia, Rwanda, South Africa, Uganda, Tanzania, Spain, Italy, and Chile. With all projects we aim: first, to contribute to our general knowledge about AGN, and second, to contribute to the development in astronomy and science in Ethiopia and East-Africa.
Impact of crop diversification on tribal farmer's income: A case study from Eastern ghats of India
Sadasiba Tripathy, Sandhyarani Das
In this investigation we analyze impact of diversification of agriculture on farmer's income, a study from primitive tribal groups from eastern ghats of India. We have taken crop diversification index to measure the extent and regression formalism to analyze the impact, of crop diversification. Descriptive statistics is employed to know the average income of the farmers, paired results of crop diversification index. We observed a positive impact on crop diversification in scheduled areas and investigated reasons where it did not work.
Use of Available Data To Inform The COVID-19 Outbreak in South Africa: A Case Study
Vukosi Marivate, Herkulaas MvE Combrink
The coronavirus disease (COVID-19), caused by the SARS-CoV-2 virus, was declared a pandemic by the World Health Organization (WHO) in February 2020. Currently, there are no vaccines or treatments that have been approved after clinical trials. Social distancing measures, including travel bans, school closure, and quarantine applied to countries or regions are being used to limit the spread of the disease and the demand on the healthcare infrastructure. The seclusion of groups and individuals has led to limited access to accurate information. To update the public, especially in South Africa, announcements are made by the minister of health daily. These announcements narrate the confirmed COVID-19 cases and include the age, gender, and travel history of people who have tested positive for the disease. Additionally, the South African National Institute for Communicable Diseases updates a daily infographic summarising the number of tests performed, confirmed cases, mortality rate, and the regions affected. However, the age of the patient and other nuanced data regarding the transmission is only shared in the daily announcements and not on the updated infographic. To disseminate this information, the Data Science for Social Impact research group at the University of Pretoria, South Africa, has worked on curating and applying publicly available data in a way that is computer-readable so that information can be shared to the public - using both a data repository and a dashboard. Through collaborative practices, a variety of challenges related to publicly available data in South Africa came to the fore. These include shortcomings in the accessibility, integrity, and data management practices between governmental departments and the South African public. In this paper, solutions to these problems will be shared by using a publicly available data repository and dashboard as a case study.
KEMAMPUAN MENULIS HASIL OBSERVASI DALAM BENTUK PARAGRAF DESKRIPSI
Dwi Fitriyani, Ana Wahyu Kusniati
<p><em>The result of</em><em> preliminary</em><em> research in SMA N 1 Air Naningan showed that</em><em> students tenth class ability in writing paragraph have difficulties in determining topic, producing and developing frame of thinking, error in spelling Indonesian language. Objective of this research is to know students’ ability in writing paragraph observation result in writing descriptive paragraph. Methodology of research used is descriptive.</em><em> </em><em>T</em><em>he data analysis used was essay test which is measured by using the formula which related to the assessment indicator. The result of the research showed percentage score 67 from average score of students’ abilities at eight grade of first semester at SMA N I Air Naningan Tanggamus in the academic years </em>2017<em>-</em>2018<em> was enough categorized.</em><em> </em><em>The students of SMAN 1 Air Naningan’s ability in writing descriptive text was categories as enough because they were not enthusiasm to read the books which were related to descriptive text, uninterested media and method used by the teacher in the learning process, and limit reference which was available in the library. </em></p>
Languages and literature of Eastern Asia, Africa, Oceania
A fekete hajók előtt
Balázs Szabó
It is a well-known fact in Japanese historiography that the country was opened up to Western powers by the American Perry expedition in 1853, dubbed in Japan ’the arrival of the black ships’. However, those ships did not come out of nowhere; there had been various earlier attempts to establish diplomatic and commercial relations with Japan, first by Russia and later by other Western powers. These attempts sparked heated debates in Japan regarding the danger caused by the foreign ships and the best way to avoid this menace. In this paper, after providing a brief description of earlier initiatives aimed at establishing diplomatic relations with Japan, I concentrate on United States foreign policy vis-à-vis Japan: the USA came into contact with Japan through its thriving whaling industry, as in the 1820s US whalers frequented the waters close to Japan. Later various economic considerations led to the decision to organize an expedition to Japan to effectively ensure that Japanese ports would receive US ships and to open the country for commerce. The success of the US expedition was due to various factors, not least the motivation and determination of its leader, Matthew C. Perry.
Chinese language and literature
Proceedings of the 11th Asia-Europe Workshop on Concepts in Information Theory
A. J. Han Vinck, Kees A. Schouhamer Immink, Tadashi Wadayama
et al.
This year, 2019 we celebrate 30 years of our friendship between Asian and European scientists at the AEW11 in Rotterdam, the Netherlands. Many of the 1989 participants are also present at the 2019 event. This year we have many participants from different parts of Asia and Europe. It shows the importance of this event. It is a good tradition to pay a tribute to a special lecturer in our community. This year we selected Hiroyoshi Morita, who is a well known information theorist with many original contributions.
George Weideman (1947–2008)
Wium van Zyl
Hy het meer as watter Afrikaanse skrywer nie een nie, maar baie stemme gegee aan die sondeurbakte, droë dele van ons land. Streekskrywer kan hy genoem word, maar hy was ook as digter, dramaturg en aksievoerder gereeld te vinde in die middel van die nimmereindigende worstelstryd teen die legio gedaantes wat die onreg in Suid-Afrika aanneem. Hoe trots sou hy nie gewees het op die mense van Pella wat onlangs in eenvoudige, maar helder taal vir Mammon die hek gewys het nie!
African languages and literature