LengClaro2023: A Dataset of Administrative Texts in Spanish with Plain Language adaptations
Belén Agüera-Marco, Itziar Gonzalez-Dios
In this work, we present LengClaro2023, a dataset of legal-administrative texts in Spanish. Based on the most frequently used procedures from the Spanish Social Security website, we have created for each text two simplified equivalents. The first version follows the recommendations provided by arText claro. The second version incorporates additional recommendations from plain language guidelines to explore further potential improvements in the system. The linguistic resource created in this work can be used for evaluating automatic text simplification (ATS) systems in Spanish.
LITERA: An LLM Based Approach to Latin-to-English Translation
Paul Rosu
This paper introduces an LLM-based Latin-to-English translation platform designed to address the challenges of translating Latin texts. We named the model LITERA, which stands for Latin Interpretation and Translations into English for Research Assistance. Through a multi-layered translation process utilizing a fine-tuned version of GPT-4o-mini and GPT-4o, LITERA offers an unprecedented level of accuracy, showcased by greatly improved BLEU scores, particularly in classical Latin, along with improved BLEURT scores. The development of LITERA involved close collaboration with Duke University's Classical Studies Department, which was instrumental in creating a small, high-quality parallel Latin-English dataset. This paper details the architecture, fine-tuning methodology, and prompting strategies used in LITERA, emphasizing its ability to produce literal translations.
3CEL: A corpus of legal Spanish contract clauses
Nuria Aldama García, Patricia Marsà Morales, David Betancur Sánchez
et al.
Legal corpora for Natural Language Processing (NLP) are valuable and scarce resources in languages like Spanish due to two main reasons: data accessibility and legal expert knowledge availability. INESData 2024 is a European Union funded project lead by the Universidad Politécnica de Madrid (UPM) and developed by Instituto de Ingeniería del Conocimiento (IIC) to create a series of state-of-the-art NLP resources applied to the legal/administrative domain in Spanish. The goal of this paper is to present the Corpus of Legal Spanish Contract Clauses (3CEL), which is a contract information extraction corpus developed within the framework of INESData 2024. 3CEL contains 373 manually annotated tenders using 19 defined categories (4 782 total tags) that identify key information for contract understanding and reviewing.
HISPASpoof: A New Dataset For Spanish Speech Forensics
Maria Risques, Kratika Bhagtani, Amit Kumar Singh Yadav
et al.
Zero-shot Voice Cloning (VC) and Text-to-Speech (TTS) methods have advanced rapidly, enabling the generation of highly realistic synthetic speech and raising serious concerns about their misuse. While numerous detectors have been developed for English and Chinese, Spanish-spoken by over 600 million people worldwide-remains underrepresented in speech forensics. To address this gap, we introduce HISPASpoof, the first large-scale Spanish dataset designed for synthetic speech detection and attribution. It includes real speech from public corpora across six accents and synthetic speech generated with six zero-shot TTS systems. We evaluate five representative methods, showing that detectors trained on English fail to generalize to Spanish, while training on HISPASpoof substantially improves detection. We also evaluate synthetic speech attribution performance on HISPASpoof, i.e., identifying the generation method of synthetic speech. HISPASpoof thus provides a critical benchmark for advancing reliable and inclusive speech forensics in Spanish.
SANTOS, Fernando Pereira dos. A conduta marcial inglesa na Guerra dos Cem Anos: um estudo sobre os ditames morais do conflito ao final da Idade Média (1400-1453). Florianópolis: UDESC, 2022.
Paloma Catelan
Resenha da obra "A conduta marcial inglesa na Guerra dos Cem Anos: um estudo sobre os ditames morais do conflito ao final da Idade Média (1400-1453)", de autoria de Fernando Pereira dos Santos, Doutor em História pela Universidade Estadual Paulista Júlio de Mesquita Filho (UNESP) - FCHS/Franca, especialista em história medieval inglesa, atuando sobre o tema das prescrições morais sobre a atividade marcial na Guerra dos Cem Anos (1337-1453).
History (General), Latin America. Spanish America
El ejército de Estrada. Disección de una rebelión frustrada en la frontera norte, 1926-1927
Miguel Ángel Grijalva Dávila
Esta es la historia del ejército de Enrique Estrada, un cuerpo de exiliados rebeldes con base en California, quienes pretendieron cruzar a México y derrocar a la presidencia de Plutarco Elías Calles en agosto de 1926. Fueron detenidos por las autoridades estadunidenses cuando se dirigían a cruzar la frontera, arrestados y enjuiciados. El objetivo, a grandes rasgos, es conocer las circunstancias en las que los exiliados de la revolución organizaban incursiones armadas desde E.U., cómo eran las relaciones gubernamentales para evitar esas incursiones y cómo se componía el grupo de Estrada. Para este objetivo, se dio prioridad a la documentación depositada en los National Archives of Record Administration (NARA), en el estado de Maryland, donde hay documentos sobre la investigación y juicio del grupo armado sobre el que nos enfocamos. Cabe señalar, es documentación poco explorada por la historiografía mexicana. Los aportes más significativos es el uso de dichas fuentes, así como la desvelación de los obstáculos, retos y complicaciones que tenían los exiliados al planear una rebelión desde E.U., algo que se repitió en muchos casos durante la revolución, pero se ha investigado poco. Por último, el texto propone que las relaciones entre agencias de ambos gobiernos, de seguridad e inteligencia en este caso, tenían una relación autónoma e independiente de la relación entre cancillerías.
History America, Latin America. Spanish America
Magic squares: Latin, Semiclassical and Quantum
Gemma De las Cuevas, Tim Netzer, Inga Valentiner-Branth
Quantum magic squares were recently introduced as a 'magical' combination of quantum measurements. In contrast to quantum measurements, they cannot be purified (i.e. dilated to a quantum permutation matrix) -- only the so-called semiclassical ones can. Purifying establishes a relation to an ideal world of fundamental theoretical and practical importance; the opposite of purifying is described by the matrix convex hull. In this work, we prove that semiclassical magic squares can be purified to quantum Latin squares, which are 'magical' combinations of orthonormal bases. Conversely, we prove that the matrix convex hull of quantum Latin squares is larger than the semiclassical ones. This tension is resolved by our third result: We prove that the quantum Latin squares that are semiclassical are precisely those constructed from a classical Latin square. Our work sheds light on the internal structure of quantum magic squares, on how this is affected by the matrix convex hull, and, more generally, on the nature of the 'magical' composition rule, both at the semiclassical and quantum level.
Punctuation Restoration in Spanish Customer Support Transcripts using Transfer Learning
Xiliang Zhu, Shayna Gardiner, David Rossouw
et al.
Automatic Speech Recognition (ASR) systems typically produce unpunctuated transcripts that have poor readability. In addition, building a punctuation restoration system is challenging for low-resource languages, especially for domain-specific applications. In this paper, we propose a Spanish punctuation restoration system designed for a real-time customer support transcription service. To address the data sparsity of Spanish transcripts in the customer support domain, we introduce two transfer-learning-based strategies: 1) domain adaptation using out-of-domain Spanish text data; 2) cross-lingual transfer learning leveraging in-domain English transcript data. Our experiment results show that these strategies improve the accuracy of the Spanish punctuation restoration system.
Another Face of Empire
D. Castro
The Spanish cleric Bartolome de Las Casas is a key figure in the history of Spain’s conquest of the Americas. Las Casas condemned the torture and murder of natives by the conquistadores in reports to the Spanish royal court and in tracts such as A Short Account of the Destruction of the Indies (1552). For his unrelenting denunciation of the colonialists’ atrocities, Las Casas has been revered as a noble protector of the Indians and as a pioneering anti-imperialist. He has become a larger-than-life figure invoked by generations of anticolonialists in Europe and Latin America. Separating historical reality from myth, Daniel Castro provides a nuanced, revisionist assessment of the friar’s career, writings, and political activities. Castro argues that Las Casas was very much an imperialist. Intent on converting the Indians to Christianity, the religion of the colonizers, Las Casas simply offered the natives another face of empire: a paternalistic, ecclesiastical imperialism. Castro contends that while the friar was a skilled political manipulator, influential at what was arguably the world’s most powerful sixteenth-century imperial court, his advocacy on behalf of the natives had little impact on their lives. Analyzing Las Casas’s extensive writings, Castro points out that in his many years in the Americas, Las Casas spent very little time among the indigenous people he professed to love, and he made virtually no effort to learn their languages. He saw himself as an emissary from a superior culture with a divine mandate to impose a set of ideas and beliefs on the colonized. He differed from his compatriots primarily in his antipathy to violence as the means for achieving conversion.
The Discovery and Conquest of Peru
P. León
Dazzled by the sight of the vast treasure of gold and silver being unloaded at Seville’s docks in 1537, a teenaged Pedro de Cieza de Leon vowed to join the Spanish effort in the New World, become an explorer, and write what would become the earliest historical account of the conquest of Peru. Available for the first time in English, this history of Peru is based largely on interviews with Cieza’s conquistador compatriates, as well as with Indian informants knowledgeable of the Incan past. Alexandra Parma Cook and Noble David Cook present this recently discovered third book of a four-part chronicle that provides the most thorough and definitive record of the birth of modern Andean America. It describes with unparalleled detail the exploration of the Pacific coast of South America led by Francisco Pizarro and Diego de Almagro, the imprisonment and death of the Inca Atahualpa, the Indian resistance, and the ultimate Spanish domination. Students and scholars of Latin American history and conquest narratives will welcome the publication of this volume.
A cena musical “blues” de Fortaleza 1989-1992: os primeiros momentos do Blues Pai D’Égua
Leopoldo de Macedo Barbosa
Este texto tem o objetivo de analisar os primeiros momentos da chamada cena musical “blues” em Fortaleza, entre os anos de 1989 e 1992. Nesse espaço de atuação marcado por uma produção musical e que posteriormente, na segunda metade da década de 1990, seria simbolicamente denominada de Blues Pai D’Égua, evidenciamos sujeitos os quais contribuíam para o desenvolvimento e efetivação do “blues” em Fortaleza. Quanto aos seus primeiros anos, a cena musical gradualmente se integrava e se destacava no contexto musical fortalezense por meio do crescimento do público, aumento de shows ou surgimento de artistas que participavam de diferentes shows em lugares como a barraca Kafua (na região litorânea da cidade, especificamente na Praia do Futuro). Ainda sobre essa cena musical identificamos sua ligação com as movimentações vinculadas ao “rock ‘n’ roll”, porque a presença de agentes que produziam esses dois gêneros musicais significava atuação para essas produções. Interligados, eles (por meio dos sujeitos envolvidos) construíam uma Fortaleza jovem e inquieta. Para fomentar teoricamente o texto apresentamos o conceito de cena musical; já em termos metodológicos evidenciamos principalmente os materiais hemerográficos com base em reportagens dos jornais O Povo e Diário do Nordeste (jornais de grande circulação no Ceará).
History (General), Latin America. Spanish America
Responsabilidad empresarial en la última dictadura argentina: aportes a partir de casos de empresas agroindustriales en las regiones Noroeste y Noreste (1974-1983)
Victoria Basualdo, María Alejandra Esponda, Silvia Nassif
Este artículo analiza procesos de investigación y judicialización de casos de responsabilidad empresarial en delitos de lesa humanidad durante la última dictadura en Argentina (1976-1983). En particular, estudia cuatro casos de empresas agroindustriales: dos ingenios azucareros ubicados en las provincias de Tucumán (ingenios La Fronterita y Concepción) y otro en Jujuy (ingenio Ledesma), y una empresa yerbatera y de producción de té en la provincia de Corrientes (Las Marías). El artículo subraya que todos estos casos incluyen no solo trabajo fabril sino también rural, caracterizado por una mayor precarización de las relaciones laborales y una mayor explotación. Se enfatizan las potencialidades de estos estudios, pero también los obstáculos y limitaciones de estos procesos de Memoria, Verdad y Justicia debido al fuerte poder empresarial y las dificultades de la organización obrera y sindical.
History of Civilization, History (General) and history of Europe
A simple proof for the chromatic number of cyclic Latin squares of even order
Zahra Naghdabadi
The chromatic number of a cyclic Latin square of order 2n is 2n+2. The available proof for this statement includes a coloring that is rather lengthy. Here, we introduce a coloring of cyclic Latin square of even order 2n (the Latin square of a cyclic group's Cayley table) with 2n+2 colors using a simple method supported by a graphical presentation.
Large deviations in random Latin squares
Matthew Kwan, Ashwin Sah, Mehtaab Sawhney
In this note, we study large deviations of the number $\mathbf{N}$ of intercalates ($2\times2$ combinatorial subsquares which are themselves Latin squares) in a random $n\times n$ Latin square. In particular, for constant $δ>0$ we prove that $\Pr(\mathbf{N}\le(1-δ)n^{2}/4)\le\exp(-Ω(n^{2}))$ and $\Pr(\mathbf{N}\ge(1+δ)n^{2}/4)\le\exp(-Ω(n^{4/3}(\log n)^{2/3}))$, both of which are sharp up to logarithmic factors in their exponents. As a consequence, we deduce that a typical order-$n$ Latin square has $(1+o(1))n^{2}/4$ intercalates, matching a lower bound due to Kwan and Sudakov and resolving an old conjecture of McKay and Wanless.
Automatic Counting of Generalized Latin Rectangles and Trapezoids
George Spahn, Doron Zeilberger
In this case study in ``fully automated enumeration'', we illustrate how to take full advantage of symbolic computation by developing (what we call) `symbolic-dynamical-programming' algorithms for computing many terms of `hard to compute sequences', namely the number of Latin trapezoids, generalized derangements, and generalized three-rowed Latin rectangles. At the end we also sketch the proof of a generalization of Ira Gessel's 1987 theorem that says that for any number of rows, k, the number of Latin rectangles with k rows and n columns is P-recursive in n. Our algorithms are fully implemented in Maple, and generated quite a few terms of such sequences.
Quito, Ecuador
Francisco De Quito Usfq
Reseña de Alejandro Eujanian, El pasado en el péndulo de la política. Rosas, la provincia y la nación en el debate político de Buenos Aires, 1852-1861
Fabio Wasserman
History America, Latin America. Spanish America
Moçambique em perspectiva
Maria Cristina Cortez Wissenbach, Juliana Paiva Magalhães, Lia Dias Laranjeira
Uma possível perspectiva analítica para o devir dos estudos africanos no Brasil é pensá-los a partir de suas inflexões, uma vez que o desenvolvimento do campo parece ter dado saltos qualitativos nos últimos anos. Nessa direção, a configuração e a própria concepção do dossiê Moçambique em perspectiva: histórias conectadas, interdisciplinaridade e novos sujeitos históricos, desde o diálogo estabelecido entre as organizadoras até a publicação dos artigos, estão relacionadas à trajetória de implantação e consolidação da área cujos marcos políticos, legais e institucionais são retomados no escopo desta apresentação.
History (General), Latin America. Spanish America
Latin squares with maximal partial transversals of many lengths
Anthony B. Evans, Adam Mammoliti, Ian Wanless
A partial transversal $T$ of a Latin square $L$ is a set of entries of $L$ in which each row, column and symbol is represented at most once. A partial transversal is maximal if it is not contained in a larger partial transversal. Any maximal partial transversal of a Latin square of order $n$ has size at least $\lceil\frac{n}{2}\rceil$ and at most $n$. We say that a Latin square is omniversal if it possesses a maximal partial transversal of all feasible sizes and is near-omniversal if it possesses a maximal partial transversal of all feasible sizes except one. Evans showed that omniversal Latin squares of order $n$ exist for any odd $n \neq 3$. By extending this result, we show that an omniversal Latin square of order $n$ exists if and only if $n\notin\{3,4\}$ and $n \not\equiv 2 \mod 4$. Furthermore, we show that near-omniversal Latin squares exist for all orders $n \equiv 2 \mod 4$. Finally, we show that no non-trivial group has an omniversal Cayley table, and only 15 groups have a near-omniversal Cayley table. In fact, as $n$ grows, Cayley tables of groups of order $n$ miss a constant fraction of the feasible sizes of maximal partial transversals. In the course of proving this, we are led to consider the following interesting problem in combinatorial group theory. Suppose that we have two subsets $R,C\subseteq G$ of a finite group $G$ such that $|\{rc:r\in R,c\in C\}|=m$. How large do $|R|$ and $|C|$ need to be (in terms of $m$) to be certain that $R\subseteq xH$ and $C\subseteq Hy$ for some subgroup $H$ of order $m$ in $G$, and $x,y\in G$.
Enumeration of Sets of Mutually Orthogonal Latin Rectangles
Gerold Jäger, Klas Markström, Denys Shcherbak
et al.
We study sets of mutually orthogonal Latin rectangles (MOLR), and a natural variation of the concept of self-orthogonal Latin squares which is applicable on larger sets of mutually orthogonal Latin squares and MOLR, namely that each Latin rectangle in a set of MOLR is isotopic to each other rectangle in the set. We call such a set of MOLR \emph{homogeneous}. In the course of doing this, we perform a complete enumeration of non-isotopic sets of $t$ mutually orthogonal $k\times n$ Latin rectangles for $k\leq n \leq 7$, for all $t < n$. Specifically, we keep track of homogeneous sets of MOLR, as well as sets of MOLR where the autotopism group acts transitively on the rectangles, and we call such sets of MOLR \emph{transitive}. We build the sets of MOLR row by row, and in this process we also keep track of which of the MOLR are homogeneous and/or transitive in each step of the construction process. We use the prefix \emph{stepwise} to refer to sets of MOLR with this property. Sets of MOLR are connected to other discrete objects, notably finite geometries and certain regular graphs. Here we observe that all projective planes of order at most 9 except the Hughes plane can be constructed from a stepwise transitive MOLR.