{"results":[{"id":"ss_ef2659c8d5e83817b44c8b255304436e39ce60fb","title":"Ancient Admixture in Human History","authors":[{"name":"N. Patterson"},{"name":"Priya Moorjani"},{"name":"Yontao Luo"},{"name":"Swapan Mallick"},{"name":"N. Rohland"},{"name":"Yiping Zhan"},{"name":"Teri Genschoreck"},{"name":"Teresa A. Webster"},{"name":"D. Reich"}],"abstract":"Population mixture is an important process in biology. We present a suite of methods for learning about population mixtures, implemented in a software package called ADMIXTOOLS, that support formal tests for whether mixture occurred and make it possible to infer proportions and dates of mixture. We also describe the development of a new single nucleotide polymorphism (SNP) array consisting of 629,433 sites with clearly documented ascertainment that was specifically designed for population genetic analyses and that we genotyped in 934 individuals from 53 diverse populations. To illustrate the methods, we give a number of examples that provide new insights about the history of human admixture. The most striking finding is a clear signal of admixture into northern Europe, with one ancestral population related to present-day Basques and Sardinians and the other related to present-day populations of northeast Asia and the Americas. This likely reflects a history of admixture between Neolithic migrants and the indigenous Mesolithic population of Europe, consistent with recent analyses of ancient bones from Sweden and the sequencing of the genome of the Tyrolean “Iceman.”","source":"Semantic Scholar","year":2012,"language":"en","subjects":["Biology","Medicine"],"doi":"10.1534/genetics.112.145037","url":"https://www.semanticscholar.org/paper/ef2659c8d5e83817b44c8b255304436e39ce60fb","pdf_url":"https://www.genetics.org/content/genetics/192/3/1065.full.pdf","is_open_access":true,"citations":2465,"published_at":"","score":86},{"id":"ss_21a5f17e38a0209aebdef0a9e5e75ae6dc6b70ad","title":"An algorithm based on lightweight semantic features for ancient mural element object detection","authors":[{"name":"Jiaquan Shen"},{"name":"Ningzhong Liu"},{"name":"Han Sun"},{"name":"Deguang Li"},{"name":"Yongxin Zhang"},{"name":"Lulu Han"}],"abstract":"The ancient mural paintings unearthed in China are precious world cultural heritages, which record the historical information of various eras and serve as valuable image materials for studying ancient Chinese society. The elements of the murals include figures, carriages, flowers, birds, and auspicious clouds. The digital research on these elements can better help us understand history and culture. In this paper, we have established a large-scale target detection dataset for mural elements excavated from ancient China, featuring a rich variety of labeled sample categories that span across different historical periods and regions, which provides significant value for the study of ancient Chinese history. Meanwhile, to address the defects present in the mural paintings, we have developed an adaptive random erasing augmentation algorithm, which forces the model to learn more comprehensive feature information, enabling it to adapt to the defective scenarios of the mural paintings. Moreover, we have created a target semantic feature extraction model for elements of ancient Chinese murals, which utilizes contextual information and residual attention mechanism to capture the semantic information, thereby enhancing the accuracy of element target detection. Finally, we have conducted a comparative analysis of the detection results of our proposed method with several other state-of-the-art target detection algorithms on the created mural dataset, and the visualization results validated the superiority of our proposed method.","source":"Semantic Scholar","year":2025,"language":"en","subjects":null,"doi":"10.1038/s40494-025-01565-6","url":"https://www.semanticscholar.org/paper/21a5f17e38a0209aebdef0a9e5e75ae6dc6b70ad","pdf_url":"https://doi.org/10.1038/s40494-025-01565-6","is_open_access":true,"citations":100,"published_at":"","score":72},{"id":"arxiv_2505.03836","title":"Explainable Coarse-to-Fine Ancient Manuscript Duplicates Discovery","authors":[{"name":"Chongsheng Zhang"},{"name":"Shuwen Wu"},{"name":"Yingqi Chen"},{"name":"Yi Men"},{"name":"Gaojuan Fan"},{"name":"Matthias Aßenmacher"},{"name":"Christian Heumann"},{"name":"João Gama"}],"abstract":"Ancient manuscripts are the primary source of ancient linguistic corpora. However, many ancient manuscripts exhibit duplications due to unintentional repeated publication or deliberate forgery. The Dead Sea Scrolls, for example, include counterfeit fragments, whereas Oracle Bones (OB) contain both republished materials and fabricated specimens. Identifying ancient manuscript duplicates is of great significance for both archaeological curation and ancient history study. In this work, we design a progressive OB duplicate discovery framework that combines unsupervised low-level keypoints matching with high-level text-centric content-based matching to refine and rank the candidate OB duplicates with semantic awareness and interpretability. We compare our model with state-of-the-art content-based image retrieval and image matching methods, showing that our model yields comparable recall performance and the highest simplified mean reciprocal rank scores for both Top-5 and Top-15 retrieval results, and with significantly accelerated computation efficiency. We have discovered over 60 pairs of new OB duplicates in real-world deployment, which were missed by domain experts for decades. Code, model and real-world results are available at: https://github.com/cszhangLMU/OBD-Finder/.","source":"arXiv","year":2025,"language":"en","subjects":["cs.IR","cs.AI","cs.CV"],"url":"https://arxiv.org/abs/2505.03836","pdf_url":"https://arxiv.org/pdf/2505.03836","is_open_access":true,"published_at":"2025-05-04T20:35:15Z","score":69},{"id":"arxiv_2503.04313","title":"Episodes from the history of infinitesimals","authors":[{"name":"Mikhail G. Katz"}],"abstract":"Infinitesimals have seen ups and downs in their tumultuous history. In the 18th century, d'Alembert set the tone by describing infinitesimals as chimeras. Some adversaries of infinitesimals, including Moigno and Connes, picked up on the term. We highlight the work of Cauchy, Noël, Poisson and Riemann. We also chronicle reactions by Moigno, Lamarle and Cantor, and signal the start of a revival with Peano.","source":"arXiv","year":2025,"language":"en","subjects":["math.HO"],"doi":"10.1080/26375451.2025.2474811","url":"https://arxiv.org/abs/2503.04313","pdf_url":"https://arxiv.org/pdf/2503.04313","is_open_access":true,"published_at":"2025-03-06T10:58:17Z","score":69},{"id":"arxiv_2505.02983","title":"Logits-Constrained Framework with RoBERTa for Ancient Chinese NER","authors":[{"name":"Wenjie Hua"},{"name":"Shenghan Xu"}],"abstract":"This paper presents a Logits-Constrained (LC) framework for Ancient Chinese Named Entity Recognition (NER), evaluated on the EvaHan 2025 benchmark. Our two-stage model integrates GujiRoBERTa for contextual encoding and a differentiable decoding mechanism to enforce valid BMES label transitions. Experiments demonstrate that LC improves performance over traditional CRF and BiLSTM-based approaches, especially in high-label or large-data settings. We also propose a model selection criterion balancing label complexity and dataset size, providing practical guidance for real-world Ancient Chinese NLP tasks.","source":"arXiv","year":2025,"language":"en","subjects":["cs.CL"],"url":"https://arxiv.org/abs/2505.02983","pdf_url":"https://arxiv.org/pdf/2505.02983","is_open_access":true,"published_at":"2025-05-05T19:23:16Z","score":69},{"id":"doaj_10.34024/herodoto.2024.v9.20080","title":"O Culto e a destruição das estátuas antigas nas sociedades árabe-islâmicas contemporâneas","authors":[{"name":"Jorge Elices Ocón"}],"abstract":"\nEste trabalho analisa a recepção das estátuas antigas nas sociedades árabe-islâmicas considerando seis estudos de caso que evidenciam seu valor e vigência, da época medieval até os dias atuais: a construção de uma estátua faraônica, de Ramsés II, no Cairo, durante o governo de Nasser, e sua recente transferência, entre as massas, ao novo museu arqueológico; a descoberta de uma estátua de Dario, em 1972, em Susa e seu papel simbólico como peça de destaque do museu arqueológico de Teerã; a estátua moderna de Zenóbia e sua exibição em Damasco, no ano de 2015, no contexto da guerra civil na Síria; a construção de uma estátua dedicada a Kahina, em 2003, no território argelino de Baghai e seu incêndio, em 2016, resultante de conflitos políticos e religiosos entre comunidades (árabes e berberes) e países (Argélia e França); o vídeo de destruição do Museu Arqueológico de Mossul gravado pelos militantes do DAESH em 2015; a exibição inaugural do Abu Dabi Louvre Museum, em 2017, com uma destacada presença da estatuária clássica. A estátua desempenha um papel determinante na reafirmação identitária de certos coletivos ou regimes políticos, e seu significado se constrói tanto a partir de discursos historiográficos ancorados no suposto rechaço, por parte do Islã, às representações figuradas quanto pelo conjunto de respostas que a estatuária suscita. Venerada ou destruída, a estátua forma parte de uma onda iconoclasta atual e que ressoa sobre os debates globais sobre o patrimônio, as identidades, sua representatividade e a revisão da História.\n","source":"DOAJ","year":2025,"language":"","subjects":["Ancient history"],"doi":"10.34024/herodoto.2024.v9.20080","url":"https://periodicos.unifesp.br/index.php/herodoto/article/view/20080","is_open_access":true,"published_at":"","score":69},{"id":"doaj_10.12797/CC.28.2025.28.12","title":"Two Weddings and a Funeral","authors":[{"name":"Michael Edward Stewart"}],"abstract":"\nThe marriage of Germanus, nephew of Emperor Justin I (r. 518–527), to Matasuintha, former Gothic queen and granddaughter of Theoderic the Great (r. 475–526), in late 549 or early 550, was a significant yet often overlooked moment in the later stages of the Gothic War. Scholars generally interpret the marriage as a pragmatic alliance shaped by immediate strategic concerns – either a political manoeuvre by Justinian or a personal initiative by Germanus following his appointment as commander in Italy. This article revisits that assumption by exploring three related questions. First, did the marriage and military appointment signal a reconciliation between Justinian and Germanus, or a calculated attempt by the emperor to stabilize a deteriorating political situation? Second, how did their relationship evolve in the years leading up to the union, particularly after Theodora’s death in 548? Finally, more speculatively, was Germanus’ earlier decision to marry his daughter to the general John in 545 connected to his own dynastic ambitions?\n","source":"DOAJ","year":2025,"language":"","subjects":["Ancient history","Greek language and literature. Latin language and literature"],"doi":"10.12797/CC.28.2025.28.12","url":"https://journals.akademicka.pl/cc/article/view/6771","is_open_access":true,"published_at":"","score":69},{"id":"ss_e61960ac0dda186bb55567aac78c195a48f65ff3","title":"Ancient herbal therapy: A brief history of Panax ginseng","authors":[{"name":"Maria Assunta Potenza"},{"name":"M. Montagnani"},{"name":"L. Santacroce"},{"name":"Ioannis Alexandros Charitos"},{"name":"L. Bottalico"}],"abstract":"Ginseng was the most revered of the herbs in ancient times in China, Korea, Japan, America. Ginseng was discovered over 5000 years ago in the mountains of Manchuria, China. References to ginseng are found in books dating back more than two millennia. It is revered by the Chinese people as it is considered a herb for everything use and therefore for a wide range of diseases (currently its Latin name derived from the Greek panacea, meanings, that is, for everything). So, it was used exclusively by the Chinese Emperor's, and they were willing to pay the price without problems. Increasing its fame, ginseng brought a flourishing international trade that allowed Korea to supply China with silk and medicines in exchange for wild ginseng and later along with what grows in America.","source":"Semantic Scholar","year":2022,"language":"en","subjects":["Medicine"],"doi":"10.1016/j.jgr.2022.03.004","url":"https://www.semanticscholar.org/paper/e61960ac0dda186bb55567aac78c195a48f65ff3","pdf_url":"https://doi.org/10.1016/j.jgr.2022.03.004","is_open_access":true,"citations":86,"published_at":"","score":68.58},{"id":"ss_2f031f337f7f97b82db98030c5aee26b457015c2","title":"CHisIEC: An Information Extraction Corpus for Ancient Chinese History","authors":[{"name":"Xuemei Tang"},{"name":"Zekun Deng"},{"name":"Qi Su"},{"name":"Haoxia Yang"},{"name":"Jun Wang"}],"abstract":"Natural Language Processing (NLP) plays a pivotal role in the realm of Digital Humanities (DH) and serves as the cornerstone for advancing the structural analysis of historical and cultural heritage texts. This is particularly true for the domains of named entity recognition (NER) and relation extraction (RE). In our commitment to expediting ancient history and culture, we present the “Chinese Historical Information Extraction Corpus”(CHisIEC). CHisIEC is a meticulously curated dataset designed to develop and evaluate NER and RE tasks, offering a resource to facilitate research in the field. Spanning a remarkable historical timeline encompassing data from 13 dynasties spanning over 1830 years, CHisIEC epitomizes the extensive temporal range and text heterogeneity inherent in Chinese historical documents. The dataset encompasses four distinct entity types and twelve relation types, resulting in a meticulously labeled dataset comprising 14,194 entities and 8,609 relations. To establish the robustness and versatility of our dataset, we have undertaken comprehensive experimentation involving models of various sizes and paradigms. Additionally, we have evaluated the capabilities of Large Language Models (LLMs) in the context of tasks related to ancient Chinese history. The dataset and code are available at https://github.com/tangxuemei1995/CHisIEC.","source":"Semantic Scholar","year":2024,"language":"en","subjects":["Computer Science"],"doi":"10.48550/arXiv.2403.15088","url":"https://www.semanticscholar.org/paper/2f031f337f7f97b82db98030c5aee26b457015c2","is_open_access":true,"citations":18,"published_at":"","score":68.53999999999999},{"id":"ss_885e8989d2976a38e4708d44627e3bc9a2577f50","title":"Old and ancient trees are life history lottery winners and vital evolutionary resources for long-term adaptive capacity","authors":[{"name":"C. Cannon"},{"name":"G. Piovesan"},{"name":"S. Munné‐Bosch"}],"abstract":"","source":"Semantic Scholar","year":2022,"language":"en","subjects":["Medicine"],"doi":"10.1038/s41477-021-01088-5","url":"https://www.semanticscholar.org/paper/885e8989d2976a38e4708d44627e3bc9a2577f50","is_open_access":true,"citations":76,"published_at":"","score":68.28},{"id":"arxiv_2407.00475","title":"Classifier identification in Ancient Egyptian as a low-resource sequence-labelling task","authors":[{"name":"Dmitry Nikolaev"},{"name":"Jorke Grotenhuis"},{"name":"Haleli Harel"},{"name":"Orly Goldwasser"}],"abstract":"The complex Ancient Egyptian (AE) writing system was characterised by widespread use of graphemic classifiers (determinatives): silent (unpronounced) hieroglyphic signs clarifying the meaning or indicating the pronunciation of the host word. The study of classifiers has intensified in recent years with the launch and quick growth of the iClassifier project, a web-based platform for annotation and analysis of classifiers in ancient and modern languages. Thanks to the data contributed by the project participants, it is now possible to formulate the identification of classifiers in AE texts as an NLP task. In this paper, we make first steps towards solving this task by implementing a series of sequence-labelling neural models, which achieve promising performance despite the modest amount of training data. We discuss tokenisation and operationalisation issues arising from tackling AE texts and contrast our approach with frequency-based baselines.","source":"arXiv","year":2024,"language":"en","subjects":["cs.CL"],"url":"https://arxiv.org/abs/2407.00475","pdf_url":"https://arxiv.org/pdf/2407.00475","is_open_access":true,"published_at":"2024-06-29T15:40:25Z","score":68},{"id":"arxiv_2408.11903","title":"Ancient Wisdom, Modern Tools: Exploring Retrieval-Augmented LLMs for Ancient Indian Philosophy","authors":[{"name":"Priyanka Mandikal"}],"abstract":"LLMs have revolutionized the landscape of information retrieval and knowledge dissemination. However, their application in specialized areas is often hindered by factual inaccuracies and hallucinations, especially in long-tail knowledge distributions. We explore the potential of retrieval-augmented generation (RAG) models for long-form question answering (LFQA) in a specialized knowledge domain. We present VedantaNY-10M, a dataset curated from extensive public discourses on the ancient Indian philosophy of Advaita Vedanta. We develop and benchmark a RAG model against a standard, non-RAG LLM, focusing on transcription, retrieval, and generation performance. Human evaluations by computational linguists and domain experts show that the RAG model significantly outperforms the standard model in producing factual and comprehensive responses having fewer hallucinations. In addition, a keyword-based hybrid retriever that emphasizes unique low-frequency terms further improves results. Our study provides insights into effectively integrating modern large language models with ancient knowledge systems. Project page with dataset and code: https://sites.google.com/view/vedantany-10m","source":"arXiv","year":2024,"language":"en","subjects":["cs.CL","cs.CY","cs.IR"],"url":"https://arxiv.org/abs/2408.11903","pdf_url":"https://arxiv.org/pdf/2408.11903","is_open_access":true,"published_at":"2024-08-21T18:00:21Z","score":68},{"id":"doaj_10.21041/ra.v14i2.717","title":"Numerical-vector succession for the graphic structural analysis of masonry historic buildings with arches and symmetrical systems","authors":[{"name":"Carlos Alberto  Torres Montes de Oca"},{"name":"José Eduardo  Rosas Valencia"},{"name":"Oswaldo Aldair  Pérez Jarquín"}],"abstract":"\nMany historic buildings have symmetry in their geometric configuration. The objective of this research is to denote the application of numerical-vector succession in the structural analysis of historical masonry buildings, with arches and symmetrical systems, including mathematical processes in ancient graphic analysis, emphasizing the importance of loads in the structural stability. We based the analysis on three fundamental stages: recognition of the construction system of the heritage object, geometric discretization of the system and vector analysis under different physical considerations. Hence, the thrust lines are affected by the loads, boundary conditions and history of structural behaviour. Numerical and computational tools offer faster and more accurate graphic analysis processes.\n\r\n\nMany historic buildings have symmetry in their geometric configuration. The objective of this research is to denote the application of numerical-vector succession in the structural analysis of historical masonry buildings, with arches and symmetrical systems, including mathematical processes in ancient graphic analysis, emphasizing the importance of loads in the structural stability. We based the analysis on three fundamental stages: recognition of the construction system of the heritage object, geometric discretization of the system and vector analysis under different physical considerations. Hence, the thrust lines are affected by the loads, boundary conditions and history of structural behaviour. Numerical and computational tools offer faster and more accurate graphic analysis processes.\n","source":"DOAJ","year":2024,"language":"","subjects":["Building construction"],"doi":"10.21041/ra.v14i2.717","url":"https://www.revistaalconpat.org/index.php/RA/article/view/717","is_open_access":true,"published_at":"","score":68},{"id":"doaj_10.31383/ga.vol8iss1ga04","title":"Uncovering the Past: DNA Analysis of Skeletal Remains from  the Medieval Bosnian City of Bobovac","authors":[{"name":"Mirela Džehverović"},{"name":"Amela Pilav"},{"name":"Belma Jusić"},{"name":"Edin Bujak"},{"name":"Naris Pojskić"},{"name":"Jasmina Čakar"}],"abstract":"\nNumerous archaeological sites in Bosnia and Herzegovina represent a historical heritage and testify to the rich cultural, social, and political life of medieval Bosnia. Bobovac, the capital of the Bosnian Kingdom after King Tvrtko I's coronation in 1377, featured a royal complex with a palace, church, and fortification. Recent molecular-genetic research on skeletal remains from Bobovac aims to uncover medieval ancestors' customs and genetic origins. Fifteen well-preserved teeth samples from Bobovac were processed. STR amplification employed PowerPlex® Fusion and Investigator® 24plex QS Kits, with Y-STR profiles generated using the PowerPlex® Y23 System. Fourteen partial autosomal STR profiles were obtained, enabling sex determination and kinship analysis. STR amplification success varied due to ancient DNA degradation, with larger loci showing lower amplification rates. Kinship analysis confirmed appropriate marker selection, demonstrating high reliability for determining close relationships. Integrating aDNA analysis with archaeological research enhances our understanding of historical populations, connecting archaeology and forensic genetics to contribute to the broader narrative of human history.\n","source":"DOAJ","year":2024,"language":"","subjects":["Genetics"],"doi":"10.31383/ga.vol8iss1ga04","url":"https://genapp.ba/editions/index.php/journal/article/view/216","is_open_access":true,"published_at":"","score":68},{"id":"ss_1b5874bd3d0f7999e25a3363e3004f8d4c32aea0","title":"The ancient history of kissing","authors":[{"name":"T. P. Arbøll"},{"name":"S. Rasmussen"}],"abstract":"Description Sources from Mesopotamia contextualize the emergence of kissing and its role in disease transmission Recent studies maintain that the first known record of human romantic-sexual kissing originates in a Bronze Age manuscript deriving from South Asia (India), tentatively dated to 1500 BCE (1). Yet, a substantial corpus of overlooked evidence challenges this premise because lip kissing was documented in ancient Mesopotamia and Egypt from at least 2500 BCE onward. Because this behavior did not emerge abruptly or in a specific society but appears to have been practiced in multiple ancient cultures over several millennia, the kiss cannot be regarded as a sudden biological trigger causing a spread of specific pathogens, as recently proposed (2). Further understanding of the history of kissing in human societies—and its secondary effect on disease transmission—can be gained from a case study of sources from ancient Mesopotamia (modern-day Iraq and Syria).","source":"Semantic Scholar","year":2023,"language":"en","subjects":["Medicine"],"doi":"10.1126/science.adf0512","url":"https://www.semanticscholar.org/paper/1b5874bd3d0f7999e25a3363e3004f8d4c32aea0","pdf_url":"https://www.science.org/doi/pdf/10.1126/science.adf0512?download=true","is_open_access":true,"citations":9,"published_at":"","score":67.27000000000001},{"id":"ss_238fa5c7fd325218e05bbc8334a9f5f13d64d942","title":"Insights into human history from the first decade of ancient human genomics","authors":[{"name":"Yichen Liu"},{"name":"Xiaowei Mao"},{"name":"J. Krause"},{"name":"Qiaomei Fu"}],"abstract":"Description Recent advancements in DNA sequencing technologies and laboratory preparation protocols have rapidly expanded the scope of ancient DNA research over the past decade, both temporally and geographically. Discoveries include interactions between archaic and modern humans as well as modern human population dynamics, including those coinciding with the Last Glacial Maximum and the settlement history of most world regions. This new type of data allows us to examine the deep past of human population dynamics and sharpen the current understanding of our present. The continued development in the ancient DNA field has transformed our understanding of human genetic history and will keep uncovering the further mysteries of our recent evolutionary past.","source":"Semantic Scholar","year":2021,"language":"en","subjects":["Medicine"],"doi":"10.1126/science.abi8202","url":"https://www.semanticscholar.org/paper/238fa5c7fd325218e05bbc8334a9f5f13d64d942","is_open_access":true,"citations":68,"published_at":"","score":67.03999999999999},{"id":"crossref_10.1017/9781108620420.003","title":"The History of Ancient Christian History","authors":[{"name":"David E. Wilhite"}],"abstract":"","source":"CrossRef","year":2023,"language":"en","subjects":null,"doi":"10.1017/9781108620420.003","url":"https://doi.org/10.1017/9781108620420.003","is_open_access":true,"citations":1,"published_at":"","score":67.03},{"id":"arxiv_2306.17647","title":"A Brief History of Space VLBI","authors":[{"name":"Leonid I. Gurvits"}],"abstract":"Space Very Long Baseline Interferometry is a radio astronomy technique distinguished by a record-high angular resolution reaching single-digit microseconds of arc. The paper provides a brief account of the history of developments of this technique over the period 1960s-2020s.","source":"arXiv","year":2023,"language":"en","subjects":["astro-ph.IM"],"doi":"10.1109/HISTELCON56357.2023.10365962","url":"https://arxiv.org/abs/2306.17647","pdf_url":"https://arxiv.org/pdf/2306.17647","is_open_access":true,"published_at":"2023-06-30T13:34:32Z","score":67},{"id":"arxiv_2308.13116","title":"Sentence Embedding Models for Ancient Greek Using Multilingual Knowledge Distillation","authors":[{"name":"Kevin Krahn"},{"name":"Derrick Tate"},{"name":"Andrew C. Lamicela"}],"abstract":"Contextual language models have been trained on Classical languages, including Ancient Greek and Latin, for tasks such as lemmatization, morphological tagging, part of speech tagging, authorship attribution, and detection of scribal errors. However, high-quality sentence embedding models for these historical languages are significantly more difficult to achieve due to the lack of training data. In this work, we use a multilingual knowledge distillation approach to train BERT models to produce sentence embeddings for Ancient Greek text. The state-of-the-art sentence embedding approaches for high-resource languages use massive datasets, but our distillation approach allows our Ancient Greek models to inherit the properties of these models while using a relatively small amount of translated sentence data. We build a parallel sentence dataset using a sentence-embedding alignment method to align Ancient Greek documents with English translations, and use this dataset to train our models. We evaluate our models on translation search, semantic similarity, and semantic retrieval tasks and investigate translation bias. We make our training and evaluation datasets freely available at https://github.com/kevinkrahn/ancient-greek-datasets .","source":"arXiv","year":2023,"language":"en","subjects":["cs.CL"],"url":"https://arxiv.org/abs/2308.13116","pdf_url":"https://arxiv.org/pdf/2308.13116","is_open_access":true,"published_at":"2023-08-24T23:38:44Z","score":67},{"id":"arxiv_2308.12008","title":"Graecia capta ferum victorem cepit. Detecting Latin Allusions to Ancient Greek Literature","authors":[{"name":"Frederick Riemenschneider"},{"name":"Anette Frank"}],"abstract":"Intertextual allusions hold a pivotal role in Classical Philology, with Latin authors frequently referencing Ancient Greek texts. Until now, the automatic identification of these intertextual references has been constrained to monolingual approaches, seeking parallels solely within Latin or Greek texts. In this study, we introduce SPhilBERTa, a trilingual Sentence-RoBERTa model tailored for Classical Philology, which excels at cross-lingual semantic comprehension and identification of identical sentences across Ancient Greek, Latin, and English. We generate new training data by automatically translating English texts into Ancient Greek. Further, we present a case study, demonstrating SPhilBERTa's capability to facilitate automated detection of intertextual parallels. Our models and resources are available at https://github.com/Heidelberg-NLP/ancient-language-models.","source":"arXiv","year":2023,"language":"en","subjects":["cs.CL"],"url":"https://arxiv.org/abs/2308.12008","pdf_url":"https://arxiv.org/pdf/2308.12008","is_open_access":true,"published_at":"2023-08-23T08:54:05Z","score":67}],"total":7184983,"page":1,"page_size":20,"sources":["CrossRef","arXiv","DOAJ","Semantic Scholar"],"query":"Ancient history"}