arXiv Open Access 2025

Assessing Large Language Models on Islamic Legal Reasoning: Evidence from Inheritance Law Evaluation

Abdessalam Bouchekif Samer Rashwani Heba Sbahi Shahd Gaben Mutaz Al-Khatib +1 lainnya

Lihat Sumber

Abstrak

This paper evaluates the knowledge and reasoning capabilities of Large Language Models in Islamic inheritance law, known as 'ilm al-mawarith. We assess the performance of seven LLMs using a benchmark of 1,000 multiple-choice questions covering diverse inheritance scenarios, designed to test models' ability to understand the inheritance context and compute the distribution of shares prescribed by Islamic jurisprudence. The results reveal a significant performance gap: o3 and Gemini 2.5 achieved accuracies above 90%, whereas ALLaM, Fanar, LLaMA, and Mistral scored below 50%. These disparities reflect important differences in reasoning ability and domain adaptation. We conduct a detailed error analysis to identify recurring failure patterns across models, including misunderstandings of inheritance scenarios, incorrect application of legal rules, and insufficient domain knowledge. Our findings highlight limitations in handling structured legal reasoning and suggest directions for improving performance in Islamic legal reasoning. Code: https://github.com/bouchekif/inheritance_evaluation

Topik & Kata Kunci

cs.CL cs.AI

Penulis (6)

Abdessalam Bouchekif

Samer Rashwani

Heba Sbahi

Shahd Gaben

Mutaz Al-Khatib

Mohammed Ghaly

Format Sitasi

APA MLA BibTeX

Bouchekif, A., Rashwani, S., Sbahi, H., Gaben, S., Al-Khatib, M., Ghaly, M. (2025). Assessing Large Language Models on Islamic Legal Reasoning: Evidence from Inheritance Law Evaluation. https://arxiv.org/abs/2509.01081

Akses Cepat

Lihat di Sumber

Informasi Jurnal

Tahun Terbit: 2025
Bahasa: en
Sumber Database: arXiv
Akses: Open Access ✓