arXiv Open Access 2025

Assessing Large Language Models on Islamic Legal Reasoning: Evidence from Inheritance Law Evaluation

Abdessalam Bouchekif Samer Rashwani Heba Sbahi Shahd Gaben Mutaz Al-Khatib +1 lainnya
Lihat Sumber

Abstrak

This paper evaluates the knowledge and reasoning capabilities of Large Language Models in Islamic inheritance law, known as 'ilm al-mawarith. We assess the performance of seven LLMs using a benchmark of 1,000 multiple-choice questions covering diverse inheritance scenarios, designed to test models' ability to understand the inheritance context and compute the distribution of shares prescribed by Islamic jurisprudence. The results reveal a significant performance gap: o3 and Gemini 2.5 achieved accuracies above 90%, whereas ALLaM, Fanar, LLaMA, and Mistral scored below 50%. These disparities reflect important differences in reasoning ability and domain adaptation. We conduct a detailed error analysis to identify recurring failure patterns across models, including misunderstandings of inheritance scenarios, incorrect application of legal rules, and insufficient domain knowledge. Our findings highlight limitations in handling structured legal reasoning and suggest directions for improving performance in Islamic legal reasoning. Code: https://github.com/bouchekif/inheritance_evaluation

Topik & Kata Kunci

Penulis (6)

A

Abdessalam Bouchekif

S

Samer Rashwani

H

Heba Sbahi

S

Shahd Gaben

M

Mutaz Al-Khatib

M

Mohammed Ghaly

Format Sitasi

Bouchekif, A., Rashwani, S., Sbahi, H., Gaben, S., Al-Khatib, M., Ghaly, M. (2025). Assessing Large Language Models on Islamic Legal Reasoning: Evidence from Inheritance Law Evaluation. https://arxiv.org/abs/2509.01081

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2025
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓