arXiv Open Access 2025

Dynamic Chain-of-Thought: Towards Adaptive Deep Reasoning

Libo Wang

Lihat Sumber

Abstrak

To reduce the cost and consumption of computing resources caused by computational redundancy and delayed reward assignment in long CoT, this research proposes the dynamic chain-of-thought (D-CoT) with adaptive reasoning time and steps. The researcher used simulation experiment to simulate the integration of D-CoT through Python 3.13 IDLE combined with a Python simulator based on GPTs. At the same time, the researcher used DeepSeek R1 as a control group to test and compare the performance of the D-CoT simulator in processing MIT OpenCourseWare's linear algebra exam questions. Experimental results show that D-CoT is better than DeepSeek R1 based on long CoT in three indicators: reasoning time, CoT length (reasoning steps) and token count, which achieves a significant reduction in computing resource consumption. In addition, this research has potential value in deep reasoning optimization that is used as a reference for future dynamic deep reasoning frameworks.

Topik & Kata Kunci

cs.AI cs.LG

Penulis (1)

Libo Wang

Format Sitasi

APA MLA BibTeX

Wang, L. (2025). Dynamic Chain-of-Thought: Towards Adaptive Deep Reasoning. https://arxiv.org/abs/2502.10428

Akses Cepat

Lihat di Sumber

Informasi Jurnal

Tahun Terbit: 2025
Bahasa: en
Sumber Database: arXiv
Akses: Open Access ✓