DOAJ Open Access 2025

N-Gram and Full-Text Search Algorithm Testing for Pattern Recognition in a Chatbot Engine

I Made Sukarsa Deden Witarsyah I Putu Agung Bayupati Putu Wira Buana Ni Wayan Wisswani +5 lainnya

Abstrak

The development of chatbots to access database services and information systems has triggered a lot of research on frameworks for service development, including the development of ISONER (Information System On Internet Messenger). This framework consists of multiple phases including pattern recognition, query processing, and response generation. In its implementation, the framework develops pattern recognition services that are currently based on Natural Language Processing (NLP). Improved pattern recognition algorithms enhance the system’s ability to accurately interpret user intent. The pattern recognition used in this research utilizes built-in plugins from MySQL, namely N-gram and Full-Text Search, which can be run directly on the MySQL engine to reduce latency and do not require another programming language. The FTS and fourgram algorithms gave the best results when applied on 100 test data points, with a threshold of 0.91, accuracy of 91%, precision of 99%, and recall of 92%; the average computation time was 19 s for 100 test data points and 2 min 49 s for 1000 data points tested simultaneously.

Penulis (10)

I

I Made Sukarsa

D

Deden Witarsyah

I

I Putu Agung Bayupati

P

Putu Wira Buana

N

Ni Wayan Wisswani

I

I Ketut Adi Purnawan

I

I Putu Adi Putra Setiawan

I

I Putu Ngurah Krisna Dana

I

I Wayan Darmika Esa Krissayoga

E

Eko Prasetyo

Format Sitasi

Sukarsa, I.M., Witarsyah, D., Bayupati, I.P.A., Buana, P.W., Wisswani, N.W., Purnawan, I.K.A. et al. (2025). N-Gram and Full-Text Search Algorithm Testing for Pattern Recognition in a Chatbot Engine. https://doi.org/10.3390/engproc2025107086

Akses Cepat

PDF tidak tersedia langsung

Cek di sumber asli →
Lihat di Sumber doi.org/10.3390/engproc2025107086
Informasi Jurnal
Tahun Terbit
2025
Sumber Database
DOAJ
DOI
10.3390/engproc2025107086
Akses
Open Access ✓