Towards AI-Assisted Motorcycle Safety: Multi-Modal Video Analysis for Hazard Detection and Contextual Risk Assessment
Abstrak
Motorcyclists face a disproportionately high risk of severe injury or death compared to other road users, highlighting the need for intelligent rider assistance technologies. This paper presents an initial, modular, and interpretable AI pipeline that generates context-aware safety advice from first-person motorcycle videos with practical inference latency suitable for on-device deployment, framing large language models as interpretable cognitive support agents for motorcycle safety. The system integrates lightweight perception and reasoning components to emulate the function of an Advanced Rider Assistance System (ARAS). Video frames are processed at 1 FPS using Pixtral, a Mistral-based multimodal large language model (MLLM), to produce descriptive scene captions, while YOLOv8 identifies key objects such as vehicles, pedestrians, and road hazards. A Mistral-small language model then fuses this information to generate concise, imperative safety tips. Preliminary evaluations on publicly available motorcycle POV datasets demonstrate promising performance in terms of contextual accuracy, interpretability, and scalability, suggesting potential for real-world deployment in low-resource or embedded environments. The proposed framework offers interpretable, context-aware safety assistance that is particularly valuable for young and newly licensed riders during the transition from supervised training to independent riding, where real-time hazard interpretation support is most needed.
Topik & Kata Kunci
Penulis (4)
Fatemeh Ghorbani
Augustin Hym
Mohammed Elhenawy
Andry Rakotonirainy
Akses Cepat
- Tahun Terbit
- 2026
- Sumber Database
- DOAJ
- DOI
- 10.3390/vehicles8020039
- Akses
- Open Access ✓