VS-BIM: a cognitive map-driven framework enhancing MLLM for automatic safety inspection in construction

The rise of Multimodal Large Language Models (MLLMs) offers new potential for automated construction safety inspection. However, current discriminative vision-language alignment approaches struggle with spatial understanding and complex reasoning, limiting proactive risk detection. To address this,...

Full description

Saved in:
Bibliographic Details
Main Authors: Wang, Lei, Liu, Yu, Wang, Cunrui, An, Hongda, Li, Yiting
Format: Article
Language:en
Published: Elsevier 2026
Subjects:
Online Access:http://psasir.upm.edu.my/id/eprint/122268/1/122268.pdf
http://psasir.upm.edu.my/id/eprint/122268/
https://linkinghub.elsevier.com/retrieve/pii/S147403462500878X
Tags: Add Tag
No Tags, Be the first to tag this record!