VS-BIM: a cognitive map-driven framework enhancing MLLM for automatic safety inspection in construction
The rise of Multimodal Large Language Models (MLLMs) offers new potential for automated construction safety inspection. However, current discriminative vision-language alignment approaches struggle with spatial understanding and complex reasoning, limiting proactive risk detection. To address this,...
Saved in:
| Main Authors: | , , , , |
|---|---|
| Format: | Article |
| Language: | en |
| Published: |
Elsevier
2026
|
| Subjects: | |
| Online Access: | http://psasir.upm.edu.my/id/eprint/122268/1/122268.pdf http://psasir.upm.edu.my/id/eprint/122268/ https://linkinghub.elsevier.com/retrieve/pii/S147403462500878X |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
