Text this: CMMF-Net: A Generative network based on CLIP-guided multi-modal feature fusion for thermal infrared image colorization