Text this: A fine-tuned large language model for domain-specific with reinforcement learning