Text this: Enhanced normalization approach to address stop-word complexity in compound-word schema labels