Text this: Framework for a semantic data transformation in solving data quality issues in big data