Text this: Reduced-reference video quality metric using spatio-temporal activity information