Text this: Data quality, scaling assumptions and reliability of crowd behaviour scale