Text this: A vision-based deep learning approach for non-contact vibration measurement using (2+1)D CNN and optical flow