Text this: Extreme action recognition from real-time video using time-series deep learning model