Efficient Recurrent Residual Networks Improved by Feature Transfer