LSTM(Long Short Term Memory) -The model complemented RNN. -It can over 1000 steps. -It can capture long term dependency. σ is sigmoid(between 0 and 1). C is a highway, h is a national road. Therefore, there is less change to occur vanishing gradient. https://www.coursera.org/learn/cnns-and-rnns/lecture/RJpfH/new-video 1. Input layer that receives 10 inputs Input(shape=(10,)) 2. Words to dense ve..