Video-Recognition-using-LSTM

Trained an LSTM model for action recognition in a video dataset. The project is implemented in TensorFlow Framework..

Dataset

The dataset contains 588 videos each of 4 to 6 seconds in length. Each video consist of 50 frames of 2048 pixels each. The videos were divided in 5 categories:

Cricket Bowling : Consist of bowling videos from cricket matches.
Cricket Shot: Consist of videos of batsman hitting balls in cricket games.
Pizza Tossing: Consist of videos of persons tossing pizzas in the air.
Playing Cello: Consists of videos of people playing cello.
Playing Sitar: Consists of videos of people playing sitar.

Preprocessing

All the videos were divided into frames and were passed through a Convolutional Neural Network (VGG16) and features were stored as sequential data in a npz file. The data was divided into training set and validation set.

Model

Model consist of a basic LSTM cell with 128 neurons along with a dynamic rnn layer. The whole model was implemented in TensorFlow framework.

Training

Training was done on batches of size 32. Adam Optimizer was used with learning rate = 1e-4, beta1 = 0.9 beta2 = 0.999 and epsilon = 1e-8. Training was successful with approximately 90% accuracy on the validation set Softmax classification was used with cross entropy loss.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
Data/Task_1/Raw		Data/Task_1/Raw
README.md		README.md
Video Classification.ipynb		Video Classification.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Video-Recognition-using-LSTM

Dataset

Preprocessing

Model

Training

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Video-Recognition-using-LSTM

Dataset

Preprocessing

Model

Training

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages