Text Classification Using RNN on IMDB Dataset

This text classification tutorial demonstrates the implementation of a Recurrent Neural Network (RNN) on the IMDB large movie review dataset for sentiment analysis. The dataset comprises movie reviews labeled as either positive or negative sentiment.

Purpose

The code showcases:

Setup and initialization using TensorFlow and TensorFlow Datasets (TFDS).
Preprocessing of the IMDB dataset for binary sentiment classification.
Building an RNN-based model using TensorFlow/Keras for sentiment analysis.
Model training, evaluation, and visualization of training metrics.

Setup

This code requires TensorFlow and TensorFlow Datasets. Use the provided setup to install the necessary packages.

Input Pipeline

The dataset is split into training and test sets and processed using TensorFlow Datasets. The code demonstrates:

Dataset loading with tfds.load.
Shuffle and batch setup for training and test datasets.
Visualization of text and label pairs.

Text Encoding

The raw text from the dataset is preprocessed using the TextVectorization layer. This layer adapts to the text and encodes it into indices for model input. The process involves setting vocabulary size, encoding text to indices, and reversing the encoding.

Model Architecture

The model architecture consists of the following layers:

TextVectorization layer for encoding text.
Embedding layer for word representation.
Bidirectional LSTM layer for sequence processing.
Dense layers for final classification.

Training and Evaluation

The code compiles and trains the model using a binary cross-entropy loss function and Adam optimizer. It tracks training accuracy, loss, and evaluates model performance on the test set.

Additional Techniques

Demonstrates the use of stacking multiple LSTM layers in the model architecture for improved performance. It visualizes training metrics using Matplotlib.

Usage

Setup: Install required packages.
Execution: Run code blocks sequentially to observe the training process and model evaluation.
Model Customization: Explore changing the model architecture or hyperparameters for different results.
Visualizations: Analyze training and validation metric plots to understand model performance.

Sample Predictions

The code includes examples of predicting sentiment for custom input sentences using the trained model.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
README.md		README.md
text_classification_using___RNN_using_IMDB_Dataset.ipynb		text_classification_using___RNN_using_IMDB_Dataset.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Text Classification Using RNN on IMDB Dataset

Purpose

Setup

Input Pipeline

Text Encoding

Model Architecture

Training and Evaluation

Additional Techniques

Usage

Sample Predictions

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Text Classification Using RNN on IMDB Dataset

Purpose

Setup

Input Pipeline

Text Encoding

Model Architecture

Training and Evaluation

Additional Techniques

Usage

Sample Predictions

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages