PlaygroundRL �️

PlaygroundRL is a real-time reinforcement learning playground that runs entirely in the browser. Watch autonomous agents explore stylized grid worlds, adapt to obstacles, and chase rewards using Proximal Policy Optimization (PPO).

Overview

PlaygroundRL turns PPO training into an interactive visual experience. Multiple agents learn concurrently inside richly lit Three.js environments, helping you understand how policy gradients behave under different levels of difficulty.

Features

Real-time AI Training: Watch PPO agents improve directly in the browser
Multiple Difficulty Levels: Two distinct environments with increasing complexity
Smooth 3D Visualization: Powered by React Three Fiber for performant 3D graphics
Multi-Agent System: Ten agents learn simultaneously for richer dynamics
Dynamic Environments: Level 2 introduces moving obstacles for an added challenge

Tech Stack

Frontend: Next.js 14, React, TypeScript
3D Graphics: React Three Fiber, Three.js
AI/ML: ONNX Runtime Web for in-browser inference
Styling: Tailwind CSS, shadcn/ui components
State Management: Zustand
Animation: React Spring

Getting Started

Prerequisites

Node.js 14+
npm or yarn

Installation

# Clone the repository
git clone https://github.com/yourusername/playgroundrl.git

# Navigate to project directory
cd playgroundrl

# Install dependencies
npm install
# or
yarn install

# Run the development server
npm run dev
# or
yarn dev

Open http://localhost:3000 to see the application.

Build for Production

npm run build
npm start

How It Works

The Environment

Grid World: 25x25 tile-based environment
Agents: Bunny agents start from random positions
Goal: Find the pink reward tile while avoiding hologram tiles
Obstacles:
- Level 1: Static hologram tiles (instant failure)
- Level 2: Moving hologram tiles + vision-based navigation

The AI

The bunnies use PPO (Proximal Policy Optimization) to learn optimal policies:

State Space: Agent position, target position, distance to goal (+ vision in Level 2)
Action Space: 4 discrete actions (up, down, left, right)
Reward Structure: Positive reward for reaching the goal, negative for hitting obstacles

Model Details

Architecture: Actor-Critic neural network
Training: Python implementation with stable-baselines3
Deployment: ONNX models running in-browser via ONNX Runtime Web
Hyperparameters: See in-app "Model Details" for complete configuration

Project Structure

├── app/
│   ├── (game)/
│   │   ├── page.tsx          # Main game page
│   │   ├── LevelOne.tsx      # Level 1 implementation
│   │   ├── LevelTwo.tsx      # Level 2 implementation
│   │   ├── Player.tsx        # Player bunny component
│   │   ├── runModel.ts       # ONNX inference logic
│   │   └── store/           # Zustand stores
│   └── components/          # UI components
├── public/
│   └── models/             # 3D models and ONNX files
└── train/                  # Python training scripts

Training Your Own Model

The train/ directory contains Python scripts for training new PPO models:

cd train
python ppo.py  # Train the model
python torch2onnx.py  # Convert to ONNX format

License

This project is licensed under the MIT License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 198 Commits
.github		.github
app		app
model		model
public		public
src		src
train		train
.editorconfig		.editorconfig
.eslintignore		.eslintignore
.eslintrc		.eslintrc
.gitignore		.gitignore
.prettierignore		.prettierignore
.prettierrc		.prettierrc
.vercelignore		.vercelignore
README.md		README.md
bun.lockb		bun.lockb
components.json		components.json
next-env.d.ts		next-env.d.ts
next.config.js		next.config.js
package-lock.json		package-lock.json
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
postcss.config.js		postcss.config.js
sandbox.config.json		sandbox.config.json
tailwind.config.js		tailwind.config.js
test.nnb		test.nnb
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PlaygroundRL �️

Overview

Features

Tech Stack

Getting Started

Prerequisites

Installation

Build for Production

How It Works

The Environment

The AI

Model Details

Project Structure

Training Your Own Model

License

Links

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

PlaygroundRL �️

Overview

Features

Tech Stack

Getting Started

Prerequisites

Installation

Build for Production

How It Works

The Environment

The AI

Model Details

Project Structure

Training Your Own Model

License

Links

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages