TacoSkill LAB
TacoSkill LAB
HomeSkillHubCreatePlaygroundSkillKit
© 2026 TacoSkill LAB
AboutPrivacyTerms
  1. Home
  2. /
  3. SkillHub
  4. /
  5. stable-baselines3
Improve

stable-baselines3

8.7

by davila7

126Favorites
310Upvotes
0Downvotes

Use this skill for reinforcement learning tasks including training RL agents (PPO, SAC, DQN, TD3, DDPG, A2C, etc.), creating custom Gym environments, implementing callbacks for monitoring and control, using vectorized environments for parallel training, and integrating with deep RL workflows. This skill should be used when users request RL algorithm implementation, agent training, environment design, or RL experimentation.

reinforcement learning

8.7

Rating

0

Installs

Machine Learning

Category

Quick Review

Excellent reinforcement learning skill with comprehensive coverage of Stable Baselines3 capabilities. The description clearly articulates when to use the skill (RL training, custom environments, callbacks, vectorization). Task knowledge is outstanding with detailed code examples, gotchas, and workflow guidance covering training, evaluation, custom environments, and advanced features. Structure is clean with a well-organized SKILL.md that provides inline examples while delegating detailed references and templates to separate files. High novelty as RL workflows involve complex multi-step processes (environment validation, algorithm selection, hyperparameter tuning, callback configuration) that would consume many tokens if done from scratch by a CLI agent. Minor room for improvement: could slightly expand the description to mention evaluation/monitoring capabilities explicitly, though current coverage is strong.

LLM Signals

Description coverage9
Task knowledge10
Structure9
Novelty9

GitHub Signals

18,073
1,635
132
71
Last commit 0 days ago

Publisher

davila7

davila7

Skill Author

Related Skills

ml-pipelinesparse-autoencoder-traininghuggingface-accelerate

Loading SKILL.md…

Try onlineView on GitHub

Publisher

davila7 avatar
davila7

Skill Author

Related Skills

ml-pipeline

Jeffallan

6.4

sparse-autoencoder-training

zechenzhangAGI

7.6

huggingface-accelerate

zechenzhangAGI

7.6

moe-training

zechenzhangAGI

7.6
Try online