Files
CompanionGuard-RL/requirements.txt
wangyu 7d4345c29d feat: initial CompanionGuard-RL framework
Two-module pipeline for AI companion safety:
- Module B: context-aware risk detector with CrossAttention fusion
- Module C: PPO-based adaptive intervention policy

Includes CompanionRisk Taxonomy (10 primary + 14 fine-grained labels),
dataset generation/annotation pipeline, training scripts, and eval suite.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-05-09 17:21:11 +08:00

36 lines
440 B
Plaintext

torch>=2.0.0
transformers>=4.40.0
peft>=0.10.0
accelerate>=0.27.0
datasets>=2.18.0
tokenizers>=0.15.0
# RL
gymnasium>=0.29.0
stable-baselines3>=2.2.0
# LLM API
openai>=1.20.0
anthropic>=0.25.0
dashscope>=1.18.0
# Experiment tracking
wandb>=0.16.0
# Data processing
pandas>=2.0.0
numpy>=1.24.0
scikit-learn>=1.3.0
tqdm>=4.66.0
# Evaluation
scipy>=1.11.0
# Config
pyyaml>=6.0
omegaconf>=2.3.0
# Utilities
jsonlines>=4.0.0
rich>=13.0.0