Files
CompanionGuard-RL/configs/data_generation.yaml
wangyu 7d4345c29d feat: initial CompanionGuard-RL framework
Two-module pipeline for AI companion safety:
- Module B: context-aware risk detector with CrossAttention fusion
- Module C: PPO-based adaptive intervention policy

Includes CompanionRisk Taxonomy (10 primary + 14 fine-grained labels),
dataset generation/annotation pipeline, training scripts, and eval suite.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-05-09 17:21:11 +08:00

23 lines
398 B
YAML

api:
type: "qwen" # "qwen" or "openai"
model: "qwen-max"
generation:
total_samples: 3000
samples_per_category: 300
delay: 0.5 # seconds between API calls
output:
raw_dir: "data/raw"
output_file: "data/raw/generated.jsonl"
annotation:
judge_model: "qwen-max"
output_file: "data/processed/annotated.jsonl"
split:
train: 0.8
val: 0.1
test: 0.1
seed: 42