Visualization

Monitoring training progress through visualized metrics is essential for understanding model behavior and tuning hyperparameters effectively.

Supported Visualization Tools

Modern experiment tracking platform designed for AI research. Recommended.

Weights & Biases experiment tracking platform.

Simple text-based logging to standard output.

Simply set the logger backend to swanlab in your YAML configuration:

config.yaml

ajet:
  trainer_common:
    logger: swanlab

Launch your training as usual:

ajet --conf tutorial/example_math_agent/math_agent.yaml

Automatic Tracking

Once training starts, SwanLab will automatically:

You can access the SwanLab dashboard through the URL printed in the training logs.

Metric	Description
Reward	Average reward per episode, indicating task performance
Success Rate	Percentage of successfully completed tasks
Loss	Training loss from the policy optimization algorithm
Response Length	Average length of model responses
KL Divergence	Divergence between current and reference policy

Example Training Curve:

Example Training Curve

A typical reward curve shows:

Phase	Description
Initial	Reward may be low or unstable as the model explores
Learning	Reward gradually increases as the model learns better strategies
Convergence	Reward plateaus when the model reaches optimal performance

What to Look For

Compare different hyperparameter settings by running multiple experiments and comparing their curves side-by-side.

Balance between logging detail and training overhead:

config.yaml

ajet:
  trainer_common:
    log_freq: 1  # Log every N steps

Configure checkpoint saving to preserve models at peak performance:

config.yaml

ajet:
  trainer_common:
    save_freq: 100  # Save every 100 steps

Token-level debugging and visualization.

Auto-generate training data from documents.

Official SwanLab documentation.