Research
May 2, 20267 min readJeTech Lab

Agent Model Version Changelog: PPO iTransformer v1

Runtime preset and registry gate notes for the first JeTech iTransformer v1 candidate model family.

Agent Model
iTransformer
PPO
Registry Gate

Summary

ppo_itransformer_v1 is the first iTransformer runtime preset used to produce JeTech agent registry candidates. The run name intentionally carries only the minimum identity fields. Full training parameters and gate results are stored in the run directory config.json, W&B config, and uploaded model sidecar metadata.

Run identity

FieldValue
Canonical run nameppo_itransformer_v1_{symbol}_{YYYYMMDD_HHMMSS}
AlgorithmPPO, Stable-Baselines3
BackboneiTransformer feature extractor
Versionv1
Current batch symbolsBTC, ETH, SOL, XRP, DOGE
Registry artifact scopeOnly gate-passing models are uploaded as S3 registry candidates

Runtime preset

ParameterValue
window_size128
encoder_hidden_size64
encoder_layers2
encoder_dropout0.1
transformer_num_heads4
transformer_ffn_dim128
features_dim64
policy_networkmlp
value_networkmlp
policy_hidden_dims[64, 64]
value_hidden_dims[64, 64]

Training defaults

FieldValue
Synthetic reference period5y
Reference interval1d
Episode length1000
GAN innovation blend0.75
Target rollout batch1024
Eval frequencySame as rollout batch
Total timesteps102400
Current batch scripttrading/agents/scripts/train_crypto_batch.sh
Batch repeat count30 runs per symbol

n_envs and n_steps are resolved automatically from the runtime CPU count. The current batch logs show n_envs=4, n_steps=256, and rollout_batch=1024.

Risk and action controls

ParameterValue
target_exposure_deadband0.05
rebalance_deadband0.05
cooldown_bars2
drawdown_stop0.40
reward_turnover_penalty0.0005
reward_downside_penalty0.05
reward_running_drawdown_penalty0.005
reward_drawdown_increment_penalty0.50

Real evaluation and gate

Real evaluation uses recent 5y, 1d data during training. The system still records returns, MDD, and Sharpe across 20 three-month diagnostic windows, but registry promotion is decided by one continuous metric.

Gate fieldValue
Gate mode5y_30d_avg_sharpe
Pass condition5Y 30d Avg Sharpe >= 1.0
Rolling window30 daily returns
Annualization365 periods/year
Diagnostic windows20 x 3M, report-only

For passing models, the uploaded .meta.json includes gate.backtest_5y_rolling_sharpe_avg_30d, gate.min_backtest_5y_rolling_sharpe_avg_30d, and gate.research_metrics. JeTech Lab reads that sidecar metadata to show the candidate model table's 5Y 30d Avg Sharpe column.

W&B logging

W&B receives real-evaluation metrics rather than the full training scalar set.

  • real_eval/num_timesteps
  • real_eval/backtest_5y_rolling_sharpe_avg_30d
  • real_eval/backtest_5y_return_count
  • real_eval/mean_strategy_window_return
  • real_eval/mean_market_window_return
  • real_eval/mean_strategy_window_mdd
  • real_eval/mean_market_window_mdd
  • real_eval/mean_strategy_window_sharpe
  • real_eval/sharpe_window_count
  • real_eval/sharpe_window_ratio

Change log

DateChange
2026-05-02Documented the ppo_itransformer_v1 batch preset
2026-05-02Switched the registry gate from three-month window count to 5Y 30d Avg Sharpe >= 0.5
2026-05-04Raised the registry gate pass threshold to 5Y 30d Avg Sharpe >= 1.0
2026-05-02Standardized candidate table display around 5Y 30d Avg Sharpe