developer_uid: chai_backend_admin
submission_id: function_runis_2026-02-03
model_name: function_runis_2026-02-03
model_group:
status: torndown
timestamp: 2026-02-06T18:01:39+00:00
num_battles: 10706
num_wins: 5345
celo_rating: 1302.88
family_friendly_score: 0.5674
family_friendly_standard_error: 0.007006528955196004
submission_type: function
display_name: function_runis_2026-02-03
is_internal_developer: True
ranking_group: single
us_pacific_date: 2026-02-03
win_ratio: 0.49925275546422565
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': 'CUSTOM', 'prompt_template': 'CUSTOM', 'bot_template': 'CUSTOM', 'user_template': 'CUSTOM', 'response_template': 'CUSTOM', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.9867465496063232s
Received healthy response to inference request in 1.8533151149749756s
Received healthy response to inference request in 1.9420719146728516s
Received healthy response to inference request in 1.241844892501831s
Received healthy response to inference request in 1.5361061096191406s
Received healthy response to inference request in 1.7335364818572998s
Received healthy response to inference request in 1.4959297180175781s
Received healthy response to inference request in 4.073848009109497s
Received healthy response to inference request in 1.3242404460906982s
Received healthy response to inference request in 1.5541353225708008s
10 requests
0 failed requests
5th percentile: 1.2789228916168214
10th percentile: 1.3160008907318115
20th percentile: 1.461591863632202
30th percentile: 1.524053192138672
40th percentile: 1.5469236373901367
50th percentile: 1.6438359022140503
60th percentile: 1.78144793510437
70th percentile: 1.8799421548843385
80th percentile: 1.9510068416595459
90th percentile: 2.19545669555664
95th percentile: 3.1346523523330667
99th percentile: 3.886008877754212
mean time: 1.8741774559020996
Pipeline stage StressChecker completed in 19.97s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.66s
Shutdown handler de-registered
function_runis_2026-02-03 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Generating Leaderboard row for %s
Generated Leaderboard row for %s
Pipeline stage OfflineFamilyFriendlyScorer completed in 2002.94s
Shutdown handler de-registered
function_runis_2026-02-03 status is now inactive due to auto deactivation removed underperforming models
function_runis_2026-02-03 status is now torndown due to DeploymentManager action