developer_uid: chai_backend_admin
submission_id: function_ditor_2026-01-22
model_name: function_ditor_2026-01-22
model_group:
status: torndown
timestamp: 2026-01-26T02:18:17+00:00
num_battles: 10237
num_wins: 5097
celo_rating: 1303.05
family_friendly_score: 0.6564
family_friendly_standard_error: 0.006716234659390632
submission_type: function
display_name: function_ditor_2026-01-22
is_internal_developer: True
ranking_group: single
us_pacific_date: 2026-01-22
win_ratio: 0.4978997753248022
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': 'CUSTOM', 'prompt_template': 'CUSTOM', 'bot_template': 'CUSTOM', 'user_template': 'CUSTOM', 'response_template': 'CUSTOM', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.2420904636383057s
Received healthy response to inference request in 2.7928073406219482s
Received healthy response to inference request in 3.758976459503174s
Received healthy response to inference request in 3.2247815132141113s
Received healthy response to inference request in 5.154137134552002s
Received healthy response to inference request in 3.1984493732452393s
Received healthy response to inference request in 3.32869029045105s
Received healthy response to inference request in 2.9870855808258057s
Received healthy response to inference request in 4.397495746612549s
Received healthy response to inference request in 4.441090106964111s
10 requests
0 failed requests
5th percentile: 2.880232548713684
10th percentile: 2.96765775680542
20th percentile: 3.1561766147613524
30th percentile: 3.2168818712234497
40th percentile: 3.235166883468628
50th percentile: 3.2853903770446777
60th percentile: 3.500804758071899
70th percentile: 3.950532245635986
80th percentile: 4.406214618682862
90th percentile: 4.5123948097229
95th percentile: 4.8332659721374505
99th percentile: 5.089962902069092
mean time: 3.6525604009628294
Pipeline stage StressChecker completed in 38.13s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.57s
Shutdown handler de-registered
function_ditor_2026-01-22 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Generating Leaderboard row for %s
Generated Leaderboard row for %s
Pipeline stage OfflineFamilyFriendlyScorer completed in 2994.23s
Shutdown handler de-registered
function_ditor_2026-01-22 status is now inactive due to auto deactivation removed underperforming models
function_ditor_2026-01-22 status is now torndown due to DeploymentManager action