developer_uid: chai_backend_admin
submission_id: function_fudut_2025-12-02
model_name: function_fudut_2025-12-02
model_group:
status: torndown
timestamp: 2025-12-12T18:28:58+00:00
num_battles: 8113
num_wins: 4315
celo_rating: 1316.7
family_friendly_score: 0.5426
family_friendly_standard_error: 0.007045356484948083
submission_type: function
display_name: function_fudut_2025-12-02
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-02
win_ratio: 0.5318624429927277
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.0703306198120117s
Received healthy response to inference request in 1.913755178451538s
Received healthy response to inference request in 1.9361422061920166s
Received healthy response to inference request in 2.0647623538970947s
Received healthy response to inference request in 2.181915760040283s
Received healthy response to inference request in 2.087214946746826s
Received healthy response to inference request in 2.013807535171509s
Received healthy response to inference request in 1.779144048690796s
Received healthy response to inference request in 2.0893445014953613s
Received healthy response to inference request in 1.793278694152832s
10 requests
0 failed requests
5th percentile: 1.7855046391487122
10th percentile: 1.7918652296066284
20th percentile: 1.8896598815917969
30th percentile: 1.929426097869873
40th percentile: 1.982741403579712
50th percentile: 2.0392849445343018
60th percentile: 2.0669896602630615
70th percentile: 2.075395917892456
80th percentile: 2.0876408576965333
90th percentile: 2.0986016273498533
95th percentile: 2.1402586936950683
99th percentile: 2.17358434677124
mean time: 1.9929695844650268
Pipeline stage StressChecker completed in 22.81s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.60s
Shutdown handler de-registered
function_fudut_2025-12-02 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Generating Leaderboard row for %s
Generated Leaderboard row for %s
Pipeline stage OfflineFamilyFriendlyScorer completed in 3384.38s
Shutdown handler de-registered
function_fudut_2025-12-02 status is now torndown due to DeploymentManager action