developer_uid: chai_backend_admin
submission_id: function_depak_2025-12-03
model_name: function_depak_2025-12-03
model_group:
status: protected
timestamp: 2025-12-03T00:58:42+00:00
num_battles: 3994
num_wins: 2101
celo_rating: 1311.75
family_friendly_score: 0.5436000000000001
family_friendly_standard_error: 0.007044132877792696
submission_type: function
display_name: function_depak_2025-12-03
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-02
win_ratio: 0.5260390585878818
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.8953542709350586s
Received healthy response to inference request in 2.0255637168884277s
Received healthy response to inference request in 2.5025527477264404s
Received healthy response to inference request in 2.325140953063965s
Received healthy response to inference request in 1.7791988849639893s
Received healthy response to inference request in 2.3030452728271484s
Received healthy response to inference request in 5.529642581939697s
Received healthy response to inference request in 2.368825912475586s
Received healthy response to inference request in 1.9364237785339355s
Received healthy response to inference request in 2.352520227432251s
10 requests
0 failed requests
5th percentile: 1.8314688086509705
10th percentile: 1.8837387323379517
20th percentile: 1.9282098770141602
30th percentile: 1.99882173538208
40th percentile: 2.19205265045166
50th percentile: 2.3140931129455566
60th percentile: 2.3360926628112795
70th percentile: 2.3574119329452516
80th percentile: 2.3955712795257567
90th percentile: 2.8052617311477652
95th percentile: 4.167452156543728
99th percentile: 5.2572044968605045
mean time: 2.50182683467865
Pipeline stage StressChecker completed in 26.59s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.66s
Shutdown handler de-registered
function_depak_2025-12-03 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Generating Leaderboard row for %s
Generated Leaderboard row for %s
Pipeline stage OfflineFamilyFriendlyScorer completed in 2729.22s
Shutdown handler de-registered