developer_uid: chai_evaluation_service
submission_id: function_gojam_2025-12-17
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-20T18:51:15+00:00
num_battles: 10761
num_wins: 5422
celo_rating: 1295.83
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-20
win_ratio: 0.5038565189108819
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.0582337379455566s
Received healthy response to inference request in 3.0532724857330322s
Received healthy response to inference request in 2.0225956439971924s
Received healthy response to inference request in 2.6805436611175537s
Received healthy response to inference request in 3.2405495643615723s
Received healthy response to inference request in 3.939683675765991s
Received healthy response to inference request in 3.1019067764282227s
Received healthy response to inference request in 3.8911051750183105s
Received healthy response to inference request in 1.7633869647979736s
Received healthy response to inference request in 2.664269208908081s
10 requests
0 failed requests
5th percentile: 1.880030870437622
10th percentile: 1.9966747760772705
20th percentile: 2.535934495925903
30th percentile: 2.675661325454712
40th percentile: 2.904180955886841
50th percentile: 3.0557531118392944
60th percentile: 3.075702953338623
70th percentile: 3.1434996128082275
80th percentile: 3.37066068649292
90th percentile: 3.895963025093079
95th percentile: 3.917823350429535
99th percentile: 3.9353116106987
mean time: 2.9415546894073485
Pipeline stage StressChecker completed in 30.80s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.57s
Shutdown handler de-registered
function_gojam_2025-12-17 status is now deployed due to DeploymentManager action
function_gojam_2025-12-17 status is now inactive due to auto deactivation removed underperforming models
function_gojam_2025-12-17 status is now torndown due to DeploymentManager action