developer_uid: chai_evaluation_service
submission_id: function_josik_2025-12-16
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-19T13:51:21+00:00
num_battles: 8863
num_wins: 4446
celo_rating: 1294.46
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-19
win_ratio: 0.501636014893377
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.3315837383270264s
Received healthy response to inference request in 3.1149914264678955s
Received healthy response to inference request in 2.545869827270508s
Received healthy response to inference request in 2.9626898765563965s
Received healthy response to inference request in 2.3230721950531006s
Received healthy response to inference request in 2.0565972328186035s
Received healthy response to inference request in 2.3188624382019043s
Received healthy response to inference request in 1.856903314590454s
Received healthy response to inference request in 2.074476957321167s
Received healthy response to inference request in 2.770695686340332s
10 requests
0 failed requests
5th percentile: 1.9467655777931214
10th percentile: 2.0366278409957888
20th percentile: 2.0709010124206544
30th percentile: 2.245546793937683
40th percentile: 2.321388292312622
50th percentile: 2.3273279666900635
60th percentile: 2.417298173904419
70th percentile: 2.613317584991455
80th percentile: 2.809094524383545
90th percentile: 2.9779200315475465
95th percentile: 3.0464557290077208
99th percentile: 3.1012842869758606
mean time: 2.435574269294739
Pipeline stage StressChecker completed in 26.09s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.61s
Shutdown handler de-registered
function_josik_2025-12-16 status is now deployed due to DeploymentManager action
function_josik_2025-12-16 status is now inactive due to auto deactivation removed underperforming models
function_josik_2025-12-16 status is now torndown due to DeploymentManager action