developer_uid: chai_evaluation_service
submission_id: function_pinum_2025-12-14
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-17T20:21:16+00:00
num_battles: 11098
num_wins: 5508
celo_rating: 1256.4
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-14
win_ratio: 0.49630564065597405
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 7.664183139801025s
Received healthy response to inference request in 3.991219997406006s
Received healthy response to inference request in 5.792365789413452s
Received healthy response to inference request in 3.27521014213562s
Received healthy response to inference request in 7.536757946014404s
Received healthy response to inference request in 6.582979440689087s
Received healthy response to inference request in 2.6097419261932373s
Received healthy response to inference request in 9.616914510726929s
Received healthy response to inference request in 9.931254386901855s
Received healthy response to inference request in 2.912679672241211s
10 requests
0 failed requests
5th percentile: 2.7460639119148254
10th percentile: 2.8823858976364134
20th percentile: 3.2027040481567384
30th percentile: 3.77641704082489
40th percentile: 5.071907472610474
50th percentile: 6.1876726150512695
60th percentile: 6.964490842819213
70th percentile: 7.574985504150391
80th percentile: 8.054729413986207
90th percentile: 9.648348498344422
95th percentile: 9.789801442623139
99th percentile: 9.902963798046112
mean time: 5.991330695152283
Pipeline stage StressChecker completed in 61.38s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.60s
Shutdown handler de-registered
function_pinum_2025-12-14 status is now deployed due to DeploymentManager action
function_pinum_2025-12-14 status is now inactive due to auto deactivation removed underperforming models
function_pinum_2025-12-14 status is now torndown due to DeploymentManager action