developer_uid: chai_evaluation_service
submission_id: function_reson_2025-12-15
model_name: richard
model_group:
status: inactive
timestamp: 2025-12-15T06:26:56+00:00
num_battles: 7221
num_wins: 3552
celo_rating: 1256.46
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-14
win_ratio: 0.49189862899875364
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.4210855960845947s
Received healthy response to inference request in 1.9124505519866943s
Received healthy response to inference request in 3.288682222366333s
Received healthy response to inference request in 4.343205451965332s
Received healthy response to inference request in 4.107394218444824s
Received healthy response to inference request in 3.6671142578125s
Received healthy response to inference request in 1.9795572757720947s
Received healthy response to inference request in 5.065374374389648s
Received healthy response to inference request in 2.3905816078186035s
Received healthy response to inference request in 3.3294196128845215s
10 requests
0 failed requests
5th percentile: 1.9426485776901246
10th percentile: 1.9728466033935548
20th percentile: 2.308376741409302
30th percentile: 3.019252038002014
40th percentile: 3.313124656677246
50th percentile: 3.375252604484558
60th percentile: 3.5194970607757567
70th percentile: 3.7991982460021974
80th percentile: 4.154556465148926
90th percentile: 4.415422344207763
95th percentile: 4.740398359298705
99th percentile: 5.00037917137146
mean time: 3.3504865169525146
Pipeline stage StressChecker completed in 35.01s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.58s
Shutdown handler de-registered
function_reson_2025-12-15 status is now deployed due to DeploymentManager action
function_reson_2025-12-15 status is now inactive due to auto deactivation removed underperforming models