developer_uid: chai_evaluation_service
submission_id: function_dabob_2025-12-18
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-21T06:51:17+00:00
num_battles: 8280
num_wins: 4095
celo_rating: 1289.73
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-20
win_ratio: 0.4945652173913043
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.8056387901306152s
Received healthy response to inference request in 2.5942623615264893s
Received healthy response to inference request in 2.2950313091278076s
Received healthy response to inference request in 1.6944901943206787s
Received healthy response to inference request in 2.649509906768799s
Received healthy response to inference request in 3.4009921550750732s
Received healthy response to inference request in 3.034317970275879s
Received healthy response to inference request in 2.677769422531128s
Received healthy response to inference request in 2.4751646518707275s
Received healthy response to inference request in 2.5314290523529053s
10 requests
0 failed requests
5th percentile: 1.7445070624351502
10th percentile: 1.7945239305496217
20th percentile: 2.1971528053283693
30th percentile: 2.4211246490478513
40th percentile: 2.508923292160034
50th percentile: 2.5628457069396973
60th percentile: 2.616361379623413
70th percentile: 2.6579877614974974
80th percentile: 2.749079132080078
90th percentile: 3.070985388755798
95th percentile: 3.2359887719154354
99th percentile: 3.367991478443146
mean time: 2.51586058139801
Pipeline stage StressChecker completed in 26.76s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.58s
Shutdown handler de-registered
function_dabob_2025-12-18 status is now deployed due to DeploymentManager action
function_dabob_2025-12-18 status is now inactive due to auto deactivation removed underperforming models
function_dabob_2025-12-18 status is now torndown due to DeploymentManager action