developer_uid: chai_evaluation_service
submission_id: function_fibir_2025-12-14
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-17T20:41:22+00:00
num_battles: 10452
num_wins: 5269
celo_rating: 1256.4
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-14
win_ratio: 0.5041140451588213
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.4204580783843994s
Received healthy response to inference request in 2.8966078758239746s
Received healthy response to inference request in 2.703726291656494s
Received healthy response to inference request in 3.9133777618408203s
Received healthy response to inference request in 2.2151288986206055s
Received healthy response to inference request in 2.6167335510253906s
Received healthy response to inference request in 2.9950716495513916s
Received healthy response to inference request in 3.505579948425293s
Received healthy response to inference request in 2.553589105606079s
Received healthy response to inference request in 2.4795892238616943s
10 requests
0 failed requests
5th percentile: 2.3075270295143127
10th percentile: 2.39992516040802
20th percentile: 2.467762994766235
30th percentile: 2.5313891410827636
40th percentile: 2.591475772857666
50th percentile: 2.6602299213409424
60th percentile: 2.7808789253234862
70th percentile: 2.9261470079421996
80th percentile: 3.097173309326172
90th percentile: 3.5463597297668454
95th percentile: 3.7298687458038327
99th percentile: 3.876675958633423
mean time: 2.829986238479614
Pipeline stage StressChecker completed in 30.43s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.58s
Shutdown handler de-registered
function_fibir_2025-12-14 status is now deployed due to DeploymentManager action
function_fibir_2025-12-14 status is now inactive due to auto deactivation removed underperforming models
function_fibir_2025-12-14 status is now torndown due to DeploymentManager action