developer_uid: chai_evaluation_service
submission_id: function_gifit_2025-12-14
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-17T16:31:14+00:00
num_battles: 5398
num_wins: 2738
celo_rating: 1303.31
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-14
win_ratio: 0.5072248981104113
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.977449893951416s
Received healthy response to inference request in 1.9959933757781982s
Received healthy response to inference request in 2.761265516281128s
Received healthy response to inference request in 2.018561363220215s
Received healthy response to inference request in 2.4246082305908203s
Received healthy response to inference request in 2.1443474292755127s
Received healthy response to inference request in 2.756155252456665s
Received healthy response to inference request in 2.069897413253784s
Received healthy response to inference request in 2.6022329330444336s
Received healthy response to inference request in 2.7800252437591553s
10 requests
0 failed requests
5th percentile: 1.985794460773468
10th percentile: 1.99413902759552
20th percentile: 2.0140477657318114
30th percentile: 2.0544965982437136
40th percentile: 2.114567422866821
50th percentile: 2.2844778299331665
60th percentile: 2.4956581115722654
70th percentile: 2.648409628868103
80th percentile: 2.757177305221558
90th percentile: 2.7631414890289308
95th percentile: 2.771583366394043
99th percentile: 2.7783368682861327
mean time: 2.353053665161133
Pipeline stage StressChecker completed in 24.88s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.55s
Shutdown handler de-registered
function_gifit_2025-12-14 status is now deployed due to DeploymentManager action
function_gifit_2025-12-14 status is now inactive due to auto deactivation removed underperforming models
function_gifit_2025-12-14 status is now torndown due to DeploymentManager action