developer_uid: chai_evaluation_service
submission_id: function_tajef_2025-12-18
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-21T17:21:16+00:00
num_battles: 8208
num_wins: 4051
celo_rating: 1288.81
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-21
win_ratio: 0.4935428849902534
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.1632089614868164s
Received healthy response to inference request in 3.13582444190979s
Received healthy response to inference request in 2.5969603061676025s
Received healthy response to inference request in 1.7448265552520752s
Received healthy response to inference request in 2.1729907989501953s
Received healthy response to inference request in 2.894066333770752s
Received healthy response to inference request in 2.4323248863220215s
Received healthy response to inference request in 2.5715856552124023s
Received healthy response to inference request in 3.2005972862243652s
Received healthy response to inference request in 2.5567941665649414s
10 requests
0 failed requests
5th percentile: 1.9375004649162293
10th percentile: 2.1301743745803834
20th percentile: 2.3804580688476564
30th percentile: 2.5194533824920655
40th percentile: 2.565669059753418
50th percentile: 2.5842729806900024
60th percentile: 2.715802717208862
70th percentile: 2.9665937662124633
80th percentile: 3.1413013458251955
90th percentile: 3.1669477939605715
95th percentile: 3.1837725400924684
99th percentile: 3.197232336997986
mean time: 2.646917939186096
Pipeline stage StressChecker completed in 28.51s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.99s
Shutdown handler de-registered
function_tajef_2025-12-18 status is now deployed due to DeploymentManager action
function_tajef_2025-12-18 status is now inactive due to auto deactivation removed underperforming models
function_tajef_2025-12-18 status is now torndown due to DeploymentManager action