developer_uid: chai_evaluation_service
submission_id: function_ririt_2025-12-16
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-19T08:51:23+00:00
num_battles: 7766
num_wins: 3877
celo_rating: 1293.01
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-19
win_ratio: 0.49922740149369044
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.756758689880371s
Received healthy response to inference request in 1.8603639602661133s
Received healthy response to inference request in 2.294097661972046s
Received healthy response to inference request in 2.261693239212036s
Received healthy response to inference request in 2.09574556350708s
Received healthy response to inference request in 1.852281093597412s
Received healthy response to inference request in 2.4652674198150635s
Received healthy response to inference request in 1.8980536460876465s
Received healthy response to inference request in 2.571341037750244s
Received healthy response to inference request in 2.8657853603363037s
10 requests
0 failed requests
5th percentile: 1.8559183835983277
10th percentile: 1.8595556735992431
20th percentile: 1.8905157089233398
30th percentile: 2.03643798828125
40th percentile: 2.1953141689300537
50th percentile: 2.277895450592041
60th percentile: 2.3625655651092528
70th percentile: 2.4970895051956177
80th percentile: 2.6302299022674562
90th percentile: 2.95488269329071
95th percentile: 3.35582069158554
99th percentile: 3.676571090221405
mean time: 2.3921387672424315
Pipeline stage StressChecker completed in 25.33s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.60s
Shutdown handler de-registered
function_ririt_2025-12-16 status is now deployed due to DeploymentManager action
function_ririt_2025-12-16 status is now inactive due to auto deactivation removed underperforming models
function_ririt_2025-12-16 status is now torndown due to DeploymentManager action