developer_uid: chai_evaluation_service
submission_id: function_dalir_2025-12-13
model_name: richard
model_group:
status: inactive
timestamp: 2025-12-13T10:17:22+00:00
num_battles: 10294
num_wins: 5189
celo_rating: 1256.33
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-13
win_ratio: 0.5040800466291043
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.8234596252441406s
Received healthy response to inference request in 2.3478689193725586s
Received healthy response to inference request in 2.720689535140991s
Received healthy response to inference request in 1.6770708560943604s
Received healthy response to inference request in 2.220637798309326s
Received healthy response to inference request in 3.2656795978546143s
Received healthy response to inference request in 2.8576462268829346s
Received healthy response to inference request in 2.6322827339172363s
Received healthy response to inference request in 2.6755363941192627s
Received healthy response to inference request in 3.1960020065307617s
10 requests
0 failed requests
5th percentile: 1.7429458022117614
10th percentile: 1.8088207483291625
20th percentile: 2.1412021636962892
30th percentile: 2.3096995830535887
40th percentile: 2.5185172080993654
50th percentile: 2.6539095640182495
60th percentile: 2.693597650527954
70th percentile: 2.7617765426635743
80th percentile: 2.9253173828125
90th percentile: 3.202969765663147
95th percentile: 3.2343246817588804
99th percentile: 3.2594086146354675
mean time: 2.5416873693466187
Pipeline stage StressChecker completed in 27.04s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.80s
Shutdown handler de-registered
function_dalir_2025-12-13 status is now deployed due to DeploymentManager action
function_dalir_2025-12-13 status is now inactive due to auto deactivation removed underperforming models