developer_uid: chai_evaluation_service
submission_id: function_nedat_2025-12-13
model_name: richard
model_group:
status: inactive
timestamp: 2025-12-13T04:47:38+00:00
num_battles: 6246
num_wins: 3099
celo_rating: 1286.59
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-12
win_ratio: 0.49615754082612873
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.658113718032837s
Received healthy response to inference request in 3.058713912963867s
Received healthy response to inference request in 1.6191885471343994s
Received healthy response to inference request in 1.6503288745880127s
Received healthy response to inference request in 1.9686026573181152s
Received healthy response to inference request in 2.073223829269409s
Received healthy response to inference request in 1.9114797115325928s
Received healthy response to inference request in 1.6381855010986328s
Received healthy response to inference request in 2.0948758125305176s
Received healthy response to inference request in 1.9611270427703857s
10 requests
0 failed requests
5th percentile: 1.6277371764183044
10th percentile: 1.6362858057022094
20th percentile: 1.6479001998901368
30th percentile: 1.6557782649993897
40th percentile: 1.8101333141326905
50th percentile: 1.9363033771514893
60th percentile: 1.9641172885894775
70th percentile: 1.9999890089035035
80th percentile: 2.077554225921631
90th percentile: 2.191259622573852
95th percentile: 2.624986767768859
99th percentile: 2.971968483924866
mean time: 1.963383960723877
Pipeline stage StressChecker completed in 21.32s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.70s
Shutdown handler de-registered
function_nedat_2025-12-13 status is now deployed due to DeploymentManager action
function_nedat_2025-12-13 status is now inactive due to auto deactivation removed underperforming models