developer_uid: chai_evaluation_service
submission_id: function_tipem_2025-12-13
model_name: richard
model_group:
status: inactive
timestamp: 2025-12-13T20:46:35+00:00
num_battles: 6577
num_wins: 3255
celo_rating: 1256.32
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-13
win_ratio: 0.49490649232172723
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 4.041083574295044s
Received healthy response to inference request in 3.5124573707580566s
Received healthy response to inference request in 3.180450439453125s
Received healthy response to inference request in 3.5059609413146973s
Received healthy response to inference request in 4.520386457443237s
Received healthy response to inference request in 4.391113519668579s
Received healthy response to inference request in 3.110328197479248s
Received healthy response to inference request in 2.1348025798797607s
Received healthy response to inference request in 3.618903636932373s
Received healthy response to inference request in 5.916974782943726s
10 requests
0 failed requests
5th percentile: 2.57378910779953
10th percentile: 3.0127756357192994
20th percentile: 3.1664259910583494
30th percentile: 3.4083077907562256
40th percentile: 3.509858798980713
50th percentile: 3.565680503845215
60th percentile: 3.787775611877441
70th percentile: 4.146092557907105
80th percentile: 4.416968107223511
90th percentile: 4.660045289993286
95th percentile: 5.288510036468504
99th percentile: 5.791281833648682
mean time: 3.7932461500167847
Pipeline stage StressChecker completed in 39.22s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.60s
Shutdown handler de-registered
function_tipem_2025-12-13 status is now deployed due to DeploymentManager action
function_tipem_2025-12-13 status is now inactive due to auto deactivation removed underperforming models