developer_uid: chai_evaluation_service
submission_id: function_repel_2025-12-15
model_name: richard
model_group:
status: inactive
timestamp: 2025-12-15T06:16:45+00:00
num_battles: 6696
num_wins: 3362
celo_rating: 1256.45
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-14
win_ratio: 0.5020908004778972
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.9314451217651367s
Received healthy response to inference request in 1.9175090789794922s
Received healthy response to inference request in 1.654930830001831s
Received healthy response to inference request in 2.6754260063171387s
Received healthy response to inference request in 2.4776906967163086s
Received healthy response to inference request in 3.0958070755004883s
Received healthy response to inference request in 2.3737196922302246s
Received healthy response to inference request in 1.9415557384490967s
Received healthy response to inference request in 3.130171298980713s
Received healthy response to inference request in 3.0190722942352295s
10 requests
0 failed requests
5th percentile: 1.7730910420417785
10th percentile: 1.8912512540817261
20th percentile: 1.9286579132080077
30th percentile: 1.9385225534439088
40th percentile: 2.2008541107177733
50th percentile: 2.4257051944732666
60th percentile: 2.5567848205566404
70th percentile: 2.778519892692566
80th percentile: 3.0344192504882814
90th percentile: 3.099243497848511
95th percentile: 3.1147073984146116
99th percentile: 3.1270785188674926
mean time: 2.421732783317566
Pipeline stage StressChecker completed in 25.81s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.58s
Shutdown handler de-registered
function_repel_2025-12-15 status is now deployed due to DeploymentManager action
function_repel_2025-12-15 status is now inactive due to auto deactivation removed underperforming models