developer_uid: chai_evaluation_service
submission_id: function_jirol_2025-12-18
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-21T10:51:15+00:00
num_battles: 8161
num_wins: 4005
celo_rating: 1286.67
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-21
win_ratio: 0.49074868275946576
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.870689868927002s
Received healthy response to inference request in 2.3775389194488525s
Received healthy response to inference request in 2.3826186656951904s
Received healthy response to inference request in 2.3959758281707764s
Received healthy response to inference request in 3.771854877471924s
Received healthy response to inference request in 3.205580472946167s
Received healthy response to inference request in 2.5264296531677246s
Received healthy response to inference request in 3.254168748855591s
Received healthy response to inference request in 2.060182809829712s
Received healthy response to inference request in 1.9153482913970947s
10 requests
0 failed requests
5th percentile: 1.8907861590385437
10th percentile: 1.9108824491500855
20th percentile: 2.0312159061431885
30th percentile: 2.2823320865631103
40th percentile: 2.380586767196655
50th percentile: 2.3892972469329834
60th percentile: 2.4481573581695555
70th percentile: 2.7301748991012573
80th percentile: 3.215298128128052
90th percentile: 3.305937361717224
95th percentile: 3.5388961195945736
99th percentile: 3.725263125896454
mean time: 2.5760388135910035
Pipeline stage StressChecker completed in 26.99s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.58s
Shutdown handler de-registered
function_jirol_2025-12-18 status is now deployed due to DeploymentManager action
function_jirol_2025-12-18 status is now inactive due to auto deactivation removed underperforming models
function_jirol_2025-12-18 status is now torndown due to DeploymentManager action