developer_uid: chai_evaluation_service
submission_id: function_banan_2025-12-14
model_name: richard
model_group:
status: inactive
timestamp: 2025-12-14T09:27:07+00:00
num_battles: 8365
num_wins: 4259
celo_rating: 1285.93
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-14
win_ratio: 0.5091452480573819
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.4856176376342773s
Received healthy response to inference request in 2.5006260871887207s
Received healthy response to inference request in 1.9157440662384033s
Received healthy response to inference request in 2.970468521118164s
Received healthy response to inference request in 2.388849973678589s
Received healthy response to inference request in 2.549470901489258s
Received healthy response to inference request in 2.2625863552093506s
Received healthy response to inference request in 2.1851861476898193s
Received healthy response to inference request in 2.6152822971343994s
Received healthy response to inference request in 2.0827465057373047s
10 requests
0 failed requests
5th percentile: 1.990895164012909
10th percentile: 2.0660462617874145
20th percentile: 2.1646982192993165
30th percentile: 2.239366292953491
40th percentile: 2.3383445262908937
50th percentile: 2.437233805656433
60th percentile: 2.4916210174560547
70th percentile: 2.515279531478882
80th percentile: 2.562633180618286
90th percentile: 2.650800919532776
95th percentile: 2.81063472032547
99th percentile: 2.938501760959625
mean time: 2.395657849311829
Pipeline stage StressChecker completed in 25.28s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.89s
Shutdown handler de-registered
function_banan_2025-12-14 status is now deployed due to DeploymentManager action
function_banan_2025-12-14 status is now inactive due to auto deactivation removed underperforming models