developer_uid: chai_evaluation_service
submission_id: function_nasom_2025-12-16
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-19T11:51:21+00:00
num_battles: 9808
num_wins: 4971
celo_rating: 1298.06
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-19
win_ratio: 0.5068311582381729
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.2875192165374756s
Received healthy response to inference request in 2.2769265174865723s
Received healthy response to inference request in 1.862607717514038s
Received healthy response to inference request in 1.8415820598602295s
Received healthy response to inference request in 1.920435905456543s
Received healthy response to inference request in 2.3839216232299805s
Received healthy response to inference request in 1.7418715953826904s
Received healthy response to inference request in 1.9535043239593506s
Received healthy response to inference request in 2.974924325942993s
Received healthy response to inference request in 2.643195390701294s
10 requests
0 failed requests
5th percentile: 1.786741304397583
10th percentile: 1.8316110134124757
20th percentile: 1.8584025859832765
30th percentile: 1.9030874490737915
40th percentile: 1.9402769565582276
50th percentile: 2.1152154207229614
60th percentile: 2.2811635971069335
70th percentile: 2.3164399385452272
80th percentile: 2.435776376724243
90th percentile: 2.6763682842254637
95th percentile: 2.825646305084228
99th percentile: 2.94506872177124
mean time: 2.1886488676071165
Pipeline stage StressChecker completed in 23.26s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.89s
Shutdown handler de-registered
function_nasom_2025-12-16 status is now deployed due to DeploymentManager action
function_nasom_2025-12-16 status is now inactive due to auto deactivation removed underperforming models
function_nasom_2025-12-16 status is now torndown due to DeploymentManager action