developer_uid: chai_evaluation_service
submission_id: function_raran_2025-12-17
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-20T10:51:19+00:00
num_battles: 8684
num_wins: 4330
celo_rating: 1292.25
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-20
win_ratio: 0.4986181483187471
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 4.015116930007935s
Received healthy response to inference request in 3.9763760566711426s
Received healthy response to inference request in 3.4438657760620117s
Received healthy response to inference request in 5.330164909362793s
Received healthy response to inference request in 1.7767024040222168s
Received healthy response to inference request in 5.556697368621826s
Received healthy response to inference request in 3.370649814605713s
Received healthy response to inference request in 2.0732510089874268s
Received healthy response to inference request in 2.102588653564453s
Received healthy response to inference request in 5.256750106811523s
10 requests
0 failed requests
5th percentile: 1.9101492762565613
10th percentile: 2.0435961484909058
20th percentile: 2.096721124649048
30th percentile: 2.9902314662933347
40th percentile: 3.414579391479492
50th percentile: 3.710120916366577
60th percentile: 3.9918724060058595
70th percentile: 4.387606883049011
80th percentile: 5.2714330673217775
90th percentile: 5.352818155288697
95th percentile: 5.454757761955261
99th percentile: 5.536309447288513
mean time: 3.690216302871704
Pipeline stage StressChecker completed in 38.18s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.60s
Shutdown handler de-registered
function_raran_2025-12-17 status is now deployed due to DeploymentManager action
function_raran_2025-12-17 status is now inactive due to auto deactivation removed underperforming models
function_raran_2025-12-17 status is now torndown due to DeploymentManager action