developer_uid: chai_evaluation_service
submission_id: function_jupis_2025-12-13
model_name: richard
model_group:
status: inactive
timestamp: 2025-12-13T18:36:21+00:00
num_battles: 5397
num_wins: 2668
celo_rating: 1301.67
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-13
win_ratio: 0.49434871224754495
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.8893773555755615s
Received healthy response to inference request in 2.779503345489502s
Received healthy response to inference request in 4.582485914230347s
Received healthy response to inference request in 2.5096564292907715s
Received healthy response to inference request in 2.074989080429077s
Received healthy response to inference request in 5.038901090621948s
Received healthy response to inference request in 4.380368232727051s
Received healthy response to inference request in 3.1314501762390137s
Received healthy response to inference request in 2.875675678253174s
Received healthy response to inference request in 3.1420023441314697s
10 requests
0 failed requests
5th percentile: 2.2705893874168397
10th percentile: 2.4661896944046022
20th percentile: 2.725533962249756
30th percentile: 2.846823978424072
40th percentile: 2.8838966846466065
50th percentile: 3.0104137659072876
60th percentile: 3.135671043395996
70th percentile: 3.5135121107101437
80th percentile: 4.42079176902771
90th percentile: 4.6281274318695065
95th percentile: 4.833514261245727
99th percentile: 4.997823724746704
mean time: 3.3404409646987916
Pipeline stage StressChecker completed in 34.80s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.58s
Shutdown handler de-registered
function_jupis_2025-12-13 status is now deployed due to DeploymentManager action
function_jupis_2025-12-13 status is now inactive due to auto deactivation removed underperforming models