developer_uid: chai_evaluation_service
submission_id: function_bafam_2025-12-13
model_name: richard
model_group:
status: inactive
timestamp: 2025-12-13T07:16:57+00:00
num_battles: 9160
num_wins: 4619
celo_rating: 1293.57
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-12
win_ratio: 0.5042576419213973
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.4648964405059814s
Received healthy response to inference request in 4.111429214477539s
Received healthy response to inference request in 2.73244571685791s
Received healthy response to inference request in 2.9427735805511475s
Received healthy response to inference request in 3.513641357421875s
Failed to get response for submission chaiml-llama31-mer-v2-t_44570_v4: ('http://guanaco-model-mesh-load-balancer.model-mesh.k2.chaiverse.com/models/chaiml-7b07-69d4-linear-w01_v7/predict', '{"detail":"1 validation error for RuntimeResponse\\npredictions\\n Field required [type=missing, input_value={\'detail\': \'[Errno 104] Connection reset by peer\'}, input_type=dict]\\n For further information visit https://errors.pydantic.dev/2.11/v/missing"}')
Received healthy response to inference request in 3.070011615753174s
Received healthy response to inference request in 3.1241307258605957s
Received healthy response to inference request in 3.1820507049560547s
Received healthy response to inference request in 3.2593777179718018s
Received healthy response to inference request in 2.933804512023926s
10 requests
0 failed requests
5th percentile: 2.823057174682617
10th percentile: 2.913668632507324
20th percentile: 2.940979766845703
30th percentile: 3.031840205192566
40th percentile: 3.1024830818176268
50th percentile: 3.153090715408325
60th percentile: 3.2129815101623533
70th percentile: 3.3210333347320558
80th percentile: 3.47464542388916
90th percentile: 3.573420143127441
95th percentile: 3.8424246788024896
99th percentile: 4.057628307342529
mean time: 3.2334561586380004
Pipeline stage StressChecker completed in 34.20s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.64s
Shutdown handler de-registered
function_bafam_2025-12-13 status is now deployed due to DeploymentManager action
function_bafam_2025-12-13 status is now inactive due to auto deactivation removed underperforming models