developer_uid: chai_backend_admin
submission_id: function_gijuk_2025-07-15
model_name: function_gijuk_2025-07-15
model_group:
status: inactive
timestamp: 2025-07-15T21:59:56+00:00
num_battles: 2107
num_wins: 1114
celo_rating: 1295.84
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: function_gijuk_2025-07-15
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-07-15
win_ratio: 0.5287138111058377
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
HTTPConnectionPool(host='guanaco-submitter.guanaco-backend.k2.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
Received healthy response to inference request in 4.047364950180054s
Received healthy response to inference request in 3.0756685733795166s
Received healthy response to inference request in 2.802739381790161s
Received healthy response to inference request in 2.828052520751953s
5 requests
1 failed requests
5th percentile: 2.8078020095825194
10th percentile: 2.812864637374878
20th percentile: 2.822989892959595
30th percentile: 2.877575731277466
40th percentile: 2.976622152328491
50th percentile: 3.0756685733795166
60th percentile: 3.4643471240997314
70th percentile: 3.8530256748199463
80th percentile: 7.262033367156985
90th percentile: 13.691370201110841
95th percentile: 16.906038618087766
99th percentile: 19.47777335166931
mean time: 6.574906492233277
%s, retrying in %s seconds...
Received healthy response to inference request in 2.959808588027954s
Received healthy response to inference request in 2.8291587829589844s
Received healthy response to inference request in 3.379267454147339s
Received healthy response to inference request in 2.5864367485046387s
Received healthy response to inference request in 2.302277088165283s
5 requests
0 failed requests
5th percentile: 2.3591090202331544
10th percentile: 2.4159409523010256
20th percentile: 2.5296048164367675
30th percentile: 2.634981155395508
40th percentile: 2.732069969177246
50th percentile: 2.8291587829589844
60th percentile: 2.8814187049865723
70th percentile: 2.93367862701416
80th percentile: 3.043700361251831
90th percentile: 3.211483907699585
95th percentile: 3.295375680923462
99th percentile: 3.3624890995025636
mean time: 2.8113897323608397
Pipeline stage StressChecker completed in 50.20s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.75s
Shutdown handler de-registered
function_gijuk_2025-07-15 status is now deployed due to DeploymentManager action
function_gijuk_2025-07-15 status is now inactive due to admin request
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
%s, retrying in %s seconds...
Evaluating %s Family Friendly Score with %s threads
%s, retrying in %s seconds...
Evaluating %s Family Friendly Score with %s threads
clean up pipeline due to error=DeploymentChecksError('None: None')
Shutdown handler de-registered
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
%s, retrying in %s seconds...
Evaluating %s Family Friendly Score with %s threads
%s, retrying in %s seconds...
Evaluating %s Family Friendly Score with %s threads
clean up pipeline due to error=DeploymentChecksError('None: None')
Shutdown handler de-registered
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
%s, retrying in %s seconds...
Evaluating %s Family Friendly Score with %s threads
%s, retrying in %s seconds...
Evaluating %s Family Friendly Score with %s threads
clean up pipeline due to error=DeploymentChecksError('None: None')
Shutdown handler de-registered
function_gijuk_2025-07-15 status is now torndown due to DeploymentManager action
function_gijuk_2025-07-15 status is now torndown due to DeploymentManager action