developer_uid: chai_backend_admin
submission_id: function_kibib_2025-07-29
model_name: function_kibib_2025-07-29
model_group:
status: torndown
timestamp: 2025-07-29T23:13:15+00:00
num_battles: 5753
num_wins: 2969
celo_rating: 1290.41
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: function_kibib_2025-07-29
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-07-29
win_ratio: 0.5160785677038067
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.5216119289398193s
Received healthy response to inference request in 4.777735948562622s
Received healthy response to inference request in 3.417940139770508s
Received healthy response to inference request in 3.9693603515625s
Received healthy response to inference request in 3.6468024253845215s
5 requests
0 failed requests
5th percentile: 3.43867449760437
10th percentile: 3.4594088554382325
20th percentile: 3.500877571105957
30th percentile: 3.54665002822876
40th percentile: 3.5967262268066404
50th percentile: 3.6468024253845215
60th percentile: 3.775825595855713
70th percentile: 3.904848766326904
80th percentile: 4.131035470962525
90th percentile: 4.454385709762573
95th percentile: 4.616060829162597
99th percentile: 4.745400924682617
mean time: 3.866690158843994
%s, retrying in %s seconds...
Received healthy response to inference request in 3.016814947128296s
Received healthy response to inference request in 2.7976186275482178s
Received healthy response to inference request in 4.687144994735718s
Received healthy response to inference request in 3.260746479034424s
Received healthy response to inference request in 2.6586294174194336s
5 requests
0 failed requests
5th percentile: 2.6864272594451903
10th percentile: 2.714225101470947
20th percentile: 2.769820785522461
30th percentile: 2.841457891464233
40th percentile: 2.9291364192962646
50th percentile: 3.016814947128296
60th percentile: 3.114387559890747
70th percentile: 3.2119601726531983
80th percentile: 3.546026182174683
90th percentile: 4.1165855884552
95th percentile: 4.401865291595459
99th percentile: 4.630089054107666
mean time: 3.2841908931732178
Pipeline stage StressChecker completed in 38.68s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.63s
Shutdown handler de-registered
function_kibib_2025-07-29 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
%s, retrying in %s seconds...
Evaluating %s Family Friendly Score with %s threads
Received signal 15, running shutdown handler
Shutdown handler de-registered
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
%s, retrying in %s seconds...
Evaluating %s Family Friendly Score with %s threads
%s, retrying in %s seconds...
Evaluating %s Family Friendly Score with %s threads
clean up pipeline due to error=DeploymentChecksError('None: None')
Shutdown handler de-registered
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
%s, retrying in %s seconds...
Evaluating %s Family Friendly Score with %s threads
%s, retrying in %s seconds...
Evaluating %s Family Friendly Score with %s threads
clean up pipeline due to error=DeploymentChecksError('None: None')
Shutdown handler de-registered
function_kibib_2025-07-29 status is now inactive due to auto deactivation removed underperforming models
function_kibib_2025-07-29 status is now torndown due to DeploymentManager action