developer_uid: chai_backend_admin
submission_id: function_sinib_2025-07-05
model_name: function_sinib_2025-07-05
model_group:
status: torndown
timestamp: 2025-07-05T05:49:38+00:00
num_battles: 10632
num_wins: 5671
celo_rating: 1313.5
family_friendly_score: 0.509
family_friendly_standard_error: 0.007069922206078367
submission_type: function
display_name: function_sinib_2025-07-05
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-07-04
win_ratio: 0.5333897667419112
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.4505653381347656s
Received healthy response to inference request in 3.8978404998779297s
Received healthy response to inference request in 4.271065950393677s
Received healthy response to inference request in 6.365395545959473s
Received healthy response to inference request in 4.017943620681763s
5 requests
0 failed requests
5th percentile: 3.5400203704833983
10th percentile: 3.629475402832031
20th percentile: 3.808385467529297
30th percentile: 3.9218611240386965
40th percentile: 3.9699023723602296
50th percentile: 4.017943620681763
60th percentile: 4.1191925525665285
70th percentile: 4.220441484451294
80th percentile: 4.689931869506836
90th percentile: 5.527663707733154
95th percentile: 5.9465296268463135
99th percentile: 6.281622362136841
mean time: 4.400562191009522
%s, retrying in %s seconds...
Received healthy response to inference request in 5.756726026535034s
Received healthy response to inference request in 4.561340808868408s
Received healthy response to inference request in 2.626582145690918s
Received healthy response to inference request in 3.5033576488494873s
Received healthy response to inference request in 3.286151885986328s
5 requests
0 failed requests
5th percentile: 2.75849609375
10th percentile: 2.890410041809082
20th percentile: 3.1542379379272463
30th percentile: 3.32959303855896
40th percentile: 3.4164753437042235
50th percentile: 3.5033576488494873
60th percentile: 3.9265509128570555
70th percentile: 4.349744176864624
80th percentile: 4.800417852401734
90th percentile: 5.278571939468383
95th percentile: 5.517648983001709
99th percentile: 5.708910617828369
mean time: 3.946831703186035
%s, retrying in %s seconds...
Received healthy response to inference request in 2.810164213180542s
Received healthy response to inference request in 3.514655351638794s
Received healthy response to inference request in 3.5288455486297607s
Received healthy response to inference request in 3.518338918685913s
Received healthy response to inference request in 3.709784984588623s
5 requests
0 failed requests
5th percentile: 2.9510624408721924
10th percentile: 3.0919606685638428
20th percentile: 3.3737571239471436
30th percentile: 3.5153920650482178
40th percentile: 3.5168654918670654
50th percentile: 3.518338918685913
60th percentile: 3.522541570663452
70th percentile: 3.5267442226409913
80th percentile: 3.565033435821533
90th percentile: 3.637409210205078
95th percentile: 3.6735970973968506
99th percentile: 3.7025474071502686
mean time: 3.4163578033447264
Pipeline stage StressChecker completed in 63.43s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.88s
Shutdown handler de-registered
function_sinib_2025-07-05 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 3701.34s
Shutdown handler de-registered
function_sinib_2025-07-05 status is now inactive due to auto deactivation removed underperforming models
function_sinib_2025-07-05 status is now torndown due to DeploymentManager action