developer_uid: chai_backend_admin
submission_id: function_nigul_2025-04-19
model_name: function_nigul_2025-04-19
model_group:
status: torndown
timestamp: 2025-04-19T22:44:15+00:00
num_battles: 6184
num_wins: 3119
celo_rating: 1290.32
family_friendly_score: 0.5232
family_friendly_standard_error: 0.007063451847361883
submission_type: function
display_name: function_nigul_2025-04-19
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-04-19
win_ratio: 0.5043661060802069
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.3559422492980957s
Received healthy response to inference request in 4.76038122177124s
Received healthy response to inference request in 4.351449728012085s
Received healthy response to inference request in 3.623445987701416s
Received healthy response to inference request in 2.9856832027435303s
5 requests
0 failed requests
5th percentile: 3.0597350120544435
10th percentile: 3.1337868213653564
20th percentile: 3.2818904399871824
30th percentile: 3.40944299697876
40th percentile: 3.5164444923400877
50th percentile: 3.623445987701416
60th percentile: 3.9146474838256835
70th percentile: 4.205848979949951
80th percentile: 4.433236026763916
90th percentile: 4.596808624267578
95th percentile: 4.678594923019409
99th percentile: 4.744023962020874
mean time: 3.8153804779052733
%s, retrying in %s seconds...
Received healthy response to inference request in 2.9303815364837646s
Received healthy response to inference request in 2.882758140563965s
Received healthy response to inference request in 3.950457811355591s
Received healthy response to inference request in 3.6355326175689697s
Received healthy response to inference request in 3.813253164291382s
5 requests
0 failed requests
5th percentile: 2.892282819747925
10th percentile: 2.901807498931885
20th percentile: 2.9208568572998046
30th percentile: 3.0714117527008056
40th percentile: 3.353472185134888
50th percentile: 3.6355326175689697
60th percentile: 3.7066208362579345
70th percentile: 3.7777090549468992
80th percentile: 3.8406940937042235
90th percentile: 3.8955759525299074
95th percentile: 3.923016881942749
99th percentile: 3.9449696254730227
mean time: 3.4424766540527343
%s, retrying in %s seconds...
Received healthy response to inference request in 3.4221956729888916s
Received healthy response to inference request in 3.5820014476776123s
Received healthy response to inference request in 3.3050663471221924s
Received healthy response to inference request in 3.8269155025482178s
Received healthy response to inference request in 3.148655414581299s
5 requests
0 failed requests
5th percentile: 3.1799376010894775
10th percentile: 3.2112197875976562
20th percentile: 3.2737841606140137
30th percentile: 3.328492212295532
40th percentile: 3.375343942642212
50th percentile: 3.4221956729888916
60th percentile: 3.48611798286438
70th percentile: 3.550040292739868
80th percentile: 3.6309842586517336
90th percentile: 3.7289498805999757
95th percentile: 3.7779326915740965
99th percentile: 3.8171189403533936
mean time: 3.4569668769836426
Pipeline stage StressChecker completed in 56.86s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.78s
Shutdown handler de-registered
function_nigul_2025-04-19 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 7229.54s
Shutdown handler de-registered
function_nigul_2025-04-19 status is now inactive due to auto deactivation removed underperforming models
function_nigul_2025-04-19 status is now torndown due to DeploymentManager action