developer_uid: NischayDnk
submission_id: function_kirok_2025-06-25
model_name: function_kirok_2025-06-25
model_group:
status: torndown
timestamp: 2025-06-25T20:25:02+00:00
num_battles: 7306
num_wins: 3567
celo_rating: 1285.83
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: function_kirok_2025-06-25
is_internal_developer: False
ranking_group: single
us_pacific_date: 2025-06-25
win_ratio: 0.48822885299753627
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 17.618516206741333s
Received healthy response to inference request in 3.25471830368042s
Received healthy response to inference request in 18.952202796936035s
Received healthy response to inference request in 5.410636901855469s
Received healthy response to inference request in 2.9594311714172363s
5 requests
0 failed requests
5th percentile: 3.0184885978698732
10th percentile: 3.0775460243225097
20th percentile: 3.195660877227783
30th percentile: 3.6859020233154296
40th percentile: 4.548269462585449
50th percentile: 5.410636901855469
60th percentile: 10.293788623809814
70th percentile: 15.176940345764159
80th percentile: 17.885253524780275
90th percentile: 18.418728160858155
95th percentile: 18.685465478897093
99th percentile: 18.898855333328246
mean time: 9.6391010761261
%s, retrying in %s seconds...
Received healthy response to inference request in 2.8240833282470703s
Received healthy response to inference request in 2.632939100265503s
Received healthy response to inference request in 2.149380683898926s
Received healthy response to inference request in 3.7577059268951416s
Received healthy response to inference request in 18.824248790740967s
5 requests
0 failed requests
5th percentile: 2.246092367172241
10th percentile: 2.3428040504455567
20th percentile: 2.5362274169921877
30th percentile: 2.6711679458618165
40th percentile: 2.747625637054443
50th percentile: 2.8240833282470703
60th percentile: 3.1975323677062986
70th percentile: 3.5709814071655273
80th percentile: 6.771014499664309
90th percentile: 12.797631645202639
95th percentile: 15.8109402179718
99th percentile: 18.221587076187134
mean time: 6.0376715660095215
Pipeline stage StressChecker completed in 81.26s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.72s
Shutdown handler de-registered
function_kirok_2025-06-25 status is now deployed due to DeploymentManager action
function_kirok_2025-06-25 status is now inactive due to auto deactivation removed underperforming models
function_kirok_2025-06-25 status is now torndown due to DeploymentManager action