developer_uid: NischayDnk
submission_id: function_kinab_2025-08-22
model_name: function_kinab_2025-08-22
model_group:
status: torndown
timestamp: 2025-08-22T18:04:07+00:00
num_battles: 7063
num_wins: 4082
celo_rating: 1282.84
family_friendly_score: 0.5751999999999999
family_friendly_standard_error: 0.0069906360225661865
submission_type: function
display_name: function_kinab_2025-08-22
is_internal_developer: False
ranking_group: single
us_pacific_date: 2025-08-22
win_ratio: 0.5779413846807305
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.4973530769348145s
Received healthy response to inference request in 2.762279748916626s
Received healthy response to inference request in 2.8119375705718994s
Received healthy response to inference request in 3.8368139266967773s
Received healthy response to inference request in 2.480660915374756s
5 requests
0 failed requests
5th percentile: 2.53698468208313
10th percentile: 2.593308448791504
20th percentile: 2.705955982208252
30th percentile: 2.7722113132476807
40th percentile: 2.79207444190979
50th percentile: 2.8119375705718994
60th percentile: 3.0861037731170655
70th percentile: 3.360269975662231
80th percentile: 3.565245246887207
90th percentile: 3.701029586791992
95th percentile: 3.7689217567443847
99th percentile: 3.823235492706299
mean time: 3.077809047698975
Pipeline stage StressChecker completed in 16.45s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.66s
Shutdown handler de-registered
function_kinab_2025-08-22 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 4434.96s
Shutdown handler de-registered
function_kinab_2025-08-22 status is now inactive due to auto deactivation removed underperforming models
function_kinab_2025-08-22 status is now torndown due to DeploymentManager action