developer_uid: NischayDnk
submission_id: function_loguk_2025-08-22
model_name: function_loguk_2025-08-22
model_group:
status: torndown
timestamp: 2025-08-22T18:04:26+00:00
num_battles: 6418
num_wins: 3784
celo_rating: 1282.91
family_friendly_score: 0.575
family_friendly_standard_error: 0.006991065727054781
submission_type: function
display_name: function_loguk_2025-08-22
is_internal_developer: False
ranking_group: single
us_pacific_date: 2025-08-22
win_ratio: 0.5895917731380492
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.360957145690918s
Received healthy response to inference request in 1.6956672668457031s
Received healthy response to inference request in 1.3213810920715332s
Received healthy response to inference request in 1.1955173015594482s
Received healthy response to inference request in 0.5708014965057373s
5 requests
0 failed requests
5th percentile: 0.6957446575164795
10th percentile: 0.8206878185272217
20th percentile: 1.0705741405487061
30th percentile: 1.2206900596618653
40th percentile: 1.2710355758666991
50th percentile: 1.3213810920715332
60th percentile: 1.337211513519287
70th percentile: 1.353041934967041
80th percentile: 1.427899169921875
90th percentile: 1.561783218383789
95th percentile: 1.628725242614746
99th percentile: 1.6822788619995117
mean time: 1.228864860534668
Pipeline stage StressChecker completed in 7.58s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.58s
Shutdown handler de-registered
function_loguk_2025-08-22 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 2681.88s
Shutdown handler de-registered
function_loguk_2025-08-22 status is now inactive due to auto deactivation removed underperforming models
function_loguk_2025-08-22 status is now torndown due to DeploymentManager action