developer_uid: chai_backend_admin
submission_id: function_lokak_2025-05-23
model_name: function_lokak_2025-05-23
model_group:
status: torndown
timestamp: 2025-05-23T11:00:52+00:00
num_battles: 8023
num_wins: 4148
celo_rating: 1294.95
family_friendly_score: 0.5316000000000001
family_friendly_standard_error: 0.007056931911248684
submission_type: function
display_name: function_lokak_2025-05-23
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-05-23
win_ratio: 0.5170135859404212
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 5.501901626586914s
Received healthy response to inference request in 4.460310935974121s
Received healthy response to inference request in 3.646841049194336s
Received healthy response to inference request in 5.134077310562134s
Received healthy response to inference request in 4.338506460189819s
5 requests
0 failed requests
5th percentile: 3.7851741313934326
10th percentile: 3.9235072135925293
20th percentile: 4.200173377990723
30th percentile: 4.36286735534668
40th percentile: 4.4115891456604
50th percentile: 4.460310935974121
60th percentile: 4.729817485809326
70th percentile: 4.999324035644531
80th percentile: 5.2076421737670895
90th percentile: 5.354771900177002
95th percentile: 5.428336763381958
99th percentile: 5.487188653945923
mean time: 4.616327476501465
%s, retrying in %s seconds...
Received healthy response to inference request in 3.2692573070526123s
Received healthy response to inference request in 2.9302947521209717s
Received healthy response to inference request in 2.4971718788146973s
Received healthy response to inference request in 4.68435001373291s
Received healthy response to inference request in 4.0242109298706055s
5 requests
0 failed requests
5th percentile: 2.5837964534759523
10th percentile: 2.670421028137207
20th percentile: 2.8436701774597166
30th percentile: 2.9980872631073
40th percentile: 3.133672285079956
50th percentile: 3.2692573070526123
60th percentile: 3.5712387561798096
70th percentile: 3.873220205307007
80th percentile: 4.156238746643067
90th percentile: 4.420294380187988
95th percentile: 4.552322196960449
99th percentile: 4.657944450378418
mean time: 3.481056976318359
%s, retrying in %s seconds...
Received healthy response to inference request in 3.6517577171325684s
Received healthy response to inference request in 2.918252468109131s
Received healthy response to inference request in 1.99155855178833s
Received healthy response to inference request in 4.3625335693359375s
Received healthy response to inference request in 2.8013856410980225s
5 requests
0 failed requests
5th percentile: 2.1535239696502684
10th percentile: 2.315489387512207
20th percentile: 2.639420223236084
30th percentile: 2.824759006500244
40th percentile: 2.8715057373046875
50th percentile: 2.918252468109131
60th percentile: 3.2116545677185058
70th percentile: 3.5050566673278807
80th percentile: 3.793912887573242
90th percentile: 4.07822322845459
95th percentile: 4.220378398895264
99th percentile: 4.334102535247803
mean time: 3.145097589492798
Pipeline stage StressChecker completed in 59.26s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.68s
Shutdown handler de-registered
function_lokak_2025-05-23 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 3376.64s
Shutdown handler de-registered
function_lokak_2025-05-23 status is now inactive due to auto deactivation removed underperforming models
function_lokak_2025-05-23 status is now torndown due to DeploymentManager action