developer_uid: chai_backend_admin
submission_id: function_lemus_2025-07-09
model_name: function_lemus_2025-07-09
model_group:
status: torndown
timestamp: 2025-07-09T23:39:49+00:00
num_battles: 7080
num_wins: 3587
celo_rating: 1290.86
family_friendly_score: 0.5307999999999999
family_friendly_standard_error: 0.007057639265363454
submission_type: function
display_name: function_lemus_2025-07-09
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-07-09
win_ratio: 0.506638418079096
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.3760786056518555s
Received healthy response to inference request in 3.6380224227905273s
Received healthy response to inference request in 3.0858752727508545s
Received healthy response to inference request in 2.9502463340759277s
Received healthy response to inference request in 4.3507163524627686s
5 requests
0 failed requests
5th percentile: 2.9773721218109133
10th percentile: 3.0044979095458983
20th percentile: 3.058749485015869
30th percentile: 3.1439159393310545
40th percentile: 3.259997272491455
50th percentile: 3.3760786056518555
60th percentile: 3.4808561325073244
70th percentile: 3.585633659362793
80th percentile: 3.7805612087249756
90th percentile: 4.065638780593872
95th percentile: 4.20817756652832
99th percentile: 4.322208595275879
mean time: 3.4801877975463866
Pipeline stage StressChecker completed in 20.28s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.87s
Shutdown handler de-registered
function_lemus_2025-07-09 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 3568.24s
Shutdown handler de-registered
function_lemus_2025-07-09 status is now inactive due to auto deactivation removed underperforming models
function_lemus_2025-07-09 status is now torndown due to DeploymentManager action