developer_uid: chai_backend_admin
submission_id: function_numel_2025-12-22
model_name: function_numel_2025-12-22
model_group:
status: torndown
timestamp: 2025-12-25T14:21:37+00:00
num_battles: 16599
num_wins: 8691
celo_rating: 1309.74
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: function_numel_2025-12-22
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-25
win_ratio: 0.5235857581782035
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.2428100109100342s
Received healthy response to inference request in 1.3206608295440674s
Received healthy response to inference request in 2.649121046066284s
Received healthy response to inference request in 1.2687692642211914s
Received healthy response to inference request in 1.3727970123291016s
Received healthy response to inference request in 1.6536588668823242s
Received healthy response to inference request in 1.345703125s
Received healthy response to inference request in 1.3710806369781494s
Received healthy response to inference request in 1.8299286365509033s
Received healthy response to inference request in 1.879713773727417s
10 requests
0 failed requests
5th percentile: 1.254491674900055
10th percentile: 1.2661733388900758
20th percentile: 1.3102825164794922
30th percentile: 1.3381904363632202
40th percentile: 1.3609296321868896
50th percentile: 1.3719388246536255
60th percentile: 1.4851417541503904
70th percentile: 1.706539797782898
80th percentile: 1.839885663986206
90th percentile: 1.9566545009613034
95th percentile: 2.302887773513793
99th percentile: 2.5798743915557862
mean time: 1.5934243202209473
Pipeline stage StressChecker completed in 18.53s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.62s
Shutdown handler de-registered
function_numel_2025-12-22 status is now deployed due to DeploymentManager action
function_numel_2025-12-22 status is now inactive due to admin request
function_numel_2025-12-22 status is now torndown due to DeploymentManager action