developer_uid: chai_backend_admin
submission_id: function_mamos_2026-01-16
model_name: function_mamos_2026-01-16
model_group:
status: torndown
timestamp: 2026-01-19T04:28:06+00:00
num_battles: 11772
num_wins: 4571
celo_rating: 1225.93
family_friendly_score: 0.7018
family_friendly_standard_error: 0.006469571237725109
submission_type: function
display_name: function_mamos_2026-01-16
is_internal_developer: True
ranking_group: single
us_pacific_date: 2026-01-15
win_ratio: 0.3882942575603126
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Received healthy response to inference request in 1.7117290496826172s
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Received healthy response to inference request in 1.6018106937408447s
Received healthy response to inference request in 1.9534127712249756s
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Received healthy response to inference request in 1.4361767768859863s
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Received healthy response to inference request in 1.5152428150177002s
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Received healthy response to inference request in 1.8561785221099854s
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Received healthy response to inference request in 1.5251965522766113s
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Received healthy response to inference request in 1.5660135746002197s
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Received healthy response to inference request in 1.5495164394378662s
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Received healthy response to inference request in 1.8514208793640137s
10 requests
0 failed requests
5th percentile: 1.4717564940452577
10th percentile: 1.5073362112045288
20th percentile: 1.5232058048248291
30th percentile: 1.5422204732894897
40th percentile: 1.5594147205352784
50th percentile: 1.5839121341705322
Falling back to EndpointApi.from_submission implementation
60th percentile: 1.6457780361175536
Falling back to EndpointApi.from_submission implementation
70th percentile: 1.7536365985870361
80th percentile: 1.852372407913208
90th percentile: 1.8659019470214844
95th percentile: 1.90965735912323
99th percentile: 1.9446616888046264
mean time: 1.656669807434082
Pipeline stage StressChecker completed in 18.19s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 1.65s
Shutdown handler de-registered
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation
function_mamos_2026-01-16 status is now deployed due to DeploymentManager action
Falling back to EndpointApi.from_submission implementation
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Generating Leaderboard row for %s
Generated Leaderboard row for %s
Pipeline stage OfflineFamilyFriendlyScorer completed in 2395.28s
Shutdown handler de-registered
function_mamos_2026-01-16 status is now inactive due to auto deactivation removed underperforming models
admin requested tearing down of function_mamos_2026-01-16
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
Falling back to EndpointApi.from_submission implementation
Shutdown handler de-registered
Falling back to EndpointApi.from_submission implementation
Falling back to EndpointApi.from_submission implementation