developer_uid: chai_evaluation_service
submission_id: function_bigot_2025-12-13
model_name: richard
model_group:
status: inactive
timestamp: 2025-12-14T02:26:37+00:00
num_battles: 5268
num_wins: 2584
celo_rating: 1288.49
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-13
win_ratio: 0.4905087319665907
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.0120174884796143s
Received healthy response to inference request in 1.904939889907837s
read tcp 127.0.0.1:58608->127.0.0.1:8080: read: connection reset by peer
Received unhealthy response to inference request!
Received healthy response to inference request in 2.167863130569458s
Received healthy response to inference request in 2.4047415256500244s
Received healthy response to inference request in 2.0235707759857178s
Received healthy response to inference request in 2.448071241378784s
Received healthy response to inference request in 1.910193681716919s
Received healthy response to inference request in 2.658428192138672s
Received healthy response to inference request in 2.8343989849090576s
10 requests
1 failed requests
5th percentile: 0.9610314011573792
10th percentile: 1.733320164680481
20th percentile: 1.9091429233551025
30th percentile: 1.9814703464508057
40th percentile: 2.0189494609832765
50th percentile: 2.095716953277588
60th percentile: 2.2626144886016846
70th percentile: 2.4177404403686524
80th percentile: 2.4901426315307615
90th percentile: 2.6760252714157104
95th percentile: 2.755212128162384
99th percentile: 2.818561613559723
mean time: 2.055296754837036
%s, retrying in %s seconds...
Received healthy response to inference request in 1.97698974609375s
Received healthy response to inference request in 2.3067500591278076s
Received healthy response to inference request in 2.3033251762390137s
Received healthy response to inference request in 2.257240056991577s
Received healthy response to inference request in 2.256472587585449s
Received healthy response to inference request in 3.203000783920288s
Received healthy response to inference request in 1.680201530456543s
Received healthy response to inference request in 2.192760705947876s
Received healthy response to inference request in 2.616265296936035s
Received healthy response to inference request in 2.7683534622192383s
10 requests
0 failed requests
5th percentile: 1.8137562274932861
10th percentile: 1.9473109245300293
20th percentile: 2.1496065139770506
30th percentile: 2.2373590230941773
40th percentile: 2.256933069229126
50th percentile: 2.2802826166152954
60th percentile: 2.3046951293945312
70th percentile: 2.3996046304702756
80th percentile: 2.646682929992676
90th percentile: 2.811818194389343
95th percentile: 3.0074094891548153
99th percentile: 3.163882524967194
mean time: 2.356135940551758
Pipeline stage StressChecker completed in 47.59s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.65s
Shutdown handler de-registered
function_bigot_2025-12-13 status is now deployed due to DeploymentManager action
function_bigot_2025-12-13 status is now inactive due to auto deactivation removed underperforming models