developer_uid: rirv938
submission_id: function_nurir_2025-06-12
model_name: dpo_data_collection
model_group:
status: torndown
timestamp: 2025-06-12T06:51:06+00:00
num_battles: 11785
num_wins: 6321
celo_rating: 1304.09
family_friendly_score: 0.5356000000000001
family_friendly_standard_error: 0.007053121861984238
submission_type: function
display_name: dpo_data_collection
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-06-11
win_ratio: 0.5363597793805686
generation_params: {'temperature': 0.9, 'top_p': 0.9, 'min_p': 0.05, 'top_k': 80, 'presence_penalty': 0.5, 'frequency_penalty': 0.5, 'stopping_words': ['</s>', '\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
HTTPConnectionPool(host='guanaco-submitter.guanaco-backend.k2.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
Received healthy response to inference request in 6.704310178756714s
Received healthy response to inference request in 3.088205337524414s
Received healthy response to inference request in 3.4535582065582275s
Received healthy response to inference request in 3.6344048976898193s
5 requests
1 failed requests
5th percentile: 3.1612759113311766
10th percentile: 3.2343464851379395
20th percentile: 3.380487632751465
30th percentile: 3.4897275447845457
40th percentile: 3.5620662212371825
50th percentile: 3.6344048976898193
60th percentile: 4.862367010116577
70th percentile: 6.090329122543334
80th percentile: 9.391672801971438
90th percentile: 14.76639804840088
95th percentile: 17.453760671615598
99th percentile: 19.60365077018738
mean time: 7.4043203830719
%s, retrying in %s seconds...
Received healthy response to inference request in 1.9334030151367188s
Received healthy response to inference request in 3.4167234897613525s
Received healthy response to inference request in 3.516444683074951s
Received healthy response to inference request in 3.976961135864258s
Received healthy response to inference request in 3.7240052223205566s
5 requests
0 failed requests
5th percentile: 2.2300671100616456
10th percentile: 2.5267312049865724
20th percentile: 3.1200593948364257
30th percentile: 3.436667728424072
40th percentile: 3.4765562057495116
50th percentile: 3.516444683074951
60th percentile: 3.5994688987731935
70th percentile: 3.6824931144714355
80th percentile: 3.774596405029297
90th percentile: 3.8757787704467774
95th percentile: 3.9263699531555174
99th percentile: 3.9668428993225096
mean time: 3.3135075092315676
Pipeline stage StressChecker completed in 55.63s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.76s
Shutdown handler de-registered
function_nurir_2025-06-12 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 3451.18s
Shutdown handler de-registered
function_nurir_2025-06-12 status is now inactive due to auto deactivation removed underperforming models
function_nurir_2025-06-12 status is now protected due to ABTestQueueItem
function_nurir_2025-06-12 status is now inactive due to ABTestQueueItem
function_nurir_2025-06-12 status is now torndown due to DeploymentManager action