2024-07-17 06:14:19,185 - INFO: Calling run.. 2024-07-17 06:14:19,186 - INFO: Environment configuration: ConfigNLPCausalClassificationEnvironment(gpus=['0'], mixed_precision=False, compile_model=False, use_deepspeed=False, deepspeed_reduce_bucket_size=10000000.0, deepspeed_stage3_prefetch_bucket_size=10000000.0, deepspeed_stage3_param_persistence_threshold=10000000.0, deepspeed_offload_optimizer=False, deepspeed_stage3_max_live_parameters=10000000.0, deepspeed_stage3_max_reuse_distance=10000000.0, find_unused_parameters=False, trust_remote_code=False, huggingface_branch='main', number_of_workers=8, seed=-1, _seed=0, _distributed=False, _distributed_inference=True, _local_rank=0, _world_size=1, _curr_step=0, _curr_val_step=0, _rank=0, _device='cuda', _cpu_comm=None, _model_card_template='text_causal_classification_model_card_template.md', _summary_card_template='text_causal_classification_experiment_summary_card_template.md') 2024-07-17 06:14:19,186 - INFO: cfg.environment._distributed set to False 2024-07-17 06:14:19,186 - INFO: Problem Type: text_causal_classification_modeling 2024-07-17 06:14:19,186 - INFO: Global random seed: 419783 2024-07-17 06:14:19,186 - INFO: Preparing the data... 2024-07-17 06:14:19,186 - INFO: Setting up automatic validation split... 2024-07-17 06:14:19,192 - INFO: The dataframe has following columns: Index(['Description', 'category', 'sub_category', 'label'], dtype='object') 2024-07-17 06:14:19,195 - INFO: Preparing train and validation data, dataset config to be used: ConfigNLPCausalClassificationDataset(dataset_class=, personalize=False, chatbot_name='OI_AI', chatbot_author='openinnovation.ai', train_dataframe='/app/train_df.csv', validation_strategy='automatic', validation_dataframe='/app/validation_df.csv', validation_size=0.0099999998, data_sample=1.0, data_sample_choice=('Train', 'Validation'), system_column='None', prompt_column=(), answer_column='category', parent_id_column='None', text_system_start='', text_prompt_start='', text_answer_separator='', limit_chained_samples=False, add_eos_token_to_system=False, add_eos_token_to_prompt=False, add_eos_token_to_answer=False, mask_prompt_labels=True, _allowed_file_extensions=('csv', 'pq', 'parquet'), num_classes=2) 2024-07-17 06:14:19,195 - INFO: Loading train dataset... 2024-07-17 06:14:19,195 - INFO: Columns found: Index(['Description', 'category', 'sub_category', 'label'], dtype='object') 2024-07-17 06:14:20,210 - INFO: Loading validation dataset... 2024-07-17 06:14:20,791 - INFO: Number of observations in train dataset: 494 2024-07-17 06:14:20,791 - INFO: Number of observations in validation dataset: 5 2024-07-17 06:14:21,246 - WARNING: PAD token id not matching between config and tokenizer. Overwriting with tokenizer id. 2024-07-17 06:14:21,246 - INFO: Setting pretraining_tp of model config to 1. 2024-07-17 06:14:21,251 - INFO: Using int4 for backbone 2024-07-17 06:14:21,251 - INFO: Loading TinyLlama/TinyLlama_v1.1. This may take a while. 2024-07-17 06:14:35,909 - INFO: Loaded TinyLlama/TinyLlama_v1.1. 2024-07-17 06:14:35,916 - INFO: Lora module names: ['q_proj', 'k_proj', 'v_proj', 'o_proj', 'gate_proj', 'up_proj', 'down_proj'] 2024-07-17 06:14:36,191 - INFO: Enough space available for saving model weights.Required space: 1003.87MB, Available space: 993953.90MB. 2024-07-17 06:14:36,200 - INFO: Optimizer AdamW has been provided with parameters {'weight_decay': 0.0, 'eps': 1e-08, 'betas': (0.8999999762, 0.9990000129), 'lr': 0.0001} 2024-07-17 06:14:36,637 - INFO: started process: 0, can_track: True, tracking_mode: TrackingMode.AFTER_EPOCH 2024-07-17 06:14:36,638 - INFO: Training Epoch: 1 / 1 2024-07-17 06:14:36,638 - INFO: train loss: 0%| | 0/247 [00:00