vall-e_libritts / libritts /log /log-train-2024-08-06-03-36-20-2
yuekai's picture
Upload folder using huggingface_hub
c96c265 verified
raw
history blame contribute delete
No virus
3.25 kB
2024-08-06 03:36:20,573 INFO [trainer.py:870] (2/8) Training started
2024-08-06 03:36:20,574 INFO [trainer.py:889] (2/8) Device: cuda:2
2024-08-06 03:36:20,574 INFO [trainer.py:890] (2/8) {'best_train_loss': inf, 'best_valid_loss': inf, 'best_train_epoch': -1, 'best_valid_epoch': -1, 'batch_idx_train': 0, 'log_interval': 100, 'reset_interval': 200, 'valid_interval': 2000, 'env_info': {'k2-version': '1.24.3', 'k2-build-type': 'Release', 'k2-with-cuda': True, 'k2-git-sha1': '279b0c87015a615b81b147251814d737a548f397', 'k2-git-date': 'Wed May 24 22:24:09 2023', 'lhotse-version': '1.26.0', 'torch-version': '2.0.1+cu118', 'torch-cuda-available': True, 'torch-cuda-version': '11.8', 'python-version': '3.10', 'icefall-git-branch': 'main', 'icefall-git-sha1': '7d2e5f4-dirty', 'icefall-git-date': 'Tue Aug 6 02:59:12 2024', 'icefall-path': '/workspace/icefall_llm', 'k2-path': '/usr/local/lib/python3.10/dist-packages/k2/__init__.py', 'lhotse-path': '/usr/local/lib/python3.10/dist-packages/lhotse/__init__.py', 'hostname': '6865771', 'IP address': '0.104.195.107'}, 'world_size': 8, 'master_port': 12354, 'tensorboard': True, 'num_epochs': 20, 'start_epoch': 1, 'start_batch': 0, 'exp_dir': PosixPath('exp/valle'), 'optimizer_name': 'ScaledAdam', 'scheduler_name': 'Eden', 'base_lr': 0.03, 'warmup_steps': 200, 'seed': 42, 'inf_check': False, 'save_every_n': 1000, 'keep_last_k': 20, 'average_period': 0, 'accumulate_grad_steps': 1, 'dtype': 'bfloat16', 'filter_min_duration': 0.5, 'filter_max_duration': 14.0, 'train_stage': 1, 'visualize': False, 'oom_check': False, 'model_name': 'valle', 'decoder_dim': 1024, 'nhead': 16, 'num_decoder_layers': 12, 'scale_factor': 1.0, 'norm_first': True, 'add_prenet': False, 'prefix_mode': 1, 'share_embedding': True, 'prepend_bos': False, 'num_quantizers': 8, 'scaling_xformers': False, 'manifest_dir': PosixPath('data/tokenized'), 'max_duration': 320, 'bucketing_sampler': True, 'num_buckets': 6, 'concatenate_cuts': False, 'duration_factor': 1.0, 'gap': 0.1, 'on_the_fly_feats': False, 'shuffle': True, 'buffer_size': 40000, 'shuffle_buffer_size': 100000, 'drop_last': False, 'return_cuts': True, 'num_workers': 8, 'enable_spec_aug': False, 'spec_aug_time_warp_factor': 80, 'input_strategy': 'PrecomputedFeatures', 'dataset': 'libritts', 'text_tokens': 'data/tokenized/unique_text_tokens.k2symbols', 'sampling_rate': 24000}
2024-08-06 03:36:20,574 INFO [trainer.py:892] (2/8) About to create model
2024-08-06 03:36:21,364 INFO [trainer.py:899] (2/8) Number of model parameters: 367386628
2024-08-06 03:36:22,194 INFO [trainer.py:914] (2/8) Using DDP
2024-08-06 03:36:24,258 INFO [datamodule.py:427] (2/8) About to get train cuts
2024-08-06 03:36:24,260 INFO [datamodule.py:434] (2/8) About to get dev cuts
2024-08-06 03:36:24,261 INFO [datamodule.py:292] (2/8) Disable SpecAugment
2024-08-06 03:36:24,261 INFO [datamodule.py:294] (2/8) About to create train dataset
2024-08-06 03:36:24,262 INFO [datamodule.py:323] (2/8) Using DynamicBucketingSampler
2024-08-06 03:36:24,875 INFO [datamodule.py:344] (2/8) About to create train dataloader
2024-08-06 03:36:24,876 INFO [datamodule.py:367] (2/8) About to create dev dataset
2024-08-06 03:36:25,205 INFO [datamodule.py:388] (2/8) About to create dev dataloader