runtime error

Exit code: 3. Reason: INFO: Started server process [1] INFO: Waiting for application startup. `torch_dtype` is deprecated! Use `dtype` instead! Version ---- DeepSeek Only Loading DeepSeek model into memory... Fetching 2 files: 0%| | 0/2 [00:00<?, ?it/s] Fetching 2 files: 50%|█████ | 1/2 [02:07<02:07, 127.07s/it] Fetching 2 files: 100%|██████████| 2/2 [02:07<00:00, 63.55s/it] The following generation flags are not valid and may be ignored: ['temperature']. Set `TRANSFORMERS_VERBOSITY=info` for more details. Loading checkpoint shards: 0%| | 0/2 [00:00<?, ?it/s] Loading checkpoint shards: 100%|██████████| 2/2 [00:00<00:00, 7564.12it/s] ERROR: Traceback (most recent call last): File "/home/user/.local/lib/python3.9/site-packages/starlette/routing.py", line 694, in lifespan async with self.lifespan_context(app) as maybe_state: File "/home/user/.local/lib/python3.9/site-packages/fastapi/routing.py", line 228, in __aenter__ await self._router._startup() File "/home/user/.local/lib/python3.9/site-packages/fastapi/routing.py", line 4564, in _startup handler() File "/app/app.py", line 53, in load_models app.state.deepseek_model = AutoModelForCausalLM.from_pretrained( File "/home/user/.local/lib/python3.9/site-packages/transformers/models/auto/auto_factory.py", line 604, in from_pretrained return model_class.from_pretrained( File "/home/user/.local/lib/python3.9/site-packages/transformers/modeling_utils.py", line 277, in _wrapper return func(*args, **kwargs) File "/home/user/.local/lib/python3.9/site-packages/transformers/modeling_utils.py", line 5140, in from_pretrained dispatch_model(model, **device_map_kwargs) File "/home/user/.local/lib/python3.9/site-packages/accelerate/big_modeling.py", line 504, in dispatch_model raise ValueError( ValueError: You are trying to offload the whole model to the disk. Please use the `disk_offload` function instead. ERROR: Application startup failed. Exiting.

Container logs:

Fetching error logs...