Can anyone please guide how to get inference from the model and also can this be fined tuned in small GPUs (14gb) using peft, if anyone has some guidance that would be very helpful.
Your need to confirm your account before you can post a new comment.