๐ Deploying OLMo-7B with Text Generation Inference (TGI) on Hugging Face Spaces
โข
3
I have recently created another tutorial where I show how to deploy any LLM (not VLMs at the moment) with TGI and HF Spaces!
You could check that out -- I assure you that with TGI, it will be prod ready.
Hi @lseanlon ! Thank you for your interest in this work. The premise of this blog post was to show users how to build a POC quickly (as you have mentioned), and not show production usage.
Note: It might be interesting for you to showcase some tricks to speedup the inference time. I will be very willing to see what you come up with and also add it to this article giving you due credits of course. ๐ค