Hugo Larcher's picture
5 6

Hugo Larcher

hlarcher

AI & ML interests

None yet

Recent Activity

updated a model 11 days ago
hlarcher/opt-125m-lfs
published a model 11 days ago
hlarcher/opt-125m-lfs
updated a model 11 days ago
hlarcher/opt-125m
View all activity

Organizations

Hugging Face's profile picture HuggingFaceBR4's profile picture Hugging Face Smol Cluster's profile picture Optimum Nvidia's profile picture smol-explorers's profile picture

hlarcher's activity

upvoted an article 18 days ago
view article
Article

From Chunks to Blocks: Accelerating Uploads and Downloads on the Hub

49
upvoted an article about 1 month ago
view article
Article

Welcome to Inference Providers on the Hub 🔥

404
posted an update about 2 months ago
view post
Post
1077
We are introducing multi-backend support in Hugging Face Text Generation Inference!
With new TGI architecture we are now able to plug new modeling backends to get best performances according to selected model and available hardware. This first step will very soon be followed by the integration of new backends (TRT-LLM, llama.cpp, vLLM, Neuron and TPU).

We are polishing the TensorRT-LLM backend which achieves impressive performances on NVIDIA GPUs, stay tuned 🤗 !

Check out the details: https://huggingface.co/blog/tgi-multi-backend
upvoted an article about 2 months ago
view article
Article

Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference

70
published an article about 2 months ago
view article
Article

Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference

70
updated a Space 6 months ago