Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
205987.2
TFLOPS
3
6
Hugo Larcher
hlarcher
Follow
Mi6paulino's profile picture
3outeille's profile picture
rishiraj's profile picture
31 followers
·
14 following
hugoch
hugoch
hlarcher
hlarcher.bsky.social
AI & ML interests
None yet
Recent Activity
posted
an
update
11 days ago
We are introducing multi-backend support in Hugging Face Text Generation Inference! With new TGI architecture we are now able to plug new modeling backends to get best performances according to selected model and available hardware. This first step will very soon be followed by the integration of new backends (TRT-LLM, llama.cpp, vLLM, Neuron and TPU). We are polishing the TensorRT-LLM backend which achieves impressive performances on NVIDIA GPUs, stay tuned 🤗 ! Check out the details: https://huggingface.co/blog/tgi-multi-backend
upvoted
an
article
11 days ago
Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference
updated
a dataset
13 days ago
huggingface/documentation-images
View all activity
Articles
Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference
11 days ago
•
60
Organizations
hlarcher
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
liked
3 Spaces
9 months ago
Running
794
⚡️
— Zero GPU Spaces —
List of spaces using ZERO-GPU
Running
on
Zero
1.76k
👕👔👚
IDM VTON
High-fidelity Virtual Try-on
Running
on
Zero
786
🥖
Parler-TTS
High-fidelity Text-To-Speech
liked
2 Spaces
10 months ago
Running
19
🐢
Fastapi Dummy
Running
on
T4
110
🏃
APISR
liked
a model
11 months ago
google/gemma-7b
Text Generation
•
Updated
Jun 27, 2024
•
51.9k
•
•
3.11k