Yunho Cho
yhjc2692
Β·
AI & ML interests
inference optimization, world models, learning methods in robotics contexts
Recent Activity
liked
a model
18 days ago
Qwen/Qwen2-VL-2B-Instruct-GPTQ-Int4
reacted
to
andrewrreed's
post
with β€οΈ
19 days ago
π It's now easier than ever to switch from OpenAI to open LLMs
Hugging Face's TGI now supports an OpenAI compatible Chat Completion API
This means you can transition code that uses OpenAI client libraries (or frameworks like LangChain π¦ and LlamaIndex π¦) to run open models by changing just two lines of code π€
β Here's how:
```
from openai import OpenAI
# initialize the client but point it to TGI
client = OpenAI(
base_url="<ENDPOINT_URL>" + "/v1/", # replace with your endpoint url
api_key="<HF_API_TOKEN>", # replace with your token
)
chat_completion = client.chat.completions.create(
model="tgi",
messages=[
{"role": "system", "content": "You are a helpful assistant."},
{"role": "user", "content": "Why is open-source software important?"},
],
stream=True,
max_tokens=500
)
# iterate and print stream
for message in chat_completion:
print(message.choices[0].delta.content, end="")
```
π Blog post β‘ https://huggingface.co/blog/tgi-messages-api
π TGI docs β‘ https://huggingface.co/docs/text-generation-inference/en/messages_api
liked
a model
19 days ago
Qwen/Qwen2-VL-7B-Instruct-AWQ
Organizations
models
None public yet
datasets
None public yet