Valeriy Selitskiy's picture

Valeriy Selitskiy PRO

WaveCut

AI & ML interests

Looking to switch from hobby to career

Recent Activity

liked a model 34 minutes ago
deepcogito/cogito-v1-preview-llama-70B
upvoted a collection 43 minutes ago
Cogito v1 Preview
liked a Space about 6 hours ago
a-ghorbani/ai-phone-leaderboard
View all activity

Organizations

Blog-explorers's profile picture Vikhr models's profile picture MLX Community's profile picture AI Art Collaboration space's profile picture

WaveCut's activity

reacted to as-cle-bert's post with ๐Ÿ”ฅ 2 days ago
view post
Post
2637
Llama-4 is out and I couldn't resist but to cook something with it... So I came up with ๐‹๐ฅ๐š๐ฆ๐š๐‘๐ž๐ฌ๐ž๐š๐ซ๐œ๐ก๐ž๐ซ (https://llamaresearcher.com), your deep-research AI companion!๐Ÿ”Ž

The workflow behind ๐—Ÿ๐—น๐—ฎ๐—บ๐—ฎ๐—ฅ๐—ฒ๐˜€๐—ฒ๐—ฎ๐—ฟ๐—ฐ๐—ต๐—ฒ๐—ฟ is simple:
๐Ÿ’ฌ You submit a query
๐Ÿ›ก๏ธ Your query is evaluated by Llama 3 guard model, which deems it safe or unsafe
๐Ÿง  If your query is safe, it is routed to the Researcher Agent
โš™๏ธ The Researcher Agent expands the query into three sub-queries, with which to search the web
๐ŸŒ The web is searched for each of the sub-queries
๐Ÿ“Š The retrieved information is evaluated for relevancy against your original query
โœ๏ธ The Researcher Agent produces an essay based on the information it gathered, paying attention to referencing its sources

The agent itself is also built with easy-to-use and intuitive blocks:
๐Ÿฆ™ LlamaIndex provides the agentic architecture and the integrations with the language models
โšกGroq makes Llama-4 available with its lightning-fast inference
๐Ÿ”Ž Linkup allows the agent to deep-search the web and provides sourced answers
๐Ÿ’ช FastAPI does the heavy loading with wrapping everything within an elegant API interface
โฑ๏ธ Redis is used for API rate limiting
๐ŸŽจ Gradio creates a simple but powerful user interface

Special mention also to Lovable, which helped me build the first draft of the landing page for LlamaResearcher!๐Ÿ’–

If you're curious and you want to try LlamaResearcher, you can - completely for free and without subscription - for 30 days from now โžก๏ธ https://llamaresearcher.com
And if you're like me, and you like getting your hands in code and build stuff on your own machine, I have good news: this is all open-source, fully reproducible locally and Docker-ready๐Ÿ‹
Just go to the GitHub repo: https://github.com/AstraBert/llama-4-researcher and don't forget to star it, if you find it useful!โญ

As always, have fun and feel free to leave your feedbackโœจ
reacted to javelinsoam's post with ๐Ÿ”ฅ 2 days ago
view post
Post
2125
Requests Fail with 404 on HuggingFace Inference Due to X-Forwarded-Host Header

Weโ€™re encountering a 404 Not Found error from the HuggingFace Inference endpoint when the request includes the X-Forwarded-Host header.

The issue appears to stem from the presence of this header, even if we use any private/public domain:

X-Forwarded-Host: google.com


Without Header โ€“ Works

When this header is removed, the request succeeds.
Identical payloads and endpoints return valid responses when the header is omitted.

With Header โ€“ Fails

If included (even with a valid public domain), the request fails with:

{
  "error": "Not Found: google.com"
}



You can use curl command to replicate this issue
curl "https://{your-inference-endpoint}/v1/chat/completions" \
-X POST \
-H "Authorization: Bearer <HF_TOKEN>" \
-H "Content-Type: application/json" \
-H "X-Forwarded-Host: any-domain.com" \
-d '{
  "model": "unsloth/DeepSeek-R1-GGUF",
  "messages": [{"role": "user", "content": "What is deep learning?"}],
  "max_tokens": 150
}'


Please let us know if thereโ€™s a workaround or config option available to suppress this behavior.