-
Training Large Language Models to Reason in a Continuous Latent Space
Paper • 2412.06769 • Published • 71 -
Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters
Paper • 2408.03314 • Published • 54 -
Solving math word problems with process- and outcome-based feedback
Paper • 2211.14275 • Published • 8
Shyam Sunder Kumar
theainerd
AI & ML interests
Natural Language Processing
Recent Activity
liked
a Space
about 1 hour ago
argilla/synthetic-data-generator
reacted
to
as-cle-bert's
post
with 🔥
about 1 hour ago
Hi HuggingFace community!🤗
I recently released PrAIvateSearch v2.0-beta.0 (https://github.com/AstraBert/PrAIvateSearch), my privacy-first, AI-powered, user-centered and data-safe application aimed at providing a local and open-source alternative to big AI search engines such as SearchGPT or Perplexity AI.
We have several key changes:
- New chat UI built with NextJS
- DuckDuckGo API used for web search instead of Google
- https://huggingface.co/Qwen/Qwen2.5-1.5B-Instruct as a language model served on API (by FastAPI)
- Crawl4AI crawler used for web scraping
- Optimizations in the data workflow inside the application
Read more in my blog post 👉 https://huggingface.co/blog/as-cle-bert/search-the-web-with-ai
Have fun and feel free to leave feedback about how to improve the application!✨
liked
a dataset
about 2 hours ago
princeton-nlp/SWE-bench_Multimodal
Organizations
Collections
3
models
2
datasets
None public yet