view article Article Illustrating Reinforcement Learning from Human Feedback (RLHF) Dec 9, 2022 β’ 207
Direct Preference Optimization: Your Language Model is Secretly a Reward Model Paper β’ 2305.18290 β’ Published May 29, 2023 β’ 55
DeepRAG: Thinking to Retrieval Step by Step for Large Language Models Paper β’ 2502.01142 β’ Published Feb 3 β’ 24
ModernBERT Collection Bringing BERT into modernity via both architecture changes and scaling β’ 3 items β’ Updated Dec 19, 2024 β’ 141
view article Article Transformers.js v3: WebGPU support, new models & tasks, and more⦠Oct 22, 2024 ⒠72
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper β’ 2501.12948 β’ Published Jan 22 β’ 354
Law of the Weakest Link: Cross Capabilities of Large Language Models Paper β’ 2409.19951 β’ Published Sep 30, 2024 β’ 54
view article Article Accelerating PyTorch distributed fine-tuning with Intel technologies Nov 19, 2021 β’ 1
view article Article Accelerating PyTorch Transformers with Intel Sapphire Rapids, part 1 Jan 2, 2023 β’ 3
π Avatars Collection The latest AI-powered technologies usher in a new era of realistic avatars! π β’ 71 items β’ Updated 5 days ago β’ 83
Nice Gradio Chatbot UIs Collection The following Chatbot UIs or Projects have been created and are highly regarded by the community. β’ 4 items β’ Updated Dec 20, 2023 β’ 7
Open LLM Leaderboard best models β€οΈβπ₯ Collection A daily uploaded list of models with best evaluations on the LLM leaderboard: β’ 65 items β’ Updated 1 day ago β’ 559
Seamless Communication Collection A significant step towards removing language barriers through expressive, fast and high-quality AI translation. β’ 16 items β’ Updated Jan 16, 2024 β’ 153