Manish Kumar Pandey

Manish-GenAI

AI & ML interests

#GraphML, #GeometricDL, #3DComputerVision, #DiffusionModels, #GANs, #Generative AI #ComputerVision,#ML ,#RL, #LLM, #MultiModal Fusion #GenerativeFlow Networks

Recent Activity

reacted to Kseniase's post with ❤️ about 7 hours ago

8 Free Sources on Reinforcement Learning With the phenomenon of DeepSeek-R1's top reasoning capabilities, we all saw the true power of RL. At its core, RL is a type of machine learning where a model/agent learns to make decisions by interacting with an environment to maximize a reward. RL learns through trial and error, receiving feedback in the form of rewards or penalties. Here's a list of free sources that will help you dive into RL and how to use it: 1. "Reinforcement Learning: An Introduction" book by Richard S. Sutton and Andrew G. Barto -> https://web.stanford.edu/class/psych209/Readings/SuttonBartoIPRLBook2ndEd.pdf 2. Hugging Face Deep Reinforcement Learning Course -> https://huggingface.co/learn/deep-rl-course/unit0/introduction You'll learn how to train agents in unique environments, using best libraries, share your results, compete in challenges, and earn a certificate. 3. OpenAI Spinning Up in Deep RL -> https://spinningup.openai.com/en/latest/index.html A comprehensive overview of RL with many useful resources 4. "Reinforcement Learning and Optimal Control" books, video lectures and course material by Dimitri P. Bertsekas from ASU -> https://web.mit.edu/dimitrib/www/RLbook.html Explores approximate Dynamic Programming (DP) and RL with key concepts and methods like rollout, tree search, and neural network training for RL and more. 5. RL Course by David Silver (Google DeepMind) -> https://www.youtube.com/watch?v=2pWv7GOvuf0&list=PLqYmG7hTraZDM-OYHWgPeb Many recommend these video lectures as a good foundation 6. RL theory seminars -> https://sites.google.com/view/rltheoryseminars/home?authuser=0 Provides virtual seminars from different experts about RL advancements 7. "Reinforcement Learning Specialization" (a 4-course series on Coursera) -> https://www.coursera.org/learn/fundament 8. Concepts: RLHF, RLAIF, RLEF, RLCF -> https://www.turingpost.com/p/rl-f Our flashcards easily explain what are these four RL approaches with different feedback

liked a model 3 days ago

mistralai/Mistral-Small-24B-Instruct-2501

upvoted an article 3 days ago

How to deploy and fine-tune DeepSeek models on AWS

View all activity

Organizations

None yet

Manish-GenAI's activity

upvoted an article 3 days ago

Article

How to deploy and fine-tune DeepSeek models on AWS

4 days ago

• 24

upvoted an article 14 days ago

Article

Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference

18 days ago

• 62

upvoted an article 16 days ago

Article

Topic 23: What is LLM Inference, it's challenges and solutions for it

•

16 days ago

• 5

upvoted a collection 17 days ago

InternLM3

Collection

6 items • Updated 17 days ago • 21

upvoted a collection 23 days ago

Centurio

Collection

Artifacts of the paper "Centurio: On Drivers of Multilingual Ability of Large Vision-Language Model" • 5 items • Updated 23 days ago • 4

upvoted an article 24 days ago

Article

Synthetic Data Generation with FastData and Hugging Face

•

26 days ago

• 14

upvoted a collection 24 days ago

Sa2VA model zoo

Collection

4 items • Updated 19 days ago • 28

upvoted a collection 26 days ago

Deepseek V3 (All Versions)

Collection

Deepseek V3 - available in bf16, original, and GGUF formats, with support for 2, 3, 4, 5, 6 and 8-bit quantized versions. • 3 items • Updated 1 day ago • 29

upvoted 3 collections about 1 month ago

upvoted a paper about 1 month ago

Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey

Paper • 2412.18619 • Published Dec 16, 2024 • 54

upvoted 2 articles about 1 month ago

Article

🦸🏻#2: Your Go-To Vocabulary to Navigate the World of AI Agents and Agentic Workflows

•

Dec 28, 2024

• 10

Article

🌁#81: Key AI Concepts to Follow in 2025

•

Dec 23, 2024

• 24

upvoted a paper about 1 month ago

The Llama 3 Herd of Models

Paper • 2407.21783 • Published Jul 31, 2024 • 111

upvoted 4 collections about 1 month ago

Vision Language Models Papers 🖼️💬📝

Collection

Papers about vision-language models, most important ones are on top of the list. • 27 items • Updated Apr 30, 2024 • 35

MIT Talk 31/10 Papers

Collection

14 items • Updated Oct 28, 2024 • 31

🏆 Leaderboards & Arenas

Collection

20 items • Updated 10 days ago • 7

Open-source AI Releases - August '24

Collection

8 items • Updated Aug 22, 2024 • 5