Krishna Kaasyap

KrishnaKaasyap

AI & ML interests

Test Time Training Multimodal & Inter-Modality Transfer Learning Mechanistic Interpretability Evolutionary Model Merging Swarm Intelligence of multiple models with different architectures and different algorithms MuZero approach to general tasks

Recent Activity

liked a model about 7 hours ago
Qwen/Qwen2.5-Omni-7B
liked a model 11 days ago
CohereForAI/c4ai-command-a-03-2025
liked a model 11 days ago
Qwen/QwQ-32B
View all activity

Organizations

Blog-explorers's profile picture

KrishnaKaasyap's activity

upvoted an article 8 months ago
view article
Article

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

• 231