Prashanth's picture

Prashanth

prashiyn

·

AI & ML interests

None yet

Organizations

None yet

prashiyn's activity

upvoted a collection 2 days ago

Molmo

Artifacts for open multimodal language models. • 5 items • Updated 9 days ago • 216

upvoted a collection 3 months ago

H2O Danube3

6 items • Updated Jul 16 • 52

upvoted a paper 3 months ago

AgentInstruct: Toward Generative Teaching with Agentic Flows

Paper • 2407.03502 • Published Jul 3 • 43

upvoted an article 3 months ago

Article

Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models

Jun 24

• 169

upvoted a collection 5 months ago

ZeroGPU Spaces

ZeroGPU Spaces made by the community • 17 items • Updated Jun 6 • 221

upvoted a paper 5 months ago

Octopus v4: Graph of language models

Paper • 2404.19296 • Published Apr 30 • 118

upvoted an article 5 months ago

Article

Run the strongest open-source LLM model: Llama3 70B with just a single 4GB GPU!

By

•

Apr 21

• 41

upvoted a paper 5 months ago

AutoCrawler: A Progressive Understanding Web Agent for Web Crawler Generation

Paper • 2404.12753 • Published Apr 19 • 41

upvoted a collection 6 months ago

Idefics2 🐶

Idefics2-8B is a foundation vision-language model. In this collection, you will find the models, datasets and demo related to its creation. • 11 items • Updated May 6 • 88

upvoted a paper 6 months ago

BioMedLM: A 2.7B Parameter Language Model Trained On Biomedical Text

Paper • 2403.18421 • Published Mar 27 • 21

upvoted 4 papers 7 months ago

Chronos: Learning the Language of Time Series

Paper • 2403.07815 • Published Mar 12 • 45

DeepSeek-Coder: When the Large Language Model Meets Programming -- The Rise of Code Intelligence

Paper • 2401.14196 • Published Jan 25 • 46

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27 • 592

MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases

Paper • 2402.14905 • Published Feb 22 • 108

upvoted a paper 8 months ago

Synthetic Data (Almost) from Scratch: Generalized Instruction Tuning for Language Models

Paper • 2402.13064 • Published Feb 20 • 46

upvoted 2 papers 9 months ago

PIXART-δ: Fast and Controllable Image Generation with Latent Consistency Models

Paper • 2401.05252 • Published Jan 10 • 45

Understanding LLMs: A Comprehensive Overview from Training to Inference

Paper • 2401.02038 • Published Jan 4 • 61

upvoted a paper 11 months ago

Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs

Paper • 2311.05657 • Published Nov 9, 2023 • 27

upvoted 2 papers about 1 year ago

Adapting Large Language Models via Reading Comprehension

Paper • 2309.09530 • Published Sep 18, 2023 • 75

ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world APIs

Paper • 2307.16789 • Published Jul 31, 2023 • 98