shane are's picture

2 8 17

shane are

arionsingul

·

AI & ML interests

None yet

Recent Activity

liked a model about 4 hours ago

gpt-omni/mini-omni2

upvoted a collection 3 days ago

upvoted a paper 6 days ago

Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-based LLMs

View all activity

Organizations

arionsingul's activity

upvoted a collection 3 days ago

AQLM+PV

Official AQLM quantizations for "PV-Tuning: Beyond Straight-Through Estimation for Extreme LLM Compression": https://arxiv.org/abs/2405.14852 • 26 items • Updated Feb 28 • 21

upvoted a paper 6 days ago

Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-based LLMs

Paper • 2502.14837 • Published Feb 20 • 3

upvoted a collection 18 days ago

interesting papers

10 items • Updated 15 days ago • 1

upvoted 3 papers about 1 month ago

Beyond Natural Language: LLMs Leveraging Alternative Formats for Enhanced Reasoning and Communication

Paper • 2402.18439 • Published Feb 28, 2024 • 1

OneLLM: One Framework to Align All Modalities with Language

Paper • 2312.03700 • Published Dec 6, 2023 • 24

LLMPirate: LLMs for Black-box Hardware IP Piracy

Paper • 2411.16111 • Published Nov 25, 2024 • 1

upvoted an article about 2 months ago

Article

Optimum-NVIDIA - Unlock blazingly fast LLM inference in just 1 line of code

Dec 5, 2023

• 5

upvoted a collection 6 months ago

NVLM 1.0

A family of frontier-class multimodal large language models (LLMs) that achieve state-of-the-art results on vision-language tasks and text-only tasks. • 2 items • Updated about 20 hours ago • 51