interesting - a smpanaro Collection

smpanaro 's Collections

Apple Neural Engine LLMs

quant

prune

interesting

updated Aug 2, 2024

LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning

Paper • 2401.01325 • Published Jan 2, 2024 • 26
WaveCoder: Widespread And Versatile Enhanced Instruction Tuning with Refined Data Generation

Paper • 2312.14187 • Published Dec 20, 2023 • 51
Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data

Paper • 2401.10891 • Published Jan 19, 2024 • 60
MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies

Paper • 2404.06395 • Published Apr 9, 2024 • 22
Flash normalization: fast RMSNorm for LLMs

Paper • 2407.09577 • Published Jul 12, 2024 • 1
Pruning Large Language Models with Semi-Structural Adaptive Sparse Training

Paper • 2407.20584 • Published Jul 30, 2024