ScalingIntelligence/swe-bench-verified-codebase-content Viewer β’ Updated Jan 28 β’ 57.3k β’ 958 β’ 2
Running 62 62 LLM Embeddings Explained: A Visual and Intuitive Guide π How Language Models Turn Text into Meaning, From Traditional
Hierarchical Prompting Taxonomy: A Universal Evaluation Framework for Large Language Models Paper β’ 2406.12644 β’ Published Jun 18, 2024 β’ 5
Teach Better or Show Smarter? On Instructions and Exemplars in Automatic Prompt Optimization Paper β’ 2406.15708 β’ Published Jun 22, 2024 β’ 1
MLGym: A New Framework and Benchmark for Advancing AI Research Agents Paper β’ 2502.14499 β’ Published Feb 20 β’ 189
Charting and Navigating Hugging Face's Model Atlas Paper β’ 2503.10633 β’ Published 23 days ago β’ 73