CVQA: Culturally-diverse Multilingual Visual Question Answering Benchmark Paper • 2406.05967 • Published Jun 10, 2024 • 6
MLKV: Multi-Layer Key-Value Heads for Memory Efficient Transformer Decoding Paper • 2406.09297 • Published Jun 13, 2024 • 6 • 2
MLKV: Multi-Layer Key-Value Heads for Memory Efficient Transformer Decoding Paper • 2406.09297 • Published Jun 13, 2024 • 6
zaydzuhri/the_pile_tokenized_5percent_truncated_packed_v2 Viewer • Updated Mar 5, 2024 • 2.46M • 138
zaydzuhri/the_pile_tokenized_5percent_truncated_packed Viewer • Updated Feb 9, 2024 • 2.11M • 109