Commit History

cross referencing other transformer-related implementations
bf7364f

bird-of-paradise commited on

Update class names to MultiHeadLatentAttention
2d7348d

bird-of-paradise commited on

Fix: Rename to Multi-Head Latent Attention
098730b

bird-of-paradise commited on

Update README.md: clarify this is an attention implementation, not a trained model
f628f42

bird-of-paradise commited on

Initial commit: DeepSeek Multi-Latent Attention implementation
550eb56

Yan Wei commited on