view article Article π¦Έπ»#1: Open-endedness and AI Agents β A Path from Generative to Creative AI? By Kseniase β’ Dec 25, 2024 β’ 9
Q-Filters: Leveraging QK Geometry for Efficient KV Cache Compression Paper β’ 2503.02812 β’ Published 12 days ago β’ 9
Q-Filters: Leveraging QK Geometry for Efficient KV Cache Compression Paper β’ 2503.02812 β’ Published 12 days ago β’ 9
Q-Filters Collection Pre-computed Q-Filters for efficient KV cache compression. β’ 15 items β’ Updated 13 days ago β’ 6
Pre-Trianing Data Packing Collection [ACL'24] Analysing the Impact of Sequence Composition on Language Model Pre-Training. https://github.com/yuzhaouoe/pretraining-data-packing β’ 10 items β’ Updated 13 days ago
SAE-Based Representation Engineering Collection [NAACL'25] SAE-Based RepE github.com/yuzhaouoe/SAE-based-representation-engineering β’ 5 items β’ Updated 13 days ago
SAE-Based Representation Engineering Collection [NAACL'25] SAE-Based RepE github.com/yuzhaouoe/SAE-based-representation-engineering β’ 5 items β’ Updated 13 days ago
Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering Paper β’ 2410.15999 β’ Published Oct 21, 2024 β’ 20 β’ 3
Analysing the Residual Stream of Language Models Under Knowledge Conflicts Paper β’ 2410.16090 β’ Published Oct 21, 2024 β’ 7 β’ 2
DeCoRe: Decoding by Contrasting Retrieval Heads to Mitigate Hallucinations Paper β’ 2410.18860 β’ Published Oct 24, 2024 β’ 11
HIT-TMG/KaLM-embedding-multilingual-mini-v1 Sentence Similarity β’ Updated Jan 3 β’ 8.12k β’ β’ 20
HIT-TMG/KaLM-embedding-multilingual-mini-instruct-v1 Sentence Similarity β’ Updated 3 days ago β’ 32.9k β’ β’ 32
Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering Paper β’ 2410.15999 β’ Published Oct 21, 2024 β’ 20 β’ 3