view article Article Train 400x faster Static Embedding Models with Sentence Transformers 20 days ago โข 132
Can Large Language Models Infer Causation from Correlation? Paper โข 2306.05836 โข Published Jun 9, 2023 โข 6
TextGrad: Automatic "Differentiation" via Text Paper โข 2406.07496 โข Published Jun 11, 2024 โข 29
view article Article Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent Apr 22, 2024 โข 80
Low-rank Adaptation of Large Language Model Rescoring for Parameter-Efficient Speech Recognition Paper โข 2309.15223 โข Published Sep 26, 2023 โข 19
Model Merging Collection Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! โข 30 items โข Updated Jun 12, 2024 โข 225
Model soups: averaging weights of multiple fine-tuned models improves accuracy without increasing inference time Paper โข 2203.05482 โข Published Mar 10, 2022 โข 6
Papers about model merging Collection referenced in the mergekit repo: https://github.com/cg123/mergekit โข 4 items โข Updated Feb 13, 2024 โข 14
LLM in a flash: Efficient Large Language Model Inference with Limited Memory Paper โข 2312.11514 โข Published Dec 12, 2023 โข 259