DomainInsAdap/Meta-Llama-3.1-8B-Instruct-astronomy-tree-v2-all-None-5-2e-05-epoch-4 Updated 2 minutes ago
DomainInsAdap/Meta-Llama-3.1-8B-Instruct-Longform-v1-5-2e-05-sota-template-epoch-2 Updated about 2 hours ago
DomainInsAdap/Meta-Llama-3.1-8B-Instruct-Longform-v1-5-2e-05-sota-template-epoch-1 Updated about 6 hours ago
DomainInsAdap/Meta-Llama-3.1-8B-Instruct-Longform-v1-5-2e-05-sota-template-epoch-0 Updated about 10 hours ago
DomainInsAdap/Meta-Llama-3.1-8B-Instruct-astronomy-tree-v2-all-None-5-2e-05-epoch-3 Updated about 16 hours ago
DomainInsAdap/Meta-Llama-3.1-8B-Instruct-astronomy-tree-v2-all-None-5-2e-05-epoch-2 Updated about 24 hours ago
DomainInsAdap/Meta-Llama-3.1-8B-Instruct-astronomy-tree-v2-all-None-5-2e-05-epoch-1 Updated 1 day ago
DomainInsAdap/Meta-Llama-3.1-8B-Instruct-virology-tree-v2-all-None-5-2e-05-epoch-4 Updated 3 days ago
DomainInsAdap/Meta-Llama-3.1-8B-Instruct-virology-tree-v2-all-None-5-2e-05-epoch-3 Updated 4 days ago
DomainInsAdap/Meta-Llama-3.1-8B-Instruct-virology-tree-v2-all-None-5-2e-05-epoch-2 Updated 4 days ago
DomainInsAdap/Meta-Llama-3.1-8B-Instruct-virology-tree-v2-all-None-5-2e-05-epoch-1 Updated 4 days ago
Collapse of Dense Retrievers: Short, Early, and Literal Biases Outranking Factual Evidence Paper • 2503.05037 • Published Mar 6 • 4
GlobEnc: Quantifying Global Token Attribution by Incorporating the Whole Encoder Layer in Transformers Paper • 2205.03286 • Published May 6, 2022
BERT on a Data Diet: Finding Important Examples by Gradient-Based Pruning Paper • 2211.05610 • Published Nov 10, 2022
DecompX: Explaining Transformers Decisions by Propagating Token Decomposition Paper • 2306.02873 • Published Jun 5, 2023 • 1
Metaphors in Pre-Trained Language Models: Probing and Generalization Across Datasets and Languages Paper • 2203.14139 • Published Mar 26, 2022 • 1