prithivida
/

Splade_PP_en_v1

document-expansion

sparse representation

passage-retrieval

knowledge-distillation

document encoder

Inference Endpoints

Model card Files Files and versions

prithivida commited on Feb 16, 2024

Commit

f9573c0

·

verified ·

1 Parent(s): b2893b9

Added context for lang tokens

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -81,9 +81,9 @@ SPLADE BOW rep:
 ## How does it translate into Empirical metrics?
 Our models are token sparse and yet effective. It translates to faster retrieval (User experience) and smaller index size ($). Mean retrieval time on the standard MS-MARCO small dev set and Scaled total FLOPS loss are the respective metrics are below.
-This is why Google's SparseEmbed is interesting as they also achieve SPLADE quality retrieval effectiveness with much lower FLOPs. Compared ColBERT, SPLADE and SparseEmbed match query and
 document terms with a linear complexity as ColBERT’s late interaction i.e. all query-document term pairs takes a quadratic complexity. The Challenge with SparseEmbed is it uses a hyperparameter called **Top-k to restrict number of tokens used to learn contextual dense representations.**  Say 64 and 256 tokens for query and passage encoding.
-But it is unclear how well these hyperparameters are transferable to other domains or languages.
 <img src="./Metrics.png" width=800/>

 ## How does it translate into Empirical metrics?
 Our models are token sparse and yet effective. It translates to faster retrieval (User experience) and smaller index size ($). Mean retrieval time on the standard MS-MARCO small dev set and Scaled total FLOPS loss are the respective metrics are below.
+This is why Google's SparseEmbed is interesting as they also achieve SPLADE quality retrieval effectiveness with much lower FLOPs. Compared to ColBERT, SPLADE and SparseEmbed match query and
 document terms with a linear complexity as ColBERT’s late interaction i.e. all query-document term pairs takes a quadratic complexity. The Challenge with SparseEmbed is it uses a hyperparameter called **Top-k to restrict number of tokens used to learn contextual dense representations.**  Say 64 and 256 tokens for query and passage encoding.
+But it is unclear how well these hyperparameters are transferable to other domains or languages (where the notion of tokens changes a lot like our mother tongue Tamil which is Agglutinative in nature).
 <img src="./Metrics.png" width=800/>