Searchium-ai
/

clip4clip-webvid150k

zero-shot-image-classification

Inference Endpoints

Model card Files Files and versions Community

Diangle commited on Jun 15, 2023

Commit

7d954dc

·

1 Parent(s): 331c19e

Update README.md

Files changed (1) hide show

README.md +3 -4

README.md CHANGED Viewed

@@ -15,7 +15,7 @@ Retrieval](https://arxiv.org/pdf/2104.08860.pdf) by Lou et el, and implemented i
 The training process involved 150,000 videos obtained from the [WebVid Dataset](https://m-bain.github.io/webvid-dataset/), a comprehensive collection of short videos with corresponding textual descriptions sourced from the web.
-To adapt the clip model obtained during training, we adjusted the weights and integrated them into the implementation of [clip-vit-base-patch32](https://huggingface.co/openai/clip-vit-base-patch32).
 ### Use with Transformers
 ### Extracting Text Embeddings:
@@ -45,11 +45,10 @@ For video embedding there is an extra [notebook](https://huggingface.co/Diangle/
 ## Model Intended Use
-This model is intended to use for video retrival, look for example this [**SPACE**](https://huggingface.co/spaces/Diangle/Clip4Clip-webvid).
-## Performance and Limitations
-### Performance
 We have evaluated the performance of differenet models on the last 10k video clips from Webvid database.

 The training process involved 150,000 videos obtained from the [WebVid Dataset](https://m-bain.github.io/webvid-dataset/), a comprehensive collection of short videos with corresponding textual descriptions sourced from the web.
+To adapt the clip model obtained during training into the implementation of [clip-vit-base-patch32](https://huggingface.co/openai/clip-vit-base-patch32), we have made modifications to the weights.
 ### Use with Transformers
 ### Extracting Text Embeddings:
 ## Model Intended Use
+This model is intended to use for video retrieval, look for example this [**SPACE**](https://huggingface.co/spaces/Diangle/Clip4Clip-webvid).
+## Performance
 We have evaluated the performance of differenet models on the last 10k video clips from Webvid database.