RhapsodyAI
/

MiniCPM-V-Embedding-preview

Feature Extraction

information retrieval

embedding model

visual information retrieval

Model card Files Files and versions Community

bokesyo commited on Jun 27, 2024

Commit

80c6598

·

verified ·

1 Parent(s): a2f5410

Create README.md

Files changed (1) hide show

README.md +20 -0

README.md ADDED Viewed

	@@ -0,0 +1,20 @@

+---
+language:
+- en
+---
+# MiniCPM-Visual-Embedding: An OCR-free Visual Document Embedding Model Based on MiniCPM-V-2.0
+With MiniCPM-Visual-Embedding, it is possible to directly build knowledge base with raw PDF/Book/Document without any OCR technique nor OCR pipeline. The model only takes images as document-side inputs and produce vectors representing document pages.
+[Github Repo](https://github.com/bokesyo/minicpm-visual-embedding)
+![Memex Archtechture](images/memex.png)
+# News
+- 2024-06-27: We released our first visual embedding model on huggingface.
+- 2024-05-08: We released our training code (full-parameter tuning with GradCache and DeepSpeed, supports large batch size across multiple GPUs with zero-stage1) and eval code.