ThucPD commited on
Commit
b9e2d55
Β·
verified Β·
1 Parent(s): 9958a66

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -3
README.md CHANGED
@@ -31,7 +31,7 @@ widget:
31
  # EraX-VL-7B-V1.5
32
  ## Introduction πŸŽ‰
33
 
34
- We are excited to introduce **EraX-VL-7B-V1.5**, a robust multimodal model for **OCR (optical character recognition)** and **VQA (visual question-answering)** that excels in various languages 🌍, with a particular focus on Vietnamese πŸ‡»πŸ‡³. The `EraX-VL-2B` model stands out for its precise recognition capabilities across a range of documents πŸ“, including medical forms 🩺, invoices 🧾, bills of sale πŸ’³, quotes πŸ“„, and medical records πŸ’Š. This functionality is expected to be highly beneficial for hospitals πŸ₯, clinics πŸ’‰, insurance companies πŸ›‘οΈ, and other similar applications πŸ“‹. Built on the solid foundation of the [Qwen/Qwen2-VL-2B-Instruct](https://huggingface.co/Qwen/Qwen2-VL-2B-Instruct)[1], which we found to be of high quality and fluent in Vietnamese, `EraX-VL-2B` has been fine-tuned to enhance its performance. We plan to continue improving and releasing new versions for free, along with sharing performance benchmarks in the near future.
35
 
36
  One standing-out feature of **EraX-VL-7B-V1.5** is the capability to do multi-turn Q&A with reasonable reasoning capability!
37
 
@@ -39,10 +39,10 @@ One standing-out feature of **EraX-VL-7B-V1.5** is the capability to do multi-tu
39
 
40
  **EraX-VL-7B-V1.5** is a young member of our **EraX's LΓ nhGPT** collection of LLM models.
41
 
42
- - **Model type:** Multimodal Transformer with over 2B parameters
43
  - **Languages (NLP):** Primarily Vietnamese with multilingual capabilities
44
  - **License:** Apache 2.0
45
- - **Fine-tuned from:** [Qwen/Qwen2-VL-2B-Instruct](https://huggingface.co/Qwen/Qwen2-VL-2B-Instruct)
46
 
47
  ## Benchmarks πŸ“Š
48
 
 
31
  # EraX-VL-7B-V1.5
32
  ## Introduction πŸŽ‰
33
 
34
+ We are excited to introduce **EraX-VL-7B-V1.5**, a robust multimodal model for **OCR (optical character recognition)** and **VQA (visual question-answering)** that excels in various languages 🌍, with a particular focus on Vietnamese πŸ‡»πŸ‡³. The `EraX-VL-7B-V1.5` model stands out for its precise recognition capabilities across a range of documents πŸ“, including medical forms 🩺, invoices 🧾, bills of sale πŸ’³, quotes πŸ“„, and medical records πŸ’Š. This functionality is expected to be highly beneficial for hospitals πŸ₯, clinics πŸ’‰, insurance companies πŸ›‘οΈ, and other similar applications πŸ“‹. Built on the solid foundation of the [Qwen/Qwen2-VL-2B-Instruct](https://huggingface.co/Qwen/Qwen2-VL-7B-Instruct)[1], which we found to be of high quality and fluent in Vietnamese, `EraX-VL-7B-V1.5` has been fine-tuned to enhance its performance. We plan to continue improving and releasing new versions for free, along with sharing performance benchmarks in the near future.
35
 
36
  One standing-out feature of **EraX-VL-7B-V1.5** is the capability to do multi-turn Q&A with reasonable reasoning capability!
37
 
 
39
 
40
  **EraX-VL-7B-V1.5** is a young member of our **EraX's LΓ nhGPT** collection of LLM models.
41
 
42
+ - **Model type:** Multimodal Transformer with over 7B parameters
43
  - **Languages (NLP):** Primarily Vietnamese with multilingual capabilities
44
  - **License:** Apache 2.0
45
+ - **Fine-tuned from:** [Qwen/Qwen2-VL-7B-Instruct](https://huggingface.co/Qwen/Qwen2-VL-7B-Instruct)
46
 
47
  ## Benchmarks πŸ“Š
48