erax-ai
/

EraX-VL-7B-V1.5

@@ -31,9 +31,9 @@ widget:
 # EraX-VL-7B-V1.5
 ## Introduction 🎉
-We are excited to introduce **EraX-VL-7B-V1.5**, a robust multimodal model for **OCR (optical character recognition)** and **VQA (visual question-answering)** that excels in various languages 🌍, with a particular focus on Vietnamese 🇻🇳. The `EraX-VL-7B-V1.5` model stands out for its precise recognition capabilities across a range of documents 📝, including medical forms 🩺, invoices 🧾, bills of sale 💳, quotes 📄, and medical records 💊. This functionality is expected to be highly beneficial for hospitals 🏥, clinics 💉, insurance companies 🛡️, and other similar applications 📋. Built on the solid foundation of the [Qwen/Qwen2-VL-2B-Instruct](https://huggingface.co/Qwen/Qwen2-VL-7B-Instruct)[1], which we found to be of high quality and fluent in Vietnamese, `EraX-VL-7B-V1.5` has been fine-tuned to enhance its performance. We plan to continue improving and releasing new versions for free, along with sharing performance benchmarks in the near future.
-One standing-out feature of **EraX-VL-7B-V1.5** is the capability to do multi-turn Q&A with good reasoning capability!
 ***NOTA BENE***: EraX-VL-7B-V1.5 is NOT a typical OCR-only tool likes Tesseract but is a Multimodal LLM-based model. To use it effectively, you may have to **twist your prompt carefully** depending on your tasks.
@@ -48,6 +48,8 @@ One standing-out feature of **EraX-VL-7B-V1.5** is the capability to do multi-tu
 ## 🏆 LeaderBoard
 <table style="width:75%;">
     <tr>
         <th align="middle" width="300">Models</th>

 # EraX-VL-7B-V1.5
 ## Introduction 🎉
+We are excited to introduce **EraX-VL-7B-V1.5**, another robust multimodal model for **OCR (optical character recognition)** and **VQA (visual question-answering)** that excels in various languages 🌍, with a particular focus on Vietnamese 🇻🇳.  This model stands out for its precise recognition capabilities across a range of documents 📝, including medical forms 🩺, invoices 🧾, bills of sale 💳, quotes 📄, and medical records 💊. This functionality is expected to be highly beneficial for hospitals 🏥, clinics 💉, insurance companies 🛡️, and other similar applications 📋. Built on the solid foundation of the [Qwen/Qwen2-VL-2B-Instruct](https://huggingface.co/Qwen/Qwen2-VL-7B-Instruct)[1], which we found to be of high quality and fluent in Vietnamese, `EraX-VL-7B-V1.5` has been fine-tuned to enhance its performance. We plan to continue improving and releasing new versions for free, along with sharing performance benchmarks in the near future.
+One standing-out feature of **EraX-VL-7B-V1.5** is the capability to do multi-turn Q&A with impressive reasoning capability!
 ***NOTA BENE***: EraX-VL-7B-V1.5 is NOT a typical OCR-only tool likes Tesseract but is a Multimodal LLM-based model. To use it effectively, you may have to **twist your prompt carefully** depending on your tasks.
 ## 🏆 LeaderBoard
+The `EraX-VL-7B-V1.5` achieved exceptional performance compares to other equal or even 10x larger in model size. You can re-run the benchmark at anytime.
 <table style="width:75%;">
     <tr>
         <th align="middle" width="300">Models</th>