24 15 191

Stepanov

Ihor

https://knowledgator.com/

AI & ML interests

Text classification, computational biology, relations extraction, path reasoning

Recent Activity

updated a model about 14 hours ago

Ihor/OpenBioLLM-Text2Graph-8B

updated a model 1 day ago

knowledgator/gliner-multitask-large-v0.5

liked a Space 1 day ago

wjbmattingly/caracal

View all activity

Organizations

Posts 7

Post

972

🚀 Welcome the New and Improved GLiNER-Multitask! 🚀

Since the release of our beta version, GLiNER-Multitask has received many positive responses. It's been embraced in many consulting, research, and production environments. Thank you everyone for your feedback, it helped us rethink the strengths and weaknesses of the first model and we are excited to present the next iteration of this multi-task information extraction model.

💡 What’s New?
Here are the key improvements in this latest version:
🔹 Expanded Task Support: Now includes text classification and other new capabilities.
🔹 Enhanced Relation Extraction: Significantly improved accuracy and robustness.
🔹 Improved Prompt Understanding: Optimized for open-information extraction tasks.
🔹 Better Named Entity Recognition (NER): More accurate and reliable results.

🔧 How We Made It Better:
These advancements were made possible by:
🔹 Leveraging a better and more diverse dataset.
🔹 Using a larger backbone model for increased capacity.
🔹 Implementing advanced model merging techniques.
🔹 Employing self-learning strategies for continuous improvement.
🔹 Better training strategies and hyperparameters tuning.

📄 Read the Paper: https://arxiv.org/abs/2406.12925
⚙️ Try the Model: knowledgator/gliner-multitask-v1.0
💻 Test the Demo: knowledgator/GLiNER_HandyLab
📌 Explore the Repo: https://github.com/urchade/GLiNER

Post

366

🚀 Let’s transform LLMs into encoders 🚀

Auto-regressive LMs have ruled, but encoder-based architectures like GLiNER are proving to be just as powerful for information extraction while offering better efficiency and interpretability. 🔍✨

Past encoder backbones were limited by small pre-training datasets and old techniques, but with innovations like LLM2Vec, we've transformed decoders into high-performing encoders! 🔄💡

What’s New?
🔹Converted Llama & Qwen decoders to advanced encoders
🔹Improved GLiNER architecture to be able to work with rotary positional encoding
🔹New GLiNER (zero-shot NER) & GLiClass (zero-shot classification) models

🔥 Check it out:

New models: knowledgator/llm2encoder-66d1c76e3c8270397efc5b5e

GLiNER package: https://github.com/urchade/GLiNER

GLiClass package: https://github.com/Knowledgator/GLiClass

💻 Read our blog for more insights, and stay tuned for what’s next!
https://medium.com/@knowledgrator/llm2encoders-e7d90b9f5966

View all posts

Papers 1

arxiv:2406.12925

models 6

datasets

None public yet