Latest AI-Research realistic speaking avatars!
Muhammad Umair
umair894
AI & ML interests
AI Consultant | Engineer | Global Trainer |PhD Scholar |6+ Years of Experience:
Specializing in building AI-powered SaaS solutions and delivering innovative AI, generative AI, and multimodal automation use cases. I've successfully led and contributed to dozens of projects, creating impactful proofs of concept (PoCs) that drive business value. With expertise in large language models (LLMs), computer vision, NLP, automation, and AI system design, I help companies unlock the power of AI to transform ideas into actionable solutions. As a global AI trainer, I empower organizations and individuals with the tools and knowledge to harness AIβs potential. Letβs build the future of AI together!
Recent Activity
published
a Space
5 days ago
umair894/autotrain
liked
a model
6 days ago
BoyuanJiang/FitDiT
reacted
to
alibabasglab's
post
with π
7 days ago
We are thrilled to present the improved "ClearerVoice-Studio", an open-source platform designed to make speech processing easy use for everyone! Whether youβre working on speech enhancement, speech separation, speech super-resolution, or target speaker extraction, this unified platform has you covered.
** Why Choose ClearerVoice-Studio?**
- Pre-Trained Models: Includes cutting-edge pre-trained models, fine-tuned on extensive, high-quality datasets. No need to start from scratch!
- Ease of Use: Designed for seamless integration with your projects, offering a simple yet flexible interface for inference and training.
**Where to Find Us?**
- GitHub Repository: ClearerVoice-Studio (https://github.com/modelscope/ClearerVoice-Studio)
- Try Our Demo: Hugging Face Space (https://huggingface.co/spaces/alibabasglab/ClearVoice)
**What Can You Do with ClearerVoice-Studio?**
- Enhance noisy speech recordings to achieve crystal-clear quality.
- Separate speech from complex audio mixtures with ease.
- Transform low-resolution audio into high-resolution audio. A full upscaled LJSpeech-1.1-48kHz dataset can be downloaded from https://huggingface.co/datasets/alibabasglab/LJSpeech-1.1-48kHz .
- Extract target speaker voices with precision using audio-visual models.
**Join Us in Growing ClearerVoice-Studio!**
We believe in the power of open-source collaboration. By starring our GitHub repository and sharing ClearerVoice-Studio with your network, you can help us grow this community-driven platform.
**Support us by:**
- Starring it on GitHub.
- Exploring and contributing to our codebase .
- Sharing your feedback and use cases to make the platform even better.
- Joining our community discussions to exchange ideas and innovations.
- Together, letβs push the boundaries of speech processing! Thank you for your support! :sparkling_heart:
Organizations
Collections
1
spaces
52
models
5
datasets
9
umair894/rvl_cdip_40_examples_per_class_donut
Viewer
β’
Updated
β’
80
β’
9
umair894/rvl_cdip_30_examples_per_class_donut
Viewer
β’
Updated
β’
60
β’
4
umair894/rvl_cdip_100_examples_per_class_donut_v2
Viewer
β’
Updated
β’
200
β’
3
umair894/rvl_cdip_100_examples_per_class
Viewer
β’
Updated
β’
200
β’
4
umair894/rvl_cdip_300_examples_per_class_donut_v4
Viewer
β’
Updated
β’
600
β’
3
umair894/rvl_cdip_300_examples_per_class_v3
Viewer
β’
Updated
β’
600
β’
9
umair894/rvl_cdip_300_examples_per_class_donut_v2
Viewer
β’
Updated
β’
330
β’
3
umair894/rvl_cdip_300_examples_per_class_donut
Viewer
β’
Updated
β’
330
β’
6
umair894/rvl_cdip_300_examples_per_class
Viewer
β’
Updated
β’
330
β’
21