AaRon

AARon99

AI & ML interests

None yet

Recent Activity

liked a model 3 days ago
hexgrad/Kokoro-82M
liked a Space about 1 month ago
huggingface/open-source-ai-year-in-review-2024
liked a model about 1 month ago
PrimeIntellect/INTELLECT-1-Instruct
View all activity

Organizations

None yet

AARon99's activity

liked a Space 3 months ago
New activity in mattshumer/Reflection-Llama-3.1-70B 4 months ago

Changes made in tensors

2
#22 opened 4 months ago by
leafspark
reacted to Xenova's post with πŸ”₯ 6 months ago
view post
Post
7977
Introducing Whisper Diarization: Multilingual speech recognition with word-level timestamps and speaker segmentation, running 100% locally in your browser thanks to πŸ€— Transformers.js!

Tested on this iconic Letterman interview w/ Grace Hopper from 1983!
- Demo: Xenova/whisper-speaker-diarization
- Source code: Xenova/whisper-speaker-diarization
  • 1 reply
Β·
upvoted an article 6 months ago
view article
Article

After 500+ LoRAs made, here is the secret

By FPHam β€’
β€’ 8
reacted to Csplk's post with 🧠 6 months ago
view post
Post
1400
# Offensive Physical Security Reconnaissance Planning Automation with public facing RTSP streams and Moondream


After some late night casual hacking about on VLMs for criminal attack vector reconnaissance automaton experiments using Moondream (as usual) based image-text-text with pre defined text prompts that are tuned for extracting weakness or customer identity and monitory based theft physical red team engagement reconnaissance and vector of malicious or criminal activity Working on a space. Thanks again for such a wonderful blessing of super power image-text-to-text model with minimal computational power needed @vikhyatk

I have started actually implementing a custom little tool with both static html space sand python gradio spaces on the go which I shall share as hf spaces when done them.

---

vikhyatk/moondream2

vikhyatk/moondream2
  • 1 reply
Β·