RalFinger (RalFinger)

upvoted a collection 16 days ago

Cosmos

Collection

The collection of Cosmos models • 31 items • Updated 6 days ago • 245

New activity in mit-han-lab/svdquant-models about 2 months ago

How Do I Use It in ComfyUI?

7

#1 opened 3 months ago by

youknownothing

liked 2 models 2 months ago

Kijai/SUPIR_pruned

Updated Apr 6, 2024 • 93

ali-vilab/In-Context-LoRA

Text-to-Image • Updated Dec 17, 2024 • 106k • • 522

liked 3 models 3 months ago

upvoted an article 3 months ago

Article

🧨 Diffusers welcomes Stable Diffusion 3.5 Large

Oct 22, 2024

• 50

liked a model 3 months ago

jimmycarter/LibreFLUX

Text-to-Image • Updated Oct 24, 2024 • 495 • 158

reacted to onekq's post with 🔥 3 months ago

Post

1857

I'm now working on finetuning of coding models. If you are GPU-hungry like me, you will find quantized models very helpful. But quantization for finetuning and inference are different and incompatible. So I made two collections here.

Inference (GGUF, via Ollama, CPU is enough)
onekq-ai/ollama-ready-coding-models-67118c3cfa1af2cf04a926d6

Finetuning (Bitsandbytes, QLora, GPU is needed)
onekq-ai/qlora-ready-coding-models-67118771ce001b8f4cf946b2

For quantization, the inference models are far more popular on HF than finetuning models. I use https://huggingface.co/QuantFactory to generate inference models (GGUF), and there are a few other choices.

But there hasn't been such a service for finetuning models. DIY isn't too hard though. I made a few myself and you can find the script in the model cards. If the original model is small enough, you can even do it on a free T4 (available via Google Colab).

If you know a (small) coding model worthy of quantization, please let me know and I'd love to add it to the collections.

liked a model 3 months ago

ostris/OpenFLUX.1

Text-to-Image • Updated Oct 3, 2024 • 9.43k • 616

reacted to clem's post with 👍 4 months ago

Post

3714

Very few people realize that most of the successful AI startups got successful because they were focused on open science and open-source for at least their first few years. To name but a few, OpenAI (GPT, GPT2 was open-source), Runway & Stability (stable diffusion), Cohere, Mistral and of course Hugging Face!

The reasons are not just altruistic, it's also because sharing your science and your models pushes you to build AI faster (which is key in a fast-moving domain like AI), attracts the best scientists & engineers and generates much more visibility, usage and community contributions than if you were 100% closed-source. The same applies to big tech companies as we're seeing with Meta and Google!

More startups and companies should release research & open-source AI, it's not just good for the world but also increases their probability of success!

4 replies

·

liked a model 4 months ago

VAGOsolutions/SauerkrautLM-Phi-3-medium

Text Generation • Updated Jul 12, 2024 • 5.13k • 9

reacted to davidberenstein1957's post with 🔥 4 months ago

Post

1651

🦀 Is your SQL a bit rusty? I just created theText To SQL Hub dataset explorer. To write SQL queries based on natural text input. Uses DuckDB, Llama 3.1 70B and the Hugging Face dataset-server API.

davidberenstein1957/text-to-sql-hub-datasets

reacted to cbensimon's post with ❤️ 4 months ago

Post

4403

Hello everybody,

We've rolled out a major update to ZeroGPU! All the Spaces are now running on it.

Major improvements:

1. GPU cold starts about twice as fast!
2. RAM usage reduced by two-thirds, allowing more effective resource usage, meaning more GPUs for the community!
3. ZeroGPU initializations (coldstarts) can now be tracked and displayed (use progress=gr.Progress(track_tqdm=True))
4. Improved compatibility and PyTorch integration, increasing ZeroGPU compatible spaces without requiring any modifications!

Feel free to answer in the post if you have any questions

🤗 Best regards,
Charles

liked 3 models 5 months ago

ZhenyaYang/flux_1_dev_hyper_8steps_nf4

Updated Sep 8, 2024 • 28

jphme/em_german_leo_mistral

Text Generation • Updated Oct 27, 2023 • 1.26k • 70

openai/whisper-small

Automatic Speech Recognition • Updated Feb 29, 2024 • 9.21M • • 335

reacted to macadeliccc's post with 👍 5 months ago

Post

1755

Automated web scraping with playwright is becoming easier by the day. Now, using ollama tool calling, its possible to perform very high accuracy web scraping (in some cases 100% accurate) through just asking an LLM to scrape the content for you.

This can be completed in a multistep process similar to cohere's platform. If you have tried the cohere playground with web scraping, this will feel very similar. In my experience, the Llama 3.1 version is much better due to the larger context window. Both tools are great, but the difference is the ollama + playwright version is completely controlled by you.

All you need to do is wrap your scraper in a function:

async def query_web_scraper(url: str) -> dict:
    scraper = WebScraper(headless=False)
    return await scraper.query_page_content(url)

and then make your request:

# First API call: Send the query and function description to the model
response = ollama.chat(
    model=model,
    messages=messages,
    tools=[
        {
            'type': 'function',
            'function': {
                'name': 'query_web_scraper',
                'description': 'Scrapes the content of a web page and returns the structured JSON object with titles, articles, and associated links.',
                'parameters': {
                    'type': 'object',
                    'properties': {
                        'url': {
                            'type': 'string',
                            'description': 'The URL of the web page to scrape.',
                        },
                    },
                    'required': ['url'],
                },
            },
        },
    ]
)