Simulation Environments Tests and Builds

non-profit

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

thomwolf authored a paper 3 days ago

Towards Best Practices for Open Datasets for LLM Training

natolambert authored a paper 13 days ago

Objective Mismatch in Model-based Reinforcement Learning

natolambert authored a paper 13 days ago

Confidence-Building Measures for Artificial Intelligence: Workshop Proceedings

View all activity

simulate-tests's activity

thomwolf

authored a paper 3 days ago

Towards Best Practices for Open Datasets for LLM Training

Paper • 2501.08365 • Published 5 days ago • 41

dylanebert

posted an update 9 days ago

Post

1774

🟦 New Image-to-3D model from Stability AI

stabilityai/stable-point-aware-3d

here's how it looks, with TRELLIS for comparison

natolambert

authored 9 papers 13 days ago

Unpacking DPO and PPO: Disentangling Best Practices for Learning from Preference Feedback

Paper • 2406.09279 • Published Jun 13, 2024 • 2

WildGuard: Open One-Stop Moderation Tools for Safety Risks, Jailbreaks, and Refusals of LLMs

Paper • 2406.18495 • Published Jun 26, 2024 • 13

Towards a Framework for Openness in Foundation Models: Proceedings from the Columbia Convening on Openness in Artificial Intelligence

Paper • 2405.15802 • Published May 17, 2024

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models

Paper • 2409.17146 • Published Sep 25, 2024 • 106

2 OLMo 2 Furious

Paper • 2501.00656 • Published 19 days ago • 15

dylanebert

posted an update about 1 month ago

Post

2170

TRELLIS is now the highest ranked open-source model in the 3D Arena Leaderboard, surpassing InstantMesh

dylanebert/3d-arena

1 reply

thomwolf

posted an update about 1 month ago

Post

4878

We are proud to announce HuggingFaceFW/fineweb-2: A sparkling update to HuggingFaceFW/fineweb with 1000s of 🗣️languages.

We applied the same data-driven approach that led to SOTA English performance in🍷 FineWeb to thousands of languages.

🥂 FineWeb2 has 8TB of compressed text data and outperforms other multilingual datasets in our experiments.

The dataset is released under the permissive 📜 ODC-By 1.0 license, and the 💻 code to reproduce it and our evaluations is public.

We will very soon announce a big community project, and are working on a 📝 blogpost walking you through the entire dataset creation process. Stay tuned!

In the mean time come ask us question on our chat place: HuggingFaceFW/discussion

H/t @guipenedo @hynky @lvwerra as well as @vsabolcec Bettina Messmer @negar-foroutan and @mjaggi

2 replies

dylanebert

posted an update about 1 month ago

Post

2883

blender has AI now

dylanebert

posted an update about 1 month ago

Post

3624

🟦 New open-source Image-to-3D model from Microsoft

TRELLIS: Structured 3D Latents for Scalable and Versatile 3D Generation

it's really good! the topology isn't clean, but it's a very very good 3D reference

JeffreyXiang/TRELLIS-image-large

1 reply

thomwolf

posted an update about 1 month ago

Post

1271

Exponentially growing number of open-source AI models over the course of the past 30 months – from a few thousands to over 1 million and more

Interactive data viz: huggingface/open-source-ai-year-in-review-2024

thomwolf

posted an update about 2 months ago

Post

1443

Most liked and most downloaded open-source AI models from 2022 to 2024

Interactive viz: https://aiworld.eu/embed/model/model/treemap
Discussion: huggingface/open-source-ai-year-in-review-2024

dylanebert

posted an update about 2 months ago

Post

1626

Generate meshes with AI locally in Blender

📢 New open-source release

meshgen, a local blender integration of LLaMa-Mesh, is open source and available now 🤗

get started here: https://github.com/huggingface/meshgen

natolambert

authored a paper about 2 months ago

TÜLU 3: Pushing Frontiers in Open Language Model Post-Training

Paper • 2411.15124 • Published Nov 22, 2024 • 58

thomwolf

posted an update about 2 months ago

Post

1688

Interesting long read from @evanmiller-anthropic on having a better founded statistical approach to Language Model Evaluations:
https://www.anthropic.com/research/statistical-approach-to-model-evals

Worth a read if you're into LLM evaluations!

Cc @clefourrier

1 reply

AI & ML interests

Recent Activity

Team members 7

simulate-tests's activity