Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
65.6
TFLOPS
2
23
57
Piyush Maharana
catastropiyush
Follow
qubvel-hf's profile picture
eugrug-60's profile picture
ma7583's profile picture
12 followers
Β·
38 following
https://catastropiyush.github.io/
catastropiyush
AI & ML interests
LLMs for scientific data extraction, Solid State Hydrogen Storage,Machine Learning
Recent Activity
liked
a model
1 day ago
thellert/physbert_cased
reacted
to
burtenshaw
's
post
with π₯
4 days ago
Weβre launching a FREE and CERTIFIED course on Agents! We're thrilled to announce the launch of the Hugging Face Agents course on Learn! This interactive, certified course will guide you through building and deploying your own AI agents. Here's what you'll learn: - Understanding Agents: We'll break down the fundamentals of AI agents, showing you how they use LLMs to perceive their environment (observations), reason about it (thoughts), and take actions. Think of a smart assistant that can book appointments, answer emails, or even write code based on your instructions. - Building with Frameworks: You'll dive into popular agent frameworks like LangChain, LlamaIndex and smolagents. These tools provide the building blocks for creating complex agent behaviors. - Real-World Applications: See how agents are used in practice, from automating SQL queries to generating code and summarizing complex documents. - Certification: Earn a certification by completing the course modules, implementing a use case, and passing a benchmark assessment. This proves your skills in building and deploying AI agents. Audience This course is designed for anyone interested in the future of AI. Whether you're a developer, data scientist, or simply curious about AI, this course will equip you with the knowledge and skills to build your own intelligent agents. Enroll today and start building the next generation of AI agent applications! https://bit.ly/hf-learn-agents
reacted
to
tomaarsen
's
post
with π₯
4 days ago
ποΈ Today I'm introducing a method to train static embedding models that run 100x to 400x faster on CPU than common embedding models, while retaining 85%+ of the quality! Including 2 fully open models: training scripts, datasets, metrics. We apply our recipe to train 2 Static Embedding models that we release today! We release: 2οΈβ£ an English Retrieval model and a general-purpose Multilingual similarity model (e.g. classification, clustering, etc.), both Apache 2.0 π§ my modern training strategy: ideation -> dataset choice -> implementation -> evaluation π my training scripts, using the Sentence Transformers library π my Weights & Biases reports with losses & metrics π my list of 30 training and 13 evaluation datasets The 2 Static Embedding models have the following properties: ποΈ Extremely fast, e.g. 107500 sentences per second on a consumer CPU, compared to 270 for 'all-mpnet-base-v2' and 56 for 'gte-large-en-v1.5' 0οΈβ£ Zero active parameters: No Transformer blocks, no attention, not even a matrix multiplication. Super speed! π No maximum sequence length! Embed texts at any length (note: longer texts may embed worse) π Linear instead of exponential complexity: 2x longer text takes 2x longer, instead of 2.5x or more. πͺ Matryoshka support: allow you to truncate embeddings with minimal performance loss (e.g. 4x smaller with a 0.56% perf. decrease for English Similarity tasks) Check out the full blogpost if you'd like to 1) use these lightning-fast models or 2) learn how to train them with consumer-level hardware: https://huggingface.co/blog/static-embeddings The blogpost contains a lengthy list of possible advancements; I'm very confident that our 2 models are only the tip of the iceberg, and we may be able to get even better performance. Alternatively, check out the models: * https://huggingface.co/sentence-transformers/static-retrieval-mrl-en-v1 * https://huggingface.co/sentence-transformers/static-similarity-mrl-multilingual-v1
View all activity
Organizations
catastropiyush
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
liked
a model
1 day ago
thellert/physbert_cased
Feature Extraction
β’
Updated
Oct 29, 2024
β’
57
β’
1
liked
a model
4 days ago
fairchem/OMAT24
Updated
Dec 17, 2024
β’
58
liked
a model
6 days ago
Goodfire/Llama-3.3-70B-Instruct-SAE-l50
Updated
9 days ago
β’
15
β’
23
liked
a Space
12 days ago
Running
on
CPU Upgrade
7
π
Phase Diagram
liked
a model
13 days ago
m3rg-iitd/matscibert
Fill-Mask
β’
Updated
Jun 22, 2024
β’
2.83k
β’
17
liked
2 models
17 days ago
meta-llama/Llama-3.2-1B-Instruct
Text Generation
β’
Updated
Oct 24, 2024
β’
1.03M
β’
β’
706
unsloth/Meta-Llama-3.1-8B-bnb-4bit
Text Generation
β’
Updated
Nov 22, 2024
β’
265k
β’
91
liked
2 Spaces
about 1 month ago
Running
475
π
Scaling test-time compute
Running
12
ππ
FineWeb 2 - Community Leaderboard
liked
2 models
about 1 month ago
AI4Chem/ChemLLM-7B-Chat
Text Generation
β’
Updated
Sep 17, 2024
β’
363
β’
71
unsloth/QwQ-32B-Preview-unsloth-bnb-4bit
Text Generation
β’
Updated
Dec 6, 2024
β’
1.16k
β’
17
liked
2 datasets
about 2 months ago
n0w0f/MatText
Viewer
β’
Updated
Aug 13, 2024
β’
5.72M
β’
393
β’
6
nimashoghi/oc22
Viewer
β’
Updated
Aug 3, 2024
β’
9.85M
β’
112
β’
1
liked
a dataset
3 months ago
Congliu/USPTO-50k-Instruction
Viewer
β’
Updated
May 12, 2023
β’
44.6k
β’
34
β’
3
liked
a model
3 months ago
stepfun-ai/GOT-OCR2_0
Image-Text-to-Text
β’
Updated
Sep 18, 2024
β’
740k
β’
1.34k
liked
a model
5 months ago
Qwen/Qwen2-VL-2B-Instruct
Image-Text-to-Text
β’
Updated
8 days ago
β’
1.92M
β’
374
liked
a Space
5 months ago
Running
4
π’
SLICES
CIF2SLICES, SLICES2CIF
liked
3 models
5 months ago
unsloth/llama-3-8b-bnb-4bit
Text Generation
β’
Updated
12 days ago
β’
547k
β’
191
google/gemma-2-9b
Text Generation
β’
Updated
Aug 7, 2024
β’
90.4k
β’
628
nomic-ai/nomic-embed-text-v1
Sentence Similarity
β’
Updated
Sep 26, 2024
β’
355k
β’
480
Load more