MVRL
/

File size: 760 Bytes
ee725b0
 
 
 
 
 
 
 
6061697
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
---
tags:
- clip
library_name: open_clip
pipeline_tag: zero-shot-image-classification
license: mit
---
# Model card for taxabind-vit-b-16

## Paper: TaxaBind: A Unified Embedding Space for Ecological Applications <br>
## Venue: WACV 2025 <br>
## Github: https://github.com/mvrl/TaxaBind

## TaxaBind

TaxaBind is a multimodal embedding space consisting of six modalities. This model contains image and text modalities in `open_clip` format. The model is used for zero-shot classification of species images using taxonomic text classes.

## Usage

```python
import open_clip

model, preprocess_train, preprocess_val = open_clip.create_model_and_transforms('hf-hub:MVRL/taxabind-vit-b-16')
tokenizer = open_clip.get_tokenizer('hf-hub:MVRL/taxabind-vit-b-16')
```