BGE base En v1.5 Phase 4
This is a sentence-transformers model finetuned from BAAI/bge-base-en-v1.5. It maps sentences & paragraphs to a 768-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
Model Details
Model Description
- Model Type: Sentence Transformer
- Base model: BAAI/bge-base-en-v1.5
- Maximum Sequence Length: 512 tokens
- Output Dimensionality: 768 dimensions
- Similarity Function: Cosine Similarity
- Language: en
- License: apache-2.0
Model Sources
- Documentation: Sentence Transformers Documentation
- Repository: Sentence Transformers on GitHub
- Hugging Face: Sentence Transformers on Hugging Face
Full Model Architecture
SentenceTransformer(
(0): Transformer({'max_seq_length': 512, 'do_lower_case': True}) with Transformer model: BertModel
(1): Pooling({'word_embedding_dimension': 768, 'pooling_mode_cls_token': True, 'pooling_mode_mean_tokens': False, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
(2): Normalize()
)
Usage
Direct Usage (Sentence Transformers)
First install the Sentence Transformers library:
pip install -U sentence-transformers
Then you can load this model and run inference.
from sentence_transformers import SentenceTransformer
# Download from the 🤗 Hub
model = SentenceTransformer("RishuD7/bge-base-en-v1.5-41-keys-phase-4-exp_v1")
# Run inference
sentences = [
'Telstra International HK Limited Waiver of rights 15.8 A condition of this Agreement, or a right created by it, may only be waived by a party by giving notice and the failure to exercise or any delay In exercising a right or remedy provided by this Agreement or by law does not constitute a waiver of the right or remedy. 15.9 A waiver of a breach of this Agreement does not waive any other breach. Warranties 15.10 Each party warrants to the other that entering into and performing its obligations under this Agreement does not breach any of its contractual obligations to any other person. 15.11 You warrant that you have not relied on any representations or warranties by us other than those expressly provided in this Agreement. Assignment 15.12 Neither party may assign its rights or novate any rights or obligations under this Agreement without the prior consent of the other, which will not be unreasonably withheld. Agency 15.13 You may appoint a third party to act on your behalf in relation to this Agreement only if and for so long as we consent to you doing so. We may impose conditions on the giving of our consent and may withdraw our consent at any time. 15.14 If we consent to you appointing a third party to act on your behalf in relation to this Agreement, you agree to indemnify us against all direct and indirect loss, damage, liability, costs or expenses incurred by us (including to a fourth party) resulting from the appointment of the third party to act on your behalf including, but not limited to, any intentional or negligent act or omission by the third party, whether within or outside the scope of that appointment, such as a misuse of our confidential information.\n15.15 Nothing in clauses 1 5 . 1 3 and 1 5 . 1 4 is intended to waive any rights or obligations of us or you under this Agreement or to create any rights or impose any obligations on the third party appointed to act on your behalf under this Agreement. You remain liable for all of your obligations under this Agreement, including the payment of all charges for Services, and acknowledge that we may deal with you or with the third party (or both) in relation to matters arising under this Agreement. Force majeure 15.16 Neither party is liable for not performing an obligation in whole or in part, or for not performing it on time (except an obligation to pay money), because of a Force Majeure Event. 15.17 If a Force Majeure Event occurs, that party must: (a) give the other party notice of the event promptly and an estimate of the non-performance and delay; (b) take all reasonable steps to overcome the effects of the event (but this does not require the settlement of industrial disputes or other claims on unreasonable terms); and (c) resume compliance as soon as practicable after the event no longer affects that party. Compliance with laws 15.18 In carrying out its obligations under this Agreement, each party must comply with any relevant statutes, regulations and by-laws which apply in Hong Kong and the requirements of any government body with authority in Hong Kong. Third party rights 15.19 A person who is not a party to this Agreement has no rights under any applicable legislation to enforce any term or condition in this Agreement..\n Telstra International HK Limited\n.\n I\n16 Definitions\n.\n In this Agreement, unless otherwise indicated:\n.\n Act - means the Telecommunications Act 1997 (Cth).\n.\n Availability· means the number of minutes in a month during which a Global Service is not\n.\n Unavailable.\n.\n Business Day - means any day other than a Saturday, Sunday or recognised public holiday in Hong\n.\n Kong.\n',
'Assignment',
'Counterparty Name',
]
embeddings = model.encode(sentences)
print(embeddings.shape)
# [3, 768]
# Get the similarity scores for the embeddings
similarities = model.similarity(embeddings, embeddings)
print(similarities.shape)
# [3, 3]
Evaluation
Metrics
Information Retrieval
- Dataset:
dim_768
- Evaluated with
InformationRetrievalEvaluator
Metric | Value |
---|---|
cosine_accuracy@1 | 0.0063 |
cosine_accuracy@3 | 0.0222 |
cosine_accuracy@5 | 0.0338 |
cosine_accuracy@10 | 0.0622 |
cosine_precision@1 | 0.0063 |
cosine_precision@3 | 0.0074 |
cosine_precision@5 | 0.0068 |
cosine_precision@10 | 0.0062 |
cosine_recall@1 | 0.0063 |
cosine_recall@3 | 0.0222 |
cosine_recall@5 | 0.0338 |
cosine_recall@10 | 0.0622 |
cosine_ndcg@10 | 0.0289 |
cosine_mrr@10 | 0.0189 |
cosine_map@100 | 0.0327 |
Training Details
Training Dataset
Unnamed Dataset
- Size: 5,086 training samples
- Columns:
positive
andanchor
- Approximate statistics based on the first 1000 samples:
positive anchor type string string details - min: 123 tokens
- mean: 352.74 tokens
- max: 512 tokens
- min: 3 tokens
- mean: 5.52 tokens
- max: 11 tokens
- Samples:
positive anchor In no event shall CBRE, Client, or their respective affiliates incur liability under this agreement or otherwise relating to the Services beyond the insurance proceeds available with respect to the particular matter under the Insurance Policies required to be carried by CBRE AND Client under Article 6 above including, if applicable, proceeds of self-insurance. Each party shall and shall cause its affiliates to look solely to such insurance proceeds (and any such proceeds paid through self-insurance) to satisfy its claims against the released parties and agrees that it shall have no right of recovery beyond such proceeds; provided, however, that if insurance proceeds under such policies are not paid because a party has failed to maintain such policies, comply with policy requirements or, in the case of self-insurance, unreasonably denied a claim, such party shall be liable for the amounts that otherwise would have been payable under such policies had such party maintained such policies,...
Absolute Maximum Amount of Liability
4. Rent.
4.01 From and after the Commencement Date, Tenant shall pay Landlord, without any
setoff or deduction, unless expressly set forth in this Lease, all Base Rent and Additional Rent
due for the Term (collectively referred to as "Rent"). "Additional Rent" means all sums
(exclusive of Base Rent) that Tenant is required to pay Landlord under this Lease. Tenant shall
pay and be liable for all rental, sales and use taxes (but excluding income taxes), if any,
imposed upon or measured by Rent. Base Rent and recurring monthly charges of Additional
Rent shall be due and payable in advance on the first day of each calendar month without
notice or demand, provided that the installment of Base Rent attributable to the first (1st) full
calendar month of the Term following the Abatement Period shall be due concurrently with the
execution of this Lease by Tenant. All other items of Rent shall be due and payable on or
before thirty (30) days after billing by Landlord. Rent shall be made payable...Late Payment Charges
Term This Agreement shall come into force and shall last unlimited from such date. Either Party may however terminate this Agreement at any time by giving upon thirty (30) days' written notice to the other Party. The Receiving Party's obligations contained in this Agreement to keep confidential and restrict use of the Disclosing Party's Confidential Information shall sur- vive for a period of five (5) years from the date of its termination for any reason whatsoever. lX. Contractual penalty
For the purposes of this Non-Disclosure Agreement, " Confidential Information" includes all technical and/or commercial and/or financial information in the field designated in section 1., which a contracting Party (hereinafter referred to as the "EQ€i1gPedy") makes, or has made, accessible to the other contracting Party (hereinafter referred to as the ".&eiyi!g Partv") in oral, written, tangible or other form (e.9. disk, data carrier) directly or indirectly, in- cluding but not limited to, drawings, ...Termination for Convenience
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
Training Hyperparameters
Non-Default Hyperparameters
eval_strategy
: epochper_device_train_batch_size
: 32per_device_eval_batch_size
: 16gradient_accumulation_steps
: 16learning_rate
: 2e-05num_train_epochs
: 30lr_scheduler_type
: cosinewarmup_ratio
: 0.1tf32
: Falseload_best_model_at_end
: Trueoptim
: adamw_torch_fusedbatch_sampler
: no_duplicates
All Hyperparameters
Click to expand
overwrite_output_dir
: Falsedo_predict
: Falseeval_strategy
: epochprediction_loss_only
: Trueper_device_train_batch_size
: 32per_device_eval_batch_size
: 16per_gpu_train_batch_size
: Noneper_gpu_eval_batch_size
: Nonegradient_accumulation_steps
: 16eval_accumulation_steps
: Nonelearning_rate
: 2e-05weight_decay
: 0.0adam_beta1
: 0.9adam_beta2
: 0.999adam_epsilon
: 1e-08max_grad_norm
: 1.0num_train_epochs
: 30max_steps
: -1lr_scheduler_type
: cosinelr_scheduler_kwargs
: {}warmup_ratio
: 0.1warmup_steps
: 0log_level
: passivelog_level_replica
: warninglog_on_each_node
: Truelogging_nan_inf_filter
: Truesave_safetensors
: Truesave_on_each_node
: Falsesave_only_model
: Falserestore_callback_states_from_checkpoint
: Falseno_cuda
: Falseuse_cpu
: Falseuse_mps_device
: Falseseed
: 42data_seed
: Nonejit_mode_eval
: Falseuse_ipex
: Falsebf16
: Falsefp16
: Falsefp16_opt_level
: O1half_precision_backend
: autobf16_full_eval
: Falsefp16_full_eval
: Falsetf32
: Falselocal_rank
: 0ddp_backend
: Nonetpu_num_cores
: Nonetpu_metrics_debug
: Falsedebug
: []dataloader_drop_last
: Falsedataloader_num_workers
: 0dataloader_prefetch_factor
: Nonepast_index
: -1disable_tqdm
: Falseremove_unused_columns
: Truelabel_names
: Noneload_best_model_at_end
: Trueignore_data_skip
: Falsefsdp
: []fsdp_min_num_params
: 0fsdp_config
: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}fsdp_transformer_layer_cls_to_wrap
: Noneaccelerator_config
: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}deepspeed
: Nonelabel_smoothing_factor
: 0.0optim
: adamw_torch_fusedoptim_args
: Noneadafactor
: Falsegroup_by_length
: Falselength_column_name
: lengthddp_find_unused_parameters
: Noneddp_bucket_cap_mb
: Noneddp_broadcast_buffers
: Falsedataloader_pin_memory
: Truedataloader_persistent_workers
: Falseskip_memory_metrics
: Trueuse_legacy_prediction_loop
: Falsepush_to_hub
: Falseresume_from_checkpoint
: Nonehub_model_id
: Nonehub_strategy
: every_savehub_private_repo
: Falsehub_always_push
: Falsegradient_checkpointing
: Falsegradient_checkpointing_kwargs
: Noneinclude_inputs_for_metrics
: Falseeval_do_concat_batches
: Truefp16_backend
: autopush_to_hub_model_id
: Nonepush_to_hub_organization
: Nonemp_parameters
:auto_find_batch_size
: Falsefull_determinism
: Falsetorchdynamo
: Noneray_scope
: lastddp_timeout
: 1800torch_compile
: Falsetorch_compile_backend
: Nonetorch_compile_mode
: Nonedispatch_batches
: Nonesplit_batches
: Noneinclude_tokens_per_second
: Falseinclude_num_input_tokens_seen
: Falseneftune_noise_alpha
: Noneoptim_target_modules
: Nonebatch_eval_metrics
: Falseprompts
: Nonebatch_sampler
: no_duplicatesmulti_dataset_batch_sampler
: proportional
Training Logs
Epoch | Step | Training Loss | dim_768_cosine_ndcg@10 |
---|---|---|---|
1.0063 | 10 | 2.1725 | - |
2.0126 | 20 | 0.3731 | - |
3.0189 | 30 | 0.0406 | - |
4.0252 | 40 | 0.0 | - |
4.8302 | 48 | - | 0.0230 |
1.0063 | 10 | 1.6101 | - |
2.0126 | 20 | 0.055 | - |
3.0189 | 30 | 0.0016 | - |
4.0252 | 40 | 0.0 | - |
4.8302 | 48 | - | 0.0242 |
1.1761 | 50 | 0.2904 | - |
2.1824 | 60 | 0.8168 | - |
3.1887 | 70 | 0.0161 | - |
4.1950 | 80 | 0.004 | - |
5.2013 | 90 | 0.0 | - |
5.8050 | 96 | - | 0.0280 |
2.3522 | 100 | 0.4037 | - |
3.3585 | 110 | 0.3704 | - |
4.3648 | 120 | 0.0017 | - |
5.3711 | 130 | 0.0003 | - |
6.3774 | 140 | 0.0 | - |
6.7799 | 144 | - | 0.0293 |
3.5283 | 150 | 0.4189 | - |
4.5346 | 160 | 0.1746 | - |
5.5409 | 170 | 0.0004 | - |
6.5472 | 180 | 0.0 | - |
7.5535 | 190 | 0.0 | - |
7.8553 | 193 | - | 0.0278 |
4.7044 | 200 | 0.421 | - |
5.7107 | 210 | 0.0661 | - |
6.7170 | 220 | 0.0001 | - |
7.7233 | 230 | 0.0 | - |
8.7296 | 240 | 0.0 | - |
8.8302 | 241 | - | 0.0287 |
5.8805 | 250 | 0.41 | - |
6.8868 | 260 | 0.0111 | - |
7.8931 | 270 | 0.0001 | 0.0289 |
- The bold row denotes the saved checkpoint.
Framework Versions
- Python: 3.10.12
- Sentence Transformers: 3.3.0
- Transformers: 4.41.2
- PyTorch: 2.1.2+cu121
- Accelerate: 1.1.1
- Datasets: 2.19.1
- Tokenizers: 0.19.1
Citation
BibTeX
Sentence Transformers
@inproceedings{reimers-2019-sentence-bert,
title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
author = "Reimers, Nils and Gurevych, Iryna",
booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
month = "11",
year = "2019",
publisher = "Association for Computational Linguistics",
url = "https://arxiv.org/abs/1908.10084",
}
MultipleNegativesRankingLoss
@misc{henderson2017efficient,
title={Efficient Natural Language Response Suggestion for Smart Reply},
author={Matthew Henderson and Rami Al-Rfou and Brian Strope and Yun-hsuan Sung and Laszlo Lukacs and Ruiqi Guo and Sanjiv Kumar and Balint Miklos and Ray Kurzweil},
year={2017},
eprint={1705.00652},
archivePrefix={arXiv},
primaryClass={cs.CL}
}
- Downloads last month
- 2
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social
visibility and check back later, or deploy to Inference Endpoints (dedicated)
instead.
Model tree for RishuD7/bge-base-en-v1.5-41-keys-phase-4-exp_v1
Base model
BAAI/bge-base-en-v1.5Evaluation results
- Cosine Accuracy@1 on dim 768self-reported0.006
- Cosine Accuracy@3 on dim 768self-reported0.022
- Cosine Accuracy@5 on dim 768self-reported0.034
- Cosine Accuracy@10 on dim 768self-reported0.062
- Cosine Precision@1 on dim 768self-reported0.006
- Cosine Precision@3 on dim 768self-reported0.007
- Cosine Precision@5 on dim 768self-reported0.007
- Cosine Precision@10 on dim 768self-reported0.006
- Cosine Recall@1 on dim 768self-reported0.006
- Cosine Recall@3 on dim 768self-reported0.022