PiC/phrase_similarity
Viewer • Updated • 10k • 224 • 7
How to use Deehan1866/finetuned-bert-large with sentence-transformers:
from sentence_transformers import SentenceTransformer
model = SentenceTransformer("Deehan1866/finetuned-bert-large")
sentences = [
"The valve will open 100% when the set point is reached and will remain open until a certain blow down factor is reached.",
"Having raised $17,000,000 in a standard matter, one of the first speculative IPOs, Tucker needed more money to continue development of the car.",
"The valve will open 100% when the tennis scoring protocol is reached and will remain open until a certain blow down factor is reached.",
"But the government of PML (N) gave it the complete exponential of a Tehsil."
]
embeddings = model.encode(sentences)
similarities = model.similarity(embeddings, embeddings)
print(similarities.shape)
# [4, 4]This is a sentence-transformers model finetuned from google-bert/bert-large-uncased on the PiC/phrase_similarity dataset. It maps sentences & paragraphs to a 1024-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
SentenceTransformer(
(0): Transformer({'max_seq_length': 512, 'do_lower_case': False}) with Transformer model: BertModel
(1): Pooling({'word_embedding_dimension': 1024, 'pooling_mode_cls_token': False, 'pooling_mode_mean_tokens': True, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
)
First install the Sentence Transformers library:
pip install -U sentence-transformers
Then you can load this model and run inference.
from sentence_transformers import SentenceTransformer
# Download from the 🤗 Hub
model = SentenceTransformer("Deehan1866/finetuned-bert-large")
# Run inference
sentences = [
'He also played with the Turkish 2nd Division team Pertevniyal, which was at the time the farm team of Efes, via a dual license.',
'He also played with the Turkish 2nd Division team Pertevniyal, which was at the time the farm team of Efes, via a two-part authorization.',
'Storage/centre tracks are found in the vicinity of the following stations:\nOther song highlights.',
]
embeddings = model.encode(sentences)
print(embeddings.shape)
# [3, 1024]
# Get the similarity scores for the embeddings
similarities = model.similarity(embeddings, embeddings)
print(similarities.shape)
# [3, 3]
quora-duplicates-devBinaryClassificationEvaluator| Metric | Value |
|---|---|
| cosine_accuracy | 0.671 |
| cosine_accuracy_threshold | 0.9711 |
| cosine_f1 | 0.7194 |
| cosine_f1_threshold | 0.8364 |
| cosine_precision | 0.5843 |
| cosine_recall | 0.936 |
| cosine_ap | 0.7181 |
| dot_accuracy | 0.573 |
| dot_accuracy_threshold | 286.6644 |
| dot_f1 | 0.6908 |
| dot_f1_threshold | 235.8573 |
| dot_precision | 0.5352 |
| dot_recall | 0.974 |
| dot_ap | 0.5457 |
| manhattan_accuracy | 0.678 |
| manhattan_accuracy_threshold | 98.6069 |
| manhattan_f1 | 0.7198 |
| manhattan_f1_threshold | 300.5829 |
| manhattan_precision | 0.5802 |
| manhattan_recall | 0.948 |
| manhattan_ap | 0.7271 |
| euclidean_accuracy | 0.679 |
| euclidean_accuracy_threshold | 3.8845 |
| euclidean_f1 | 0.7192 |
| euclidean_f1_threshold | 12.026 |
| euclidean_precision | 0.5786 |
| euclidean_recall | 0.95 |
| euclidean_ap | 0.7272 |
| max_accuracy | 0.679 |
| max_accuracy_threshold | 286.6644 |
| max_f1 | 0.7198 |
| max_f1_threshold | 300.5829 |
| max_precision | 0.5843 |
| max_recall | 0.974 |
| max_ap | 0.7272 |
sentence1, sentence2, and label| sentence1 | sentence2 | label | |
|---|---|---|---|
| type | string | string | int |
| details |
|
|
|
| sentence1 | sentence2 | label |
|---|---|---|
newly formed camp is released from the membrane and diffuses across the intracellular space where it serves to activate pka. |
recently made encampment is released from the membrane and diffuses across the intracellular space where it serves to activate pka. |
0 |
According to one data, in 1910, on others – in 1915, the mansion became Natalya Dmitriyevna Shchuchkina's property. |
According to a particular statistic, in 1910, on others – in 1915, the mansion became Natalya Dmitriyevna Shchuchkina's property. |
1 |
Note that Fact 1 does not assume any particular structure on the set formula_65. |
Note that Fact 1 does not assume any specific edifice on the set formula_65. |
0 |
SoftmaxLosssentence1, sentence2, and label| sentence1 | sentence2 | label | |
|---|---|---|---|
| type | string | string | int |
| details |
|
|
|
| sentence1 | sentence2 | label |
|---|---|---|
after theo's apparent death, she decides to leave first colony and ends up traveling with the apostles. |
after theo's apparent death, she decides to leave original settlement and ends up traveling with the apostles. |
0 |
The guard assigned to Vivian leaves her to prevent the robbery, allowing her to connect to the bank's network. |
The guard assigned to Vivian leaves her to prevent the robbery, allowing her to connect to the bank's locations. |
0 |
Two days later Louis XVI banished Necker by a "lettre de cachet" for his very public exchange of pamphlets. |
Two days later Louis XVI banished Necker by a "lettre de cachet" for his very free forum of pamphlets. |
0 |
SoftmaxLosseval_strategy: stepsper_device_train_batch_size: 16per_device_eval_batch_size: 16learning_rate: 2e-05num_train_epochs: 7warmup_ratio: 0.1load_best_model_at_end: Trueoverwrite_output_dir: Falsedo_predict: Falseeval_strategy: stepsprediction_loss_only: Trueper_device_train_batch_size: 16per_device_eval_batch_size: 16per_gpu_train_batch_size: Noneper_gpu_eval_batch_size: Nonegradient_accumulation_steps: 1eval_accumulation_steps: Nonelearning_rate: 2e-05weight_decay: 0.0adam_beta1: 0.9adam_beta2: 0.999adam_epsilon: 1e-08max_grad_norm: 1.0num_train_epochs: 7max_steps: -1lr_scheduler_type: linearlr_scheduler_kwargs: {}warmup_ratio: 0.1warmup_steps: 0log_level: passivelog_level_replica: warninglog_on_each_node: Truelogging_nan_inf_filter: Truesave_safetensors: Truesave_on_each_node: Falsesave_only_model: Falserestore_callback_states_from_checkpoint: Falseno_cuda: Falseuse_cpu: Falseuse_mps_device: Falseseed: 42data_seed: Nonejit_mode_eval: Falseuse_ipex: Falsebf16: Falsefp16: Falsefp16_opt_level: O1half_precision_backend: autobf16_full_eval: Falsefp16_full_eval: Falsetf32: Nonelocal_rank: 0ddp_backend: Nonetpu_num_cores: Nonetpu_metrics_debug: Falsedebug: []dataloader_drop_last: Falsedataloader_num_workers: 0dataloader_prefetch_factor: Nonepast_index: -1disable_tqdm: Falseremove_unused_columns: Truelabel_names: Noneload_best_model_at_end: Trueignore_data_skip: Falsefsdp: []fsdp_min_num_params: 0fsdp_config: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}fsdp_transformer_layer_cls_to_wrap: Noneaccelerator_config: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}deepspeed: Nonelabel_smoothing_factor: 0.0optim: adamw_torchoptim_args: Noneadafactor: Falsegroup_by_length: Falselength_column_name: lengthddp_find_unused_parameters: Noneddp_bucket_cap_mb: Noneddp_broadcast_buffers: Falsedataloader_pin_memory: Truedataloader_persistent_workers: Falseskip_memory_metrics: Trueuse_legacy_prediction_loop: Falsepush_to_hub: Falseresume_from_checkpoint: Nonehub_model_id: Nonehub_strategy: every_savehub_private_repo: Falsehub_always_push: Falsegradient_checkpointing: Falsegradient_checkpointing_kwargs: Noneinclude_inputs_for_metrics: Falseeval_do_concat_batches: Truefp16_backend: autopush_to_hub_model_id: Nonepush_to_hub_organization: Nonemp_parameters: auto_find_batch_size: Falsefull_determinism: Falsetorchdynamo: Noneray_scope: lastddp_timeout: 1800torch_compile: Falsetorch_compile_backend: Nonetorch_compile_mode: Nonedispatch_batches: Nonesplit_batches: Noneinclude_tokens_per_second: Falseinclude_num_input_tokens_seen: Falseneftune_noise_alpha: Noneoptim_target_modules: Nonebatch_eval_metrics: Falseeval_on_start: Falsebatch_sampler: batch_samplermulti_dataset_batch_sampler: proportional| Epoch | Step | Training Loss | loss | quora-duplicates-dev_max_ap |
|---|---|---|---|---|
| 0 | 0 | - | - | 0.6622 |
| 0.2283 | 100 | - | 0.6955 | 0.6512 |
| 0.4566 | 200 | - | 0.6922 | 0.6678 |
| 0.6849 | 300 | - | 0.6392 | 0.7288 |
| 0.9132 | 400 | - | 0.6328 | 0.7164 |
| 1.1416 | 500 | 0.6444 | 0.6133 | 0.7340 |
| 1.3699 | 600 | - | 0.6031 | 0.7279 |
| 1.5982 | 700 | - | 0.5974 | 0.7272 |
| 1.8265 | 800 | - | 0.6068 | 0.7355 |
| 2.0548 | 900 | - | 0.8700 | 0.7088 |
| 2.2831 | 1000 | 0.4496 | 0.9311 | 0.7272 |
| 2.5114 | 1100 | - | 0.8534 | 0.7248 |
| 2.7397 | 1200 | - | 0.7985 | 0.7132 |
| 2.9680 | 1300 | - | 0.8680 | 0.7043 |
| 3.1963 | 1400 | - | 1.1919 | 0.7091 |
| 3.4247 | 1500 | 0.2152 | 1.4764 | 0.7106 |
| 3.6530 | 1600 | - | 1.3088 | 0.7155 |
| 3.8813 | 1700 | - | 1.4484 | 0.7135 |
| 4.1096 | 1800 | - | 1.5510 | 0.7112 |
| 4.3379 | 1900 | - | 1.5921 | 0.7160 |
| 4.5662 | 2000 | 0.1126 | 1.5381 | 0.7155 |
| 4.7945 | 2100 | - | 1.5480 | 0.7236 |
| 5.0228 | 2200 | - | 1.5668 | 0.7163 |
| 5.2511 | 2300 | - | 1.7076 | 0.7252 |
| 5.4795 | 2400 | - | 1.8512 | 0.7228 |
| 5.7078 | 2500 | 0.0653 | 1.9132 | 0.7209 |
| 5.9361 | 2600 | - | 1.8488 | 0.7256 |
| 6.1644 | 2700 | - | 1.9057 | 0.7235 |
| 6.3927 | 2800 | - | 1.9437 | 0.7262 |
| 6.6210 | 2900 | - | 1.9948 | 0.7273 |
| 6.8493 | 3000 | 0.0289 | 2.0333 | 0.7288 |
| 7.0 | 3066 | - | - | 0.7272 |
@inproceedings{reimers-2019-sentence-bert,
title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
author = "Reimers, Nils and Gurevych, Iryna",
booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
month = "11",
year = "2019",
publisher = "Association for Computational Linguistics",
url = "https://arxiv.org/abs/1908.10084",
}
Base model
google-bert/bert-large-uncased