Matryoshka Representation Learning
Paper
• 2205.13147 • Published
• 25
This is a sentence-transformers model trained on the cornstack_python, cornstack_python_pairs, codesearchnet, codesearchnet_pairs and solyanka_qa datasets. It maps sentences & paragraphs to a 768-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
SentenceTransformer(
(0): Transformer({'max_seq_length': 8192, 'do_lower_case': False, 'architecture': 'ModernBertModel'})
(1): Pooling({'word_embedding_dimension': 768, 'pooling_mode_cls_token': False, 'pooling_mode_mean_tokens': True, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
)
First install the Sentence Transformers library:
pip install -U sentence-transformers
Then you can load this model and run inference.
from sentence_transformers import SentenceTransformer
# Download from the 🤗 Hub
model = SentenceTransformer("fyaronskiy/code_retriever_ru_en")
# Run inference
sentences = [
"Method which returns a dictionary of field statistics received from the\n input source.\n\n Returns:\n\n fieldStats: dict of dicts where the first level is the field name and\n the second level is the statistic. ie. fieldStats['pounds']['min']",
'def _getFieldStats(self):\n """\n Method which returns a dictionary of field statistics received from the\n input source.\n\n Returns:\n\n fieldStats: dict of dicts where the first level is the field name and\n the second level is the statistic. ie. fieldStats[\'pounds\'][\'min\']\n\n """\n\n fieldStats = dict()\n fieldNames = self._inputSource.getFieldNames()\n for field in fieldNames:\n curStats = dict()\n curStats[\'min\'] = self._inputSource.getFieldMin(field)\n curStats[\'max\'] = self._inputSource.getFieldMax(field)\n fieldStats[field] = curStats\n return fieldStats',
'def customize(func):\n """\n Decorator to set plotting context and axes style during function call.\n """\n @wraps(func)\n def call_w_context(*args, **kwargs):\n set_context = kwargs.pop(\'set_context\', True)\n if set_context:\n with plotting_context(), axes_style():\n return func(*args, **kwargs)\n else:\n return func(*args, **kwargs)\n return call_w_context',
]
embeddings = model.encode(sentences)
print(embeddings.shape)
# [3, 768]
# Get the similarity scores for the embeddings
similarities = model.similarity(embeddings, embeddings)
print(similarities)
# tensor([[1.0000, 0.9048, 0.0377],
# [0.9048, 1.0000, 0.0953],
# [0.0377, 0.0953, 1.0000]])
InformationRetrievalEvaluator| Metric | Value |
|---|---|
| cosine_accuracy@1 | 0.8684 |
| cosine_accuracy@3 | 0.9439 |
| cosine_accuracy@5 | 0.9566 |
| cosine_accuracy@10 | 0.9668 |
| cosine_precision@1 | 0.8684 |
| cosine_precision@3 | 0.3146 |
| cosine_precision@5 | 0.1913 |
| cosine_precision@10 | 0.0967 |
| cosine_recall@1 | 0.8684 |
| cosine_recall@3 | 0.9439 |
| cosine_recall@5 | 0.9566 |
| cosine_recall@10 | 0.9668 |
| cosine_ndcg@10 | 0.9224 |
| cosine_mrr@10 | 0.9076 |
| cosine_map@100 | 0.9083 |
InformationRetrievalEvaluator| Metric | Value |
|---|---|
| cosine_accuracy@1 | 0.8742 |
| cosine_accuracy@3 | 0.9425 |
| cosine_accuracy@5 | 0.9549 |
| cosine_accuracy@10 | 0.9644 |
| cosine_precision@1 | 0.8742 |
| cosine_precision@3 | 0.3142 |
| cosine_precision@5 | 0.191 |
| cosine_precision@10 | 0.0964 |
| cosine_recall@1 | 0.8742 |
| cosine_recall@3 | 0.9425 |
| cosine_recall@5 | 0.9549 |
| cosine_recall@10 | 0.9644 |
| cosine_ndcg@10 | 0.9234 |
| cosine_mrr@10 | 0.9098 |
| cosine_map@100 | 0.9105 |
ru_query, document, negative_0, negative_1, negative_2, negative_3, negative_4, negative_5, negative_6, negative_7, negative_8, negative_9, negative_10, negative_11, negative_12, negative_13, negative_14, and negative_15| ru_query | document | negative_0 | negative_1 | negative_2 | negative_3 | negative_4 | negative_5 | negative_6 | negative_7 | negative_8 | negative_9 | negative_10 | negative_11 | negative_12 | negative_13 | negative_14 | negative_15 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| type | string | string | string | string | string | string | string | string | string | string | string | string | string | string | string | string | string | string |
| details |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| ru_query | document | negative_0 | negative_1 | negative_2 | negative_3 | negative_4 | negative_5 | negative_6 | negative_7 | negative_8 | negative_9 | negative_10 | negative_11 | negative_12 | negative_13 | negative_14 | negative_15 |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
установите значение business_id сообщения данных в конкретное значение |
def step_impl_the_ru_is_set_to(context, business_id): |
def business_id(self, business_id): |
def business_phone(self, business_phone): |
def business_phone_number(self, business_phone_number): |
def bus_ob_id(self, bus_ob_id): |
def bus_ob_id(self, bus_ob_id): |
def _set_id(self, value): |
def business_email(self, business_email): |
def mailing_id(self, val: str): |
def message_id(self, val: str): |
def business_model(self, business_model): |
def business_account(self, business_account): |
def update_business(current_user, businessId): |
def set_company_id_value(self, company_id_value): |
def id(self, value): |
def set_bribe(self, bribe_amount): |
|
self.bribe = bribe_amount |
def business_owner(self, business_owner): |
||||||||||||||||
Установить состояние правил sid |
def set_state_sid_request(ruleset_name, sid): |
def sid(self, sid): |
def set_state(self,s): |
def set_state(self, state: int): |
def setstate(self, state): |
def set_rule(self, rule): |
def _set_state(self, state): |
def set_state( self ): |
def set_ident(self, new_ident: int): |
def set_state(self, state): |
def setstate(self, state): |
def setstate(self, state): |
def state_id(self, state_id): |
def set_state(self, state: int): |
def set_domain_sid(self, sid): |
def set_state(self,state): |
def set_srid(self, srid: ir.IntegerValue) -> GeoSpatialValue: |
Отправить события sid в ruleset |
def post_sid_events(ruleset_name, sid): |
def post_events(ruleset_name): |
def set_state_sid_request(ruleset_name, sid): |
def sid(self, sid): |
def post(self, request, *args, **kwargs): |
def informed_consent_on_post_save(sender, instance, raw, created, **kwargs): |
def post_event(self, event): |
||||||||||
from evennia.scripts.models import ScriptDB |
|||||||||||||||||
if event.public_event: |
|||||||||||||||||
event_manager = ScriptDB.objects.get(db_key="Event Manager") |
|||||||||||||||||
event_manager.post_event(event, self.owner.player, event.display()) |
def post(self, event, *args, **kwargs): |
def post(self, request): |
def register_to_event(request): |
def setFilterOnRule(request): |
def store_event(self, violations): |
def test_post_event_on_schedule_page(self): |
def _push(self, server): |
def post(self, slug = None, eid = None): |
def events(self): |
def post(self): |
MatryoshkaLoss with these parameters:{
"loss": "CachedMultipleNegativesRankingLoss",
"matryoshka_dims": [
768,
512,
256,
128,
64
],
"matryoshka_weights": [
1,
1,
1,
1,
1
],
"n_dims_per_step": -1
}
en_query, ru_query, and label| en_query | ru_query | label | |
|---|---|---|---|
| type | string | string | float |
| details |
|
|
|
| en_query | ru_query | label |
|---|---|---|
set the message data business_id to a specific value |
установите значение business_id сообщения данных в конкретное значение |
1.0 |
Set ruleset state sid |
Установить состояние правил sid |
1.0 |
Post sid events to the ruleset |
Отправить события sid в ruleset |
1.0 |
MatryoshkaLoss with these parameters:{
"loss": "CoSENTLoss",
"matryoshka_dims": [
768,
512,
256,
128,
64
],
"matryoshka_weights": [
1,
1,
1,
1,
1
],
"n_dims_per_step": -1
}
ru_func_documentation_string and func_code_string| ru_func_documentation_string | func_code_string | |
|---|---|---|
| type | string | string |
| details |
|
|
| ru_func_documentation_string | func_code_string |
|---|---|
Мультипроцессинг-целевой объект для устройства очереди zmq |
def zmq_device(self): |
Чисто завершите работу сокета роутера |
def close(self): |
До форка нам нужно создать устройство zmq роутера |
def pre_fork(self, process_manager): |
MatryoshkaLoss with these parameters:{
"loss": "CachedMultipleNegativesRankingLoss",
"matryoshka_dims": [
768,
512,
256,
128,
64
],
"matryoshka_weights": [
1,
1,
1,
1,
1
],
"n_dims_per_step": -1
}
en_func_documentation_string, ru_func_documentation_string, and label| en_func_documentation_string | ru_func_documentation_string | label | |
|---|---|---|---|
| type | string | string | float |
| details |
|
|
|
| en_func_documentation_string | ru_func_documentation_string | label |
|---|---|---|
Multiprocessing target for the zmq queue device |
Мультипроцессинг-целевой объект для устройства очереди zmq |
1.0 |
Cleanly shutdown the router socket |
Чисто завершите работу сокета роутера |
1.0 |
Pre-fork we need to create the zmq router device |
До форка нам нужно создать устройство zmq роутера |
1.0 |
MatryoshkaLoss with these parameters:{
"loss": "CoSENTLoss",
"matryoshka_dims": [
768,
512,
256,
128,
64
],
"matryoshka_weights": [
1,
1,
1,
1,
1
],
"n_dims_per_step": -1
}
anchor and positive| anchor | positive | |
|---|---|---|
| type | string | string |
| details |
|
|
| anchor | positive |
|---|---|
Как происходит взаимодействие нескольких языков программирования? Понятно, что большинство (если не все) крупные энтерпрайз сервисы, приложения и тд. (не только веб) написаны с использованием не одного языка программирования, а нескольких. И эти составные части, написанные на разных языках, как-то взаимодействуют между собой (фронт, бизнес-логика, еще что-то). |
Несколько языков могут сосуществовать как в рамках одного процесса, так и в рамках нескольких. |
Слэши и ковычки после использования stringify Есть подобный скрипт: |
Может сразу сделать валидный JSON |
Оптимизация поиска числа в списке Есть функция. Она принимает число от 1 до 9 (мы ищем, есть ли оно в списке), и список, в котором мы его ищем) |
> |
MatryoshkaLoss with these parameters:{
"loss": "CachedMultipleNegativesRankingLoss",
"matryoshka_dims": [
768,
512,
256,
128,
64
],
"matryoshka_weights": [
1,
1,
1,
1,
1
],
"n_dims_per_step": -1
}
ru_func_documentation_string and func_code_string| ru_func_documentation_string | func_code_string | |
|---|---|---|
| type | string | string |
| details |
|
|
| ru_func_documentation_string | func_code_string |
|---|---|
Обучить модель deepq. |
def learn(env, |
Сохранить модель в pickle, расположенный по пути |
def save_act(self, path=None): |
CNN из статьи Nature. |
def nature_cnn(unscaled_images, **conv_kwargs): |
MatryoshkaLoss with these parameters:{
"loss": "CachedMultipleNegativesRankingLoss",
"matryoshka_dims": [
768,
512,
256,
128,
64
],
"matryoshka_weights": [
1,
1,
1,
1,
1
],
"n_dims_per_step": -1
}
en_func_documentation_string and func_code_string| en_func_documentation_string | func_code_string | |
|---|---|---|
| type | string | string |
| details |
|
|
| en_func_documentation_string | func_code_string |
|---|---|
Train a deepq model. |
def learn(env, |
Save model to a pickle located at |
def save_act(self, path=None): |
CNN from Nature paper. |
def nature_cnn(unscaled_images, **conv_kwargs): |
MatryoshkaLoss with these parameters:{
"loss": "CachedMultipleNegativesRankingLoss",
"matryoshka_dims": [
768,
512,
256,
128,
64
],
"matryoshka_weights": [
1,
1,
1,
1,
1
],
"n_dims_per_step": -1
}
en_func_documentation_string, ru_func_documentation_string, and label| en_func_documentation_string | ru_func_documentation_string | label | |
|---|---|---|---|
| type | string | string | float |
| details |
|
|
|
| en_func_documentation_string | ru_func_documentation_string | label |
|---|---|---|
Train a deepq model. |
Обучить модель deepq. |
1.0 |
Save model to a pickle located at |
Сохранить модель в pickle, расположенный по пути |
1.0 |
CNN from Nature paper. |
CNN из статьи Nature. |
1.0 |
MatryoshkaLoss with these parameters:{
"loss": "CoSENTLoss",
"matryoshka_dims": [
768,
512,
256,
128,
64
],
"matryoshka_weights": [
1,
1,
1,
1,
1
],
"n_dims_per_step": -1
}
anchor and positive| anchor | positive | |
|---|---|---|
| type | string | string |
| details |
|
|
| anchor | positive |
|---|---|
Atom IDE произвольное изменение строк Пользуюсь Atom IDE, установлены плагины для GIT'а, использую тему Material theme (может быть кому то это что то даст), в общем проблема такая, что в php файлах при сохранении файла, даже если я изменил всего один символ, он добавляет изменения очень странные,берет 2-3 строки (хз как выбирает) и удаляет их, а потом вставялет их же, без каких то либо изменений. При этом GIT фиксирует это изменение... |
Проблема заключалась в том, что все IDE испльзуют свой символ перехода на следующую строку, если в команде разработчики используют разные IDE, у которых разный перенос строки, то при сохранении файла чужие переносы строк будут заменяться на свои :) |
print() с частью текста и форматированием как переменная Python3 Есть повторяющаяся функция |
[code] |
Не получается перегрузить оператор присваивания в шаблонном классе Нужно перегрузить оператор присваивания в шаблонном классе, не могу понять, почему не работает стандартный синтаксис, при реализации выдает эту ошибку (/home/anton/Programming/tree/tree.h:96: ошибка: overloaded 'operator=' must be a binary operator (has 1 parameter)). Объявление и реализация в одном .h файле. |
Ну надо указать, какому классу он принадлежит... А так вы пытались реализовать унарный оператор |
MatryoshkaLoss with these parameters:{
"loss": "CachedMultipleNegativesRankingLoss",
"matryoshka_dims": [
768,
512,
256,
128,
64
],
"matryoshka_weights": [
1,
1,
1,
1,
1
],
"n_dims_per_step": -1
}
eval_strategy: stepsper_device_train_batch_size: 4per_device_eval_batch_size: 16gradient_accumulation_steps: 32learning_rate: 2e-05num_train_epochs: 2warmup_ratio: 0.1bf16: Trueresume_from_checkpoint: ../models/RuModernBERT-base_bs128_lr_2e-05_2nd_epoch/checkpoint-27400auto_find_batch_size: Truebatch_sampler: no_duplicatesoverwrite_output_dir: Falsedo_predict: Falseeval_strategy: stepsprediction_loss_only: Trueper_device_train_batch_size: 4per_device_eval_batch_size: 16per_gpu_train_batch_size: Noneper_gpu_eval_batch_size: Nonegradient_accumulation_steps: 32eval_accumulation_steps: Nonetorch_empty_cache_steps: Nonelearning_rate: 2e-05weight_decay: 0.0adam_beta1: 0.9adam_beta2: 0.999adam_epsilon: 1e-08max_grad_norm: 1.0num_train_epochs: 2max_steps: -1lr_scheduler_type: linearlr_scheduler_kwargs: {}warmup_ratio: 0.1warmup_steps: 0log_level: passivelog_level_replica: warninglog_on_each_node: Truelogging_nan_inf_filter: Truesave_safetensors: Truesave_on_each_node: Falsesave_only_model: Falserestore_callback_states_from_checkpoint: Falseno_cuda: Falseuse_cpu: Falseuse_mps_device: Falseseed: 42data_seed: Nonejit_mode_eval: Falseuse_ipex: Falsebf16: Truefp16: Falsefp16_opt_level: O1half_precision_backend: autobf16_full_eval: Falsefp16_full_eval: Falsetf32: Nonelocal_rank: 0ddp_backend: Nonetpu_num_cores: Nonetpu_metrics_debug: Falsedebug: []dataloader_drop_last: Falsedataloader_num_workers: 0dataloader_prefetch_factor: Nonepast_index: -1disable_tqdm: Falseremove_unused_columns: Truelabel_names: Noneload_best_model_at_end: Falseignore_data_skip: Falsefsdp: []fsdp_min_num_params: 0fsdp_config: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}fsdp_transformer_layer_cls_to_wrap: Noneaccelerator_config: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}deepspeed: Nonelabel_smoothing_factor: 0.0optim: adamw_torchoptim_args: Noneadafactor: Falsegroup_by_length: Falselength_column_name: lengthddp_find_unused_parameters: Noneddp_bucket_cap_mb: Noneddp_broadcast_buffers: Falsedataloader_pin_memory: Truedataloader_persistent_workers: Falseskip_memory_metrics: Trueuse_legacy_prediction_loop: Falsepush_to_hub: Falseresume_from_checkpoint: ../models/RuModernBERT-base_bs128_lr_2e-05_2nd_epoch/checkpoint-27400hub_model_id: Nonehub_strategy: every_savehub_private_repo: Nonehub_always_push: Falsegradient_checkpointing: Falsegradient_checkpointing_kwargs: Noneinclude_inputs_for_metrics: Falseinclude_for_metrics: []eval_do_concat_batches: Truefp16_backend: autopush_to_hub_model_id: Nonepush_to_hub_organization: Nonemp_parameters: auto_find_batch_size: Truefull_determinism: Falsetorchdynamo: Noneray_scope: lastddp_timeout: 1800torch_compile: Falsetorch_compile_backend: Nonetorch_compile_mode: Noneinclude_tokens_per_second: Falseinclude_num_input_tokens_seen: Falseneftune_noise_alpha: Noneoptim_target_modules: Nonebatch_eval_metrics: Falseeval_on_start: Falseuse_liger_kernel: Falseeval_use_gather_object: Falseaverage_tokens_across_devices: Falseprompts: Nonebatch_sampler: no_duplicatesmulti_dataset_batch_sampler: proportionalrouter_mapping: {}learning_rate_mapping: {}| Epoch | Step | Training Loss | codesearchnet loss | codesearchnet en loss | codesearchnet pairs loss | solyanka qa loss | cosine_ndcg@10 |
|---|---|---|---|---|---|---|---|
| 0.4899 | 27600 | 50.3428 | - | - | - | - | - |
| 0.4934 | 27800 | 50.6395 | - | - | - | - | - |
| 0.4970 | 28000 | 49.216 | 0.1459 | 0.1558 | 0.0 | 0.2914 | 0.9234 |
| 0.5005 | 28200 | 49.4888 | - | - | - | - | - |
| 0.5041 | 28400 | 49.2362 | - | - | - | - | - |
| 0.5076 | 28600 | 50.2319 | - | - | - | - | - |
| 0.5112 | 28800 | 48.3359 | - | - | - | - | - |
| 0.5147 | 29000 | 49.5835 | - | - | - | - | - |
| 0.5183 | 29200 | 50.2161 | - | - | - | - | - |
| 0.5218 | 29400 | 49.6727 | - | - | - | - | - |
| 0.5254 | 29600 | 48.9899 | - | - | - | - | - |
| 0.5289 | 29800 | 48.823 | - | - | - | - | - |
| 0.5325 | 30000 | 49.1113 | 0.1460 | 0.1557 | 0.0 | 0.2914 | 0.9234 |
| 0.5360 | 30200 | 48.4824 | - | - | - | - | - |
| 0.5396 | 30400 | 48.8933 | - | - | - | - | - |
| 0.5431 | 30600 | 49.2607 | - | - | - | - | - |
| 0.5467 | 30800 | 49.1046 | - | - | - | - | - |
| 0.5502 | 31000 | 48.6113 | - | - | - | - | - |
| 0.5538 | 31200 | 49.6846 | - | - | - | - | - |
| 0.5573 | 31400 | 50.1739 | - | - | - | - | - |
| 0.5609 | 31600 | 49.4535 | - | - | - | - | - |
| 0.5644 | 31800 | 48.6802 | - | - | - | - | - |
| 0.5680 | 32000 | 49.9103 | 0.1458 | 0.1554 | 0.0 | 0.2909 | 0.9234 |
| 0.5715 | 32200 | 50.5902 | - | - | - | - | - |
| 0.5751 | 32400 | 49.3202 | - | - | - | - | - |
| 0.5786 | 32600 | 49.0049 | - | - | - | - | - |
| 0.5822 | 32800 | 49.7297 | - | - | - | - | - |
| 0.5857 | 33000 | 49.4004 | - | - | - | - | - |
| 0.5893 | 33200 | 48.5135 | - | - | - | - | - |
| 0.5928 | 33400 | 48.2331 | - | - | - | - | - |
| 0.5964 | 33600 | 50.3588 | - | - | - | - | - |
| 0.5999 | 33800 | 48.3158 | - | - | - | - | - |
| 0.6035 | 34000 | 49.6962 | 0.1456 | 0.1553 | 0.0 | 0.2908 | 0.9234 |
| 0.6070 | 34200 | 49.8121 | - | - | - | - | - |
| 0.6106 | 34400 | 50.8481 | - | - | - | - | - |
| 0.6141 | 34600 | 50.0363 | - | - | - | - | - |
| 0.6177 | 34800 | 49.5676 | - | - | - | - | - |
| 0.6212 | 35000 | 47.5664 | - | - | - | - | - |
| 0.6248 | 35200 | 48.5752 | - | - | - | - | - |
| 0.6283 | 35400 | 49.4174 | - | - | - | - | - |
| 0.6319 | 35600 | 48.8215 | - | - | - | - | - |
| 0.6354 | 35800 | 49.9745 | - | - | - | - | - |
| 0.6390 | 36000 | 47.8552 | 0.1456 | 0.1551 | 0.0 | 0.2906 | 0.9234 |
| 0.6425 | 36200 | 50.2583 | - | - | - | - | - |
| 0.6461 | 36400 | 48.5441 | - | - | - | - | - |
| 0.6496 | 36600 | 48.7192 | - | - | - | - | - |
| 0.6532 | 36800 | 49.947 | - | - | - | - | - |
| 0.6567 | 37000 | 48.6255 | - | - | - | - | - |
| 0.6603 | 37200 | 48.0433 | - | - | - | - | - |
| 0.6638 | 37400 | 49.5333 | - | - | - | - | - |
| 0.6674 | 37600 | 48.8394 | - | - | - | - | - |
| 0.6709 | 37800 | 48.6463 | - | - | - | - | - |
| 0.6745 | 38000 | 49.3688 | 0.1456 | 0.1551 | 0.0 | 0.2913 | 0.9234 |
| 0.6780 | 38200 | 49.4758 | - | - | - | - | - |
| 0.6816 | 38400 | 50.0071 | - | - | - | - | - |
| 0.6851 | 38600 | 49.9054 | - | - | - | - | - |
| 0.6887 | 38800 | 49.9274 | - | - | - | - | - |
| 0.6922 | 39000 | 47.5942 | - | - | - | - | - |
| 0.6958 | 39200 | 49.409 | - | - | - | - | - |
| 0.6993 | 39400 | 49.6438 | - | - | - | - | - |
| 0.7029 | 39600 | 49.4253 | - | - | - | - | - |
| 0.7064 | 39800 | 49.1187 | - | - | - | - | - |
| 0.7100 | 40000 | 49.2283 | 0.1455 | 0.1551 | 0.0 | 0.2910 | 0.9235 |
| 0.7135 | 40200 | 51.0079 | - | - | - | - | - |
| 0.7171 | 40400 | 48.4275 | - | - | - | - | - |
| 0.7206 | 40600 | 48.6685 | - | - | - | - | - |
| 0.7242 | 40800 | 48.7769 | - | - | - | - | - |
| 0.7277 | 41000 | 49.712 | - | - | - | - | - |
| 0.7312 | 41200 | 49.0523 | - | - | - | - | - |
| 0.7348 | 41400 | 49.6381 | - | - | - | - | - |
| 0.7383 | 41600 | 49.7758 | - | - | - | - | - |
| 0.7419 | 41800 | 51.02 | - | - | - | - | - |
| 0.7454 | 42000 | 49.738 | 0.1454 | 0.1550 | 0.0 | 0.2914 | 0.9235 |
| 0.7490 | 42200 | 48.4278 | - | - | - | - | - |
| 0.7525 | 42400 | 48.2776 | - | - | - | - | - |
| 0.7561 | 42600 | 50.1085 | - | - | - | - | - |
| 0.7596 | 42800 | 49.6109 | - | - | - | - | - |
| 0.7632 | 43000 | 50.1112 | - | - | - | - | - |
| 0.7667 | 43200 | 48.3173 | - | - | - | - | - |
| 0.7703 | 43400 | 49.4717 | - | - | - | - | - |
| 0.7738 | 43600 | 50.4256 | - | - | - | - | - |
| 0.7774 | 43800 | 51.3672 | - | - | - | - | - |
| 0.7809 | 44000 | 49.5019 | 0.1455 | 0.1550 | 0.0 | 0.2913 | 0.9234 |
| 0.7845 | 44200 | 49.9114 | - | - | - | - | - |
| 0.7880 | 44400 | 48.8164 | - | - | - | - | - |
| 0.7916 | 44600 | 48.4947 | - | - | - | - | - |
| 0.7951 | 44800 | 48.6371 | - | - | - | - | - |
| 0.7987 | 45000 | 49.3439 | - | - | - | - | - |
| 0.8022 | 45200 | 48.8964 | - | - | - | - | - |
| 0.8058 | 45400 | 48.7946 | - | - | - | - | - |
| 0.8093 | 45600 | 48.6259 | - | - | - | - | - |
| 0.8129 | 45800 | 49.4873 | - | - | - | - | - |
| 0.8164 | 46000 | 49.6979 | 0.1454 | 0.1550 | 0.0 | 0.2914 | 0.9234 |
| 0.8200 | 46200 | 48.246 | - | - | - | - | - |
| 0.8235 | 46400 | 49.1022 | - | - | - | - | - |
| 0.8271 | 46600 | 49.18 | - | - | - | - | - |
| 0.8306 | 46800 | 48.8027 | - | - | - | - | - |
| 0.8342 | 47000 | 48.7197 | - | - | - | - | - |
| 0.8377 | 47200 | 47.9643 | - | - | - | - | - |
| 0.8413 | 47400 | 50.829 | - | - | - | - | - |
| 0.8448 | 47600 | 50.3984 | - | - | - | - | - |
| 0.8484 | 47800 | 48.848 | - | - | - | - | - |
| 0.8519 | 48000 | 50.6701 | 0.1453 | 0.1548 | 0.0 | 0.2908 | 0.9235 |
| 0.8555 | 48200 | 49.9972 | - | - | - | - | - |
| 0.8590 | 48400 | 48.1245 | - | - | - | - | - |
| 0.8626 | 48600 | 49.4942 | - | - | - | - | - |
| 0.8661 | 48800 | 48.1227 | - | - | - | - | - |
| 0.8697 | 49000 | 48.9811 | - | - | - | - | - |
| 0.8732 | 49200 | 49.4753 | - | - | - | - | - |
| 0.8768 | 49400 | 49.2714 | - | - | - | - | - |
| 0.8803 | 49600 | 49.166 | - | - | - | - | - |
| 0.8839 | 49800 | 49.0925 | - | - | - | - | - |
| 0.8874 | 50000 | 48.4746 | 0.1453 | 0.1549 | 0.0 | 0.2910 | 0.9234 |
| 0.8910 | 50200 | 49.0912 | - | - | - | - | - |
| 0.8945 | 50400 | 49.6571 | - | - | - | - | - |
| 0.8981 | 50600 | 50.9175 | - | - | - | - | - |
| 0.9016 | 50800 | 51.2218 | - | - | - | - | - |
| 0.9052 | 51000 | 47.8553 | - | - | - | - | - |
| 0.9087 | 51200 | 48.6819 | - | - | - | - | - |
| 0.9123 | 51400 | 49.6197 | - | - | - | - | - |
@inproceedings{reimers-2019-sentence-bert,
title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
author = "Reimers, Nils and Gurevych, Iryna",
booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
month = "11",
year = "2019",
publisher = "Association for Computational Linguistics",
url = "https://arxiv.org/abs/1908.10084",
}
@misc{kusupati2024matryoshka,
title={Matryoshka Representation Learning},
author={Aditya Kusupati and Gantavya Bhatt and Aniket Rege and Matthew Wallingford and Aditya Sinha and Vivek Ramanujan and William Howard-Snyder and Kaifeng Chen and Sham Kakade and Prateek Jain and Ali Farhadi},
year={2024},
eprint={2205.13147},
archivePrefix={arXiv},
primaryClass={cs.LG}
}
@misc{gao2021scaling,
title={Scaling Deep Contrastive Learning Batch Size under Memory Limited Setup},
author={Luyu Gao and Yunyi Zhang and Jiawei Han and Jamie Callan},
year={2021},
eprint={2101.06983},
archivePrefix={arXiv},
primaryClass={cs.LG}
}
@article{10531646,
author={Huang, Xiang and Peng, Hao and Zou, Dongcheng and Liu, Zhiwei and Li, Jianxin and Liu, Kay and Wu, Jia and Su, Jianlin and Yu, Philip S.},
journal={IEEE/ACM Transactions on Audio, Speech, and Language Processing},
title={CoSENT: Consistent Sentence Embedding via Similarity Ranking},
year={2024},
doi={10.1109/TASLP.2024.3402087}
}