model: Add eager-embed embedding model by jpbalarini · Pull Request #3602 · embeddings-benchmark/mteb

jpbalarini · 2025-11-22T05:25:53Z

Add inference code for eager-embed embedding model.
eager-embed-v1 is a multimodal dense embedding model with a 2560 embed dimension based on Qwen3-VL and finetuned on multiple public datasets.

More info here:
https://huggingface.co/eagerworks/eager-embed-v1
https://github.com/eagerworks/eager-embed

Checklist:

I have filled out the ModelMeta object to the extent possible
I have ensured that my model can be loaded using
- mteb.get_model(model_name, revision) and
- mteb.get_model_meta(model_name, revision)
I have tested the implementation works on a representative set of tasks.
The model is public, i.e. is available either as an API or the weight are publicly available to download

KennethEnevoldsen

A few minor comments - otherwise the submission looks good

pyproject.toml

mteb/models/model_implementations/eagerworks_models.py

KennethEnevoldsen

A few minor comments otherwise this looks good

mteb/models/model_implementations/eagerworks_models.py

Samoed · 2025-11-22T13:17:03Z

mteb/models/model_implementations/eagerworks_models.py

Also probably it would be better to integrate your model with sentence transformers

Do you mean loading the model from sentence transformers instead of from transformers? What do I need to change?

Yes. This can be complicated. You can see how this was done for other models as example

mmE5 pr 1 pr 2

Jasper https://huggingface.co/NovaSearch/jasper_en_vision_language_v1/tree/main

gme-Qwen2 https://huggingface.co/Alibaba-NLP/gme-Qwen2-VL-2B-Instruct/discussions/9

mteb/models/model_implementations/eagerworks_models.py

jpbalarini · 2025-11-24T05:42:18Z

@KennethEnevoldsen @Samoed Thanks for your comments, the code is much cleaner now. Implemented most of them and left some questions. Thanks!

mteb/models/model_implementations/eagerworks_models.py

KennethEnevoldsen

I think this is good to merge - @Samoed do you have any remaining issues?

Samoed · 2025-11-25T11:45:51Z

@jpbalarini Did you try to encode images and texts together without separation on image/text modalities

jpbalarini · 2025-11-25T13:56:50Z

@Samoed I did but I was having some breaking changes with the batches when running some tasks (specifically Vidore2ESGReportsHLRetrieval). I rolled back the changes just to check if I had the same issues with the above changes, and it's the same:

Traceback (most recent call last):
  File "/mnt/data/QWEN_EMBEDDINGS/eager-embed-v1/evaluate_mteb.py", line 73, in <module>
    evaluate_mteb_with_custom_model()
  File "/mnt/data/QWEN_EMBEDDINGS/eager-embed-v1/evaluate_mteb.py", line 54, in evaluate_mteb_with_custom_model
    results = mteb.evaluate(model=model, tasks=tasks, encode_kwargs={"batch_size": 8})
              ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/mnt/data/QWEN_EMBEDDINGS/eager-embed-v1/.venv/lib/python3.12/site-packages/mteb/evaluate.py", line 377, in evaluate
    _res = evaluate(
           ^^^^^^^^^
  File "/mnt/data/QWEN_EMBEDDINGS/eager-embed-v1/.venv/lib/python3.12/site-packages/mteb/evaluate.py", line 473, in evaluate
    result = _evaluate_task(
             ^^^^^^^^^^^^^^^
  File "/mnt/data/QWEN_EMBEDDINGS/eager-embed-v1/.venv/lib/python3.12/site-packages/mteb/evaluate.py", line 168, in _evaluate_task
    task_results[split] = task.evaluate(
                          ^^^^^^^^^^^^^^
  File "/mnt/data/QWEN_EMBEDDINGS/eager-embed-v1/.venv/lib/python3.12/site-packages/mteb/abstasks/retrieval.py", line 310, in evaluate
    return super().evaluate(
           ^^^^^^^^^^^^^^^^^
  File "/mnt/data/QWEN_EMBEDDINGS/eager-embed-v1/.venv/lib/python3.12/site-packages/mteb/abstasks/abstask.py", line 183, in evaluate
    scores[hf_subset] = self._evaluate_subset(
                        ^^^^^^^^^^^^^^^^^^^^^^
  File "/mnt/data/QWEN_EMBEDDINGS/eager-embed-v1/.venv/lib/python3.12/site-packages/mteb/abstasks/retrieval.py", line 372, in _evaluate_subset
    results = retriever(
              ^^^^^^^^^^
  File "/mnt/data/QWEN_EMBEDDINGS/eager-embed-v1/.venv/lib/python3.12/site-packages/mteb/_evaluators/retrieval_evaluator.py", line 62, in __call__
    return search_model.search(
           ^^^^^^^^^^^^^^^^^^^^
  File "/mnt/data/QWEN_EMBEDDINGS/eager-embed-v1/.venv/lib/python3.12/site-packages/mteb/models/search_wrappers.py", line 96, in search
    query_embeddings = self.model.encode(
                       ^^^^^^^^^^^^^^^^^^
  File "/mnt/data/QWEN_EMBEDDINGS/eager-embed-v1/mteb_wrapper.py", line 88, in encode
    text_embeddings = self.get_text_embeddings(inputs, prompt_type=prompt_type, **kwargs)
                      ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/mnt/data/QWEN_EMBEDDINGS/eager-embed-v1/mteb_wrapper.py", line 182, in get_text_embeddings
    for batch in tqdm(inputs, desc="Encoding texts"):
                 ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/mnt/data/QWEN_EMBEDDINGS/eager-embed-v1/.venv/lib/python3.12/site-packages/tqdm/std.py", line 1181, in __iter__
    for obj in iterable:
               ^^^^^^^^
  File "/mnt/data/QWEN_EMBEDDINGS/eager-embed-v1/.venv/lib/python3.12/site-packages/torch/utils/data/dataloader.py", line 701, in __next__
    data = self._next_data()
           ^^^^^^^^^^^^^^^^^
  File "/mnt/data/QWEN_EMBEDDINGS/eager-embed-v1/.venv/lib/python3.12/site-packages/torch/utils/data/dataloader.py", line 757, in _next_data
    data = self._dataset_fetcher.fetch(index)  # may raise StopIteration
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/mnt/data/QWEN_EMBEDDINGS/eager-embed-v1/.venv/lib/python3.12/site-packages/torch/utils/data/_utils/fetch.py", line 55, in fetch
    return self.collate_fn(data)
           ^^^^^^^^^^^^^^^^^^^^^
  File "/mnt/data/QWEN_EMBEDDINGS/eager-embed-v1/.venv/lib/python3.12/site-packages/torch/utils/data/_utils/collate.py", line 398, in default_collate
    return collate(batch, collate_fn_map=default_collate_fn_map)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/mnt/data/QWEN_EMBEDDINGS/eager-embed-v1/.venv/lib/python3.12/site-packages/torch/utils/data/_utils/collate.py", line 172, in collate
    key: collate(
         ^^^^^^^^
  File "/mnt/data/QWEN_EMBEDDINGS/eager-embed-v1/.venv/lib/python3.12/site-packages/torch/utils/data/_utils/collate.py", line 207, in collate
    raise RuntimeError("each element in list of batch should be of equal size")

and then I remembered why I added this (it was because of this bug): #3602 (comment)

just in case, here's my try on the unified embeddings method (I get the same error as above when running the benchmark):

def encode(
        self,
        inputs: DataLoader[BatchedInput],
        *,
        task_metadata: TaskMetadata,
        hf_split: str,
        hf_subset: str,
        prompt_type: PromptType | None = None,
        **kwargs: Any,
    ) -> Array:
        """Encode inputs (text and/or images) into embeddings."""
        from qwen_vl_utils import process_vision_info

        all_embeddings: list[torch.Tensor] = []

        with torch.no_grad():
            for batch in tqdm(inputs, desc="Encoding"):
                batch_texts = batch.get("text", [])
                batch_images = batch.get("image", [])
                
                messages = []
                for i in range(max(len(batch_texts), len(batch_images))):
                    text_content = batch_texts[i] if batch_texts else ""
                    image_content = batch_images[i] if batch_images else None
                    
                    query_prefix = ('Query: ' if prompt_type == PromptType.query else '')
                    
                    content = [
                        {
                            'type': 'text',
                            'text': f'{query_prefix}{text_content}'
                        }
                    ]
                    
                    if image_content is not None:
                        content.append({
                            'type': 'image',
                            'image': image_content,
                            'resized_height': self.image_size,
                            'resized_width': self.image_size
                        })

                    messages.append([{
                        'role': 'user',
                        'content': content
                    }])

                # Prepare inputs
                texts = [
                    self.processor.apply_chat_template(
                        msg, tokenize=False, add_generation_prompt=False
                    ) + "<|endoftext|>"
                    for msg in messages
                ]

                image_inputs = None
                video_inputs = None
                if batch_images:
                    image_inputs, video_inputs = process_vision_info(messages)

                model_inputs = self.processor(
                    text=texts,
                    images=image_inputs,
                    videos=video_inputs,
                    padding='longest',
                    return_tensors='pt'
                ).to(self.device)

                # Get embeddings
                output = self.mdl(**model_inputs, return_dict=True, output_hidden_states=True)
                embeddings = self.get_embedding(output.hidden_states[-1])
                embeddings = embeddings.cpu().to(torch.float32)
                embeddings = torch.nn.functional.normalize(embeddings, p=2, dim=-1)
                
                all_embeddings.append(embeddings)

        # Concatenate all embeddings
        return torch.cat(all_embeddings, dim=0)

I assume I must be doing something wrong with how I handle the tensors, but I was debugging this for several hours with no luck so far.

Samoed · 2025-11-25T14:53:58Z

@jpbalarini I've added a fix #3618. Thank you for reporting!

jpbalarini · 2025-11-25T16:56:18Z

@jpbalarini I've added a fix #3618. Thank you for reporting!

You're welcome! Let me add the latest changes and rerun the benchmark to see that everything works as expected

mteb/models/model_implementations/eagerworks_models.py

mteb/tasks/retrieval/multilingual/vidore2_bench_retrieval.py

Samoed · 2025-11-26T17:34:54Z

@jpbalarini Is this the final version? Have you submitted all the results with the processing of text and images combined?

mteb/models/model_implementations/eagerworks_models.py

jpbalarini · 2025-11-26T20:13:08Z

@jpbalarini Is this the final version? Have you submitted all the results with the processing of text and images combined?

Yes @Samoed I updated the new results here (and I added vidore v3 too).
It should be ready for merging! Thanks for all the comments

Samoed · 2025-11-26T20:16:02Z

Great work!

jpbalarini added 2 commits November 21, 2025 17:13

Add eagerembed model

362258d

Merge branch 'embeddings-benchmark:main' into feat/eagerembed

2f0d0c7

jpbalarini mentioned this pull request Nov 22, 2025

Add eager-embed-v1 results for Vidore 1,2 and 3 embeddings-benchmark/results#328

Merged

6 tasks

KennethEnevoldsen reviewed Nov 22, 2025

View reviewed changes

Samoed reviewed Nov 22, 2025

View reviewed changes

Address CR comments to make code cleaner

33fc5b9

jpbalarini force-pushed the feat/eagerembed branch from 99d692c to 33fc5b9 Compare November 23, 2025 04:48

Refactor code to remove unnecessary dataloader. Use prompt_type

705b211

Samoed reviewed Nov 24, 2025

View reviewed changes

mteb/models/model_implementations/eagerworks_models.py Outdated Show resolved Hide resolved

mteb/models/model_implementations/eagerworks_models.py Outdated Show resolved Hide resolved

mteb/models/model_implementations/eagerworks_models.py Outdated Show resolved Hide resolved

Samoed added the new model Questions related to adding a new model to the benchmark label Nov 24, 2025

Update model revision. Move tokenizer config to file

6d70d22

KennethEnevoldsen approved these changes Nov 25, 2025

View reviewed changes

Samoed mentioned this pull request Nov 25, 2025

fix: vidore loading #3618

Merged

jpbalarini added 3 commits November 25, 2025 14:03

Merge branch 'embeddings-benchmark:main' into feat/eagerembed

f9543ca

Add support for unified encoding

b6a58b2

Fix vidore2 retrieval language filter

8917fe8

Samoed reviewed Nov 26, 2025

View reviewed changes

mteb/models/model_implementations/eagerworks_models.py Outdated Show resolved Hide resolved

Samoed reviewed Nov 26, 2025

View reviewed changes

mteb/models/model_implementations/eagerworks_models.py Outdated Show resolved Hide resolved

Samoed reviewed Nov 26, 2025

View reviewed changes

mteb/tasks/retrieval/multilingual/vidore2_bench_retrieval.py Outdated Show resolved Hide resolved

Samoed reviewed Nov 26, 2025

View reviewed changes

mteb/models/model_implementations/eagerworks_models.py Show resolved Hide resolved

Samoed reviewed Nov 26, 2025

View reviewed changes

mteb/models/model_implementations/eagerworks_models.py Outdated Show resolved Hide resolved

mteb/models/model_implementations/eagerworks_models.py Outdated Show resolved Hide resolved

Remove unused methods. Fix vidore2 filtering

569f056

jpbalarini force-pushed the feat/eagerembed branch from 62e623e to 569f056 Compare November 26, 2025 17:44

Samoed mentioned this pull request Nov 26, 2025

model: Add tomoro-colqwen3-embed embedding models #3627

Merged

6 tasks

Remove deprecated torch_dtype

c8a2eda

Samoed approved these changes Nov 26, 2025

View reviewed changes

Samoed enabled auto-merge (squash) November 26, 2025 20:16

Samoed merged commit 7e2fa98 into embeddings-benchmark:main Nov 26, 2025
11 checks passed

Conversation

jpbalarini commented Nov 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

KennethEnevoldsen left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

KennethEnevoldsen left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Samoed Nov 22, 2025

Choose a reason for hiding this comment

Uh oh!

jpbalarini Nov 24, 2025

Choose a reason for hiding this comment

Uh oh!

Samoed Nov 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jpbalarini commented Nov 24, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

KennethEnevoldsen left a comment

Choose a reason for hiding this comment

Uh oh!

Samoed commented Nov 25, 2025

Uh oh!

jpbalarini commented Nov 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Samoed commented Nov 25, 2025

Uh oh!

jpbalarini commented Nov 25, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Samoed commented Nov 26, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jpbalarini commented Nov 26, 2025

Uh oh!

Samoed commented Nov 26, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Comments

jpbalarini commented Nov 22, 2025 •

edited

Loading

Samoed Nov 24, 2025 •

edited

Loading

jpbalarini commented Nov 25, 2025 •

edited

Loading