Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature]: support Data-in Data-out #35856

Open
1 task done
zhengbuqian opened this issue Aug 30, 2024 · 1 comment
Open
1 task done

[Feature]: support Data-in Data-out #35856

zhengbuqian opened this issue Aug 30, 2024 · 1 comment
Assignees
Labels
kind/feature Issues related to feature request from users

Comments

@zhengbuqian
Copy link
Collaborator

Is there an existing issue for this?

  • I have searched the existing issues

Is your feature request related to a problem? Please describe.

Data-in Data-out is a code name, the official feature name is yet to be formalized.

In order to improve the usability of Milvus and make vector based unstructured data(texts, images, audios, etc) similarity search more approachable to non-techs, we want to allow the users to specify the type of embedding models they want to use, and insert raw data directly without worrying about how they should embed the data. At search time, the users can also just provide the raw data as the query instead of providing query embedding.

In the very first phase, only BM25 based Doc-in Doc-out(or Text-in Text-out) will be supported, this is tracked in #35853.

At a later phase, users may be able to bring any type of embedding models/services at their choice and search on any types of data beyond just texts.

More details and API specs will be shared shortly.

Describe the solution you'd like.

No response

Describe an alternate solution.

No response

Anything else? (Additional Context)

No response

@junjiejiangjjj
Copy link

junjiejiangjjj commented Sep 9, 2024

Data-in Data-out

Architecture

Milvus converts raw data into vectors in the proxy service.

image

usage

  • Create a collection with embedding model config.
  • Input the original data, milvus uses the model to calculate the vector.
  • Supports searching using raw data.

Support models

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
kind/feature Issues related to feature request from users
Projects
None yet
Development

No branches or pull requests

3 participants