Add vector service concept, and resource endpoint to handle vector search #33

sd2k · 2023-09-07T10:25:17Z

This cannibalizes some of #23 but focusses on the read path: adding a resource endpoint to the plugin to search a configured vector store using the configured embedding engine.

It adds:

a store package for interacting with various vector stores
- the interfaces are split into ReadVectorStore and WriteVectorStore so we don't need to implement everything right now (we could remove the write interface for now since it's not being used yet)
- there's an implementation for the pending Grafana vectorapi here too; I can copy a qdrant one from a draft Grafana PR later on as another example
an embed package for interacting with embedding engines
- there's an implementation here for OpenAI embeddings (which vectorapi is also compatible with); more can be added as-and-when
a vector package which wraps and combines objects from these two packages and utilises them to provide a user-friendly service to search for related text, handling the mapping from collection -> model and the embedding process
an resource endpoint /vector/search, which frontend clients can use to do search for related objects in various collections
another docker-compose service for the vectorapi to be deployed, and extra provisioning for the app including more config to talk to the vectorapi service. This will probably need to be removed for now though since vectorapi isn't OSS yet.

Some decisions made here:

collection details are specified in config; specifically this means each collection has exactly one associated model and dimension
- this is specified in config rather than anywhere else right now because the vector service is responsible for embedding the query using the same model and dimension as the vectors in the collection
- in theory this could go somewhere else but we can't make many assumptions about what each vector DB implementation lets you store, so config is a fairly safe denominator
- ideally this would be shared with whatever sync service we end up using, so that they both use the same embedding model for each collection, and the same collection names for each type of object.
we have interfaces for vector.Store and embed.Embedder so we can have various concrete implementations, such as Qdrant/Milvus/Weaviate for the store or OpenAI/custom APIs for the embedder. This PR only adds a single one for each though, and we may end up abstracting over stores in vectorapi instead
there's a collection in the provisioned config named grafana:core:dashboards. The naming convention here is completely open to bikeshedding. The idea is to scope collections similarly to the way Grafana Live channels are scoped (the : can be replaced with / for consistency)

Testing this is a little tricky right now because we don't have open images for vectorapi, or a process for getting data into the vectorapi store. One way to do so is:

build vectorapi using make
run docker-compose up in this repository
run poetry run python -m src.dashboards.index --directory ./src/dashboards/output from the vector-playground directory of the llm-experiment-lab repository, on the vectordb-playground-vectorapi branch. This will send a bunch of dashboards into the grafana:core:dashboards collection, ready for searching
call the resource endpoint with curl http://localhost:3000/api/plugins/grafana-llm-app/resources/vector/search -d '{"collection": "grafana:core:dashboards", "text": "Mimir usage", "limit": 3}'

…arch This cannibalizes some of #23 but focusses on the read path: searching a configured vector store using the configured embedding engine.

This adds convenience functions to access the endpoints added in grafana/grafana-llm-app#33.

PR #33 can't really be merged until we open-source and package up vectorapi, so this commit adds a qdrant implementation and modifies the provisioning file to use it (and OpenAI) by default, rather than vectorapi.

…t in store

PR #33 can't really be merged until we open-source and package up vectorapi, so this commit adds a qdrant implementation and modifies the provisioning file to use it (and OpenAI) by default, rather than vectorapi.

src/plugin.json

PR #33 can't really be merged until we open-source and package up vectorapi, so this commit adds a qdrant implementation and modifies the provisioning file to use it (and OpenAI) by default, rather than vectorapi.

provisioning/plugins/grafana-llm-app.yaml

Otherwise the payloads are JSON marshaled like this: ```json {"results":[{"payload":{"description":{"Kind":{"NullValue":0}},"panels":{"Kind":{"ListValue":{"values":[{"Kind":{"StructValue":{"fields":{"description":{"Kind":{"StringValue":"CPU usage of Mimir"}},"title":{"Kind":{"StringValue":"CPU usage"}}}}}}]}}},"title":{"Kind":{"StringValue":"Mimir resource usage"}}},"score":0.7932371497154236}]} ``` instead of what we want (after this PR): ```json {"results":[{"payload":{"description":null,"panels":[{"description":"CPU usage of Mimir","title":"CPU usage"}],"title":"Mimir resource usage"},"score":0.7932371497154236}]} ```

Add qdrant read vector store implementation

csmarchbanks

Let's get this going and iterate on it as we get more data into vector dbs!

Qdrant doesn't allow ':' in collection names, so replace them with '.'. In future we might want to follow their [recommendations] and only using one collection but this will work for now. [recommendations]: https://qdrant.tech/documentation/faq/qdrant-fundamentals/#how-many-collections-can-i-create

pkg/plugin/vector/store/store.go

Also tidy up some settings which were getting a bit messy, and fix secret handling (they shouldn't be in JSON data)

…ector services

yoziru

alright lgtm, provisioning needs some better docs or maybe less nesting but the core concepts work well 👍

This adds convenience functions to access the endpoints added in grafana/grafana-llm-app#33.

This was moved around as part of #33, should be fixed by this commit.

This adds convenience functions to access the endpoints added in grafana/grafana-llm-app#33.

This adds convenience functions to access the endpoints added in #33.

This adds convenience functions to access the endpoints added in grafana/grafana-llm-app#33.

sd2k requested a review from yoziru September 7, 2023 10:26

sd2k added 2 commits September 7, 2023 12:12

Add vector service concept, and resource endpoint to handle vector se…

725fe1a

…arch This cannibalizes some of #23 but focusses on the read path: searching a configured vector store using the configured embedding engine.

Add new words to spellcheck config

bb0e956

sd2k force-pushed the vector-service branch from c3e56fe to bb0e956 Compare September 7, 2023 11:12

sd2k added 5 commits September 7, 2023 12:23

Fix lint errors, remember to close response bodies

b8aa28b

Add even more words to spell check config

ed6b840

Be more robust to embedding/vector stores not being configured

12d2031

Add vectorapi to docker-compose/provisioning

e40b1e9

Add better logging, handle nil values better, add collection config

d1b90e3

sd2k force-pushed the vector-service branch from dff0a30 to d1b90e3 Compare September 7, 2023 13:14

yoziru and others added 6 commits September 7, 2023 16:05

fixup model

04e13d0

vectorapi: add /v1 to route

63ebd64

docker: cache .sentence-transformers volume

1d73141

Load 'payload' from vectorapi correctly

73a886b

Merge branch 'main' into vector-service

496d342

Fix import paths

ccbb1fb

sd2k marked this pull request as ready for review September 8, 2023 09:42

sd2k added a commit to grafana/grafana-experimental that referenced this pull request Sep 8, 2023

Add llms.vector module for vector search using grafana-llm-app

e078895

This adds convenience functions to access the endpoints added in grafana/grafana-llm-app#33.

sd2k mentioned this pull request Sep 8, 2023

Add llms.vector module for vector search using grafana-llm-app grafana/grafana-experimental#76

Merged

sd2k added a commit to grafana/grafana-experimental that referenced this pull request Sep 8, 2023

Add llms.vector module for vector search using grafana-llm-app

f46fd1f

This adds convenience functions to access the endpoints added in grafana/grafana-llm-app#33.

sd2k added a commit to grafana/grafana-experimental that referenced this pull request Sep 8, 2023

Add llms.vector module for vector search using grafana-llm-app

1d9f1eb

This adds convenience functions to access the endpoints added in grafana/grafana-llm-app#33.

Fix handling of OpenAI API key

bda7f80

sd2k mentioned this pull request Sep 12, 2023

Add qdrant read vector store implementation #43

Merged

Short-circuit before calculating embedding if collection doesn't exis…

190635f

…t in store

sd2k force-pushed the vector-service branch from c7995ca to 190635f Compare September 12, 2023 07:32

Use pointer receiver for vectorService.Search

727be1b

yoziru reviewed Sep 12, 2023

View reviewed changes

src/plugin.json Outdated Show resolved Hide resolved

yoziru mentioned this pull request Sep 12, 2023

CollectionPointResult: update response model grafana/vectorapi#8

Merged

sd2k added 3 commits September 12, 2023 10:06

Remove unused externalServiceRegistration from plugin.json

dd3765f

Remove unused Grafana feature toggle in docker-compose

ec3f88a

Add qdrant read vector store implementation

fa30317

PR #33 can't really be merged until we open-source and package up vectorapi, so this commit adds a qdrant implementation and modifies the provisioning file to use it (and OpenAI) by default, rather than vectorapi.

sd2k requested a review from csmarchbanks September 12, 2023 15:39

sd2k commented Sep 12, 2023

View reviewed changes

provisioning/plugins/grafana-llm-app.yaml Outdated Show resolved Hide resolved

sd2k added 3 commits September 13, 2023 09:19

Handle TLS and auth in qdrant store

b81083f

Merge pull request #43 from grafana/vector-service-qdrant

977f0b0

Add qdrant read vector store implementation

csmarchbanks approved these changes Sep 13, 2023

View reviewed changes

yoziru reviewed Sep 14, 2023

View reviewed changes

pkg/plugin/vector/store/store.go Outdated Show resolved Hide resolved

yoziru reviewed Sep 14, 2023

View reviewed changes

pkg/plugin/vector/store/store.go Outdated Show resolved Hide resolved

sd2k added 2 commits September 14, 2023 09:27

Simplify model handling; use a single model for the whole vector service

8e6985b

Also tidy up some settings which were getting a bit messy, and fix secret handling (they shouldn't be in JSON data)

Add 'enabled' setting to vector settings to globally enable/disable v…

bbfa61c

…ector services

yoziru approved these changes Sep 14, 2023

View reviewed changes

sd2k merged commit b605e7b into main Sep 14, 2023
3 checks passed

sd2k deleted the vector-service branch September 14, 2023 09:34

sd2k added a commit to grafana/grafana-experimental that referenced this pull request Sep 14, 2023

Add llms.vector module for vector search using grafana-llm-app

9550e57

This adds convenience functions to access the endpoints added in grafana/grafana-llm-app#33.

sd2k added a commit that referenced this pull request Sep 14, 2023

Centralize and fix OpenAI key handling

30184ce

This was moved around as part of #33, should be fixed by this commit.

sd2k mentioned this pull request Sep 14, 2023

Centralize and fix OpenAI key handling #45

Merged

sd2k added a commit to grafana/grafana-experimental that referenced this pull request Sep 14, 2023

Add llms.vector module for vector search using grafana-llm-app

48d1007

This adds convenience functions to access the endpoints added in grafana/grafana-llm-app#33.

sd2k added a commit to grafana/grafana-experimental that referenced this pull request Sep 14, 2023

Add llms.vector module for vector search using grafana-llm-app

288d806

This adds convenience functions to access the endpoints added in grafana/grafana-llm-app#33.

SandersAaronD pushed a commit that referenced this pull request Oct 27, 2023

Add llms.vector module for vector search using grafana-llm-app

a9668e1

This adds convenience functions to access the endpoints added in #33.

SandersAaronD pushed a commit that referenced this pull request Oct 30, 2023

Add llms.vector module for vector search using grafana-llm-app

92c8ebb

This adds convenience functions to access the endpoints added in #33.

SandersAaronD pushed a commit that referenced this pull request Oct 31, 2023

Add llms.vector module for vector search using grafana-llm-app

d871d30

This adds convenience functions to access the endpoints added in #33.

SandersAaronD pushed a commit to grafana/grafana-experimental that referenced this pull request Jan 10, 2024

Add llms.vector module for vector search using grafana-llm-app

2c72ebe

This adds convenience functions to access the endpoints added in grafana/grafana-llm-app#33.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add vector service concept, and resource endpoint to handle vector search #33

Add vector service concept, and resource endpoint to handle vector search #33

sd2k commented Sep 7, 2023 •

edited

Loading

csmarchbanks left a comment

yoziru left a comment

Add vector service concept, and resource endpoint to handle vector search #33

Add vector service concept, and resource endpoint to handle vector search #33

Conversation

sd2k commented Sep 7, 2023 • edited Loading

csmarchbanks left a comment

Choose a reason for hiding this comment

yoziru left a comment

Choose a reason for hiding this comment

sd2k commented Sep 7, 2023 •

edited

Loading