diff --git a/v2.3.x/site/en/reference/index.md b/v2.3.x/site/en/reference/index.md index fac9a9122..344e95160 100644 --- a/v2.3.x/site/en/reference/index.md +++ b/v2.3.x/site/en/reference/index.md @@ -191,7 +191,9 @@ Similar to IVF_FLAT, GPU_IVF_FLAT also divides vector data into `nlist` cluster By adjusting `nprobe`, an ideal balance between accuracy and speed can be found for a given scenario. Results from the [IVF_FLAT performance test](https://zilliz.com/blog/Accelerating-Similarity-Search-on-Really-Big-Data-with-Vector-Indexing) demonstrate that query time increases sharply as both the number of target input vectors (`nq`), and the number of clusters to search (`nprobe`), increase. -IVF_FLAT is the most basic IVF index, and the encoded data stored in each unit is consistent with the original data. +GPU_IVF_FLAT is the most basic IVF index, and the encoded data stored in each unit is consistent with the original data. + +When conducting searches, note that you can set the top-K up to 256 for any search against a GPU_IVF_FLAT-indexed collection. - Index building parameters @@ -205,6 +207,12 @@ IVF_FLAT is the most basic IVF index, and the encoded data stored in each unit i | --------- | ------------------------ | --------------- | ------------- | | `nprobe` | Number of units to query | [1, nlist] | 8 | +- Limits on search + + | Parameter | Range | + | --------- | ------ | + | `top-K` | <= 256 | + ### IVF_SQ8 IVF_FLAT does not perform any compression, so the index files it produces are roughly the same size as the original, raw non-indexed vector data. For example, if the original 1B SIFT dataset is 476 GB, its IVF_FLAT index files will be slightly smaller (~470 GB). Loading all the index files into memory will consume 470 GB of storage.