From 266c9d2f93fc639561bb7cd06253f5fae83e9718 Mon Sep 17 00:00:00 2001 From: PahudPlus Date: Wed, 8 Jul 2020 19:37:38 +0800 Subject: [PATCH] Updated title. Minor --- site/en/quick_start/milvus_operation.md | 30 +++++++++++----------- site/zh-CN/quick_start/milvus_operation.md | 2 +- 2 files changed, 16 insertions(+), 16 deletions(-) diff --git a/site/en/quick_start/milvus_operation.md b/site/en/quick_start/milvus_operation.md index d28eb114e..85ad2dfae 100644 --- a/site/en/quick_start/milvus_operation.md +++ b/site/en/quick_start/milvus_operation.md @@ -4,7 +4,7 @@ title: Learn Milvus Operations sidebar_label: Learn Milvus Operations --- -# Learn Milvus Operations +# Milvus Hello World This page walks you through some basic Milvus operations using the [Python client](https://github.com/milvus-io/pymilvus). Refer to [Python API documentation](https://github.com/milvus-io/pymilvus) for detailed information. @@ -108,13 +108,13 @@ Currently, a collection only supports one index type, and switching the index ty > Note: For different index types, the required parameters for index building also differ. You **must** specify values for all index parameters. - | Index type | Index parameters | Example | Value range | - | --------------------- | ------------ | ----------------------------------------------------------------------- | -------------------- | - | `IVFLAT` / `SQ8` / `SQ8H`| `nlist`:Number of clusters from the vector data file when Milvus performs clustering operation for index creation. The index file records the results of the clustering operation, including index type, central vector of each cluster, and the vectors in each cluster, for later search operations. | `{nlist: 16384}` | `nlist`:[1, 999999] | - | `IVFPQ` | `nlist`:Number of clusters from the vector data file when Milvus performs clustering operation for index creation. The index file records the results of the clustering operation, including index type, central vector of each cluster, and the vectors in each cluster, for later search operations.

`m`:Compression ratio during index creation. The smaller `m` is, the higher the compression ratio. | `{nlist: 16384, m: 12}` | `nlist`:[1, 999999]

`m`: one of {96, 64, 56, 48, 40, 32, 28, 24, 20, 16, 12, 8, 4, 3, 2, 1} | - | `NSG` | `search_length`:The higher the value, the more nodes are searched in the graph, the higher the recall rate, but the slower the search speed. It is recommended that `search_length` is smaller than `candidate_pool` and in range [40, 80]。

`out_degree`:The higher the value, the higher the memory usage, and the better search performance.

`candidate_pool`:Affects index quality and is suggested to be in range [200, 500].

`knng`:Affects index quality and is suggested to be `out_degree` + 20. | `{search_length: 45, out_degree:50, candidate_pool_size:300, knng:100}` | `search_length range`: [10, 300]

`out_degree`: [5, 300]

`candidate_pool_size`: [50, 1000]

`knng`: [5, 300] | - | `HNSW` | `M`:Affects index build time and index quality. The higher the value, the longer it costs to build an index, the higher the index quality, and the higher the memory usage.

`efConstruction`:Affects index build time and index quality. The higher the value, the longer it costs to build and index, the higher the index quality, and the higher the memory usage. | `{M: 16, efConstruction:500}` | `M` :[5, 48]

`efConstruction` :[100, 500] | - | `ANNOY` | `n_trees`: Affects the build time and the index size. A larger value will give more accurate results, but larger indexes. | `{"n_trees": 8}` | [1, 1024] | + | Index type | Index parameters | Example | Value range | + | --------------------- | ------------ | ----------------------------------------------------------------------- | -------------------- | + | `IVFLAT` / `SQ8` / `SQ8H`| `nlist`:Number of clusters from the vector data file when Milvus performs clustering operation for index creation. The index file records the results of the clustering operation, including index type, central vector of each cluster, and the vectors in each cluster, for later search operations. | `{nlist: 16384}` | `nlist`:[1, 999999] | + | `IVFPQ` | `nlist`:Number of clusters from the vector data file when Milvus performs clustering operation for index creation. The index file records the results of the clustering operation, including index type, central vector of each cluster, and the vectors in each cluster, for later search operations.

`m`:Compression ratio during index creation. The smaller `m` is, the higher the compression ratio. | `{nlist: 16384, m: 12}` | `nlist`:[1, 999999]

`m`: one of {96, 64, 56, 48, 40, 32, 28, 24, 20, 16, 12, 8, 4, 3, 2, 1} | + | `NSG` | `search_length`:The higher the value, the more nodes are searched in the graph, the higher the recall rate, but the slower the search speed. It is recommended that `search_length` is smaller than `candidate_pool` and in range [40, 80]。

`out_degree`:The higher the value, the higher the memory usage, and the better search performance.

`candidate_pool`:Affects index quality and is suggested to be in range [200, 500].

`knng`:Affects index quality and is suggested to be `out_degree` + 20. | `{search_length: 45, out_degree:50, candidate_pool_size:300, knng:100}` | `search_length range`: [10, 300]

`out_degree`: [5, 300]

`candidate_pool_size`: [50, 1000]

`knng`: [5, 300] | + | `HNSW` | `M`:Affects index build time and index quality. The higher the value, the longer it costs to build an index, the higher the index quality, and the higher the memory usage.

`efConstruction`:Affects index build time and index quality. The higher the value, the longer it costs to build and index, the higher the index quality, and the higher the memory usage. | `{M: 16, efConstruction:500}` | `M` :[5, 48]

`efConstruction` :[100, 500] | + | `ANNOY` | `n_trees`: Affects the build time and the index size. A larger value will give more accurate results, but larger indexes. | `{"n_trees": 8}` | [1, 1024] | Refer to [Milvus Indexes](index.md) for more information。 @@ -211,13 +211,13 @@ A segment is a data file that Milvus automatically creates by merging inserted v > Note: For different index types, search parameters also differ. You **must** assign values to all search parameters. - | Index type | Search parameters | Example | Value range | - | --------------------- | ------------ | ----------------------------------------------------------------------- | -------------------- | - | `FLAT` | - | | - | - | `IVFLAT`/`SQ8`/`SQ8H`/`IVFPQ` | `nprobe`:Number of classes of vectors to search. `nprobe` affects search precision. The higher the value, the higher the precision, but the lower the search speed. | `{nprobe: 32}`| [1, `nlist`] | - | `NSG` | `search_length`:The higher the value, the more number of nodes are searched in the graph and the higher the recall rate, but the lower the search speed. | `{search_length:100}`| [10, 300] | - | `HNSW` | `ef`:The higher the value, the more data is searched in the index and the higher the recall rate, but the lower the search speed.| `{ef: 64}`| [`topk`, 4096] | - | `ANNOY` | `search_k`: Affects the search performance. A larger value will give more accurate results, but will take longer time to return.
-1 indicates the default value which is 5% of the total data amount. | `{"search_k": -1}` | {-1} ∪ [topk, ∞) | + | Index type | Search parameters | Example | Value range | + | --------------------- | ------------ | ----------------------------------------------------------------------- | -------------------- | + | `FLAT` | - | | - | + | `IVFLAT`/`SQ8`/`SQ8H`/`IVFPQ` | `nprobe`:Number of classes of vectors to search. `nprobe` affects search precision. The higher the value, the higher the precision, but the lower the search speed. | `{nprobe: 32}`| [1, `nlist`] | + | `NSG` | `search_length`:The higher the value, the more number of nodes are searched in the graph and the higher the recall rate, but the lower the search speed. | `{search_length:100}`| [10, 300] | + | `HNSW` | `ef`:The higher the value, the more data is searched in the index and the higher the recall rate, but the lower the search speed.| `{ef: 64}`| [`topk`, 4096] | + | `ANNOY` | `search_k`: Affects the search performance. A larger value will give more accurate results, but will take longer time to return.
-1 indicates the default value which is 5% of the total data amount. | `{"search_k": -1}` | {-1} ∪ [topk, ∞) | > Note: `top_k` stands for the number of vectors that are the most similar to the target vector. `top_k` is defined during search. The value range of `top_k` is `(0, 2048]`. diff --git a/site/zh-CN/quick_start/milvus_operation.md b/site/zh-CN/quick_start/milvus_operation.md index d76f3a134..21d222870 100644 --- a/site/zh-CN/quick_start/milvus_operation.md +++ b/site/zh-CN/quick_start/milvus_operation.md @@ -4,7 +4,7 @@ title: Learn Milvus Operations sidebar_label: Learn Milvus Operations --- -# Milvus 基本操作 +# Milvus Hello World 该页面将向你展示如何使用 [Python SDK](https://github.com/milvus-io/pymilvus) 运行 Milvus 基本操作。关于详细的 API 参考信息,请参考 [Python API 文档](https://github.com/milvus-io/pymilvus)。