milvus入門使用

bonelee發表於2024-06-13

原文網址 : https://www.cnblogs.com/bonelee/p/18246704

插入資料後的效果：

程式碼如下：

import configparser
from pymilvus import connections, Collection, DataType, FieldSchema, CollectionSchema
import numpy as np

def create_collection():
    # Define the schema
    fields = [
        FieldSchema(name="sentence_id", dtype=DataType.INT64, is_primary=True, auto_id=True),
        FieldSchema(name="sentence", dtype=DataType.VARCHAR, max_length=512),
        FieldSchema(name="embedding", dtype=DataType.FLOAT_VECTOR, dim=128)
    ]
    schema = CollectionSchema(fields, description="Sentence collection")

    # Create the collection
    collection = Collection(name="sentence_collection", schema=schema)
    return collection

def insert_data(collection):
    sentences = [
        "這是第一句。",
        "這是第二句。",
        "這是第三句。"
    ]
    
    embeddings = np.random.rand(len(sentences), 128).tolist()  # Generate 128-dimensional vectors
    
    entities = [
        sentences,
        embeddings
    ]

    insert_result = collection.insert(entities)
    print(f"Inserted {len(insert_result.primary_keys)} records into collection.")

def create_index(collection):
    index_params = {
        "index_type": "IVF_FLAT",
        "params": {"nlist": 128},
        "metric_type": "L2"
    }
    collection.create_index(field_name="embedding", index_params=index_params)
    print("Index created.")

def search_data(collection, query_sentence):
    query_embedding = np.random.rand(1, 128).tolist()  # Generate a vector for the query sentence

    search_params = {"metric_type": "L2", "params": {"nprobe": 10}}
    
    results = collection.search(
        data=query_embedding,
        anns_field="embedding",
        param=search_params,
        limit=3,
        expr=None,
        output_fields=["sentence"]
    )
    
    for hits in results:
        for hit in hits:
            print(f"Match found: {hit.id} with distance: {hit.distance}, sentence: {hit.entity.get('sentence')}")

if __name__ == '__main__':
    # Connect to Milvus
    cfp = configparser.RawConfigParser()
    cfp.read('config.ini')
    milvus_uri = cfp.get('example', 'uri')
    token = cfp.get('example', 'token')
    connections.connect("default",
                        uri=milvus_uri,
                        token=token)
    print(f"Connecting to DB: {milvus_uri}")
    
    # Create collection
    collection = create_collection()

    # Insert data
    insert_data(collection)
    
    # Create index
    create_index(collection)
    
    # Load the collection into memory
    collection.load()
    
    # Search data
    search_data(collection, "這是一個查詢句子。")

執行效果：

python hello_zilliz_vectordb.py
Connecting to DB: https://in03-ca69f49bb65709f.api.gcp-us-west1.zillizcloud.com
Inserted 3 records into collection.
Index created.
Match found: 450140263656791260 with distance: 19.557846069335938, sentence: 這是第二句。
Match found: 450140263656791261 with distance: 20.327802658081055, sentence: 這是第三句。
Match found: 450140263656791259 with distance: 20.40052032470703, sentence: 這是第一句。

注意事項：

向量轉換：上面的程式碼使用了隨機向量來模擬句子向量。在實際應用中，您需要使用 NLP 模型（例如中文 BERT）來將中文句子轉換為向量。
字元編碼：確保在讀取和處理中文文字時使用正確的字元編碼（通常是 UTF-8）。

Milvus向量資料庫入門實踐
2024-05-21
資料庫
milvus-migration安裝使用
2024-11-13
Android入門教程 | RecyclerView使用入門
2021-11-07
AndroidView
influxdb使用入門
2018-11-07
UX
IPFS 使用入門
2019-01-10
lucene入門使用
2018-12-23
PromiseKit 入門使用
2019-03-01
Promise
Numpy使用入門
2018-06-26
PyAutoGUI使用入門
2024-05-20
GUI
Jetty使用入門
2024-03-04
Jetty
plantuml使用入門
2024-04-11
CentOS使用入門
2024-05-02
CentOS
appuploader 入門使用
2023-05-03
APP
Redis 入門使用
2024-11-10
Redis
valgrind使用入門
2024-08-12
JNA使用入門
2024-08-09
postman 使用入門
2024-06-03
Postman
Docker使用入門
2021-07-07
Docker
Mysql - 使用入門
2021-04-28
MySql
python 使用 imagehash 庫生成 ahash，並存入 milvus
2022-11-24
Python
java操作milvus
2024-08-19
Java
milvus基礎
2024-08-21
milvus日常管理
2024-08-09
AWS Lambda 使用入門
2019-03-04
01 | Bootstrap使用入門
2019-08-08
boot
mongoose的入門使用
2019-09-02
Go
PostgreSQL JSONB 使用入門
2019-05-30
SQLJSON
React入門到使用
2019-03-23
React
Redux入門到使用
2019-03-23
Redux
arthas的使用入門
2024-08-14
Metricbeat入門與使用
2024-10-15
Android WorkManager使用入門
2021-08-19
Android
github ations 入門使用
2022-04-15
Github
promise入門基本使用
2021-09-22
Promise
React Hook 入門使用
2020-12-27
ReactHook
Elasticsearch（windows）使用入門
2020-12-17
ElasticsearchWindows
milvus日常管理new
2024-11-08
Milvus 2.0 正式 GA
2022-02-07

milvus入門使用

注意事項：

相關文章