nv_ingest_client.util.vdb package#

Submodules#

nv_ingest_client.util.vdb.adt_vdb module#

class nv_ingest_client.util.vdb.adt_vdb.VDB(**kwargs)[source]#

Bases: ABC

abstract create_index(**kwargs)[source]#

abstract reindex(records: list, **kwargs)[source]#

abstract retrieval(queries: list, **kwargs)[source]#

abstract run(records)[source]#

abstract write_to_index(records: list, **kwargs)[source]#

nv_ingest_client.util.vdb.milvus module#

class nv_ingest_client.util.vdb.milvus.Milvus(

collection_name: str | Dict = 'nv_ingest_collection',

milvus_uri: str = 'http://localhost:19530',

sparse: bool = False,

recreate: bool = True,

gpu_index: bool = True,

gpu_search: bool = True,

dense_dim: int = 2048,

minio_endpoint: str = 'localhost:9000',

enable_text: bool = True,

enable_charts: bool = True,

enable_tables: bool = True,

enable_images: bool = True,

enable_infographics: bool = True,

bm25_save_path: str = 'bm25_model.json',

compute_bm25_stats: bool = True,

access_key: str = 'minioadmin',

secret_key: str = 'minioadmin',

bucket_name: str = 'a-bucket',

meta_dataframe: str | DataFrame | None = None,

meta_source_field: str | None = None,

meta_fields: list[str] | None = None,

stream: bool = False,

**kwargs,

)[source]#

Bases: VDB

create_index(**kwargs)[source]#

get_connection_params()[source]#

get_write_params()[source]#

reindex(**kwargs)[source]#

retrieval(queries, **kwargs)[source]#

run(records)[source]#

write_to_index(records, **kwargs)[source]#

nv_ingest_client.util.vdb.milvus.add_metadata( element, meta_dataframe, meta_source_field, meta_data_fields, )[source]#

nv_ingest_client.util.vdb.milvus.bulk_insert_milvus( collection_name: str, writer: RemoteBulkWriter, milvus_uri: str = 'http://localhost:19530', )[source]#

This function initialize the bulk ingest of all minio uploaded records, and checks for milvus task completion. Once the function is complete all records have been uploaded to the milvus collection.

Parameters:

collection_name (str) – Name of the milvus collection.
writer (RemoteBulkWriter) – The Milvus Remote BulkWriter instance that was created with necessary params to access the minio instance corresponding to milvus.
milvus_uri (str,) – Milvus address with http(s) preffix and port. Can also be a file path, to activate milvus-lite.

nv_ingest_client.util.vdb.milvus.cleanup_records( records, enable_text: bool = True, enable_charts: bool = True, enable_tables: bool = True, enable_images: bool = True, enable_infographics: bool = True, enable_audio: bool = True, meta_dataframe: ~pandas.core.frame.DataFrame | None = None, meta_source_field: str | None = None, meta_fields: list[str] | None = None, record_func=<function _record_dict>, sparse_model=None, )[source]#

nv_ingest_client.util.vdb.milvus.create_bm25_model( records, enable_text: bool = True, enable_charts: bool = True, enable_tables: bool = True, enable_images: bool = True, enable_infographics: bool = True, enable_audio: bool = True, ) → BM25EmbeddingFunction[source]#

This function takes the input records and creates a corpus, factoring in filters (i.e. texts, charts, tables) and fits a BM25 model with that information. If the user sets the log level to info, any time a record fails ingestion, it will be reported to the user.

Parameters:

records (List) – List of chunks with attached metadata
enable_text (bool, optional) – When true, ensure all text type records are used.
enable_charts (bool, optional) – When true, ensure all chart type records are used.
enable_tables (bool, optional) – When true, ensure all table type records are used.
enable_images (bool, optional) – When true, ensure all image type records are used.
enable_infographics (bool, optional) – When true, ensure all infographic type records are used.
enable_audio (bool, optional) – When true, ensure all audio transcript type records are used.

Returns:

Returns the model fitted to the selected corpus.

Return type:

BM25EmbeddingFunction

nv_ingest_client.util.vdb.milvus.create_collection( client: MilvusClient, collection_name: str, schema: CollectionSchema, index_params: IndexParams | None = None, recreate=True, )[source]#

Creates a milvus collection with the supplied name and schema. Within that collection, this function ensures that the desired indexes are created based on the IndexParams supplied.

Parameters:

client (MilvusClient) – Client connected to mivlus instance.
collection_name (str) – Name of the collection to be created.
schema (CollectionSchema,) – Schema that identifies the fields of data that will be available in the collection.
index_params (IndexParams, optional) – The parameters used to create the index(es) for the associated collection fields.
recreate (bool, optional) – If true, and the collection is detected, it will be dropped before being created again with the provided information (schema, index_params).

nv_ingest_client.util.vdb.milvus.create_meta_collection( schema: CollectionSchema, milvus_uri: str = 'http://localhost:19530', collection_name: str = 'meta', recreate=False, )[source]#

nv_ingest_client.util.vdb.milvus.create_nvingest_collection( collection_name: str, milvus_uri: str = 'http://localhost:19530', sparse: bool = False, recreate: bool = True, gpu_index: bool = True, gpu_search: bool = True, dense_dim: int = 2048, recreate_meta: bool = False, ) → CollectionSchema[source]#

Creates a milvus collection with an nv-ingest compatible schema under the target name.

Parameters:

collection_name (str) – Name of the collection to be created.
milvus_uri (str,) – Milvus address with http(s) preffix and port. Can also be a file path, to activate milvus-lite.
sparse (bool, optional) – When set to true, this adds a Sparse index to the IndexParams, usually activated for hybrid search.
recreate (bool, optional) – If true, and the collection is detected, it will be dropped before being created again with the provided information (schema, index_params).
gpu_cagra (bool, optional) – If true, creates a GPU_CAGRA index for dense embeddings.
dense_dim (int, optional) – Sets the dimension size for the dense embedding in the milvus schema.

Returns:

Returns a milvus collection schema, that represents the fields in the created collection.

Return type:

CollectionSchema

nv_ingest_client.util.vdb.milvus.create_nvingest_index_params( sparse: bool = False, gpu_index: bool = True, gpu_search: bool = True, local_index: bool = True, ) → IndexParams[source]#

Creates index params necessary to create an index for a collection. At a minimum, this function will create a dense embedding index but can also create a sparse embedding index (BM25) for hybrid search.

Parameters:

sparse (bool, optional) – When set to true, this adds a Sparse index to the IndexParams, usually activated for hybrid search.
gpu_index (bool, optional) – When set to true, creates an index on the GPU. The index is GPU_CAGRA.
gpu_search (bool, optional) – When set to true, if using a gpu index, the search will be conducted using the GPU. Otherwise the search will be conducted on the CPU (index will be turned into HNSW).

Returns:

Returns index params setup for a dense embedding index and if specified, a sparse embedding index.

Return type:

IndexParams

nv_ingest_client.util.vdb.milvus.create_nvingest_meta_schema()[source]#

nv_ingest_client.util.vdb.milvus.create_nvingest_schema( dense_dim: int = 1024, sparse: bool = False, local_index: bool = False, ) → CollectionSchema[source]#

Creates a schema for the nv-ingest produced data. This is currently setup to follow the default expected schema fields in nv-ingest. You can see more about the declared fields in the nv_ingest.schemas.vdb_task_sink_schema.build_default_milvus_config function. This schema should have the fields declared in that function, at a minimum. To ensure proper data propagation to milvus.

Parameters:

dense_dim (int, optional) – The size of the embedding dimension.
sparse (bool, optional) – When set to true, this adds a Sparse field to the schema, usually activated for hybrid search.

Returns:

Returns a milvus collection schema, with the minimum required nv-ingest fields and extra fields (sparse), if specified by the user.

Return type:

CollectionSchema

nv_ingest_client.util.vdb.milvus.dense_retrieval( queries, collection_name: str, client: MilvusClient, dense_model, top_k: int, dense_field: str = 'vector', output_fields: List[str] = ['text'], _filter: str = '', )[source]#

This function takes the input queries and conducts a dense embedding search against the dense vector and return the top_k nearest records in the collection.

Parameters:

queries (List) – List of queries
collection (Collection) – Milvus Collection to search against
client (MilvusClient) – Client connected to mivlus instance.
dense_model (NVIDIAEmbedding) – Dense model to generate dense embeddings for queries.
top_k (int) – Number of search results to return per query.
dense_field (str) – The name of the anns_field that holds the dense embedding vector the collection.

Returns:

Nested list of top_k results per query.

Return type:

List

nv_ingest_client.util.vdb.milvus.embed_index_collection(

data,

collection_name,

batch_size: int = 256,

embedding_endpoint: str | None = None,

model_name: str | None = None,

nvidia_api_key: str | None = None,

milvus_uri: str = 'http://localhost:19530',

sparse: bool = False,

recreate: bool = True,

gpu_index: bool = True,

gpu_search: bool = True,

dense_dim: int = 2048,

minio_endpoint: str = 'localhost:9000',

enable_text: bool = True,

enable_charts: bool = True,

enable_tables: bool = True,

enable_images: bool = True,

enable_infographics: bool = True,

bm25_save_path: str = 'bm25_model.json',

compute_bm25_stats: bool = True,

access_key: str = 'minioadmin',

secret_key: str = 'minioadmin',

bucket_name: str = 'a-bucket',

meta_dataframe: str | DataFrame | None = None,

meta_source_field: str | None = None,

meta_fields: list[str] | None = None,

intput_type: str = 'passage',

truncate: str = 'END',

**kwargs,

)[source]#

This function takes the input data and creates a collection in Milvus, it will embed the records using the NVIDIA embedding model and store them in the collection. After embedding the records, it will run the same ingestion process as the vdb_upload stage in the Ingestor pipeline.

Parameters:

data (Union[str, List]) – The data to be ingested. Can be a list of records or a file path.
collection_name (Union[str, Dict], optional) – The name of the Milvus collection or a dictionary containing collection configuration. Defaults to “nv_ingest_collection”.
embedding_endpoint (str, optional) – The endpoint for the NVIDIA embedding service. Defaults to None.
model_name (str, optional) – The name of the embedding model. Defaults to None.
nvidia_api_key (str, optional) – The API key for NVIDIA services. Defaults to None.
milvus_uri (str, optional) – The URI of the Milvus server. Defaults to “http://localhost:19530”.
sparse (bool, optional) – Whether to use sparse indexing. Defaults to False.
recreate (bool, optional) – Whether to recreate the collection if it already exists. Defaults to True.
gpu_index (bool, optional) – Whether to use GPU for indexing. Defaults to True.
gpu_search (bool, optional) – Whether to use GPU for search operations. Defaults to True.
dense_dim (int, optional) – The dimensionality of dense vectors. Defaults to 2048.
minio_endpoint (str, optional) – The endpoint for the MinIO server. Defaults to “localhost:9000”.
enable_text (bool, optional) – Whether to enable text data ingestion. Defaults to True.
enable_charts (bool, optional) – Whether to enable chart data ingestion. Defaults to True.
enable_tables (bool, optional) – Whether to enable table data ingestion. Defaults to True.
enable_images (bool, optional) – Whether to enable image data ingestion. Defaults to True.
enable_infographics (bool, optional) – Whether to enable infographic data ingestion. Defaults to True.
bm25_save_path (str, optional) – The file path to save the BM25 model. Defaults to “bm25_model.json”.
compute_bm25_stats (bool, optional) – Whether to compute BM25 statistics. Defaults to True.
access_key (str, optional) – The access key for MinIO authentication. Defaults to “minioadmin”.
secret_key (str, optional) – The secret key for MinIO authentication. Defaults to “minioadmin”.
bucket_name (str, optional) – The name of the MinIO bucket. Defaults to “a-bucket”.
meta_dataframe (Union[str, pd.DataFrame], optional) – A metadata DataFrame or the path to a CSV file containing metadata. Defaults to None.
meta_source_field (str, optional) – The field in the metadata that serves as the source identifier. Defaults to None.
meta_fields (list[str], optional) – A list of metadata fields to include. Defaults to None.
**kwargs – Additional keyword arguments for customization.

nv_ingest_client.util.vdb.milvus.get_embeddings(full_records, embedder, batch_size=256)[source]#: This function takes the input records and creates a list of embeddings. The default batch size is 256, but this can be adjusted based on the available resources, to a maximum of 259. This is set by the NVIDIA embedding microservice.

nv_ingest_client.util.vdb.milvus.grab_meta_collection_info( collection_name: str, meta_collection_name: str = 'meta', timestamp: str | None = None, embedding_model: str | None = None, embedding_dim: int | None = None, milvus_uri: str = 'http://localhost:19530', )[source]#

nv_ingest_client.util.vdb.milvus.hybrid_retrieval( queries, collection_name: str, client: MilvusClient, dense_model, sparse_model, top_k: int, dense_field: str = 'vector', sparse_field: str = 'sparse', output_fields: List[str] = ['text'], gpu_search: bool = True, local_index: bool = False, _filter: str = '', )[source]#

This function takes the input queries and conducts a hybrid embedding search against the dense and sparse vectors, returning the top_k nearest records in the collection.

Parameters:

queries (List) – List of queries
collection (Collection) – Milvus Collection to search against
client (MilvusClient) – Client connected to mivlus instance.
dense_model (NVIDIAEmbedding) – Dense model to generate dense embeddings for queries.
sparse_model (model,) – Sparse model used to generate sparse embedding in the form of scipy.sparse.csr_array
top_k (int) – Number of search results to return per query.
dense_field (str) – The name of the anns_field that holds the dense embedding vector the collection.
sparse_field (str) – The name of the anns_field that holds the sparse embedding vector the collection.

Returns:

Nested list of top_k results per query.

Return type:

List

nv_ingest_client.util.vdb.milvus.log_new_meta_collection( collection_name: str, fields: List[str], milvus_uri: str = 'http://localhost:19530', creation_timestamp: str | None = None, dense_index: str | None = None, dense_dim: int | None = None, sparse_index: str | None = None, embedding_model: str | None = None, sparse_model: str | None = None, meta_collection_name: str = 'meta', recreate: bool = False, )[source]#

nv_ingest_client.util.vdb.milvus.nv_rerank( query, candidates, reranker_endpoint: str | None = None, model_name: str | None = None, nvidia_api_key: str | None = None, truncate: str = 'END', max_batch_size: int = 64, topk: int = 5, )[source]#

This function allows a user to rerank a set of candidates using the nvidia reranker nim.

Parameters:

query (str) – Query the candidates are supposed to answer.
candidates (list) – List of the candidates to rerank.
reranker_endpoint (str) – The endpoint to the nvidia reranker
model_name (str) – The name of the model host in the nvidia reranker
nvidia_api_key (str,) – The nvidia reranker api key, necessary when using non-local asset
truncate (str [END, NONE]) – Truncate the incoming texts if length is longer than the model allows.
max_batch_size (int) – Max size for the number of candidates to rerank.
topk (int,) – The number of candidates to return after reranking.

Returns:

Dictionary with top_k reranked candidates.

Return type:

Dict

nv_ingest_client.util.vdb.milvus.nvingest_retrieval(

queries,

collection_name: str | None = None,

vdb_op: VDB | None = None,

milvus_uri: str = 'http://localhost:19530',

top_k: int = 5,

hybrid: bool = False,

dense_field: str = 'vector',

sparse_field: str = 'sparse',

embedding_endpoint=None,

sparse_model_filepath: str = 'bm25_model.json',

model_name: str | None = None,

output_fields: List[str] = ['text', 'source', 'content_metadata'],

gpu_search: bool = True,

nv_ranker: bool = False,

nv_ranker_endpoint: str | None = None,

nv_ranker_model_name: str | None = None,

nv_ranker_nvidia_api_key: str | None = None,

nv_ranker_truncate: str = 'END',

nv_ranker_top_k: int = 50,

nv_ranker_max_batch_size: int = 64,

_filter: str = '',

**kwargs,

)[source]#

This function takes the input queries and conducts a hybrid/dense embedding search against the vectors, returning the top_k nearest records in the collection.

Parameters:

queries (List) – List of queries
collection (Collection) – Milvus Collection to search against
milvus_uri (str,) – Milvus address with http(s) preffix and port. Can also be a file path, to activate milvus-lite.
top_k (int) – Number of search results to return per query.
hybrid (bool, optional) – If True, will calculate distances for both dense and sparse embeddings.
dense_field (str, optional) – The name of the anns_field that holds the dense embedding vector the collection.
sparse_field (str, optional) – The name of the anns_field that holds the sparse embedding vector the collection.
embedding_endpoint (str, optional) – Number of search results to return per query.
sparse_model_filepath (str, optional) – The path where the sparse model has been loaded.
model_name (str, optional) – The name of the dense embedding model available in the NIM embedding endpoint.
nv_ranker (bool) – Set to True to use the nvidia reranker.
nv_ranker_endpoint (str) – The endpoint to the nvidia reranker
nv_ranker_model_name (str) – The name of the model host in the nvidia reranker
nv_ranker_nvidia_api_key (str,) – The nvidia reranker api key, necessary when using non-local asset
truncate (str [END, NONE]) – Truncate the incoming texts if length is longer than the model allows.
nv_ranker_max_batch_size (int) – Max size for the number of candidates to rerank.
nv_ranker_top_k (int,) – The number of candidates to return after reranking.

Returns:

Nested list of top_k results per query.

Return type:

List

nv_ingest_client.util.vdb.milvus.pandas_file_reader(input_file: str)[source]#

nv_ingest_client.util.vdb.milvus.pull_all_milvus( collection_name: str, milvus_uri: str = 'http://localhost:19530', write_dir: str | None = None, batch_size: int = 1000, )[source]#

This function takes the input collection name and pulls all the records from the collection. It will either return the records as a list of dictionaries or write them to a specified directory in JSON format. :param collection_name: Milvus Collection to query against :type collection_name: str :param milvus_uri: Milvus address with http(s) preffix and port. Can also be a file path, to activate

milvus-lite.

Parameters:

write_dir (str, optional) – Directory to write the records to. If None, the records will be returned as a list.
batch_size (int, optional) – The number of records to pull in each batch. Defaults to 1000.

Returns:

List of records/files with records from the collection.

Return type:

List

nv_ingest_client.util.vdb.milvus.reconstruct_pages(anchor_record, records_list, page_signum: int = 0)[source]#

This function allows a user reconstruct the pages for a retrieved chunk.

Parameters:

anchor_record (dict) – Query the candidates are supposed to answer.
records_list (list) – List of the candidates to rerank.
page_signum (int) – The endpoint to the nvidia reranker

Returns:

Full page(s) corresponding to anchor record.

Return type:

String

nv_ingest_client.util.vdb.milvus.recreate_elements(data)[source]#

This function takes the input data and creates a list of elements with the necessary metadata for ingestion.

Parameters:: data (List) – List of chunks with attached metadata
Returns:: List of elements with metadata.
Return type:: List

nv_ingest_client.util.vdb.milvus.reindex_collection(

vdb_op: VDB | None = None,

collection_name: str | None = None,

new_collection_name: str | None = None,

write_dir: str | None = None,

embedding_endpoint: str | None = None,

model_name: str | None = None,

nvidia_api_key: str | None = None,

milvus_uri: str = 'http://localhost:19530',

sparse: bool = False,

recreate: bool = True,

gpu_index: bool = True,

gpu_search: bool = True,

dense_dim: int = 2048,

minio_endpoint: str = 'localhost:9000',

enable_text: bool = True,

enable_charts: bool = True,

enable_tables: bool = True,

enable_images: bool = True,

enable_infographics: bool = True,

bm25_save_path: str = 'bm25_model.json',

compute_bm25_stats: bool = True,

access_key: str = 'minioadmin',

secret_key: str = 'minioadmin',

bucket_name: str = 'a-bucket',

meta_dataframe: str | DataFrame | None = None,

meta_source_field: str | None = None,

meta_fields: list[str] | None = None,

embed_batch_size: int = 256,

query_batch_size: int = 1000,

input_type: str = 'passage',

truncate: str = 'END',

**kwargs,

)[source]#

This function will reindex a collection in Milvus. It will pull all the records from the current collection, embed them using the NVIDIA embedding model, and store them in a new collection. After embedding the records, it will run the same ingestion process as the vdb_upload stage in the Ingestor pipeline. This function will get embedding_endpoint, model_name and nvidia_api_key defaults from the environment variables set in the environment if not explicitly set in the function call.

Parameters:

(str) (collection_name)
(str (meta_source_field)
optional) (The batch size for querying. Defaults to 1000.)
(str
optional)
(str
optional)
(str
optional)
(str
optional)
(str
optional)
(bool (compute_bm25_stats)
optional)
(bool
optional)
(bool
optional)
(bool
optional)
(int (query_batch_size)
optional)
(str
optional)
(bool
optional)
(bool
optional)
(bool
optional)
(bool
optional)
(bool
optional)
(str
optional)
(bool
optional)
(str
optional)
(str
optional)
(str
optional)
(Union[str (meta_dataframe) – containing metadata. Defaults to None.
pd.DataFrame] (A metadata DataFrame or the path to a CSV file) – containing metadata. Defaults to None.
optional) – containing metadata. Defaults to None.
(str – Defaults to None.
optional) – Defaults to None.
(list[str] (meta_fields)
optional)
(int
optional)
(int
optional)
**kwargs (Additional keyword arguments for customization.)

nv_ingest_client.util.vdb.milvus.remove_records( source_name: str, collection_name: str, milvus_uri: str = 'http://localhost:19530', )[source]#

This function allows a user to remove chunks associated with an ingested file. Supply the full path of the file you would like to remove and this function will remove all the chunks associated with that file in the target collection.

Parameters:

source_name (str) – The full file path of the file you would like to remove from the collection.
collection_name (str) – Milvus Collection to query against
milvus_uri (str,) – Milvus address with http(s) preffix and port. Can also be a file path, to activate milvus-lite.

Returns:

Dictionary with one key, delete_cnt. The value represents the number of entities removed.

Return type:

Dict

nv_ingest_client.util.vdb.milvus.stream_insert_milvus( records, client: MilvusClient, collection_name: str, )[source]#

Parameters:

records (List) – List of chunks with attached metadata
client (MilvusClient) – Milvus client instance
collection_name (str) – Milvus Collection to search against

nv_ingest_client.util.vdb.milvus.verify_embedding(element)[source]#

nv_ingest_client.util.vdb.milvus.write_meta_collection( collection_name: str, fields: List[str], milvus_uri: str = 'http://localhost:19530', creation_timestamp: str | None = None, dense_index: str | None = None, dense_dim: int | None = None, sparse_index: str | None = None, embedding_model: str | None = None, sparse_model: str | None = None, meta_collection_name: str = 'meta', )[source]#

nv_ingest_client.util.vdb.milvus.write_records_minio( records, writer: RemoteBulkWriter, ) → RemoteBulkWriter[source]#

Writes the supplied records to milvus using the supplied writer. If a sparse model is supplied, it will be used to generate sparse embeddings to allow for hybrid search. Will filter records based on type, depending on what types are enabled via the boolean parameters. If the user sets the log level to info, any time a record fails ingestion, it will be reported to the user.

Parameters:

records (List) – List of chunks with attached metadata
writer (RemoteBulkWriter) – The Milvus Remote BulkWriter instance that was created with necessary params to access the minio instance corresponding to milvus.
sparse_model (model,) – Sparse model used to generate sparse embedding in the form of scipy.sparse.csr_array
enable_text (bool, optional) – When true, ensure all text type records are used.
enable_charts (bool, optional) – When true, ensure all chart type records are used.
enable_tables (bool, optional) – When true, ensure all table type records are used.
enable_images (bool, optional) – When true, ensure all image type records are used.
enable_infographics (bool, optional) – When true, ensure all infographic type records are used.
enable_audio (bool, optional) – When true, ensure all audio transcript type records are used.
record_func (function, optional) – This function will be used to parse the records for necessary information.

Returns:

Returns the writer supplied, with information related to minio records upload.

Return type:

RemoteBulkWriter

nv_ingest_client.util.vdb.milvus.write_to_nvingest_collection(

records,

collection_name: str,

milvus_uri: str = 'http://localhost:19530',

minio_endpoint: str = 'localhost:9000',

sparse: bool = True,

enable_text: bool = True,

enable_charts: bool = True,

enable_tables: bool = True,

enable_images: bool = True,

enable_infographics: bool = True,

bm25_save_path: str = 'bm25_model.json',

compute_bm25_stats: bool = True,

access_key: str = 'minioadmin',

secret_key: str = 'minioadmin',

bucket_name: str = 'a-bucket',

threshold: int = 1000,

meta_dataframe=None,

meta_source_field=None,

meta_fields=None,

stream: bool = False,

**kwargs,

)[source]#

This function takes the input records and creates a corpus, factoring in filters (i.e. texts, charts, tables) and fits a BM25 model with that information.

Parameters:

records (List) – List of chunks with attached metadata
collection_name (str) – Milvus Collection to search against
milvus_uri (str,) – Milvus address with http(s) preffix and port. Can also be a file path, to activate milvus-lite.
minio_endpoint (str,) – Endpoint for the minio instance attached to your milvus.
enable_text (bool, optional) – When true, ensure all text type records are used.
enable_charts (bool, optional) – When true, ensure all chart type records are used.
enable_tables (bool, optional) – When true, ensure all table type records are used.
enable_images (bool, optional) – When true, ensure all image type records are used.
enable_infographics (bool, optional) – When true, ensure all infographic type records are used.
sparse (bool, optional) – When true, incorporates sparse embedding representations for records.
bm25_save_path (str, optional) – The desired filepath for the sparse model if sparse is True.
access_key (str, optional) – Minio access key.
secret_key (str, optional) – Minio secret key.
bucket_name (str, optional) – Minio bucket name.
stream (bool, optional) – When true, the records will be inserted into milvus using the stream insert method.

Module contents#

class nv_ingest_client.util.vdb.VDB(**kwargs)[source]#

Bases: ABC

abstract create_index(**kwargs)[source]#

abstract reindex(records: list, **kwargs)[source]#

abstract retrieval(queries: list, **kwargs)[source]#

abstract run(records)[source]#

abstract write_to_index(records: list, **kwargs)[source]#