Skip to content

Multimodal Vector Search

Multimodal Vector Search

Unified vector search across text, images, audio, and video.

Overview

HeliosDB Multimodal Vector Search provides:

  • Native support for multiple modalities (text, image, audio, video)
  • CLIP-based cross-modal search
  • Unified embedding space for semantic similarity
  • High-performance ANN (Approximate Nearest Neighbor) indexes

Quick Start

-- Create table with vector column
CREATE TABLE documents (
id SERIAL PRIMARY KEY,
content TEXT,
image BYTEA,
embedding vector(1536)
);
-- Create vector index
CREATE INDEX idx_docs_embedding ON documents
USING ivfflat (embedding vector_cosine_ops)
WITH (lists = 100);
-- Search by similarity
SELECT id, content
FROM documents
ORDER BY embedding <-> to_vector('search query embedding')
LIMIT 10;

Key Features

FeatureDescription
8 ModalitiesText, image, audio, video, code, tables, graphs, time-series
CLIP IntegrationCross-modal search (search images with text)
ANN IndexesIVFFlat, HNSW for billion-scale search
Hybrid SearchCombine vector + keyword + graph search
GPU AccelerationCUDA-accelerated similarity computation

Documentation

DocumentDescription
MULTIMODAL_SEARCH_ARCHITECTURE.mdSystem architecture
CLIP_INTEGRATION_APPROACH.mdCLIP model integration
  • GraphRAG: /docs/features/graphrag/
  • Full-Text Search: /docs/guides/user/FULL_TEXT_SEARCH_TUNING_GUIDE.md
  • Vector Examples: /docs/guides/user/VECTOR_INSERT_EXAMPLES.md

Status: Production Ready Version: v7.0