🚀 Vai is available. You're on .
Voyage AI Playground v1.0.0
Desktop workspace

Build, test, and ship with Voyage AI

Use one focused desktop app to generate embeddings, compare models, rerank results, and run production-friendly workflows without bouncing between docs, scripts, and browser tabs.

New Demo gallery Watch tiny terminal demos, copy the commands, and jump from the desktop app into the full public gallery. Explore the gallery
Workspace guide

Your pixel robot lives in the corner now, adding personality without turning the screen into another promo panel.

Workspace status

Loading workspace status...

Continue where you left off

Open a tool and your recent session activity will appear here.

Latest update

All releases →
Loading the latest release notes...

Workflow spotlight

Loading featured workflows...
Input Text
Model
Input Type
Dimensions
Output Type
Result
Vector Heatmap
Text A
Text B
Model
Dimensions
Cosine Similarity
Image
🖼️
Drop an image here or click to browse
PNG, JPEG, WebP, GIF, max 20 MB. Paste from clipboard (⌘V)
Preview
Video
🎬
Drop a video here or click to browse
MP4, WebM, MOV, max 20 MB
Text
Model
Dimensions
Cosine Similarity
Embedding Latency Benchmark
Models
Rounds
Results
Model Ranking Comparison
Model A
Model B
Mode
Top K
Ranking Comparison
Why Voyage AI? — Competitive Comparison

See how Voyage AI compares to other embedding providers on quality, cost, and features.

🏆 MTEB Retrieval Benchmark (nDCG@10)

Industry-standard benchmark for retrieval quality. Higher is better.

Voyage-3
67.4
Voyage-3-lite
64.3
OpenAI text-3-large
64.6
OpenAI text-3-small
62.3
Cohere embed-v3
64.1

Source: MTEB Leaderboard (Retrieval Average, Jan 2025)

Cost per Million Tokens

Voyage AI offers significant cost savings, especially with asymmetric retrieval strategies.

Provider / Model Price vs Voyage-3
Voyage-3-lite $0.02 83% cheaper
Voyage-3 $0.06 baseline
OpenAI text-3-large $0.13 2.2x more
OpenAI text-3-small $0.02 same
Cohere embed-v3 $0.10 1.7x more
💡 Pro Tip: Asymmetric Retrieval

Embed your document corpus with voyage-4-lite ($0.02/M) and query with voyage-4 ($0.06/M). Because all Voyage 4 models share the same embedding space, you get top-tier retrieval quality at a fraction of the cost.

🚀 Voyage AI Advantages
🔗
Shared Embedding Space
All Voyage 4 models produce compatible embeddings. Mix voyage-4-large for queries with voyage-4-lite for documents, and they work together seamlessly.
🏢
Domain-Specific Models
Specialized models for code (voyage-code-3), finance, law, and multilingual content that outperform general-purpose alternatives.
Two-Stage Retrieval
Combine embedding search with rerank-2 to dramatically boost precision. The cross-encoder re-scores candidates for better relevance.
🖼️
Multimodal Embeddings
voyage-multimodal-3 embeds both text and images in the same space, enabling cross-modal search (find images with text queries and vice versa).
Ready to try Voyage AI?

Get a free API key and start building in minutes.

Get API Key → or Read the Docs
Quantization Benchmark

Compare how different output data types (float, int8, binary) affect storage size and ranking quality. Embeds the same corpus with each dtype and measures the tradeoff.

Model
Dimensions
Data Types
Query
Corpus (one document per line)
📦 Storage per Vector
⏱ API Latency
🎯 Ranking Quality vs Float Baseline
RAG Cost Calculator
Tokens / query 500
Queries / day 1,000
Model Type $/1M tokens Daily Cost Monthly Cost Relative
Benchmark History
No benchmarks recorded yet. Run a latency benchmark to start tracking.
📚

Your Knowledge Base

Start with existing data or create fresh

or

Start with a fresh knowledge base

Knowledge Base

KB Info --
Documents 0
Chunks 0
Size 0 KB
Files 0
  • No documents ingested
Upload
Files Paste URL

Drop files here

.txt, .md, .pdf — max 10MB

Instructions

Custom instructions appended to the system prompt

Configuration
LOCAL

Controls how conversation history is managed within the token budget.

No provider · No database
Chat with your knowledge base. Click Configure above to adjust your LLM provider and database.
Ready
Memory:
0%
Loading...
Vai
Select a workflow from the library
Workflow Store @vaicli
Loading catalog...
Michael Lynn
Michael Lynn
Founder, Advisor, Engineer, Developer Advocate, MongoDB
About This Project
voyageai-cli (vai) is a community-built command-line tool for working with Voyage AI embeddings, reranking, and MongoDB Atlas Vector Search. It was created to make it easier for developers to explore, benchmark, and integrate Voyage AI models into their applications — right from the terminal or this playground.
About Michael
Michael Lynn is a Principal Staff Developer Advocate at MongoDB with 25+ years in enterprise infrastructure and over a decade at MongoDB. He focuses on strategic developer relations, creating educational content around Vector Search, AI enablement, and developer tooling. He builds tools like this to help developers get hands-on with new technology faster.
What You Can Do Here
⚡ Embed — Generate vector embeddings for any text
⚖️ Compare — Measure similarity with cosine, dot product & euclidean distance
🔍 Rerank — Semantic search with optional reranking
🔮 Multimodal — Compare images and text in the same vector space with voyage-multimodal-3.5
🛠️ Generate — Generate code snippets and scaffold full projects with templates
💬 Chat — RAG-powered chat with your documents, configurable system prompts
🔄 Workflows — Multi-step agent workflows with thinking panels and tool orchestration
⏱ Benchmark — Compare model latency, ranking quality, and costs
📚 Explore — 23 interactive concepts covering embeddings, vector search, multimodal, RAG, and more
Supported Models
Text Embeddings — voyage-4-large, voyage-4, voyage-4-lite, voyage-code-3, voyage-finance-2, voyage-law-2
Multimodal — voyage-multimodal-3.5 (text + images + video in one vector space)
Reranking — rerank-2.5, rerank-2.5-lite

All models are accessible via the Voyage AI API or through MongoDB Atlas Vector Search.
⚠️ Community Tool Disclaimer
This tool is not an official product of MongoDB, Inc. or Voyage AI. It is independently built and maintained by Michael Lynn as a community resource. It is not supported, endorsed, or guaranteed by either company. Use at your own discretion. For official documentation, visit mongodb.com/docs/voyageai.
What's New
Recent — Local inference playground expansion

The Nano tab now goes far beyond setup checks: you can compare texts with an NxN similarity heatmap, inspect embedding behavior across 256/512/1024/2048 dimensions, and run cross-bridge comparisons between local voyage-4-nano embeddings and cloud API results.

Recent — Explore and About page refresh

The Explore experience has been polished with cleaner cards, updated visual styling, and Avi as the lightweight guide in the header and explainer modal. The About page has also been refreshed to better reflect the current product direction.

Recent — Better bug reporting and diagnostics

Bug reports from both the CLI and Electron app now include richer environment context, validated contact details, and cleaner GitHub issue handoff. The desktop bridge exposes platform and runtime metadata directly to improve issue triage.

Recent — Expanded explainers for local inference

The learning content now includes a dedicated local-inference explainer covering the Python bridge, shared embedding space, and when to use local versus hosted Voyage models, with new aliases that make the topic easier to discover.

Earlier — Generate, scaffold, and workflow setup

Added vai generate and vai scaffold for production-ready RAG code and starter apps, plus related maintenance workflows like purge, refresh, and evaluation comparisons.

Earlier — Desktop app, multimodal, and auto-update

Introduced the signed and notarized macOS desktop app with auto-update support, secure keychain-backed credentials, a settings cog, the Multimodal tab, telemetry controls, and the hidden Vector Space Invaders easter egg.

Earlier — Benchmarking, evaluation, and retrieval tooling

Built out the benchmark, estimate, eval, query, pipeline, chunk, and explain workflows that power the broader CLI and playground experience for retrieval, ranking, and cost analysis.

Made with ☕ and curiosity · Source on GitHub · Releases

Generate Code Snippet

Voyage 4 Family
Shared Embedding Space
voyage-4-large, voyage-4, and voyage-4-lite produce compatible embeddings in the same vector space. Index documents with the large model for best quality, then query with the lite model at a fraction of the cost. No re-indexing required.
Major Announcement
voyage-4-nano is open-weight and local-capable
The Voyage 4 family now includes voyage-4-nano, an open-weight model for local development and prototyping. It lives in the same shared embedding space as voyage-4-large, voyage-4, and voyage-4-lite, so you can prototype locally and move into asymmetric retrieval workflows without rebuilding your index.
Run locally and compare local vs cloud vectors, and use it as a zero-query-cost option in hybrid retrieval workflows.
Read Announcement
RTEB Retrieval Benchmark (NDCG@10)

Industry-standard benchmark for retrieval quality across 29 datasets. Higher is better.

How This Demo Works

Voyage AI's shared embedding space means documents embedded with voyage-4-large can be queried with voyage-4-lite at a fraction of the cost — with nearly identical retrieval results. This demo proves it.

Step 1
Prepare data — Embed 65 sample documents with voyage-4-large and store in MongoDB Atlas
Step 2
Compare models — Run the same queries with voyage-4-large and voyage-4-lite, measure retrieval overlap
Step 3
See savings — Project annual cost savings at your scale
Heads up: This demo uses your Voyage AI API key to embed documents and run queries. The sample data requires approximately ~50K tokens for embedding and ~1.5K tokens per analysis run. All data is stored in your MongoDB Atlas cluster under vai_demo.cost_optimizer_demo.

⚙️ Switching to Agent Mode

Pipeline Mode

  • Fixed RAG: retrieve → generate
  • Always searches your KB first
  • Fast, predictable responses
  • Best for direct Q&A from your docs

Agent Mode

  • LLM decides which tools to call
  • Can search, compare, embed, explore
  • Uses vai_query, vai_search, etc.
  • Best for complex, multi-step questions
In Agent mode, the LLM chooses how to answer — it may call vai_query to search your KB, vai_collections to discover data, or answer from its own knowledge. You'll see a Thinking panel showing each tool call in real time. Responses may take longer but can be more thorough.
Setup Status
Checking nano bridge status...

General

API access and connection settings

⚠️ No API key configured — add your key below to get started.
API Key Encrypted via OS keychain · Get a key
API Base URL Override the default endpoint (leave empty for default)
Request Timeout Max seconds to wait for API responses
Default Tab Which tab to open when the app starts

Database

MongoDB Atlas connection and default targets

MongoDB URI Connection string for MongoDB Atlas · Atlas Console
Default Database Used by workflows and commands when no database is specified
Default Collection Used by workflows and commands when no collection is specified

Appearance

Theme and visualization options

Theme Controls the color scheme of the interface
Vector Heatmap Colors Color palette for embedding visualizations

Models

Default embedding model preferences

Default Embedding Model Pre-selected model for Embed and Compare tabs
Default Input Type Pre-selected input type for embedding requests

Chat

Chat configuration has moved to the Knowledge Base panel in the Chat tab.

All chat settings (LLM provider, model, database, instructions) are now accessible directly from the KB panel sidebar when using Chat.

Benchmark

Benchmark iteration and display defaults

Iterations per Model Number of runs when benchmarking latency
Show Detailed Timings Display p50/p95/p99 in benchmark results

Data & Privacy

Caching, analytics, and data management

Persist Embeddings Locally Cache embedding results in browser storage to avoid re-fetching
Clear Cached Data Remove all locally stored embeddings and preferences
Anonymous Usage Analytics Help improve Vai by sharing anonymous usage data (version, platform, features used). No API keys or personal data.

MCP Integrations

Install the vai MCP server into your AI tools and editors

AI Tools

Health Check

Validate your vai setup and connectivity

Diagnostics
Click "Run Health Check" to validate your setup.
✓ Saved

📐 How the Cost Calculator Works

Voyage AI charges per million tokens processed. A token is roughly ¾ of a word. The calculator estimates your total embedding cost based on how many documents you embed and how many queries you run over time.

💡 Simple Mode

Compares the per-model query cost for a given volume. Useful for quick "which model is cheapest?" checks.

Daily cost =
tokens_per_query × queries_per_day ÷ 1,000,000 × price_per_M_tokens

Monthly cost = daily cost × 30

📊 RAG Planner Mode

Models the full cost of a Retrieval-Augmented Generation (RAG) pipeline, separating the one-time document ingestion cost from the recurring query cost.

Document embedding (one-time):
doc_cost = num_docs × tokens_per_doc ÷ 1,000,000 × doc_model_price

Query embedding (monthly):
query_cost/mo = queries_per_month × tokens_per_query ÷ 1,000,000 × query_model_price

Projected total:
total = doc_cost + (query_cost/mo × months)

⚖️ Three Strategies Compared

🔗 Shared Embedding Space

The Voyage 4 family (voyage-4-large, voyage-4, voyage-4-lite, voyage-4-nano) all produce vectors in the same geometric space. A document embedded with voyage-4-large can be searched with a query embedded by voyage-4-lite — cosine similarity still works correctly. This is what makes asymmetric strategies possible.

Example: 100K docs × 500 tok = 50M doc tokens
1M queries/mo × 30 tok = 30M query tokens/mo

Symmetric (voyage-4-large @ $0.18/1M):
  Docs: $9.00 + Queries: $5.40/mo × 12 = $73.80

Asymmetric (large docs + lite queries @ $0.05/1M):
  Docs: $9.00 + Queries: $1.50/mo × 12 = $27.00

  Savings: 63% — same document quality, cheaper queries.

📋 Per-Model Table

The bottom table shows what it would cost to use each model symmetrically (same model for docs and queries). The relative bar shows cost compared to the most expensive option. Use this to understand the price spread across the full model lineup.

🎯 Key Assumptions

CLI equivalent: vai estimate --docs 100K --queries 1M --doc-model voyage-4-large --query-model voyage-4-lite

💰 Cost Dashboard

TimeOperationModelTokensCostIf v4-largeIf OpenAI
💰 Session: $0.000000 0 tokens 0 ops LLM: $0.000000 (0in/0out) If symmetric: $0.000000 If OpenAI: $0.000000

Export

Workflow Canvas
Format
Press ⌘K to open • Type to search • ↑↓ to navigate • ⏎ to execute