AI Search Platform

Vespa lets you query, organize, and make inferences in vectors, tensors, text and structured data. Scale to billions of constantly changing data items, thousands of queries per second with latencies below 100 milliseconds.

Use cases

+ Search

Vespa is the world’s leading open text search engine and the world’s most capable vector database. In combination with Vespa’s integrated distributed machine-learned model inference for relevance this lets you create search applications with a quality you simply cannot achieve in any other way.

Search
+ Generative AI (RAG)

GenAI applications are only as good as the data we surface for them to work with; they need great search relevance. This takes much more than vector similarity—hybrid search, relevance models, and multi-vector representations. Vespa is the only platform which lets you deploy such techniques with no limitations and at any scale.

Generative AI
+ Recommendation and personalization

Recommendation, personalization and ad targeting systems combine retrieval of eligible content with machine-learned model evaluation to select the best data items. Vespa lets you easily build applications that do this at any scale and complexity.

Recommendation
+ Semi-structured navigation

Applications like e-commerce use a combination of structured data and text+images, and need to combine search and recommendation seamlessly with structured navigation. Vespa provides all the features required to do this with great performance, at any scale.

Navigation
+ Personal/private search

In applications working with personal data, any query will only access a small fraction of the total data, and building indexes would be wasteful – especially with vectors. Vespa provides a special mode – streaming search – which delivers all the industry-leading features of Vespa for personal/private search 20x cheaper than with indexing.

Personal search

All you need to build data-driven applications

Vector, text and structured search

Distributed machine-learned ranking

Unbeatable performance

Infinite automated scalability

Continous deployment & upgrades

Fully managed, with strong security

Features

Proven at scale

The most innovative teams in the world build on Vespa

Case studies

“As a reliable and scalable solution, Vespa has been instrumental in enabling Search at Spotify. We look forward to continuing our work with the Vespa team, and enabling innovation that will enhance the experience for Spotify listeners.”

Daniel Doro,

Director of Engineering, Search

Spotify case study

“Vespa is a battle-tested platform that allows us to integrate keyword and vector search seamlessly. It forms a key part of our AI research solution, guaranteeing both precision and rapidity in streamlining research processes. We highly recommend Vespa for its reliability and efficiency.”

Jungwon Byun

COO & Cofounder

Elicit case study

“Vespa has been a critical component to Yahoo’s AI and machine learning capabilities across all of our properties for many years”

Jim Lanzone,

CEO

Yahoo case study

“Our team successfully implemented the entire recommendation process of one algorithm with Vespa, matching the latency requirements (provide recommendations under 100ms) and scalability needs.”

Ricardo Rossi Tegão

Machine Learning Engineer

Farfetch case study

Blog

Scaling a Vespa Application: Feeding Fast and Furiously

A tutorial on how to scale the resources in a Vespa application to increase feed throughput. Using the metrics dashboard for informed and optimised scaling.

April 28, 2026

Blog

The Vespa Cloud Metrics Dashboard

A guide to the Vespa Cloud metrics dashboard — how to move from symptom to bottleneck to action, and what's new in the latest revision.

April 24, 2026

Blog

Using Large ONNX Models with External Data in Vespa Embedders

Many ONNX models exceed the 2GB protobuf limit and store weights in external data files. Vespa now supports these models for embedders.

March 27, 2026

Built for developers

Developer site

Sample apps

Get started from any of our production ready sample apps

Browse sample apps

Ask our docs

Browse, search, or talk to our documentation

search.vespa.ai

Vespa Slack

Join hundreds of Vespa app developers on our Slack

Vespa Slack

We
Make AI
Work

Highlights

Vespa.ai Live in London

AI Quick Fix For eCommerce Webinars

Vespa 2026 Q1 Product Update

GigaOm Radar for Vector Databases V3

Introduction to Vespa.ai

Vespa.ai fuels trusted answers at scale for Perplexity

Vespa architecture in 3 minutes

Watch the episodes

+ Search

+ Generative AI (RAG)

+ Recommendation and personalization

+ Semi-structured navigation

+ Personal/private search

All you need to build data-driven applications

Proven at scale