Perplexity Partners with Vespa.ai to Bring its Search Function In-House

By building on the Vespa platform, Perplexity delivers leading AI search to millions of users.

April 15, 2025

TRONDHEIM, Norway – April 15th, 2025 – Vespa.ai Health & Life Sciences, a platform for building and deploying large-scale AI applications, today announced its partnership with Perplexity…. The move will significantly enhance the speed, accuracy, and relevance of search results at a scale only made possible on Vespa’s platform.

“The recipe: 1. Solve Search. 2. Use it to solve everything else,” said Aravind Srinivas, CEO of Perplexity.

Perplexity’s innovative approach of direct, sourced answers to search queries relies on a massive and scalable Retrieval-Augmented Generation (RAG) architecture that can efficiently retrieve and process vast amounts of information from the web, internal databases and users’ personal files. By building on Vespa’s platform, Perplexity delivers accurate, near-real-time responses to more than 15 million monthly users, handling more than 100 million queries each week.

Perplexity has used Vespa.ai’s managed platform to efficiently scale its RAG architecture, ensuring low-latency, real-time retrieval of relevant information from massive datasets. Vespa.ai provides Perplexity with the flexibility, speed, and reliability needed to deliver best-in-class conversational experiences to millions of users worldwide.

“We’re continually expanding Vespa’s capabilities to provide the flexibility, speed, and reliability necessary to deliver best-in-class conversational experiences to millions of users worldwide, and the project with Perplexity has allowed us to show just what the platform is capable of,” said Jon Bratseth, CEO of Vespa.ai. “We’re thrilled to partner with Perplexity to power their search capabilities.”

The new project, which developers from both companies have been working on, is centered on Vespa’s capabilities to provide:

High-Performance Vector and Text Search: Vespa’s optimized vector search combined with advanced text search and relevance capabilities enable Perplexity to retrieve relevant information from large datasets with exceptional speed and accuracy.
Scalability and Reliability: Vespa.ai’s distributed architecture ensures seamless scalability and high availability, allowing Perplexity to handle increasing user demand without compromising performance.
Machine-learned ranking: Vespa.ai’s native distributed machine-learned ranking inference allows Perplexity to combine many signals to deliver state of the art relevance at scale.
Cost Efficiency: Vespa’s efficient resource utilization helps Perplexity optimize its infrastructure costs while maintaining high performance.

The new project highlights Vespa.ai’s ability to power complex AI workflows, enabling Perplexity to achieve state-of-the-art relevance at large scale with unparalleled cost efficiency.

This partnership underscores the growing importance of RAG in AI-powered applications and the critical role of high-performance hybrid text and vector search engines in enabling these applications. Vespa.ai’s robust and scalable platform empowers innovative companies like Perplexity to deliver exceptional search experiences and redefine how users access information.

About Vespa.ai
Vespa.ai is a powerful platform for developing real-time search-based AI applications. Once built, these applications are deployed through Vespa’s large-scale, distributed architecture, which efficiently manages data, inference, and logic for applications handling large datasets and high concurrent query rates. Vespa delivers all the building blocks of an AI application, including vector database, hybrid search, retrieval augmented generation (RAG), natural language processing (NLP), machine learning, and support for large language models (LLM) and vision language models (VLM). It is available as a managed service and open source.

About Perplexity
Perplexity is a conversational AI answering engine that collects answers from trusted sources in real-time and answers user questions with inline citations. Founded in 2022 by former members of OpenAI, Meta, Quora, Bing, and Databricks, Perplexity is the first AI-driven search engine to provide real-time, conversational answers to user questions. With a mission to bridge the gap between traditional search engines and AI-driven interfaces to provide the best answers for curious people, it answers over 100 million questions worldwide every week. Perplexity is available online at perplexity.com and on iOS, Mac, and Android. Learn more about Perplexity Enterprise Pro here.