Vespa at Events

SW2 Conference

May 14th, 2024: Broomfield, Colorado

SW2 is a conference devoted to the next wave of software development. It is where a welcoming community of experts will gather to learn about what’s next, years before those topics will appear at other conferences.

Jon Bratseth, CEO: Building Something Real with Retrieval Augmented Generation (RAG)

Generative AI is transformative, but in most cases it needs to be combined with your data to deliver something truly useful - this is the topic of retrieval augmented generation (RAG). This talk will discuss how to make RAG applications that delivers quality results in production, at scale. Some topics:

  • Why RAG?
  • RAG is about relevance.
  • Do you even need vector embeddings?
  • Do you even need vector indexes?
  • Using personal data effectively.
  • Quality and cost tradeoffs.
  • Letting your LLM train your relevance model.

Read more


AICamp

May 15th, 2024: Boston, MA

Kristian Aune, Head of Customer Success: Improving the Usefulness of LLMs with RAG

LLMs like GPT can give useful answers to many questions, but there are also well-known issues with their output: The responses may be outdated, inaccurate, or outright hallucinations, and it’s hard to know when you can trust them. And they don’t know anything about you or your organization's private data (we hope). RAG can help reduce the problems with “hallucinated” answers, and make the responses more up-to-date, accurate, and personalized - by injecting related knowledge, including non-public data.

In this talk, we’ll go through what RAG means and demo some ways you can implement it, including vector search, multi-vector, filtering, ranking and hybrid search.

Read more


AICamp

May 16th, 2024: New York City

Kristian Aune, Head of Customer Success: Improving the Usefulness of LLMs with RAG

LLMs like GPT can give useful answers to many questions, but there are also well-known issues with their output: The responses may be outdated, inaccurate, or outright hallucinations, and it’s hard to know when you can trust them. And they don’t know anything about you or your organization's private data (we hope). RAG can help reduce the problems with “hallucinated” answers, and make the responses more up-to-date, accurate, and personalized - by injecting related knowledge, including non-public data.

In this talk, we’ll go through what RAG means and demo some ways you can implement it, including vector search, multi-vector, filtering, ranking and hybrid search.

Read more


infoshare '24

May 22nd, 2024: Gdańsk, Poland

Jon Bratseth, CEO: Building something real with AI and data

Generative AI is transformative, but in most cases it needs to be combined with your data to deliver something truly useful - this is the topic of retrieval augmented generation (RAG). This talk will discuss how to make RAG applications that delivers quality results in production, at scale. Some topics:

  • Why RAG?
  • RAG is about relevance.
  • Do you even need vector embeddings?
  • Do you even need vector indexes?
  • Using personal data effectively.
  • Quality and cost tradeoffs.
  • Letting your LLM train your relevance model.

Read more


ML Con

Jun 25th, 2024: Munich, Germany

Jon Bratseth, CEO: Building something real with RAG

Retrieval Augmented Generation (RAG) is about combining large language models with your data. This talk will discuss how to make RAG applications that delivers quality results that makes business sense. Covering the topics:

  • Why RAG?
  • RAG is about relevance.
  • Do you even need vector embeddings?
  • Do you even need vector indexes?
  • Using personal data effectively.
  • Quality and cost tradeoffs.
  • Let your LLM train your relevance model.

Read more