Skip to content

Latest commit

 

History

History
130 lines (104 loc) · 12.6 KB

_index.md

File metadata and controls

130 lines (104 loc) · 12.6 KB
Title alwaysopen categories description linkTitle hideListLinks weight
Redis for AI documentation
false
docs
operate
rs
rc
An overview of Redis for AI documentation
Redis for AI
true
40

Redis stores and indexes vector embeddings that semantically represent unstructured data including text passages, images, videos, or audio. Store vectors and the associated metadata within [hashes]({{< relref "/develop/data-types/hashes" >}}) or [JSON]({{< relref "/develop/data-types/json" >}}) documents for [indexing]({{< relref "/develop/interact/search-and-query/indexing" >}}) and [querying]({{< relref "/develop/interact/search-and-query/query" >}}). Use the [vector sets]({{< relref "/develop/data-types/vector-sets" >}}) preview data type to add items to a set, and retrieve a subset of items that are the most similar to a specified vector.

Vector RAG RedisVL
{{AI Redis icon.}}[Redis vector database quick start guide]({{< relref "/develop/ai/vector-database" >}}) {{AI Redis icon.}} [Retrieval-Augmented Generation quick start guide]({{< relref "/develop/ai/rag" >}}) {{AI Redis icon.}}[Redis vector Python client library documentation]({{< relref "/integrate/redisvl/" >}})

Overview

This page organized into a few sections depending on what you’re trying to do:

  • How to's - The comprehensive reference section for every feature, API, and setting. It’s your source for detailed, technical information to support any level of development.
  • Concepts - Explanations of foundational ideas and core principles to help you understand the reason behind the product’s features and design.
  • Quickstarts - Short, focused guides to get you started with key features or workflows in minutes.
  • Tutorials - In-depth walkthroughs that dive deeper into specific use cases or processes. These step-by-step guides help you master essential tasks and workflows.
  • Integrations - Guides and resources to help you connect and use the product with popular tools, frameworks, or platforms.
  • Benchmarks - Performance comparisons and metrics to demonstrate how the product performs under various scenarios. This helps you understand its efficiency and capabilities.
  • Best practices - Recommendations and guidelines for maximizing effectiveness and avoiding common pitfalls. This section equips you to use the product effectively and efficiently.

How to's

  1. [Create a vector index]({{< relref "develop/ai/vector-fields" >}}): Redis maintains a secondary index over your data with a defined schema (including vector fields and metadata). Redis supports [FLAT]({{< relref "develop/ai/vector-fields#flat-index" >}}) and [HNSW]({{< relref "develop/ai/vector-fields#hnsw-index" >}}) vector index types.

Learn how to index and query vector embeddings

  • [redis-py (Python)]({{< relref "/develop/clients/redis-py/vecsearch" >}})
  • [NRedisStack (C#/.NET)]({{< relref "/develop/clients/dotnet/vecsearch" >}})
  • [node-redis (JavaScript)]({{< relref "/develop/clients/nodejs/vecsearch" >}})
  • [Jedis (Java)]({{< relref "/develop/clients/jedis/vecsearch" >}})
  • [go-redis (Go)]({{< relref "/develop/clients/go/vecsearch" >}})

Concepts

Learn to perform vector search and use gateways and semantic caching in your AI/ML projects.

Search LLM memory Semantic caching Semantic routing AI Gateways
{{AI Redis icon.}}[Vector search guide]({{< relref "/develop/ai/vector-search" >}}) {{LLM memory icon.}}Store memory for LLMs {{AI Redis icon.}}Semantic caching for faster, smarter LLM apps {{Semantic routing icon.}}Semantic routing chooses the best tool {{AI Redis icon.}}Deploy an enhanced gateway with Redis

Quickstarts

Quickstarts or recipes are useful when you are trying to build specific functionality. For example, you might want to do RAG with LangChain or set up LLM memory for you AI agent. Get started with the following Redis Python notebooks.

Hybrid and vector search

Vector search retrieves results based on the similarity of high-dimensional numerical embeddings, while hybrid search combines this with traditional keyword or metadata-based filtering for more comprehensive results.

RAG

Retrieval Augmented Generation (aka RAG) is a technique to enhance the ability of an LLM to respond to user queries. The retrieval part of RAG is supported by a vector database, which can return semantically relevant results to a user’s query, serving as contextual information to augment the generative capabilities of an LLM.

Agents

AI agents can act autonomously to plan and execute tasks for the user.

LLM memory

LLMs are stateless. To maintain context within a conversation chat sessions must be stored and resent to the LLM. Redis manages the storage and retrieval of chat sessions to maintain context and conversational relevance.

Semantic caching

An estimated 31% of LLM queries are potentially redundant. Redis enables semantic caching to help cut down on LLM costs quickly.

Computer vision

Build a facial recognition system using the Facenet embedding model and RedisVL.

Recommendation systems

Tutorials

Need a deeper-dive through different use cases and topics?

RAG

  • Agentic RAG - A tutorial focused on agentic RAG with LlamaIndex and Amazon Bedrock
  • RAG on Vertex AI - A RAG tutorial featuring Redis with Vertex AI
  • RAG workbench - A development playground for exploring RAG techniques with Redis

Recommendation system

Ecosystem integrations

Benchmarks

See how we stack up against the competition.

Best practices

See how leaders in the industry are building their RAG apps.