Add LangCache API integration for semantic caching #321

abrookins · 2025-04-17T00:02:33Z

Description

This PR adds integration with the LangCache SDK to provide enhanced LLM response
caching capabilities in RedisVL. The new LangCache class extends our existing
LLM caching functionality to provide a RedisVL-compatible interface that backed by
the LangCache API for semantic caching.

Features

New LangCache class that implements the BaseLLMCache interface
Semantic similarity search for finding relevant cached responses
Configurable distance thresholds for controlling cache hit precision
Optional TTL support for cache entries
Support for entry scopes to organize and manage cache entries
Full async support for all operations

Implementation Details

Integrates with the LangCache SDK for core functionality
Maintains compatibility with existing RedisVL patterns and interfaces
Includes comprehensive test coverage for all features
Supports both Redis client and URL-based initialization

Dependencies

Vendors the langcache package in the repository (temporary, until SDK is on PyPI)

Gaps / Open Questions

No integration tests with the live API, which I may leave as a TODO
This is available as a top-level import: should it be in an experimental module or otherwise indicate that it's experimental?

abrookins · 2025-04-23T17:37:10Z

tests/unit/test_utils.py

@@ -160,7 +160,7 @@ def test_empty_list_to_bytes():
    assert array_to_buffer(array, dtype="float32") == expected


-@pytest.mark.parametrize("dtype", ["float64", "float32", "float16", "bfloat16"])
+@pytest.mark.parametrize("dtype", ["float64", "float32", "float16"])


Will restore

tylerhutcherson · 2025-04-23T18:21:28Z

tests/conftest.py

+    # In xdist, the config has "workerid" in workerinput
+    workerinput = getattr(request.config, "workerinput", {})
+    worker_id = workerinput.get("workerid", "master")
+
    # construct a search index from the schema
    index = AsyncSearchIndex.from_dict(
        {
            "index": {
                "name": "user_index",


We should update both the index name AND the prefix to do proper isolation here

tylerhutcherson · 2025-04-23T18:22:07Z

tests/integration/test_aggregation.py

+    # In xdist, the config has "workerid" in workerinput
+    workerinput = getattr(request.config, "workerinput", {})
+    worker_id = workerinput.get("workerid", "master")
+
    index = SearchIndex.from_dict(
        {
            "index": {
                "name": "user_index",


Ditto, index name

tylerhutcherson · 2025-04-23T18:22:31Z

tests/integration/test_async_search_index.py

-    return AsyncSearchIndex.from_yaml("schemas/test_json_schema.yaml")
+def async_index_from_yaml(request):
+    # In xdist, the config has "workerid" in workerinput
+    workerinput = getattr(request.config, "workerinput", {})


Maybe we can make worker_id a fixture in conftest ?

tylerhutcherson · 2025-04-23T18:28:52Z

redisvl/extensions/cache/llm/langcache_api.py

+import json
+from typing import Any, Dict, List, Optional, Union
+
+from langcache import LangCache as LangCacheSDK


Should these be treated as optionals and lazy loaded??

Yeah, definitely, and especially because you can't actually try the service yet.

redisvl/extensions/cache/llm/langcache_api.py

tylerhutcherson · 2025-04-23T18:31:05Z

redisvl/extensions/cache/llm/langcache_api.py

+    def check(
+        self,
+        prompt: Optional[str] = None,
+        vector: Optional[List[float]] = None,


should we make vector an optional **kwarg on the base model?

Hmm. Isn't it already optional?

tylerhutcherson · 2025-04-23T18:32:23Z

redisvl/extensions/cache/llm/langcache_api.py

+            raise TypeError("return_fields must be a list of field names")
+
+        # Use the provided threshold or fall back to the instance default
+        threshold = (


are we sure we use the same metric for the threshold? I thought LangCache used similarity 0-1

SemanticCache does, but the 0-2 range doesn't seem to be encoded anywhere else (in the base classes) as an assumption or validation. 🤔 Hmm! I guess we need to treat thresholds coming to this class as 0-2 if it's a swappable component with SemanticCache, and hopefully one day soon normalize the value everywhere.

I normalized and denormalized. We'll have to return to this later, but hoping that does the trick.

abrookins · 2025-04-25T22:15:02Z

We're going to pull this from 0.6.0 for now, but I'll roll the text fixes into a separate PR.

abrookins · 2025-04-26T00:26:43Z

Pushed my latest changes up before closing (for now). This PR will ride again. 🐴

abrookins added 2 commits April 16, 2025 17:02

Checkpoint

5021a42

Checkpoint

e9e4fad

abrookins changed the base branch from main to 0.6.0 April 18, 2025 00:56

abrookins added 9 commits April 18, 2025 13:43

Merge branch '0.6.0' into feat/RAAE-769-add-langcache-wrapper

0258519

Skip flaky tests

f0231f4

Skip flaky tests

3dc3faa

Skip a flaky test

ec5fa87

Attempt to fix dupe data in tests with unique indexes

abe0ecd

Fix missing variable

fb8ab19

First pass at central HF fixtures

eea6266

Merge branch '0.6.0' into feat/RAAE-769-add-langcache-wrapper

52d8df6

Fix SearchIndex tests to account for worker IDs in prefixes

890e655

abrookins changed the title ~~WIP on LangCache integration~~ Add LangCache API integration for semantic caching Apr 23, 2025

abrookins marked this pull request as ready for review April 23, 2025 17:34

abrookins commented Apr 23, 2025

View reviewed changes

tylerhutcherson reviewed Apr 23, 2025

View reviewed changes

abrookins added 2 commits April 23, 2025 15:34

Turn worker_id into a fixture

e545db0

Merge branch '0.6.0' into feat/RAAE-769-add-langcache-wrapper

26b960d

Fix ID vs. name usage

9a50d15

abrookins closed this Apr 26, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add LangCache API integration for semantic caching #321

Add LangCache API integration for semantic caching #321

abrookins commented Apr 17, 2025 •

edited

Loading

abrookins Apr 23, 2025

tylerhutcherson Apr 23, 2025

tylerhutcherson Apr 23, 2025

tylerhutcherson Apr 23, 2025

tylerhutcherson Apr 23, 2025

abrookins Apr 23, 2025

tylerhutcherson Apr 23, 2025

abrookins Apr 25, 2025

tylerhutcherson Apr 23, 2025

abrookins Apr 25, 2025

abrookins Apr 25, 2025

abrookins commented Apr 25, 2025

abrookins commented Apr 26, 2025

Add LangCache API integration for semantic caching #321

Add LangCache API integration for semantic caching #321

Conversation

abrookins commented Apr 17, 2025 • edited Loading

Description

Features

Implementation Details

Dependencies

Gaps / Open Questions

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

abrookins commented Apr 25, 2025

abrookins commented Apr 26, 2025

abrookins commented Apr 17, 2025 •

edited

Loading