Skip to content
Change the repository type filter

All

    Repositories list

    • History for benchmark results
      Python
      Apache License 2.0
      1311Updated May 3, 2025May 3, 2025
    • lance

      Public
      Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, and PyTorch with more integrations coming..
      Rust
      Apache License 2.0
      2914.6k56271Updated May 3, 2025May 3, 2025
    • lancedb

      Public
      Developer-friendly, embedded retrieval engine for multimodal AI. Search More; Manage Less.
      Python
      Apache License 2.0
      4626.3k37230Updated May 1, 2025May 1, 2025
    • Lance Namespace Specification is an open specification on top of the storage-based Lance data format to standardize access to a collection of Lance tables (a.k.a. Lance datasets)
      Java
      Apache License 2.0
      31271Updated Apr 30, 2025Apr 30, 2025
    • Apache Flink connector for Lance
      Apache License 2.0
      0100Updated Apr 29, 2025Apr 29, 2025
    • Python
      1200Updated Apr 25, 2025Apr 25, 2025
    • research

      Public
      repository containing reproducibility code for R&D experiments and benchmarks
      Python
      1000Updated Apr 23, 2025Apr 23, 2025
    • High quality resources & applications for LLMs, multi-modal models and VectorDBs
      Jupyter Notebook
      Apache License 2.0
      13775821Updated Apr 21, 2025Apr 21, 2025
    • Lance Trino connector
      Apache License 2.0
      1000Updated Apr 15, 2025Apr 15, 2025
    • Spark integrations for working with Lance datasets
      Java
      Apache License 2.0
      2561Updated Apr 15, 2025Apr 15, 2025
    • YOLOExplorer : Iterate on your YOLO / CV datasets using SQL, Vector semantic search, and more within seconds
      Python
      1912651Updated Apr 7, 2025Apr 7, 2025
    • ocra

      Public
      OCRA: Object-store Cache in Rust for All
      Rust
      Apache License 2.0
      5910Updated Apr 4, 2025Apr 4, 2025
    • Python
      0002Updated Apr 1, 2025Apr 1, 2025
    • assets

      Public
      LanceDB public assets for docs and presentations
      3000Updated Mar 17, 2025Mar 17, 2025
    • Research papers from Lance
      TeX
      0163Updated Mar 11, 2025Mar 11, 2025
    • Python
      1000Updated Mar 10, 2025Mar 10, 2025
    • ray

      Public
      Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
      Python
      Apache License 2.0
      6.3k000Updated Mar 4, 2025Mar 4, 2025
    • Rust
      0000Updated Feb 3, 2025Feb 3, 2025
    • A JavaScript client for FlightSQL
      JavaScript
      Apache License 2.0
      4920Updated Jan 7, 2025Jan 7, 2025
    • ragged

      Public
      Python
      Other
      31900Updated Oct 14, 2024Oct 14, 2024
    • Deep Learning how-to's using Lance file format
      Python
      Apache License 2.0
      51630Updated Sep 18, 2024Sep 18, 2024
    • datasets

      Public
      🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
      Python
      Apache License 2.0
      2.8k000Updated Jun 6, 2024Jun 6, 2024
    • trino

      Public
      Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
      Java
      Apache License 2.0
      3.2k000Updated Apr 26, 2024Apr 26, 2024
    • Build an AI chatbot with website context retrieved from a vector store like LanceDB.
      TypeScript
      258410Updated Mar 26, 2024Mar 26, 2024
    • Tantivy directory implementation backed by object_store
      Rust
      Apache License 2.0
      73301Updated Jan 22, 2024Jan 22, 2024
    • A Benchmark Tool for VectorDB
      Python
      MIT License
      205100Updated Nov 27, 2023Nov 27, 2023
    • Rust client for the vercel blob API
      Rust
      Apache License 2.0
      2310Updated Oct 13, 2023Oct 13, 2023
    • LlamaIndex (GPT Index) is a project that provides a central interface to connect your LLM's with external data.
      Python
      MIT License
      5.9k200Updated Jul 25, 2023Jul 25, 2023
    • Examples and guides for using the OpenAI API
      Jupyter Notebook
      MIT License
      10k200Updated Jul 18, 2023Jul 18, 2023
    • Developer-friendly, serverless vector database for AI applications. Easily add long-term memory to your LLM apps!
      Python
      Apache License 2.0
      462000Updated Jul 18, 2023Jul 18, 2023