The "Small Vision-Language Model" (SVLM) is a compact multimodal model tailored for beginners or users with limited computational resources. Its main goal is to optimize the integration of visual a…

Python 13 Updated Sep 1, 2025

skyzh / tiny-llm

A course of learning LLM inference serving on Apple Silicon for systems engineers: build a tiny vLLM + Qwen.

Python 4,160 313 Updated Apr 24, 2026

sgl-project / mini-sglang

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python 4,121 617 Updated Mar 13, 2026

GeeeekExplorer / nano-vllm

Nano vLLM

Python 13,268 2,044 Updated Apr 26, 2026

Kuaishou-RecModel / Tri-Decoupled-GenRec

121 3 Updated Dec 25, 2025

towhee-io / towhee

Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.

Python 3,446 261 Updated Oct 18, 2024

microsoft / SPTAG

A distributed approximate nearest neighborhood search (ANN) library which provides a high quality vector index build, search and distributed online serving toolkits for large scale vector search sc…

C++ 4,990 617 Updated May 6, 2026

thustorage / PipeANN

A low-latency, billion-scale, and updatable graph-based vector store on SSD.

Jupyter Notebook 116 40 Updated Apr 24, 2026

iDC-NEU / Greator

C++ 22 1 Updated Aug 30, 2025

hhy3 / awesome-vector-search

26 Updated Apr 22, 2026

antgroup / vsag

vsag is a vector indexing library used for similarity search.

C++ 470 92 Updated May 6, 2026

zilliztech / knowhere

Vector search engine inside Milvus, integrating FAISS, HNSW, DiskANN.

C++ 347 140 Updated May 6, 2026

weaviate / weaviate

Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of …

Go 16,140 1,271 Updated May 6, 2026