Skip to content
View njhill's full-sized avatar

Organizations

@netty @kserve @vllm-project @llm-d @Inferact

Block or report njhill

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Early-stage Rust drop-in alternative frontend for vLLM

Rust 46 4 Updated May 6, 2026

Tools for Python coroutines and advanced scheduling for `asyncio`

Python 19 1 Updated Dec 29, 2025

TPU inference for vLLM, with unified JAX and PyTorch support.

Python 319 178 Updated May 7, 2026

Achieve state of the art inference performance with modern accelerators on Kubernetes

Shell 3,142 459 Updated May 6, 2026

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 79,231 16,497 Updated May 7, 2026

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 160,333 33,126 Updated May 7, 2026

💥 Fast State-of-the-Art Tokenizers optimized for Research and Production

Rust 10,699 1,084 Updated May 6, 2026

High-performance netty and thrift-based microservice RPC library for Java

Java 4 4 Updated Sep 17, 2025

Alternative etcd3 java client

Java 163 43 Updated Sep 17, 2025

Distributed Model Serving Framework

Java 189 79 Updated Apr 14, 2026

Controller for ModelMesh

Go 244 135 Updated Apr 14, 2026

Abstracted helper classes providing consistent key-value store functionality, with zookeeper and etcd3 implementations

Java 6 2 Updated Sep 17, 2025

Fake XRandR configurations for multi-head setups with crappy video drivers, like fakexinerama but with xrandr

Python 274 38 Updated Apr 29, 2024

Java utilities for working with CompletionStages

Java 59 13 Updated Jan 17, 2019
Java 3,825 587 Updated Apr 24, 2026

Netty project - an event-driven asynchronous network application framework

Java 34,938 16,247 Updated May 6, 2026

The Java gRPC implementation. HTTP/2 based RPC

Java 12,011 3,989 Updated May 7, 2026