Skip to content
View thuwzt's full-sized avatar

Organizations

@thu-ml

Block or report thuwzt

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Residual Context Diffusion (RCD): Repurposing discarded signals as structured priors for high-performance reasoning in dLLMs.

Python 56 2 Updated Mar 12, 2026

SLA: Beyond Sparsity in Diffusion Transformers via Fine-Tunable Sparse–Linear Attention

Python 306 19 Updated Feb 24, 2026

Official repo for vidar and vidarc: video foundation model for robotics.

Python 40 1 Updated Dec 22, 2025

TurboDiffusion: 100–200× Acceleration for Video Diffusion Models

Python 3,489 254 Updated Apr 15, 2026

A Survey of Efficient Attention Methods: Hardware-efficient, Sparse, Compact, and Linear Attention

297 5 Updated Dec 1, 2025

The official implementation for [NeurIPS2025 Oral] Gated Attention for Large Language Models: Non-linearity, Sparsity, and Attention-Sink-Free

Jupyter Notebook 943 57 Updated Dec 20, 2025

[ICML2025] SpargeAttention: A training-free sparse attention that accelerates any model inference.

Cuda 991 90 Updated Feb 25, 2026

Pytorch implementation of "Oscillation-Reduced MXFP4 Training for Vision Transformers" on DeiT Model Pre-training

Python 39 4 Updated May 4, 2026

🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.

17,165 1,558 Updated Feb 13, 2023

Official implementation for "Pruning Large Language Models with Semi-Structural Adaptive Sparse Training" (AAAI 2025)

Python 19 2 Updated Jul 1, 2025

[ICLR2025] Codebase for "ReMoE: Fully Differentiable Mixture-of-Experts with ReLU Routing", built on Megatron-LM.

Python 114 11 Updated Dec 20, 2024

arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv

Python 6,827 396 Updated Mar 27, 2026

[ICLR2025, ICML2025, NeurIPS2025 Spotlight] Quantized Attention achieves speedup of 2-5x compared to FlashAttention, without losing end-to-end metrics across language, image, and video models.

Cuda 3,342 406 Updated Jan 17, 2026

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 17,181 3,404 Updated May 8, 2026

Ongoing research training transformer models at scale

Python 16,254 3,920 Updated May 8, 2026

Triton-based implementation of Sparse Mixture of Experts.

Python 274 28 Updated Oct 3, 2025

Development repository for the Triton language and compiler

MLIR 19,123 2,836 Updated May 8, 2026

[TMLR 2024] Efficient Large Language Models: A Survey

1,258 98 Updated Jun 23, 2025

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on Hopper, Ada and Blackwell GPUs, to provide better performance…

Python 3,323 718 Updated May 8, 2026

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 42,281 4,823 Updated May 7, 2026

Official code for "Efficient Backpropagation with Variance Controlled Adaptive Sampling" (ICLR 2024)

Python 8 2 Updated Mar 8, 2024

Fast and memory-efficient exact attention

Python 23,675 2,696 Updated May 8, 2026

Low-bit optimizers for PyTorch

Python 138 9 Updated Oct 9, 2023

A visual no-code/code-free web crawler/spider易采集:一个可视化浏览器自动化测试/数据采集/爬虫软件,可以无代码图形化的设计和执行爬虫任务。别名:ServiceWrapper面向Web应用的智能化服务封装系统。

JavaScript 43,765 5,329 Updated Apr 22, 2026

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 57,698 9,897 Updated Nov 12, 2025

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 99,746 27,717 Updated May 8, 2026

LaTeX Thesis Template for Tsinghua University

TeX 5,303 1,145 Updated May 2, 2026

The JavaScript library that provides a program-friendly interface to Tsinghua web portal

TypeScript 28 5 Updated Sep 24, 2023

清华大学计算机系课程攻略 Guidance for courses in Department of Computer Science and Technology, Tsinghua University

HTML 37,006 7,850 Updated May 8, 2026