Starred repositories
Web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.
A Python package for evaluating radiology report generation using multiple standard and medical-specific metrics.
A metric suite leveraging the logical inference capabilities of LLMs, for radiology report generation both with and without grounding
Citrus: Leveraging Expert Cognitive Pathways in a Medical Language Model for Advanced Medical Decision Support
🦄🦄🦄AI赋能股票分析:AI加持的股票分析/选股工具。股票行情获取,AI热点资讯分析,AI资金/财务分析,涨跌报警推送。支持A股,港股,美股。支持市场整体/个股情绪分析,AI辅助选股等。数据全部保留在本地。支持DeepSeek,OpenAI, Ollama,LMStudio,AnythingLLM,硅基流动,火山方舟,阿里云百炼等平台或模型。
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Open-source industrial-grade ASR models supporting Mandarin, Chinese dialects and English, achieving a new SOTA on public Mandarin ASR benchmarks, while also offering outstanding singing lyrics rec…
TradingAgents: Multi-Agents LLM Financial Trading Framework
[ACL2026] "MiniRAG: Making RAG Simpler with Small and Open-Sourced Language Models"
百聆 是一个类似GPT-4o的语音对话机器人,通过ASR+LLM+TTS实现,集成DeepSeek R1等优秀大模型,接入openClaw,真正的个人语音助手,时延低至800ms,Mac等低配置也可运行,支持打断
Medical o1, Towards medical complex reasoning with LLMs
《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程
Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.
[ICCV 2023] CLIP-Driven Universal Model; Rank first in MSD Competition.
[COMMSENG'24, TMI'24] Interactive Computer-Aided Diagnosis using LLMs
[Nature Reviews Bioengineering🔥] Application of Large Language Models in Medicine. A curated list of practical guide resources of Medical LLMs (Medical LLMs Tree, Tables, and Papers)
This is an implementation of the paper: Searching for Best Practices in Retrieval-Augmented Generation (EMNLP2024)
🎬 卡卡字幕助手 | VideoCaptioner - 基于 LLM 的智能字幕助手 - 视频字幕生成、断句、校正、字幕翻译全流程处理!- A powered tool for easy and efficient video subtitling.
Repository for "LLM-based speaker diarization correction: A generalizable approach" paper
[CVPR2025] We present StableAnimator, the first end-to-end ID-preserving video diffusion framework, which synthesizes high-quality videos without any post-processing, conditioned on a reference ima…
Learning to Use Medical Tools with Multi-modal Agent
本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)
[Nature Communications] The official codes for "Towards Building Multilingual Language Model for Medicine"
The Largest-scale Chinese Medical QA Dataset: with 26,000,000 question answer pairs.
Universal LLM Deployment Engine with ML Compilation
A machine learning software for extracting information from scholarly documents
LlamaIndex is the leading document agent and OCR platform