flashinfer-ai/flashinfer

FlashInfer: Kernel Library for LLM Serving

View on GitHub
5.5kStars
939Forks
1Claude Commits
PythonLanguage
Website
attentioncudadistributed-inferencegpujitlarge-large-modelsllm-inferencemoenvidiapytorch
First Claude commit: Mar 20, 2026Last Claude commit: 1mo agoDiscovered: Mar 20, 2026

Recent Claude Commits