Python AxmlParser
Fast CUDA Kernels for ResNet Inference. Using Winograd algorithm to optimize the efficiency of co...
AI-powered drum removal tool using Meta's Demucs. Drop in any song, get a drumless backing track ...
AI Tensor Engine for ROCm
super repo for rocm systems projects
SGLang is a fast serving framework for large language models and vision language models.
tabnotes
PTX ISA 9.1 documentation converted to searchable markdown. Includes Claude Code skill for CUDA d...
A high-throughput and memory-efficient inference and serving engine for LLMs
Simple app to learning the lure finish tech