Fast and memory-efficient exact attention
A high-throughput and memory-efficient inference and serving engine for LLMs
Development repository for the Triton language and compiler
snippet for code
Ongoing research training transformer models at scale
👮♂️The sensitive word tool for java.(敏感词/违禁词/违法词/脏词。基于 DFA 算法实现的高性能 java 敏感词过滤工具框架。请勿发布涉及政治、广告、...
中文常用停用词表(哈工大停用词表、百度停用词表等)
leaked prompts of GPTs
Implementation of Nougat Neural Optical Understanding for Academic Documents