Yet another re-implement of jetson-containers, targeting for Jetson Thor, Spark, and x86.
UI for my LLM playground
A high-throughput and memory-efficient inference and serving engine for LLMs