Apache Spark - A unified analytics engine for large-scale data processing
LLsM: A Large Language small Model
A high-performance distributed training framework for Reinforcement Learning
Apache Iceberg
Spark job to run PDGF in parallel on a Spark cluster
Official repository of Trino, the distributed SQL query engine for big data, formerly known as Pr...
Apache Doris is an MPP-based interactive SQL data warehousing for reporting and analysis.
All the things about TPC-DS in Apache Spark
A non-validating SQL parser module for Python