An open research, friendly community expanding AI capability at edge.
We prioritize more quality than quantity — minimal overhead, public ilterations.
Modern AI performs well on familiar data but struggles with unseen domains. At Seton Labs, out-of-distribution challenges to build systems generalizing beyond training data.
We use simple and consistent naming syntax.
Difficulty levels: effortless · easy · mid · hard · ultra hard
Each level is based on three factors: number of rows · output size (tokens) · variety of categories
Dataset naming format:
bench-(tier)-(month)-(year)