Submitted by Qingchuan Ma 3 A2RBench: An Automatic Paradigm for Formally Verifiable Abstract Reasoning Benchmark Generation MAC-AutoML 2 1