v.23.1Performance Improvement

Optimize Column-Wise Ternary Logic Evaluation for 21x Performance Gain on Intel Xeon CPU

Optimize the column-wise ternary logic evaluation by achieving auto-vectorization. In the performance test of this microbenchmark, we've observed a peak performance gain of 21x on the ICX device (Intel Xeon Platinum 8380 CPU). #43669 (Zhiguo Zhou).