AI Scores Well on Chart Comparison Test, But the Scoring Method May Be Flawed — type0 | type0