A New Technique Runs a Parallel-Generation Language Model 17x to 42x Faster on a Phone's AI Chip — type0 | type0