Apple's accessibility AI is useful precisely where chatbots are the wrong metaphor
When Apple talks about Apple Intelligence in mainstream contexts, it mostly sounds like a productivity feature: write your emails faster, retouch your photos, summarize your documents. The privacy argument is real but abstract. On-device processing is a selling point, not a necessity.
That calculus changes entirely when the person using the tool has no other way to interact with it.
Apple announced a slate of Apple Intelligence accessibility features on May 19, ahead of Global Accessibility Awareness Day on May 21. The company is bringing natural language navigation to Voice Control on iPhone and iPad, AI-powered image descriptions to VoiceOver and Magnifier, auto-generated subtitles to every screen in the Apple ecosystem, and eye-tracking wheelchair control to Apple Vision Pro. They arrive later this year.
The feature that will get the most attention is Voice Control with natural language. Instead of memorizing exact button labels or overlay numbers, users will be able to say things like "tap the guide about best restaurants" or "tap the purple folder." For someone navigating an iPhone entirely by voice, this is not a convenience upgrade. It is the difference between a tool that mostly works and one that actually works. Apple's current Voice Control can be brittle. If the wording does not match what the system expects, the command fails, and for a user who cannot reach the screen to correct it, that failure is total. The new model understands intent, not just keywords.
Image descriptions in VoiceOver and Magnifier are also running on-device. VoiceOver's Image Explorer will describe photos, scanned bills, and personal records in detail. Magnifier gains spoken controls, so a user can say "zoom in" or "turn on flashlight" rather than navigating a small touch interface. These descriptions are generated locally, which means the contents of your photo library, your medical documents, your financial records, never travel to a server.
Generated subtitles are perhaps the most quietly remarkable addition. Videos without captions — personal clips, content shared by friends, online media with missing subtitles — will now display auto-generated transcriptions on iPhone, iPad, Mac, Apple TV, and Vision Pro. Apple says the speech recognition runs on-device, privately, and the subtitles appear automatically. For a deaf or hard-of-hearing user, this is not a nice-to-have. It is access.
Apple Vision Pro can now control Tolt and LUCI alternative wheelchair drive systems via eye tracking, starting in the US with Bluetooth and wired connections. This is the most striking example of the on-device principle taken to its logical end. For a power wheelchair user who cannot operate a joystick, the alternative drive connection is deeply personal data: where you want to go, how fast, when to stop. Apple is arguing that this data should never leave the device. For users in this situation, the privacy argument is not a marketing claim. It is a functional requirement.
The feature also reveals the limits of the hardware in its current form. Colin Hughes, a wheelchair user and technology journalist at The Register, noted that Vision Pro as a wheelchair control form factor is impractical for extended daily wear. It is bulky, conspicuous, and creates fatigue for users who already manage physical strain. The $3,499 starting price adds another barrier for a community with high rates of device-related expense and, often, constrained incomes.
That is a fair objection. But it is worth naming what Apple is actually doing here, independent of whether the first-generation hardware is ready for prime time. The company is deploying its most advanced sensing system — precise, recalibration-free eye tracking that works in variable lighting — to solve a problem that has no good solution for the people affected by it. Alternative wheelchair controls exist, but they are often clunky, expensive, or unavailable for specific chair models. Vision Pro's eye tracking may not be the final answer, but it is a genuine attempt at a hard problem, and the on-device architecture means the data involved never becomes a product.
Sarah Herrlinger, Apple's senior director of Global Accessibility Policy and Initiatives, framed the approach around privacy as a design constraint, not a checkbox. "With these updates, we're bringing new, intuitive options for input, exploration, and personalization, designed to protect users' privacy at every step," she said in Apple's announcement.
Tim Cook made the broader case. "Now, with Apple Intelligence, we are bringing powerful new capabilities into our accessibility features while maintaining our foundational commitment to privacy by design."
The natural language Voice Control launches later this year in English in the UK, US, Canada, and Australia, but Apple has not announced Mac support, which will disappoint users who rely on voice to work on a computer. Generated subtitles are initially limited to English in the US and Canada.
The accessibility features arriving this year are not Apple's most important Apple Intelligence announcement. They may be its most honest one. When Apple says privacy is built into the architecture, the claim is easiest to dismiss in contexts where the alternative is merely inconvenient. It becomes much harder to dismiss when the alternative for the person using the tool is no tool at all.
Apple has not confirmed whether any of these features route any data through cloud infrastructure. If that confirmation does not come, or if the wheelchair control feature requires a cloud component for calibration, the architecture-as-privacy argument needs qualification. The features Apple described should be taken at face value when they ship, not when they are announced.
What Apple is quietly building, feature by feature, is a case that on-device AI is not a compromise. It is the correct engineering choice for the contexts where the stakes are highest.