Video interaction is already live in Singapore banking. DBS and others have deployed video teller services that let customers interact with bankers remotely — combining the convenience of digital with the richness of face-to-face. This is not a future use case. It is a present one.
The challenge is that conversation design for video interaction is genuinely different from chat or voice. Flows built for text do not translate. The cues customers rely on, the pacing, the expectations around visual presence — all of it changes the design requirements. Most AI advisory in the market does not yet account for this. The organisations building these experiences are working without a playbook.
AI does not define the environment it operates in. It inherits it. That is as true for video teller interactions and omnichannel banking as it is for a standard IVR replacement. Before any channel goes live with AI, the question is not "which platform?" It is: what does this channel demand, what does our data environment actually support, and where does the handover to a human need to happen — and why?