Having a native AV format that comes from ANSI, pre-rendered via FFmpeg, is the missing link for <video> support.
That seems to optimise for usability/complexity ratio, while completely throwing coolness under the bus. But this is a ASCII video generator, I would've thought coolness was the point? I can't imagine a practical usecase for it...