Insights, updates, and deep dives into AI technology and our journey.

Today we are releasing KiriTTS — a multilingual, expressive Text-to-Speech model built for real-world applications. Here is what it can do and why we built it.

When building Kiri OCR, we faced a huge challenge: getting the computer to 'see' where the text is on a page. This is the story of how we built our text detector using a method called DB (Differentiable Binarization).