Voice Transcriptions for ILTIS

This use case was realized with:

Developing and deploying scalable AI voice transcription

In 2024, Iltis partnered with Codesphere to implement and deploy a speech recognition and knowledge retrieval use case based on the latest open source AI technologies.

Everything at a glance

The main achievements of the project

What was done in <6 weeks →

Application architecture →

Sophisticated ingestion →

Open source stack →

"Together with Codesphere, we developed a working PoC in just 6 weeks, which allowed us to test and gather feedback incredibly quickly."

Alexander Ott

CEO @ ILTIS

Achievements in < 6 weeks

In under 6 weeks, the team managed to develop a fully functional, scalable AI voice transcription tools with 4 different services:

Real time speech recognition

Utilizing OpenAI’s Whisper models for real time voice transcription.

ERP integration

Fully integrated into ILTIS knowledge system with automated updates.

Semantic search

Using the transcribed information to search through a Vector DB.

Operator cockpit UI

UI for ILTIS employees to seamlessly interact with the application.

Weeks

>2000+

Documents embedded

100%

GDPR compliant

"Great product, amazing team behind, superb support at any time! We love Codesphere!"

Alexander Woelke

Co-Founder & Co-CEO @ SaaS Titans

via Product Hunt

Fully composable architecture

The application creates numerical representations of the transcribed data and stores them in a vector database.

Frontend

Records voice, handles interaction with services.

Transcription Server

Transcribes audio sequences into text.

Sentence Transformer

Creates numerical representations of spoken input.

PostgreSQL

Database for storing numerical input

Sophisticated ingestion

The application creates numerical representations of the transcribed data and stores them in a vector database.

Ingestion Pipeline Server

Records voice, handles interaction with services.

PDF Server

Transcribes audio sequences into text.

Sentence Transformer Server

Creates numerical representations of spoken input.

ILTIS ERP System

Database for storing numerical input

PostgreSQL

Database for storing numerical input