Open Access

Topic Identification in Voice Recordings

   | Dec 14, 2023

Cite

The paper shows the understanding of a topic recognition problem like the speech recognition system based on Natural Language Processing (NLP) and the steps of its implementation of a rules-based approach, which is able to classify given audio materials based on predefined topics in real-time. During implementation, a statistical vocabulary was developed. Google Speech API (Application Programming Interface) was employed for subtitling audio materials, and the most ideal time frame for reception was identified through several experiments. The motivation of this work is based on the deficiency of similar simple systems for Hungarian topic recognition, even though numerous international languages already utilize multiple Automatic Sound Recognition (ASR) systems.