Abstract: Speech Emotion Recognition (SER) plays a significant role in many applications such as psychology and speech therapy, customer service, as well as human-computer interaction. We serve this ...
1 Department of Computer and Instructional Technologies Education, Gazi Faculty of Education, Gazi University, Ankara, Türkiye. 2 Department of Forensic Informatics, Institute of Informatics, Gazi ...
Abstract: Classroom environments often struggle with maintaining optimal acoustic conditions for effective learning. Traditional monitoring methods are manual, time-consuming, and unreliable. This ...
HTK is a respected toolkit used mainly by the speech community to perform research in speech recognition. Although quite old, many newer systems emulate the same feature extraction pipeline as used in ...
The 21st-century digital ecosystem has evolved into a vast and dynamic visual information space. Every day, billions of images and videos are uploaded across social media platforms, news portals, ...
Despite advancements in technology such as applications like Shazam that can identify music within seconds, the trend mainly applies to well-known instruments. Cultural instruments are virtually ...
In marine ecology research, it is crucial to accurately identify the marine mammal species active in the target area during the current season, which helps researchers understand the behavioral ...
Audio forensics plays a major role in the investigation and analysis of audio recordings for legal and security purposes. The advent of audio fake attacks using speech combined with scene-manipulated ...
Mental health disorders (MHDs) have significant medical and financial impacts on patients and society. Despite the potential opportunities for artificial intelligence (AI) in the mental health field, ...
Previous studies have classified major depression and healthy control groups based on vocal acoustic features, but the classification accuracy needs to be improved. Therefore, this study utilized deep ...