<Photo1. (From left) Ph.D candidate Yong-hoo Kwon, M.S candidate Do-hwan Kim, Professor Jung-woo Choi, Dr. Dong-heon Lee>
'Acoustic separation and classification technology' is a next-generation artificial intelligence (AI) core technology that enables the early detection of abnormal sounds in areas such as drones, fault detection of factory pipelines, and border surveillance systems, or allows for the separation and editing of spatial audio by sound source when producing AR/VR content.
On the 11th of July, a research team led by Professor Jung-woo Choi of KAIST's Department of Electrical and Electronic Engineering won first place in the 'Spatial Semantic Segmentation of Sound Scenes' task of the 'DCASE2025 Challenge,' the world's most prestigious acoustic detection and analysis competition.
This year’s challenge featured 86 teams competing across six tasks. In this competition, the KAIST research team achieved the best performance in their first-ever participation to Task 4. Professor Jung-woo Choi’s research team consisted of Dr. Dong-heon, Lee, Ph.D. candidate Young-hoo Kwon, and M.S. candidate Do-hwan Kim.
Task 4 titled 'Spatial Semantic Segmentation of Sound Scenes' is a highly demanding task requiring the analysis of spatial information in multi-channel audio signals with overlapping sound sources. The goal was to separate individual sounds and classify them into 18 predefined categories. The research team plans to present their technology at the DCASE workshop in Barcelona this October.
< external_image >
<Figure 1. Example of an acoustic scene with multiple mixed sounds>
Early this year, Dr. Dong-heon Lee developed a state-of-the-art sound source separation AI that combines Transformer and Mamba architectures. During the competition, centered around researcher Young-hoo Kwon, they completed a ‘chain-of-inference architecture' AI model that performs sound source separation and classification again, using the waveforms and types of the initially separated sound sources as clues. This AI model is inspired by human’s auditory scene analysis mechanism that isolates individual sounds by focusing on incomplete clues such as sound type, rhythm, or direction, when listening to complex sounds.
Through this, the team was the only participant to achieve double-digit performance (11 dB) in 'Class-Aware Signal-to-Distortion Ratio Improvement (CA-SDRi)*,' which is the measure for ranking how well the AI separated and classified sounds, proving their technical excellence.
Prof. Jung-woo Choi remarked, "The research team has showcased world-leading acoustic separation AI models for the past three years, and I am delighted that these results have been officially recognized." He added, "I am proud of every member of the research team for winning first place through focused research, despite the significant increase in difficulty and having only a few weeks for development."
< external_image >
<Figure 2. Time-frequency patterns of sound sources separated from a mixed source>
The IEEE DCASE Challenge 2025 was held online, with submissions accepted from April 1 to June 15 and results announced on June 30. Since its launch in 2013, the DCASE Challenge has served as a premier global platform of IEEE Signal Processing Society for showcasing cutting-edge AI models in acoustic signal processing.
This research was supported by the Mid-Career Researcher Support Project and STEAM Research Project of the National Research Foundation of Korea, funded by the Ministry of Education, Science and Technology, as well as support from the Future Defense Research Center, funded by the Defense Acquisition Program Administration and the Agency for Defense Development.
<Professor Mikyoung Lim from KAIST Department of Mathematical Sciences> Professor Mikyoung Lim from KAIST Department of Mathematical Sciences gave a plenary talk on "Research on Inverse Problems based on Geometric Function Theory" at AIP 2025 (12th Applied Inverse Problems Conference). AIP is one of the leading international conferences in applied mathematics, organized biennially by the Inverse Problems International Association (IPIA). This year's conference was held from July 2
2025-08-14KAIST (President Kwang Hyung Lee) is leading the transition to AI Transformation (AX) by advancing research topics based on the practical technological demands of industries, fostering AI talent, and demonstrating research outcomes in industrial settings. In this context, KAIST announced on the 13th of August that it is at the forefront of strengthening the nation's AI technology competitiveness by developing core AI technologies via national R&D projects for generative AI led by the Minis
2025-08-13< (From left) Ph.D candidate Wonho Zhung, Ph.D cadidate Joongwon Lee , Prof. Woo Young Kim , Ph.D candidate Jisu Seo > Traditional drug development methods involve identifying a target protin (e.g., a cancer cell receptor) that causes disease, and then searching through countless molecular candidates (potential drugs) that could bind to that protein and block its function. This process is costly, time-consuming, and has a low success rate. KAIST researchers have developed an AI model th
2025-08-12<Photo1. Group photo at the end of the program> KAIST (President Kwang Hyung Lee) announced on the 11thof August that it successfully hosted the 'APEC Youth STEM Conference KAIST Academic Program,' a global science exchange program for 28 youth researchers from 10 countries and over 30 experts who participated in the '2025 APEC Youth STEM* Collaborative Research and Competition.' The event was held at the main campus in Daejeon on Saturday, August 9. STEM (Science, Technology, Eng
2025-08-11<Photo1. Group Photo of Team Atlanta> Team Atlanta, led by Professor Insu Yun of the Department of Electrical and Electronic Engineering at KAIST and Tae-soo Kim, an executive from Samsung Research, along with researchers from POSTECH and Georgia Tech, won the final championship at the AI Cyber Challenge (AIxCC) hosted by the Defense Advanced Research Projects Agency (DARPA). The final was held at the world's largest hacking conference, DEF CON 33, in Las Vegas on August 8 (local time)
2025-08-10