Research Webzine of the KAIST College of Engineering since 2014
Fall 2025 Vol. 25To address prevalent data quality issues in real-world AI training, this research focuses on three major challenges. First, it automatically detects and corrects label errors during training, reducing the need for manual data preprocessing. The related paper has had a significant academic impact, being cited over 1,200 times in the past two years. Second, it automatically infers missing labels in time-series data, significantly lowering the cost of manual label acquisition. Third, it removes redundant data and selects a core set of informative samples, achieving comparable model performance while reducing the training time by up to 90%. These technologies have been successfully applied to real-world social problems such as infectious disease prediction and economic impact forecasting. They have been granted patents in both South Korea and the United States.

The technology also addresses the label shortage issue. By analyzing changes in time-series data, the model can automatically detect transition points, such as when a person switches from walking to running. This approach outperformed traditional distance-based methods, improving the accuracy by up to 12.7%, also proving especially effective for wearable healthcare sensor data.
The issue of data redundancy was tackled by enabling the AI to automatically select only the most informative samples for training. As a result, the model can achieve performance comparable to that when using the full dataset while reducing the training time by up to 90%. These functionalities are integrated into a unified framework that combines error correction and core-set selection for greater overall efficiency.
Beyond algorithmic improvements, this technology has been practically applied to solving societal problems. The researchers developed an AI model that forecasts inbound COVID-19 cases, earning a U.S. patent, and another that predicts the economic impact of infectious disease outbreaks on local businesses, which is patented in Korea.
This foundational technology directly supports the “Efficient Learning and AI Infrastructure Advancement” pillar of Korea’s National Strategic Technologies in AI. It is also expected to play a key role in the rapidly expanding AIOps (Artificial Intelligence for IT Operations) market, which is projected to grow from $27.24 billion in 2024 to approximately $79.91 billion by 2029, at a CAGR of 24.01%. This advancement marks a critical step toward making AI not only faster and smarter but also more practical and deployable across real-world domains.
A New solution enabling soft growing robots to perform a variety of tasks in confined spaces
Read moreAI-Designed carbon nanolattice: Feather-light, steel-strong
Read moreDevelopment of a compact high-resolution spectrometer using a double-layer disordered metasurface
Read moreWearable hyperspectral photoplethysmography for the continuous monitoring of exercise-induced hypertension
Read moreSmarter AI through AI-generated feedback
Read more