Publications

2025

  1. Preprint
    AEGIS: Awareness-Enhanced Guidance for Iterative Safeguard
    Kyungwon Park, Sangmin Lee, Heejae Chon, and 1 more author
    2025
    Under Review
  2. UniverSR: Unified and Versatile Audio Super-Resolution via Vocoder-Free Flow Matching
    Woongjib Choi, Sangmin Lee, Hyungseob Lim, and 1 more author
    2025
    Accepted to ICASSP 2026
  3. Preprint
    SAGE-LD: Towards Scalable and Generalizable End-to-End Language Diarization via Simulated Data Augmentation
    Sangmin Lee, Woongjib Choi, Jihyun Kim, and 1 more author
    2025
    Under Review
  4. UniCoM: A Universal Code-Switching Speech Generator
    Sangmin Lee, Woojin Chung, Seyun Um, and 1 more author
    In , 2025
    Accepted to Findings of EMNLP 2025
  5. LAMA-UT: Language Agnostic Multilingual ASR through Orthography Unification and Language-Specific Transliteration
    Sangmin Lee, Woojin Chung, and Hong-Goo Kang
    In , 2025
    Accepted to AAAI 2025 as oral presentation (Top 4.6% of the total submissions)

2024

  1. Preprint
    Talk3d: High-fidelity talking portrait synthesis via personalized 3d generative prior
    Jaehoon Ko, Kyusun Cho, Joungbin Lee, and 4 more authors
    2024
    ArXiv publication