AI Computing Platform Laboratory

Department of Computer Science and Engineering, Artificial Intelligence Convergence, Ewha Womans University

main2.jpg

As AI models continue to grow in size, they require vast amounts of energy, making sustainable AI unfeasible if current trends persist. Consequently, the importance of robust computing HW/SW infrastructure underpinning AI will become critical.

Our main research goal is to computing AI model in a faster and energy-efficient way through HW/SW co-design. Specifically, our research interests include:

  • Neural Processing Unit (NPU), domain-specific hardware, FPGA
  • Quantization, pruning, and knowledge distillation
  • Hardware-aware neural architecture search (HW-Aware NAS) and neural architecture accelerator search (NAAS)
  • Processing-in-memory (PIM)
  • Efficient LLM serving including KV Caching and other optimizations
  • On-Device AI

현재 연구실 구성원 모집 정보 및 연구실 합류 희망자 안내

news

Dec 10, 2025 Our team (Yejin Lee, Suhyeon Kim, Eunjoung Yoo, Yoon Ji Choi) won 3rd place (우수상) at 2025 AI Chip Con. Congratulations!
Nov 17, 2025 The paper authored by our lab member Nadam Park — who is concurrently participating in a capstone design project and undergraduate research internship — has been accepted for presentation at the 2025 Korea Software Congress (2025 한국소프트웨어종합학술대회). Congratulations!
Sep 26, 2025 Miyeon Lee has joined our group as a Master student. Welcome!
Sep 22, 2025 Seunghee Song has joined our group as an undergraduate intern. Welcome!
Sep 19, 2025 Our lab has two papers accepted for presentation at 2025 International Conference on Communications, Computing, Cybersecurity, and Informatics (CCCI).

latest publications

  1. 학부생 성과
    ProgressiveServe: 서버리스 LLM 콜드 스타트 완화를 위한 점진적 모델 로딩 및 복구 기법
    Nadam Park ,  Nakyeong Lee ,  Juwon Lee , and Jaehyeong Sim
    In 2025 한국소프트웨어종합학술대회
  2. MAGNETO: A Genetic Algorithm-Based Power-Aware Mapping Optimization Framework for Mobile NPUs
    Eunjin Lee, Jiho Lee, Hayoung Lim, and Jaehyeong Sim
    In 2025 International Conference on Communications, Computing, Cybersecurity, and Informatics (CCCI)
  3. DS-CAE: a Dual-Stream Cross-Attentive Autoencoder for Robust and Cluster-Aware Retrieval-Augmented Generation
    Soeun Choi, Yejin Lee, Juhee Kim, Minji Kim, and Jaehyeong Sim
    In 2025 International Conference on Communications, Computing, Cybersecurity, and Informatics (CCCI)