Selected Papers


Conference Papers

  1. Warped-Compaction: Maximizing GPU Register File Bandwidth Utilization via Operand Compaction
    Eunbi Jeong, Ipoom Jeong*, Myung Kuk Yoon*, and Nam Sung Kim
    The 31st International IEEE Symposium on High Performance Computer Architecture (HPCA 2025), Las Vegas, United States, Mar. 1 - 5, 2025
    *co-corresponding authors
  2. Marching Page Walks: Batching and Concurrent Page Table Walks for Enhancing GPU Throughput
    Jiwon Lee, Gun Ko, Myung Kuk Yoon, Ipoom Jeong, Yunho Oh, and Won Woo Ro
    The 31st International IEEE Symposium on High Performance Computer Architecture (HPCA 2025), Las Vegas, United States, Mar. 1 - 5, 2025
  3. DEPrune: Depth-wise Separable Convolution Pruning for Maximizing GPU Parallelism
    Cheonjun Park, Mincheol Park, Hyunchan Moon, Myung Kuk Yoon, Seokjin Go, Suhyun Kim, and Won Woo Ro
    The 38th Annual Conference on Neural Information Processing Systems (NeurIPS 2024), Vancouver, Canada, Dec. 9 - 15, 2024
  4. VitBit: Enhancing Embedded GPU Performance for AI Workloads through Register Operand Packing [LINK]
    Jaebeom Jeon, Minseong Gil, Junsu Kim, Jaeyong Park, Gunjae Koo, Myung Kuk Yoon*, and Yunho Oh*
    The 53rd International Conference on Parallel Processing (ICPP 2024), Gotland, Sweden, August 12 - 15, 2024
    *co-corresponding authors
  5. INTERPRET: Inter-Warp Register Reuse for GPU Tensor Core [LINK]
    Jae Seok Kwak, Myung Kuk Yoon, Ipoom Jeong, Seunghyun Jin, and Won Woo Ro
    The 32nd International Conference on Parallel Architectures and Compilation Techniques (PACT 2023), Vienna, Austria, Oct. 21 - 25, 2023
  6. Warped-MC: An Efficient Memory Controller Scheme for Massively Parallel Processors (Best Paper Award) [LINK]
    Jonghyun Jeong, Myung Kuk Yoon, Yunho Oh, and Gunjae Koo
    The 52nd International Conference on Parallel Processing (ICPP 2023), Salt Lake City, Utah, USA, August 7 - 10, 2023
  7. Early-Adaptor: An Adaptive Framework For Proactive UVM Memory Management [LINK]
    Seokjin Go, Hyunwuk Lee, Junsung Kim, Jiwon Lee, Myung Kuk Yoon, and Won Woo Ro
    The 2023 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS 2023), Raleigh, NC, USA, Apr. 23 - 25, 2023
  8. Balanced Column-Wise Block Pruning for Maximizing GPU Parallelism [LINK]
    Cheonjun Park, Mincheol Park, Hyun Jae Oh, Minkyu Kim, Myung Kuk Yoon, Suhyun Kim, and Won Woo Ro
    The 37th Association for the Advancement of Artificial Intelligence (AAAI-23), Washington DC, USA, Feb. 07 - 14, 2023
  9. Reconstructing Out-of-Order Issue Queue [LINK]
    Ipoom Jeong, Jiwon Lee, Myung Kuk Yoon, and Won Woo Ro
    The 55th IEEE/ACM International Symposium on Microarchitecture (MICRO 2022), Chicago, Illinois, USA, Oct. 01 - 05, 2022
  10. FineReg: Fine-Grained Register File Management for Augmenting GPU Throughput [LINK]
    Yunho Oh, Myung Kuk Yoon, William J. Song, and Won Woo Ro
    The 51st IEEE/ACM International Symposium on Microarchitecture (MICRO 2018), Fukuoka, Japan, Oct. 20 - 24, 2018
  11. Virtual Thread: Maximizing Thread-Level Parallelism beyond GPU Scheduling Limit [LINK]
    Myung Kuk Yoon, Keunsoo Kim, Sangpil Lee, Won Woo Ro, and Murali Annavaram
    The 43rd ACM/IEEE International Symposium on Computer Architecture (ISCA 2016), Seoul, Korea, Jun. 18 - 22, 2016
  12. APRES: Improving Cache Efficiency by Exploiting Load Characteristics on GPUs [LINK]
    Yunho Oh, Keunsoo Kim, Myung Kuk Yoon, Jong Hyun Park, Yongjun Park, Won Woo Ro, and Murali Annavaram
    The 43rd ACM/IEEE International Symposium on Computer Architecture (ISCA 2016), Seoul, Korea, Jun. 18 - 22, 2016
  13. Warped-Preexecution: A GPU Pre-execution Approach for Improving Latency Hiding [LINK]
    Keunsoo Kim, Sangpil Lee, Myung Kuk Yoon, Gunjae Koo, Won Woo Ro, and Murali Annavaram
    The 22nd International IEEE Symposium on High Performance Computer Architecture (HPCA 2016), Barcelona, Spain, Mar. 12 - 16, 2016
  14. DRAW: Investigating Benefits of Adaptive Fetch Group Size on GPU [LINK]
    Myung Kuk Yoon, Yunho Oh, Sangpil Lee, Seung Hun Kim, Deokho Kim, and Won Woo Ro
    The 2015 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS 2015), Philadelphia, PA, USA, Mar. 29 - 31, 2015

Journal Papers

  1. TLP Balancer: Predictive Thread Allocation for Multi-Tenant Inference in Embedded GPUs
    Minseong Gil, Jaebeom Jeon, Junsu Kim, Sangun Choi, Gunjae Koo, Myung Kuk Yoon, and Yunho Oh
    IEEE Embedded Systems Letters (ESL) (Accepted)
  2. Adaptive Kernel Merge and Fusion for Multi-Tenant Inference in Embedded GPUs [LINK]
    Jaebeom Jeon, Gunjae Koo, Myung Kuk Yoon*, and Yunho Oh*
    IEEE Embedded Systems Letters (ESL) (Accepted)
    *co-corresponding authors
  3. Triple-A: Early Operand Collector Allocation for Maximizing GPU Register Bank Utilization [LINK]
    Ipoom Jeong, Eunbi Jeong, Nam Sung Kim, and Myung Kuk Yoon
    IEEE Embedded Systems Letters (ESL), Vol. 16, Issue 2, pp. 206-209, June 2024
  4. Conflict-Aware Compiler for Hierarchical Register File on GPUs [LINK]
    Eunbi Jeong, Eun Seong Park, Gunjae Koo, Yunho Oh*, and Myung Kuk Yoon*
    Journal of Systems Architecture (JSA), Vol. 149, pp. 103099, Apr. 2024
    *co-corresponding authors
  5. SAVector: Vectored Systolic Arrays [LINK]
    Sangun Choi, Seongjun Park, Jaeyong Park, Jongmin Kim, Gunjae Koo, Seokin Hong, Myung Kuk Yoon*, and Yunho Oh*
    IEEE Access, Vol. 12, pp. 44446 - 44461, March 2024
    *co-corresponding authors
  6. CASH-RF: A Compiler-Assisted Hierarchical Register File in GPUs [LINK]
    Yunho Oh, Ipoom Jeong, Won Woo Ro, and Myung Kuk Yoon
    IEEE Embedded Systems Letters (ESL), Vol. 14, Issue 4, pp. 187-190, Dec. 2022
  7. Analyzing GCN Aggregation on GPU [LINK]
    Inje Kim, Jonghyun Jeong, Yunho Oh, Myung Kuk Yoon, and Gunjae Koo
    IEEE Access, Vol. 10, pp. 113046 - 113060, Oct. 2022
  8. GhostLeg: Selective Memory Coalescing for Secure GPU Architecture [LINK]
    Jongmin Lee, Seungho Jung, Taewon Suh, Yunho Oh, Myung Kuk Yoon, and Gunjae Koo
    IEEE Access, Vol. 10, pp. 111449 - 111462, Oct. 2022
  9. TEA-RC: Thread Context-Aware Register Cache for GPUs [LINK]
    Ipoom Jeong, Yunho Oh, Won Woo Ro, and Myung Kuk Yoon
    IEEE Access, Vol. 10, pp. 82049 - 82062, Aug. 2022
  10. REACT: Scalable and High-Performance Regular Expression Pattern Matching Accelerator for In-Storage Processing [LINK]
    Won Seob Jeong, Changmin Lee, Keunsoo Kim, Myung Kuk Yoon, Won Jeon, Myoungsoo Jung, and Won Woo Ro
    IEEE Transactions on Parallel and Distributed Systems (TPDS), Vol. 31, Issue 5, pp. 1137-1151, May 2020
  11. Adaptive Cooperation of Prefetching and Warp Scheduling on GPUs [LINK]
    Yunho Oh, Keunsoo Kim, Myung Kuk Yoon, Jong Hyun Park, Yongjun Park, Murali Annavaram, and Won Woo Ro
    IEEE Transactions on Computers (TC), Vol. 68, No. 4, pp. 609-616, Apr. 2019
  12. WASP: Selective Data Prefetching with Monitoring Runtime Warp Progress on GPUs [LINK]
    Yunho Oh, Myung Kuk Yoon, Jong Hyun Park, Yongjun Park, and Won Woo Ro
    IEEE Transactions on Computers (TC), Vol. 67, No. 9, pp. 1366-1373, Sep. 2018
  13. Dynamic Resizing on Active Warps Scheduler to Hide Operation Stalls on GPUs [LINK]
    Myung Kuk Yoon, Yunho Oh, Sangpil Lee, Seung Hun Kim, Deokho Kim, and Won Woo Ro
    IEEE Transactions on Parallel and Distributed Systems (TPDS), Vol. 28, No. 11, pp. 3142-3156, Nov. 2017
  14. A Distributed Signature Detection Method for Detecting Intrusions in Sensor Systems [LINK]
    Ilkyu Kim, Doohwan Oh, Myung Kuk Yoon, Kyueun Yi, and Won Woo Ro
    Sensors, Vol. 13, No. 4, pp. 3998-4016, Mar. 2013

Patents

  1. Storage System And Operating Method Thereof
    KR-10-2017-0070960, US10671307B2