Selected Papers


Conference Papers

  1. INTERPRET: Inter-Warp Register Reuse for GPU Tensor Core [LINK]
    Jae Seok Kwak, Myung Kuk Yoon, Ipoom Jeong, Seunghyun Jin, and Won Woo Ro
    The 32nd International Conference on Parallel Architectures and Compilation Techniques (PACT 2023), Vienna, Austria, Oct. 21 - 25, 2023
  2. Warped-MC: An Efficient Memory Controller Scheme for Massively Parallel Processors (Best Paper Award) [LINK]
    Jonghyun Jeong, Myung Kuk Yoon, Yunho Oh, and Gunjae Koo
    The 52nd International Conference on Parallel Processing (ICPP 2023), Salt Lake City, Utah, USA, August 7 - 10, 2023
  3. Early-Adaptor: An Adaptive Framework For Proactive UVM Memory Management [LINK]
    Seokjin Go, Hyunwuk Lee, Junsung Kim, Jiwon Lee, Myung Kuk Yoon, and Won Woo Ro
    The 2023 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS 2023), Raleigh, NC, USA, April 23 - 25, 2023
  4. Balanced Column-Wise Block Pruning for Maximizing GPU Parallelism [LINK]
    Cheonjun Park, Mincheol Park, Hyun Jae Oh, Minkyu Kim, Myung Kuk Yoon, Suhyun Kim, and Won Woo Ro
    The 37th Association for the Advancement of Artificial Intelligence (AAAI-23), Washington DC, USA, Feb. 07 - 14, 2023
  5. Reconstructing Out-of-Order Issue Queue [LINK]
    Ipoom Jeong, Jiwon Lee, Myung Kuk Yoon, and Won Woo Ro
    The 55th IEEE/ACM International Symposium on Microarchitecture (MICRO 2022), Chicago, Illinois, USA, Oct. 01 - 05, 2022
  6. FineReg: Fine-Grained Register File Management for Augmenting GPU Throughput [LINK]
    Yunho Oh, Myung Kuk Yoon, William J. Song, and Won Woo Ro
    The 51st IEEE/ACM International Symposium on Microarchitecture (MICRO 2018), Fukuoka, Japan, Oct. 20 - 24, 2018
  7. Virtual Thread: Maximizing Thread-Level Parallelism beyond GPU Scheduling Limit [LINK]
    Myung Kuk Yoon, Keunsoo Kim, Sangpil Lee, Won Woo Ro, and Murali Annavaram
    The 43rd ACM/IEEE International Symposium on Computer Architecture (ISCA 2016), Seoul, Korea, Jun. 18 - 22, 2016
  8. APRES: Improving Cache Efficiency by Exploiting Load Characteristics on GPUs [LINK]
    Yunho Oh, Keunsoo Kim, Myung Kuk Yoon, Jong Hyun Park, Yongjun Park, Won Woo Ro, and Murali Annavaram
    The 43rd ACM/IEEE International Symposium on Computer Architecture (ISCA 2016), Seoul, Korea, Jun. 18 - 22, 2016
  9. Warped-Preexecution: A GPU Pre-execution Approach for Improving Latency Hiding [LINK]
    Keunsoo Kim, Sangpil Lee, Myung Kuk Yoon, Gunjae Koo, Won Woo Ro, and Murali Annavaram
    The 22nd International IEEE Symposium on High Performance Computer Architecture (HPCA 2016), Barcelona, Spain, Mar. 12 - 16, 2016
  10. DRAW: Investigating Benefits of Adaptive Fetch Group Size on GPU [LINK]
    Myung Kuk Yoon, Yunho Oh, Sangpil Lee, Seung Hun Kim, Deokho Kim, and Won Woo Ro
    The 2015 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS 2015), Philadelphia, PA, USA, Mar. 29 - 31, 2015

Journal Papers

  1. Adaptive Kernel Merge and Fusion for Multi-Tenant Inference in Embedded GPUs [LINK]
    Jaebeom Jeon, Gunjae Koo, Myung Kuk Yoon, and Yunho Oh
    IEEE Embedded Systems Letters (ESL) (Accepted)
  2. Triple-A: Early Operand Collector Allocation for Maximizing GPU Register Bank Utilization [LINK]
    Ipoom Jeong, Eunbi Jeong, Nam Sung Kim, and Myung Kuk Yoon
    IEEE Embedded Systems Letters (ESL) (Accepted)
  3. Conflict-Aware Compiler for Hierarchical Register File on GPUs [LINK]
    Eunbi Jeong, Eun Seong Park, Gunjae Koo, Yunho Oh, and Myung Kuk Yoon
    Journal of Systems Architecture (JSA), Vol. 149, pp. 103099, April 2024
  4. SAVector: Vectored Systolic Arrays [LINK]
    Sangun Choi, Seongjun Park, Jaeyong Park, Jongmin Kim, Gunjae Koo, Seokin Hong, Myung Kuk Yoon, and Yunho Oh
    IEEE Access, Vol. 12, pp. 44446 - 44461, March 2024
  5. CASH-RF: A Compiler-Assisted Hierarchical Register File in GPUs [LINK]
    Yunho Oh, Ipoom Jeong, Won Woo Ro, and Myung Kuk Yoon
    IEEE Embedded Systems Letters (ESL), Vol. 14, Issue 4, pp. 187-190, Dec. 2022
  6. Analyzing GCN Aggregation on GPU [LINK]
    Inje Kim, Jonghyun Jeong, Yunho Oh, Myung Kuk Yoon, and Gunjae Koo
    IEEE Access, Vol. 10, pp. 113046 - 113060, Oct. 2022
  7. GhostLeg: Selective Memory Coalescing for Secure GPU Architecture [LINK]
    Jongmin Lee, Seungho Jung, Taewon Suh, Yunho Oh, Myung Kuk Yoon, and Gunjae Koo
    IEEE Access, Vol. 10, pp. 111449 - 111462, Oct. 2022
  8. TEA-RC: Thread Context-Aware Register Cache for GPUs [LINK]
    Ipoom Jeong, Yunho Oh, Won Woo Ro, and Myung Kuk Yoon
    IEEE Access, Vol. 10, pp. 82049 - 82062, Aug. 2022
  9. REACT: Scalable and High-Performance Regular Expression Pattern Matching Accelerator for In-Storage Processing [LINK]
    Won Seob Jeong, Changmin Lee, Keunsoo Kim, Myung Kuk Yoon, Won Jeon, Myoungsoo Jung, and Won Woo Ro
    IEEE Transactions on Parallel and Distributed Systems (TPDS), Vol. 31, Issue 5, pp. 1137-1151, May 2020
  10. Adaptive Cooperation of Prefetching and Warp Scheduling on GPUs [LINK]
    Yunho Oh, Keunsoo Kim, Myung Kuk Yoon, Jong Hyun Park, Yongjun Park, Murali Annavaram, and Won Woo Ro
    IEEE Transactions on Computers (TC), Vol. 68, No. 4, pp. 609-616, Apr. 2019
  11. WASP: Selective Data Prefetching with Monitoring Runtime Warp Progress on GPUs [LINK]
    Yunho Oh, Myung Kuk Yoon, Jong Hyun Park, Yongjun Park, and Won Woo Ro
    IEEE Transactions on Computers (TC), Vol. 67, No. 9, pp. 1366-1373, Sep. 2018
  12. Dynamic Resizing on Active Warps Scheduler to Hide Operation Stalls on GPUs [LINK]
    Myung Kuk Yoon, Yunho Oh, Sangpil Lee, Seung Hun Kim, Deokho Kim, and Won Woo Ro
    IEEE Transactions on Parallel and Distributed Systems (TPDS), Vol. 28, No. 11, pp. 3142-3156, Nov. 2017
  13. A Distributed Signature Detection Method for Detecting Intrusions in Sensor Systems [LINK]
    Ilkyu Kim, Doohwan Oh, Myung Kuk Yoon, Kyueun Yi, and Won Woo Ro
    Sensors, Vol. 13, No. 4, pp. 3998-4016, Mar. 2013

Patents

  1. Storage System And Operating Method Thereof
    KR-10-2276912-0000, US10671307B2