Selected Papers
Conference Papers
-
Warped-Compaction: Maximizing GPU Register File Bandwidth Utilization via Operand Compaction
Eunbi Jeong, Ipoom Jeong*, Myung Kuk Yoon*, and Nam Sung Kim
The 31st International IEEE Symposium on High Performance Computer Architecture (HPCA 2025), Las Vegas, United States, Mar. 1 - 5, 2025
*co-corresponding authors
-
Marching Page Walks: Batching and Concurrent Page Table Walks for Enhancing GPU Throughput
Jiwon Lee, Gun Ko, Myung Kuk Yoon, Ipoom Jeong, Yunho Oh, and Won Woo Ro
The 31st International IEEE Symposium on High Performance Computer Architecture (HPCA 2025), Las Vegas, United States, Mar. 1 - 5, 2025
-
DEPrune: Depth-wise Separable Convolution Pruning for Maximizing GPU Parallelism
Cheonjun Park, Mincheol Park, Hyunchan Moon, Myung Kuk Yoon, Seokjin Go, Suhyun Kim, and Won Woo Ro
The 38th Annual Conference on Neural Information Processing Systems (NeurIPS 2024), Vancouver, Canada, Dec. 9 - 15, 2024
-
VitBit: Enhancing Embedded GPU Performance for AI Workloads through Register Operand Packing [LINK]
Jaebeom Jeon, Minseong Gil, Junsu Kim, Jaeyong Park, Gunjae Koo, Myung Kuk Yoon*, and Yunho Oh*
The 53rd International Conference on Parallel Processing (ICPP 2024), Gotland, Sweden, August 12 - 15, 2024
*co-corresponding authors
-
INTERPRET: Inter-Warp Register Reuse for GPU Tensor Core [LINK]
Jae Seok Kwak, Myung Kuk Yoon, Ipoom Jeong, Seunghyun Jin, and Won Woo Ro
The 32nd International Conference on Parallel Architectures and Compilation Techniques (PACT 2023), Vienna, Austria, Oct. 21 - 25, 2023
-
Warped-MC: An Efficient Memory Controller Scheme for Massively Parallel Processors (Best Paper Award) [LINK]
Jonghyun Jeong, Myung Kuk Yoon, Yunho Oh, and Gunjae Koo
The 52nd International Conference on Parallel Processing (ICPP 2023), Salt Lake City, Utah, USA, August 7 - 10, 2023
-
Early-Adaptor: An Adaptive Framework For Proactive UVM Memory Management [LINK]
Seokjin Go, Hyunwuk Lee, Junsung Kim, Jiwon Lee, Myung Kuk Yoon, and Won Woo Ro
The 2023 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS 2023), Raleigh, NC, USA, Apr. 23 - 25, 2023
-
Balanced Column-Wise Block Pruning for Maximizing GPU Parallelism [LINK]
Cheonjun Park, Mincheol Park, Hyun Jae Oh, Minkyu Kim, Myung Kuk Yoon, Suhyun Kim, and Won Woo Ro
The 37th Association for the Advancement of Artificial Intelligence (AAAI-23), Washington DC, USA, Feb. 07 - 14, 2023
-
Reconstructing Out-of-Order Issue Queue [LINK]
Ipoom Jeong, Jiwon Lee, Myung Kuk Yoon, and Won Woo Ro
The 55th IEEE/ACM International Symposium on Microarchitecture (MICRO 2022), Chicago, Illinois, USA, Oct. 01 - 05, 2022
-
FineReg: Fine-Grained Register File Management for Augmenting GPU Throughput [LINK]
Yunho Oh, Myung Kuk Yoon, William J. Song, and Won Woo Ro
The 51st IEEE/ACM International Symposium on Microarchitecture (MICRO 2018), Fukuoka, Japan, Oct. 20 - 24, 2018
-
Virtual Thread: Maximizing Thread-Level Parallelism beyond GPU Scheduling Limit [LINK]
Myung Kuk Yoon, Keunsoo Kim, Sangpil Lee, Won Woo Ro, and Murali Annavaram
The 43rd ACM/IEEE International Symposium on Computer Architecture (ISCA 2016), Seoul, Korea, Jun. 18 - 22, 2016
-
APRES: Improving Cache Efficiency by Exploiting Load Characteristics on GPUs [LINK]
Yunho Oh, Keunsoo Kim, Myung Kuk Yoon, Jong Hyun Park, Yongjun Park, Won Woo Ro, and Murali Annavaram
The 43rd ACM/IEEE International Symposium on Computer Architecture (ISCA 2016), Seoul, Korea, Jun. 18 - 22, 2016
-
Warped-Preexecution: A GPU Pre-execution Approach for Improving Latency Hiding [LINK]
Keunsoo Kim, Sangpil Lee, Myung Kuk Yoon, Gunjae Koo, Won Woo Ro, and Murali Annavaram
The 22nd International IEEE Symposium on High Performance Computer Architecture (HPCA 2016), Barcelona, Spain, Mar. 12 - 16, 2016
-
DRAW: Investigating Benefits of Adaptive Fetch Group Size on GPU [LINK]
Myung Kuk Yoon, Yunho Oh, Sangpil Lee, Seung Hun Kim, Deokho Kim, and Won Woo Ro
The 2015 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS 2015), Philadelphia, PA, USA, Mar. 29 - 31, 2015
Journal Papers
-
TLP Balancer: Predictive Thread Allocation for Multi-Tenant Inference in Embedded GPUs
Minseong Gil, Jaebeom Jeon, Junsu Kim, Sangun Choi, Gunjae Koo, Myung Kuk Yoon, and Yunho Oh
IEEE Embedded Systems Letters (ESL) (Accepted)
-
Adaptive Kernel Merge and Fusion for Multi-Tenant Inference in Embedded GPUs [LINK]
Jaebeom Jeon, Gunjae Koo, Myung Kuk Yoon*, and Yunho Oh*
IEEE Embedded Systems Letters (ESL) (Accepted)
*co-corresponding authors
-
Triple-A: Early Operand Collector Allocation for Maximizing GPU Register Bank Utilization [LINK]
Ipoom Jeong, Eunbi Jeong, Nam Sung Kim, and Myung Kuk Yoon
IEEE Embedded Systems Letters (ESL), Vol. 16, Issue 2, pp. 206-209, June 2024
-
Conflict-Aware Compiler for Hierarchical Register File on GPUs [LINK]
Eunbi Jeong, Eun Seong Park, Gunjae Koo, Yunho Oh*, and Myung Kuk Yoon*
Journal of Systems Architecture (JSA), Vol. 149, pp. 103099, Apr. 2024
*co-corresponding authors
-
SAVector: Vectored Systolic Arrays [LINK]
Sangun Choi, Seongjun Park, Jaeyong Park, Jongmin Kim, Gunjae Koo, Seokin Hong, Myung Kuk Yoon*, and Yunho Oh*
IEEE Access, Vol. 12, pp. 44446 - 44461, March 2024
*co-corresponding authors
-
CASH-RF: A Compiler-Assisted Hierarchical Register File in GPUs [LINK]
Yunho Oh, Ipoom Jeong, Won Woo Ro, and Myung Kuk Yoon
IEEE Embedded Systems Letters (ESL), Vol. 14, Issue 4, pp. 187-190, Dec. 2022
-
Analyzing GCN Aggregation on GPU [LINK]
Inje Kim, Jonghyun Jeong, Yunho Oh, Myung Kuk Yoon, and Gunjae Koo
IEEE Access, Vol. 10, pp. 113046 - 113060, Oct. 2022
-
GhostLeg: Selective Memory Coalescing for Secure GPU Architecture [LINK]
Jongmin Lee, Seungho Jung, Taewon Suh, Yunho Oh, Myung Kuk Yoon, and Gunjae Koo
IEEE Access, Vol. 10, pp. 111449 - 111462, Oct. 2022
-
TEA-RC: Thread Context-Aware Register Cache for GPUs [LINK]
Ipoom Jeong, Yunho Oh, Won Woo Ro, and Myung Kuk Yoon
IEEE Access, Vol. 10, pp. 82049 - 82062, Aug. 2022
-
REACT: Scalable and High-Performance Regular Expression Pattern Matching Accelerator for In-Storage Processing [LINK]
Won Seob Jeong, Changmin Lee, Keunsoo Kim, Myung Kuk Yoon, Won Jeon, Myoungsoo Jung, and Won Woo Ro
IEEE Transactions on Parallel and Distributed Systems (TPDS), Vol. 31, Issue 5, pp. 1137-1151, May 2020
-
Adaptive Cooperation of Prefetching and Warp Scheduling on GPUs [LINK]
Yunho Oh, Keunsoo Kim, Myung Kuk Yoon, Jong Hyun Park, Yongjun Park, Murali Annavaram, and Won Woo Ro
IEEE Transactions on Computers (TC), Vol. 68, No. 4, pp. 609-616, Apr. 2019
-
WASP: Selective Data Prefetching with Monitoring Runtime Warp Progress on GPUs [LINK]
Yunho Oh, Myung Kuk Yoon, Jong Hyun Park, Yongjun Park, and Won Woo Ro
IEEE Transactions on Computers (TC), Vol. 67, No. 9, pp. 1366-1373, Sep. 2018
-
Dynamic Resizing on Active Warps Scheduler to Hide Operation Stalls on GPUs [LINK]
Myung Kuk Yoon, Yunho Oh, Sangpil Lee, Seung Hun Kim, Deokho Kim, and Won Woo Ro
IEEE Transactions on Parallel and Distributed Systems (TPDS), Vol. 28, No. 11, pp. 3142-3156, Nov. 2017
-
A Distributed Signature Detection Method for Detecting Intrusions in Sensor Systems [LINK]
Ilkyu Kim, Doohwan Oh, Myung Kuk Yoon, Kyueun Yi, and Won Woo Ro
Sensors, Vol. 13, No. 4, pp. 3998-4016, Mar. 2013
Patents
-
Storage System And Operating Method Thereof
KR-10-2017-0070960, US10671307B2
ALL Papers
Conference Papers
-
Warped-Compaction: Maximizing GPU Register File Bandwidth Utilization via Operand Compaction
Eunbi Jeong, Ipoom Jeong*, Myung Kuk Yoon*, and Nam Sung Kim
The 31st International IEEE Symposium on High Performance Computer Architecture (HPCA 2025), Las Vegas, United States, Mar. 1 - 5, 2025
*co-corresponding authors
-
Marching Page Walks: Batching and Concurrent Page Table Walks for Enhancing GPU Throughput
Jiwon Lee, Gun Ko, Myung Kuk Yoon, Ipoom Jeong, Yunho Oh, and Won Woo Ro
The 31st International IEEE Symposium on High Performance Computer Architecture (HPCA 2025), Las Vegas, United States, Mar. 1 - 5, 2025
-
DEPrune: Depth-wise Separable Convolution Pruning for Maximizing GPU Parallelism
Cheonjun Park, Mincheol Park, Hyunchan Moon, Myung Kuk Yoon, Seokjin Go, Suhyun Kim, and Won Woo Ro
The 38th Annual Conference on Neural Information Processing Systems (NeurIPS 2024), Vancouver, Canada, Dec. 9 - 15, 2024
-
Performance Comparison of CNN Pruning Techniques Using NanoSAM Model on Jetson Orin Nano
Jaeeun Hwang, Seonwoo Kim, and Myung Kuk Yoon
2024 Autumn Annual Conference of IEIE, Jeongseon, Gangwon, Korea, November 22 - 23, 2024
-
VitBit: Enhancing Embedded GPU Performance for AI Workloads through Register Operand Packing
Jaebeom Jeon, Minseong Gil, Junsu Kim, Jaeyong Park, Gunjae Koo, Myung Kuk Yoon*, and Yunho Oh*
The 53rd International Conference on Parallel Processing (ICPP 2024), Gotland, Sweden, August 12 - 15, 2024
*co-corresponding authors
-
Twisted Bank Arbitrator for Balanced Register Bank Accesses on Graphics Processing Units
Eunbi Jeong, Ipoom Jeong, and Myung Kuk Yoon
2024 Summer Annual Conference of IEIE, Jeju, Korea, June 26 - 28, 2024
-
INTERPRET: Inter-Warp Register Reuse for GPU Tensor Core
Jae Seok Kwak, Myung Kuk Yoon, Ipoom Jeong, Seunghyun Jin, and Won Woo Ro
The 32nd International Conference on Parallel Architectures and Compilation Techniques (PACT 2023), Vienna, Austria, Oct. 21 - 25, 2023
-
Warped-MC: An Efficient Memory Controller Scheme for Massively Parallel Processors (Best Paper Award)
Jonghyun Jeong, Myung Kuk Yoon, Yunho Oh, and Gunjae Koo
The 52nd International Conference on Parallel Processing (ICPP 2023), Salt Lake City, Utah, USA, August 7 - 10, 2023
-
Preloading Architecture for Graphics Processing Unit (Paper Award, 우수논문상)
Eun Seong Park, Eunbi Jeong, and Myung Kuk Yoon
2023 Summer Annual Conference of IEIE, Jeju, Korea, June 28 - 30, 2023
-
Reduced Precision Floating Point for Ray Tracing
Eun Soo Jung, Yeonhee Jung, and Myung Kuk Yoon
2023 Summer Annual Conference of IEIE, Jeju, Korea, June 28 - 30, 2023
-
Early-Adaptor: An Adaptive Framework For Proactive UVM Memory Management
Seokjin Go, Hyunwuk Lee, Junsung Kim, Jiwon Lee, Myung Kuk Yoon, and Won Woo Ro
The 2023 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS 2023), Raleigh, NC, USA, Apr. 23 - 25, 2023
-
Balanced Column-Wise Block Pruning for Maximizing GPU Parallelism
Cheonjun Park, Mincheol Park, Hyun Jae Oh, Minkyu Kim, Myung Kuk Yoon, Suhyun Kim, and Won Woo Ro
The 37th Association for the Advancement of Artificial Intelligence (AAAI-23), Washington DC, USA, Feb. 07 - 14, 2023
-
Reconstructing Out-of-Order Issue Queue
Ipoom Jeong, Jiwon Lee, Myung Kuk Yoon, and Won Woo Ro
The 55th IEEE/ACM International Symposium on Microarchitecture (MICRO 2022), Chicago, Illinois, USA, Oct. 01 - 05, 2022
-
Compiler-Assisted GPU Register File Power Management Technique
Myung Kuk Yoon
2022 International Conference on Electronics, Information, and Communication (ICEIC 2022), Jeju, Korea, Feb. 06 - 09, 2022
-
Analyzing Characteristics of Memory Side-Channels in GPU
Seungho Jung, Myung Kuk Yoon, and Gunjae Koo
2021 Korea Software Congress (KSC 2021), Pyeongchang, Korea, Dec. 20 - 22, 2021
-
FineReg: Fine-Grained Register File Management for Augmenting GPU Throughput
Yunho Oh, Myung Kuk Yoon, William J. Song, and Won Woo Ro
The 51st IEEE/ACM International Symposium on Microarchitecture (MICRO 2018), Fukuoka, Japan, Oct. 20 - 24, 2018
-
Optimizing Intersection and Reflection Step of Geometrical Optics using GPUs
Hyun Jin Chung, Myung Kuk Yoon, and Won Woo Ro
The 16th International Conference on Electronics, Information and Communication (ICEIC 2017), Phuket, Thailand, Jan. 11 - 14, 2017
-
Virtual Thread: Maximizing Thread-Level Parallelism beyond GPU Scheduling Limit
Myung Kuk Yoon, Keunsoo Kim, Sangpil Lee, Won Woo Ro, and Murali Annavaram
The 43rd ACM/IEEE International Symposium on Computer Architecture (ISCA 2016), Seoul, Korea, Jun. 18 - 22, 2016
-
APRES: Improving Cache Efficiency by Exploiting Load Characteristics on GPUs
Yunho Oh, Keunsoo Kim, Myung Kuk Yoon, Jong Hyun Park, Yongjun Park, Won Woo Ro, and Murali Annavaram
The 43rd ACM/IEEE International Symposium on Computer Architecture (ISCA 2016), Seoul, Korea, Jun. 18 - 22, 2016
-
Warped-Preexecution: A GPU Pre-execution Approach for Improving Latency Hiding
Keunsoo Kim, Sangpil Lee, Myung Kuk Yoon, Gunjae Koo, Won Woo Ro, and Murali Annavaram
The 22nd International IEEE Symposium on High Performance Computer Architecture (HPCA 2016), Barcelona, Spain, Mar. 12 - 16, 2016
-
DRAW: Investigating Benefits of Adaptive Fetch Group Size on GPU
Myung Kuk Yoon, Yunho Oh, Sangpil Lee, Seung Hun Kim, Deokho Kim, and Won Woo Ro
The 2015 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS 2015), Philadelphia, PA, USA, Mar. 29 - 31, 2015
-
Directory Centralized Ring-based Interconnection for Multi-Core Systems
Myung Kuk Yoon, Sangpil Lee, Deokho Kim, and Won Woo Ro
The 12th International Conference on Electronics, Information and Communication (ICEIC 2013), Bali, Indonesia, Jan. 30 - Feb. 2, 2013
Journal Papers
-
TLP Balancer: Predictive Thread Allocation for Multi-Tenant Inference in Embedded GPUs
Minseong Gil, Jaebeom Jeon, Junsu Kim, Sangun Choi, Gunjae Koo, Myung Kuk Yoon, and Yunho Oh
IEEE Embedded Systems Letters (ESL) (Accepted)
-
Adaptive Kernel Merge and Fusion for Multi-Tenant Inference in Embedded GPUs
Jaebeom Jeon, Gunjae Koo, Myung Kuk Yoon*, and Yunho Oh*
IEEE Embedded Systems Letters (ESL) (Accepted)
*co-corresponding authors
-
Advancements in GPUs for Maximizing AI Application Performance and Research Trends
Jane Rhee, Eunbi Jeong, and Myung Kuk Yoon
Communications of the Korean Institute of Information Scientists and Engineers, Vol. 42, Issue 9, pp. 8-13, Sep. 2024
-
Triple-A: Early Operand Collector Allocation for Maximizing GPU Register Bank Utilization
Ipoom Jeong, Eunbi Jeong, Nam Sung Kim, and Myung Kuk Yoon
IEEE Embedded Systems Letters (ESL), Vol. 16, Issue 2, pp. 206-209, June 2024
-
Conflict-Aware Compiler for Hierarchical Register File on GPUs
Eunbi Jeong, Eun Seong Park, Gunjae Koo, Yunho Oh*, and Myung Kuk Yoon*
Journal of Systems Architecture (JSA), Vol. 149, pp. 103099, Apr. 2024
*co-corresponding authors
-
SAVector: Vectored Systolic Arrays
Sangun Choi, Seongjun Park, Jaeyong Park, Jongmin Kim, Gunjae Koo, Seokin Hong, Myung Kuk Yoon*, and Yunho Oh*
IEEE Access, Vol. 12, pp. 44446 - 44461, March 2024
*co-corresponding authors
-
Performance Analysis of Neural Processing Units with Emerging Memory Technologies
Sangun Choi, Seongjun Park, Jaeyong Park, Seokin Hong, Myung Kuk Yoon, and Yunho Oh
Journal of the Institute of Electronics and Information Engineers (IEIE), Vol. 60, No. 7, pp. 30-39, July 2023
-
Fairness Analysis of Multi-Tenant Applications on Multi-Instance GPUs
Jane Rhee and Myung Kuk Yoon
Journal of the Institute of Electronics and Information Engineers (IEIE), Vol. 60, No. 4, pp. 11-23, Apr. 2023
-
CASH-RF: A Compiler-Assisted Hierarchical Register File in GPUs
Yunho Oh, Ipoom Jeong, Won Woo Ro, and Myung Kuk Yoon
IEEE Embedded Systems Letters (ESL), Vol. 14, Issue 4, pp. 187-190, Dec. 2022
-
Analyzing GCN Aggregation on GPU
Inje Kim, Jonghyun Jeong, Yunho Oh, Myung Kuk Yoon, and Gunjae Koo
IEEE Access, Vol. 10, pp. 113046 - 113060, Oct. 2022
-
GhostLeg: Selective Memory Coalescing for Secure GPU Architecture
Jongmin Lee, Seungho Jung, Taewon Suh, Yunho Oh, Myung Kuk Yoon, and Gunjae Koo
IEEE Access, Vol. 10, pp. 111449 - 111462, Oct. 2022
-
TEA-RC: Thread Context-Aware Register Cache for GPUs
Ipoom Jeong, Yunho Oh, Won Woo Ro, and Myung Kuk Yoon
IEEE Access, Vol. 10, pp. 82049 - 82062, Aug. 2022
-
REACT: Scalable and High-Performance Regular Expression Pattern Matching Accelerator for In-Storage Processing
Won Seob Jeong, Changmin Lee, Keunsoo Kim, Myung Kuk Yoon, Won Jeon, Myoungsoo Jung, and Won Woo Ro
IEEE Transactions on Parallel and Distributed Systems (TPDS), Vol. 31, Issue 5, pp. 1137-1151, May 2020
-
Adaptive Cooperation of Prefetching and Warp Scheduling on GPUs
Yunho Oh, Keunsoo Kim, Myung Kuk Yoon, Jong Hyun Park, Yongjun Park, Murali Annavaram, and Won Woo Ro
IEEE Transactions on Computers (TC), Vol. 68, No. 4, pp. 609-616, Apr. 2019
-
WASP: Selective Data Prefetching with Monitoring Runtime Warp Progress on GPUs
Yunho Oh, Myung Kuk Yoon, Jong Hyun Park, Yongjun Park, and Won Woo Ro
IEEE Transactions on Computers (TC), Vol. 67, No. 9, pp. 1366-1373, Sep. 2018
-
Dynamic Resizing on Active Warps Scheduler to Hide Operation Stalls on GPUs
Myung Kuk Yoon, Yunho Oh, Sangpil Lee, Seung Hun Kim, Deokho Kim, and Won Woo Ro
IEEE Transactions on Parallel and Distributed Systems (TPDS), Vol. 28, No. 11, pp. 3142-3156, Nov. 2017
-
Introduction to Researches on Performance Bottlenecks of Many-Core GPU Architectures
Yunho Oh, Myung Kuk Yoon, Jong Hyun Park, and Won Woo Ro
Communications of KIISE, Vol. 32 No. 5, May, 2014
-
A Distributed Signature Detection Method for Detecting Intrusions in Sensor Systems
Ilkyu Kim, Doohwan Oh, Myung Kuk Yoon, Kyueun Yi, and Won Woo Ro
Sensors, Vol. 13, No. 4, pp. 3998-4016, Mar. 2013
Patents
-
Electronic Device for Pre-Allocating Operand Collector to Use Register File Efficiently and Operation Method Thereof
KR-10-2023-0110116
-
A Method for Determining Register Cache Index of Processor and Electroic Device Performing the Same
KR-10-2022-0096238
-
Storage System And Operating Method Thereof
KR-10-2017-0070960, US10671307B2
-
Method and Apparatus for Analyzing Radio Wave Environment in A Wireless Communication System
KR-10-2019-0083497