Publications
- Sowoong Kim, Eunyeong Sim, Youngsam Shin, YeonGon Cho, and Woongki Baek, “Activation Sequence Caching: High-Throughput and Memory-Efficient Generative Inference with a Single GPU,” in the Proceedings of the 33rd International Conference on Parallel Architectures and Compilation Techniques (PACT), Oct. 2024 (to appear).
- Myeonggyun Han, Eunseong Park, Youngsam Shin, Deok-Jae Oh, YeonGon Cho, and Woongki Baek, “COSMOS: Coordinated Management of Cores, Memory, and Compressed Memory Swap for QoS-Aware and Efficient Workload Consolidation for Memory-Intensive Applications,” in IEEE Access, 2023.
- Sowoong Kim, Myeonggyun Han, and Woongki Baek, “MARF: A Memory-Aware CLFLUSH-Based Intra- and Inter-CPU Side-Channel Attack,” in the Proceedings of the 28th European Symposium on Research in Computer Security (ESORICS), Sep. 2023.
- Myeonggyun Han and Woongki Baek, “SDRP: Safe, Efficient, and SLO-Aware Workload Consolidation through Secure and Dynamic Resource Partitioning,” in the IEEE Transactions on Services Computing (TSC), July-Aug., 2022.
- Sowoong Kim, Myeonggyun Han, and Woongki Baek, “DPrime+DAbort: A High-Precision and Timer-Free Directory-Based Side-Channel Attack in Non-Inclusive Cache Hierarchies using Intel TSX,” in the Proceedings of the 28th IEEE International Symposium on High-Performance Computer Architecture (HPCA), Apr. 2022.
- Myeonggyun Han and Woongki Baek, “HERTI: a Reinforcement Learning-Augmented System for Efficient Real-Time Inference on Heterogeneous Embedded Systems,” in the Proceedings of the 30th International Conference on Parallel Architectures and Compilation Techniques (PACT), Sep. 2021.
- Jinsu Park, Seongbeom Park, Myeonggyun Han, and Woongki Baek, “PALM: Progress- and Locality-Aware Adaptive Task Migration for Efficient Thread Packing,” in the Proceedings of the 35th IEEE International Parallel and Distributed Processing Symposium (IPDPS), May 2021.
- Myeonggyun Han, Jinsu Park, and Woongki Baek, “Design and Implementation of a Criticality- and Heterogeneity-Aware Runtime System for Task-Parallel Applications,” in the IEEE Transactions on Parallel and Distributed Systems (TPDS), 2021.
- Myeonggyun Han, Jihoon Hyun, Seongbeom Park, and Woongki Baek, “Hotness- and Lifetime-Aware Data Placement and Migration for High-Performance Deep Learning on Heterogeneous Memory Systems,” in the IEEE Transactions on Computers (TC), 2020.
- Myeonggyun Han, Jihoon Hyun, Seongbeom Park, Jinsu Park, and Woongki Baek, “MOSAIC: Heterogeneity-, Communication-, and Constraint-Aware Model Slicing and Execution for Accurate and Efficient Inference,” in the Proceedings of the 28th International Conference on Parallel Architectures and Compilation Techniques (PACT), Sep. 2019.
- Jinsu Park, Seongbeom Park, Myeonggyun Han, and Woongki Baek, “POSTER: The Performance Impact of Thread Packing on Synchronization-Intensive Applications,” in the Proceedings of the 28th International Conference on Parallel Architectures and Compilation Techniques (PACT), Sep. 2019 (Poster).
- Jinsu Park, Seongbeom Park, and Woongki Baek, “CoPart: Coordinated Partitioning of Last-Level Cache and Memory Bandwidth for Fairness-Aware Workload Consolidation on Commodity Servers,” in the Proceedings of the 14th European Conference on Computer Systems (EuroSys), Mar. 2019.
- Jinsu Park and Woongki Baek, “Analyzing and Optimizing the Performance and Energy Efficiency of Transactional Scientific Applications on Large-Scale NUMA Systems with HTM Support,” in the Journal of Parallel and Distributed Computing (JPDC), 2019.
- Seontae Kim, Nguyen Pham, Woongki Baek, and Young-ri Choi, “Holistic VM Placement for Distributed Parallel Applications in Heterogeneous Clusters,” in the IEEE Transactions on Services Computing (TSC), 2019.
- Kyu Yeun Kim, Jinsu Park, and Woongki Baek, “Improving the Performance and Energy Efficiency of GPGPU Computing through Integrated Adaptive Cache Management,” in the IEEE Transactions on Parallel and Distributed Systems (TPDS), 2019.
- Jinsu Park, Seongbeom Park, Myeonggyun Han, Jihoon Hyun, and Woongki Baek, “HyPart: A Hybrid Technique for Practical Memory Bandwidth Partitioning on Commodity Servers,” in the Proceedings of the 27th International Conference on Parallel Architectures and Compilation Techniques (PACT), Nov. 2018.
- Kyu Yeun Kim and Woongki Baek, “BLPP: Improving the Performance of GPGPUs with Heterogeneous Memory through Bandwidth- and Latency-Aware Page Placement,” in the Proceedings of the 36th IEEE International Conference on Computer Design (ICCD), Oct. 2018.
- Jihoon Hyun, Jinsu Park, Kyu Yeun Kim, Seongdae Yu, and Woongki Baek, “CEML: a Coordinated Runtime System for Efficient Machine Learning on Heterogeneous Computing Systems,” in the Proceedings of the 24th International European Conference on Parallel and Distributed Computing (Euro-Par), Aug. 2018.
- Jinsu Park and Woongki Baek, “Quantifying the Performance and Energy-Efficiency Impact of Hardware Transactional Memory on Scientific Applications on Large-Scale NUMA Systems,” in the Proceedings of the 32nd IEEE International Parallel and Distributed Processing Symposium (IPDPS), May 2018.
- Myeonggyun Han, Seongdae Yu, and Woongki Baek, “Secure and Dynamic Core and Cache Partitioning for Safe and Efficient Server Consolidation,” in the Proceedings of the 18th IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing (CCGrid), May 2018 (selected as a Best Paper Finalist).
- Jinsu Park, Seongbeom Park, and Woongki Baek, “RPPC: a Holistic Runtime System for Maximizing Performance under Power Capping,” in the Proceedings of the 18th IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing (CCGrid), May 2018.
- Seongdae Yu, Seongbeom Park, and Woongki Baek, “Design and Implementation of Bandwidth-Aware Memory Placement and Migration Policies for Heterogeneous Memory Systems,” in the Proceedings of the 31st International Conference on Supercomputing (ICS), Jun. 2017.
- Seontae Kim, Nguyen Pham, Woongki Baek, and Young-ri Choi, “Machine Learning-Based Performance Estimation for Distributed Parallel Applications in Virtualized Heterogeneous Clusters,” in the Proceedings of 37th IEEE International Conference on Distributed Computing Systems (ICDCS), Jun. 2017 (Poster).
- Jihye Seo, Wook-Hee Kim, Woongki Baek, Beomseok Nam, and Sam H. Noh, “Failure-Atomic Slotted Paging for Persistent Memory,” in the Proceedings of the 22nd International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), Apr. 2017.
- Myeonggyun Han, Jinsu Park, and Woongki Baek, “CHRT: a Criticality- and Heterogeneity-Aware Runtime System for Task-Parallel Applications,” in the Proceedings of the 2017 Design, Automation & Test in Europe Conference & Exhibition (DATE), Mar. 2017.
- Kyu Yeun Kim, Jinsu Park, and Woongki Baek, “IACM: Integrated Adaptive Cache Management for High-Performance and Energy-Efficient GPGPU Computing,” in the Proceedings of the 34th IEEE International Conference on Computer Design (ICCD), Oct. 2016.
- Jinsu Park, Eunbi Cho, and Woongki Baek, “RMC: an Integrated Runtime System for Adaptive Many-Core Computing,” in the Proceedings of the 13th International Conference on Embedded Software (EMSOFT), Oct. 2016.
- Jinsu Park, Myeonggyun Han, and Woongki Baek, “Quantifying the Performance Impact of Large Pages on In-Memory Big-Data Workloads,” in the Proceedings of the 2016 IEEE International Symposium on Workload Characterization (IISWC), Sep. 2016.
- Jinsu Park and Woongki Baek, “HAP: a Heterogeneity-Conscious Runtime System for Adaptive Pipeline Parallelism,” in the Proceedings of the 22nd International European Conference on Parallel and Distributed Computing (Euro-Par), Aug. 2016.
- Seunghoe Kim and Woongki Baek, “HAPT: Hardware-Accelerated Persistent Transactions,” in the Proceedings of the 5th IEEE Non-Volatile Memory Systems and Applications Symposium (NVMSA), Aug. 2016.
- Jinsu Park and Woongki Baek, “RCHC: a Holistic Runtime System for Concurrent Heterogeneous Computing,” in the Proceedings of the 45th International Conference on Parallel Processing (ICPP), Aug. 2016.
- Kyu Yeun Kim and Woongki Baek, “Quantifying the Performance and Energy Efficiency of Advanced Cache Indexing for GPGPU Computing,” in the Journal on Microprocessors and Microsystems, 2016.
- Wook-Hee Kim, Jinwoong Kim, Woongki Baek, Beomseok Nam, and Youjip Won, “NVWAL: Exploiting NVRAM in Write-Ahead-Logging,” in the Proceedings of the 21st International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), Apr. 2016.
- Kyu Yeun Kim, Seunghoe Kim, and Woongki Baek, “On the Feasibility of Advanced Cache Indexing for High-Performance and Energy-Efficient GPGPU Computing,” in the Proceedings of the 3rd ACM International Workshop on Manycore Embedded Systems (MES), Jun. 2015.
- Jaeyoung Yun, Jinsu Park, and Woongki Baek, “HARS: a Heterogeneity-Aware Runtime System for Self-Adaptive Multithreaded Applications,” in the Proceedings of the 52nd Design Automation Conference (DAC), Jun. 2015.
- Woongki Baek, Richard Yoo, and Christos Kozyrakis, “Enhanced Concurrency Control with Transactional NACKs,” the 8th ACM SIGPLAN Workshop on Transactional Computing (TRANSACT), Mar. 2013.
- Woongki Baek and Trishul Chilimbi, “Green: A Framework for Supporting Energy-Conscious Programming using Controlled Approximation,” in the Proceedings of the ACM SIGPLAN 2010 Conference on Programming Language Design and Implementation (PLDI), Jun. 2010 (won Most Influential PLDI Paper Award).
- Woongki Baek, Nathan Bronson, Christos Kozyrakis, and Kunle Olukotun, “Making Nested Parallel Transactions Practical using Lightweight Hardware Support,” in the Proceedings of the 24th International Conference on Supercomputing (ICS), Jun. 2010.
- Woongki Baek, Nathan Bronson, Christos Kozyrakis, and Kunle Olukotun, “Implementing and Evaluating Nested Parallel Transactions in Software Transactional Memory,” in the Proceedings of the 22nd ACM Symposium on Parallelism Architectures and Algorithms (SPAA), Jun. 2010.
- Woongki Baek, Nathan Bronson, Christos Kozyrakis, and Kunle Olukotun, “Implementing and Evaluating a Model Checker for Transactional Memory Systems,” in the Proceedings of the 15th IEEE International Conference on Engineering of Complex Computer Systems (ICECCS), Mar. 2010.
- Woongki Baek and Trishul Chilimbi, “Green: A System for Supporting Energy-Conscious Programming using Principled Approximation,” MSR-TR-2009-89.
- JaeWoong Chung, Woongki Baek, and Christos Kozyrakis, “Fast Memory Snapshot for Concurrent Programming without Synchronization,” in the Proceedings of the 23rd International Conference on Supercomputing (ICS), Jun. 2009.
- JaeWoong Chung, Woongki Baek, Nathan G.Bronson, Jiwon Seo, Christos Kozyrakis, and Kunle Olukotun, “ASeD: Availability, Security, and Debugging Support using Transactional Memory,” in the Proceedings of the 20th ACM Symposium on Parallelism Architectures and Algorithms (SPAA), Jun. 2008 (Poster).
- JaeWoong Chung, Jiwon Seo, Woongki Baek, Chi Cao Minh, Austen McDonald, Christos Kozyrakis, and Kunle Olukotun, “Improving Software Concurrency with Hardware-assisted Memory Snapshot,” in the Proceedings of the 20th ACM Symposium on Parallelism Architectures and Algorithms (SPAA), Jun. 2008 (Poster).
- Woongki Baek, Chi Cao Minh, Martin Trautmann, Christos Kozyrakis, and Kunle Olukotun, “The OpenTM Transactional Application Programming Interface,” in the Proceedings of the 16th International Conference on Parallel Architectures and Compilation Techniques (PACT), Sep. 2007.
- Woongki Baek, Young-Jin Kim, Jinhyo Kim, and Jihong Kim, “A Measurement-based Automatic Energy Optimization Technique for Embedded Applications,” in the Journal of Low Power Electronics, Vol. 3, No. 2, 2007.
- Woongki Baek, JaeWoong Chung, Chi Cao Minh, Christos Kozyrakis, and Kunle Olukotun, “Towards Soft Optimization Techniques for Parallel Cognitive Applications,” in the Proceedings of the 19th ACM Symposium on Parallelism Architectures and Algorithms (SPAA), Jun. 2007 (Brief Announcement).
- Hassan Chafi, Jared Casper, Brian D. Carlstrom, Austen McDonald, Chi Cao Minh, Woongki Baek, Christos Kozyrakis, and Kunle Olukotun, “A Scalable, Non-blocking Approach to Transactional Memory,” in the Proceedings of the 13th International Symposium on High-Performance Computer Architecture (HPCA), Feb. 2007.
- Woongki Baek and Jihong Kim, “Load-Store Reordering for Low-Power Multimedia Data Transfers,” in the Proceedings of IEEE International Symposium on Circuits and Systems (ISCAS), May. 2005.
- Woongki Baek, Young-Jin Kim, and Jihong Kim, “ePRO: A Tool for Energy and Performance Profiling for Embedded Applications,” in the Proceedings of International SoC Conference (ISOCC), Oct. 2004.