ICPP 2021 Program
All times are in CDT (Chicago time).

YouTube Channel

Overview | By Date | By Event Type | By Room | Author Index

A | B | C | D | E | F | G | H | I | J | K | L | M | N | O | P | Q | R | S | T | U | V | W | X | Y | Z

A
Aerts, Kris · moreParatick: Reducing Timer Overhead in Virtual Machines · pdf, mp4 · view
Afibuzzaman, Md · moreAn Evaluation of Task-Parallel Frameworks for Sparse Solvers on Multicore and Manycore CPU Architectures · pdf, mov · view
Aktulga, Hasan Metin · moreAn Evaluation of Task-Parallel Frameworks for Sparse Solvers on Multicore and Manycore CPU Architectures · pdf, mov · view
Al Saadi, Aymen · moreIMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view
Alam, Maksudul · moreDesign Considerations for GPU-based Mixed Integer Programming on Parallel Computing Platforms · pdf, mp4 · view
Alfe, Dario · moreIMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view
Alperen, Abdullah · moreAn Evaluation of Task-Parallel Frameworks for Sparse Solvers on Multicore and Manycore CPU Architectures · pdf, mov · view
Anitescu, Mihai · moreDomain Decomposition Preconditioners for Unstructured Network Problems in Parallel Vector Architectures · pdf, mov · view
Armejach, Adrià · moregem5+RTL: A Framework to Enable RTL Models Inside a Full-System Simulator · pdf, mp4 · view
Arrigoni, Viviana · moreEfficiently Parallelizable Strassen-Based Multiplication of a Matrix by its Transpose · pdf, mp4 · view
Artiles, Oswaldo · moreTurboBC: A Memory Efficient and Scalable GPU Based Betweenness Centrality(BC) Algorithm in the Language of Linear Algebra · pdf, mp4 · view
Ayguadé Parra, Eduard · moreCombining Dynamic Concurrency Throttling with Voltage and Frequency Scaling on Task-based Programming Models · pdf, mp4 · view

B
Babuji, Yadu · moreIMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view
Bai, Yang · moreA Novel Multi-CPU/GPU Collaborative Computing Framework for SGD-based Matrix Factorization · pdf, mp4 · view
Ballard, Grey · moreAccelerating Neural Network Training using Arbitrary Precision Approximating Matrix Multiplication Algorithms · pdf, mp4 · view
Parallel Tucker Decomposition with Numerically Accurate SVD · pdf, mp4 · view
Bangalore, Purushotham · moreDesign of a Portable Implementation of Partitioned Point-to-Point Communication Primitives · pdf, mp4 · view
Baravdish, Gabriel George · moreGPU Accelerated SL0 for Multidimensional Signals · pdf, mp4 · view
Barker, Kevin · moreFast and Scalable Sparse Triangular Solver for Multi-GPU Based HPC Architectures · pdf, mp4 · view
Beltran Querol, Vicenç · moreCombining Dynamic Concurrency Throttling with Voltage and Frequency Scaling on Task-based Programming Models · pdf, mp4 · view
Berezun, Daniil · moreEfficient Parallel Algorithms for String Comparison · pdf, mp4 · view
Bernholdt, David · moreImplementing Arbitrary/Common Concurrent Writes of CRCW PRAM · pdf, mp4 · view
Bhati, Agastya · moreIMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view
Bischof, Christian · moreTool-Supported Mini-App Extraction to Facilitate Program Analysis and Parallelization · pdf, mp4 · view
Biswas, Swarnendu · moreExplaining the Classification Performance of Supervised and Semi-Supervised Methods for Automated Sparse Matrix Format Selection · pdf, mp4 · view
Blaiszik, Benjamin · moreIMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view
Brace, Alexander · moreIMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view
Brettin, Thomas · moreIMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view
Buhler, Jeremy · moreEnabling Real-Time Irregular Data-Flow Pipelines on SIMD Devices · pdf, mp4 · view
Buluc, Aydin · moreScaling Generalized N-Body Problems, A Case Study from Genomics · pdf, mp4 · view

C
Cai, Lei · moreHippie: A Data-Paralleled Pipeline Approach to Improve Memory-Efficiency and Scalability for Large DNN Training · pdf, mp4 · view
Cai, Wei · moreFastPSO: Towards Efficient Swarm Intelligence Algorithm on GPUs · pdf, mp4 · view
Cai, Wentao · moreA Fast, General System for Buffered Persistent Data Structures · pdf, mp4 · view
Cai, Zhigang · moreIntra-page Cache Update in SLC-mode with Partial Programming in High Density SSDs · pdf, mov · view
Cai, Zhiping · moreFedCav: Contribution-aware Model Aggregation on Distributed Heterogeneous Data in Federated Learning · pdf, mp4 · view
Caíno-Lores, Silvina · moreAssessing Resource Provisioning and Allocation of Ensembles of In Situ Workflows · pdf, mp4 · view
Cao, Guohua · moreComputeCOVID19+: Accelerating COVID-19 Diagnosis and Monitoring via High-Performance Deep Learning · pdf, mp4 · view
Cao, Huanqi · moreSparker: Efficient Reduction for More Scalable Machine Learning with Spark · pdf, mp4 · view
Cao, Qiang · moreSPMFS: A Scalable Persistent Memory File System on Optane Persistent Memory · pdf, mp4 · view
Cartier, Hannah · moreOptimizing Work Stealing Communication with Structured Atomic Operations · pdf, mp4 · view
Catalyurek, Umit · moreAn Evaluation of Task-Parallel Frameworks for Sparse Solvers on Multicore and Manycore CPU Architectures · pdf, mov · view
Chang, Da-Wei · moreDual-KV: Improving Performance of Key-value Caches on Multilevel Cell Non-volatile Memory · pdf, mp4 · view
Chang, Harry · moreTeddy: An Efficient SIMD-based Literal Matching Engine for Scalable Deep Packet Inspection · pdf, mp4 · view
Chang, Liang · moreBitX: Empower Versatile Inference with Hardware Runtime Pruning · pdf, mp4 · view
Chapman, Barbara · moreA Virtual GPU as Developer-Friendly OpenMP Offload Target · view
Chapman, David · moreAn Intelligent Parallel Distributed Streaming Framework for Near Real-time Science Sensors and High Resolution Medical Images · pdf, mp4 · view
Chard, Kyle · moreIMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view
Chard, Ryan · moreIMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view
Chen, Bangduo · moreAutomatic Code Generation and Optimization of Large-scale Stencil Computation on Many-core Processors · pdf, mp4 · view
Chen, Hanhua · moreEfficient Complete Event Trend Detection over High-Velocity Streams · pdf, mp4 · view
Chen, Jianxi · moreParallel Multi-split Extendible Hashing for Persistent Memory · pdf, mp4 · view
Chen, Jieyang · moreFast and Scalable Sparse Triangular Solver for Multi-GPU Based HPC Architectures · pdf, mp4 · view
Chen, Kai · moreA Universal Construction to implement Concurrent Data Structure for NUMA-multicore · pdf, mp4 · view
Chen, Li · moreProphet: Speeding up Distributed DNN Training with Predictable Communication Scheduling · pdf, mp4 · view
AMPS-Inf: Automatic Model Partitioning for Serverless Inference with Cost Efficiency. · pdf, mp4 · view
Accelerated Device Placement Optimization with Contrastive Learning · pdf, mp4 · view
Chen, Quan · moreDubhe: Towards Data Unbiasedness with Homomorphic Encryption in Federated Learning Client Selection · pdf, mov · view
Chen, Si · moreADA: An Application-Conscious Data Acquirer for Visual Molecular Dynamics · pdf, mp4 · view
Chen, Wei · moreBitX: Empower Versatile Inference with Hardware Runtime Pruning · pdf, mp4 · view
Chen, Wenguang · moreSparker: Efficient Reduction for More Scalable Machine Learning with Spark · pdf, mp4 · view
Chen, Yanhao · moreBGPQ: A Heap-Based Priority Queue Design for GPUs · pdf, mp4 · view
Chen, Yingwen · moreFIFL: A Fairness Incentive Framework for Federated Learning · pdf, mp4 · view
Chen, YuAng · moreHiPa: Hierarchical Partitioning for Fast PageRank on NUMA Multicore Systems · pdf, mp4 · view
Chen, Zhiguang · moreOptimizing Massively Parallel Winograd Convolution on ARM Processor · pdf, mp4 · view
Cheng, Albert Mo Kim · moreA Virtualization Platform Designed for Irregular Multi-Process Applications · pdf, pdf · view
Cheng, Liangfeng · moreCoupling Right-Provisioned Cold Storage Data Centers with Deduplication · pdf, mp4 · view
Chesterfield, Jon · moreShared Memory Remote Procedure Calls · view
Chiu, Kenneth · moreGVT-Guided Demand-Driven Scheduling in Parallel Discrete Event Simulation · pdf, mp4 · view
Choi, Hyuckjin · moreAnalysis on Nursing Care Activity Related Stress Level for Reduction of Caregiving Workload · pdf, mp4 · view
Choi, Jong · moreDYFLOW: A flexible framework for orchestrating scientific workflows on supercomputers · pdf, mp4 · view
Chung, Yeh-ching · moreHiPa: Hierarchical Partitioning for Fast PageRank on NUMA Multicore Systems · pdf, mp4 · view
Ci, Yiwei · moreMatryoshka: A Coalesced Delta Sequence Prefetcher · pdf, mp4 · view
Clyde, Austin · moreIMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view
Codognet, Philippe · moreConstraint Solving by Quantum Annealing · pdf, mp4 · view
Cornelius, Melanie · moreAdvancing OpenMP Offload Debugging Capabilities in LLVM · view
Coveney, Peter · moreIMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view

D
Dai, Guangli · moreA Virtualization Platform Designed for Irregular Multi-Process Applications · pdf, pdf · view
Deelman, Ewa · moreAssessing Resource Provisioning and Allocation of Ensembles of In Situ Workflows · pdf, mp4 · view
Deng, Haiwei · moreA Graph-Assisted Out-of-Place Update Scheme for Erasure Coded Storage Systems · pdf, mp4 · view
Deng, Tongliang · moreADA: An Application-Conscious Data Acquirer for Visual Molecular Dynamics · pdf, mp4 · view
Deng, Xun · moreAdapting SYCL’s SIMT Programming Paradigm for Accelerators via Program Reconstruction · view
Denis, Alexandre · moreInterferences between Communications and Computations in Distributed HPC Systems · pdf, mp4 · view
Deodhar, Akshay · moreExplaining the Classification Performance of Supervised and Semi-Supervised Methods for Automated Sparse Matrix Format Selection · pdf, mp4 · view
Dewald, Florian · moreTool-Supported Mini-App Extraction to Facilitate Program Analysis and Parallelization · pdf, mp4 · view
Dhandhania, Sunidhi · moreExplaining the Classification Performance of Supervised and Semi-Supervised Methods for Automated Sparse Matrix Format Selection · pdf, mp4 · view
Dietz, Henry · moreTangled: A Conventional Processor Integrating A Quantum-Inspired Coprocessor · pdf, mp4 · view
Dinan, James · moreOptimizing Work Stealing Communication with Structured Atomic Operations · pdf, mp4 · view
Ding, Xiaoning · moreParatick: Reducing Timer Overhead in Virtual Machines · pdf, mp4 · view
Do, Tu Mai Anh · moreAssessing Resource Provisioning and Allocation of Ensembles of In Situ Workflows · pdf, mp4 · view
Doerfert, Johannes · moreAdvancing OpenMP Offload Debugging Capabilities in LLVM · view
A Virtual GPU as Developer-Friendly OpenMP Offload Target · view
Towards Compile-Time-Reducing Compiler Optimization Selection via Machine Learning · view
Dong, Dezun · moreCD-SGD: Distributed Stochastic Gradient Descent with Compression and Delay Compensation · pdf, mp4 · view
Dong, Pengmin · moreDistributed Game-Theoretical Route Navigation for Vehicular Crowdsensing · pdf, mp4 · view
Dosanjh, Matthew · moreDesign of a Portable Implementation of Partitioned Point-to-Point Communication Primitives · pdf, mp4 · view
Du, Jingwen · moreFast and Consistent Remote Direct Access to Non-volatile Memory · pdf, mp4 · view
Du, Mingzhe · moreA Fast, General System for Buffered Persistent Data Structures · pdf, mp4 · view
Duan, Yubin · moreJoint Optimization of DNN Partition and Scheduling for Mobile Cloud Computing · pdf, mp4 · view

E
Eker, Ali · moreGVT-Guided Demand-Driven Scheduling in Parallel Discrete Event Simulation · pdf, mp4 · view
Ellis, Marquita · moreScaling Generalized N-Body Problems, A Case Study from Genomics · pdf, mp4 · view
Elwasif, Wael · moreImplementing Arbitrary/Common Concurrent Writes of CRCW PRAM · pdf, mp4 · view
Enright Jerger, Natalie · moreGhostwriter: A Cache Coherence Protocol for Error-Tolerant Applications · pdf, mp4 · view

F
F. Lorenzon, Arthur · moreCombining Dynamic Concurrency Throttling with Voltage and Frequency Scaling on Task-based Programming Models · pdf, mp4 · view
Fan, Sijiang · moreCNN+LSTM Accelerated Turbulent Flow Simulation with Link-Wise Artificial Compressibility Method · pdf, mp4 · view
Fang, Liang · moreHDNH: a read-efficient and write-optimized hashing scheme for hybrid DRAM-NVM memory · pdf, mp4 · view
Fang, Qiming · moreParallel Tucker Decomposition with Numerically Accurate SVD · pdf, mp4 · view
Fathi, Arash · moreWave-PIM: Accelerating Wave Simulation Using Processing-in-Memory · pdf, mp4 · view
Fei, Jiawei · moreCNN+LSTM Accelerated Turbulent Flow Simulation with Link-Wise Artificial Compressibility Method · pdf, mp4 · view
Fei, Xiang · moreRegu2D: Accelerating Vectorization of SpMV on Intel Processors through 2D-partitioning and Regular Arrangement · pdf, mp4 · view
Feng, Dan · moreA Log-Free and Consistent Chained Hashing for Non-volatile Memory · pdf, pdf · view
Fast and Consistent Remote Direct Access to Non-volatile Memory · pdf, mp4 · view
CERES: Container-Based Elastic Resource Management System for Mixed Workloads · pdf, mp4 · view
Crash-Consistency-Aware Encryption for Non-Volatile Memories · pdf, mp4 · view
ASLDP: An Active Semi-supervised Learning method for Disk Failure Prediction · pdf, mp4 · view
Multi-level Forwarding and Scheduling Recovery Technique in Heterogeneous Network for Erasure-coded Clusters · pdf, mp4 · view
Feng, Ke · moreFMSM: A Fuzzy Multi-keyword Search Scheme for Encrypted Cloud Data based on Multi-chain Network · pdf, mp4 · view
Feng, Wu · moreComputeCOVID19+: Accelerating COVID-19 Diagnosis and Monitoring via High-Performance Deep Learning · pdf, mp4 · view
Feng, Xiaobing · moreLoWino: Towards Efficient Low-Precision Winograd Convolutions on Modern CPUs · pdf, mp4 · view
Feng, Zonghao · moreAccelerating Sequence-to-Graph Alignment on Heterogeneous Processors · pdf, mp4 · view
Ferreira da Silva, Rafael · moreAssessing Resource Provisioning and Allocation of Ensembles of In Situ Workflows · pdf, mp4 · view
Firoz, Jesun · moreFast and Scalable Sparse Triangular Solver for Multi-GPU Based HPC Architectures · pdf, mp4 · view
Fohry, Claudia · moreTransparent Resource Elasticity for Task-Based Cluster Environments with Work Stealing · pdf, mp4 · view
Foster, Ian · moreIMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view
Fu, Qiang · moreAutomatic Generation of High-Performance Inference Kernels for Graph Neural Networks on Multi-Core Systems · pdf, mp4 · view
Fu, Song · moreBoosting Compaction Performance of LSM-tree-based KV Stores in Multi-Near-Data Processing Systems · pdf, pdf · view
Fujimoto, Manato · moreAnalysis on Nursing Care Activity Related Stress Level for Reduction of Caregiving Workload · pdf, mp4 · view
Fujimoto, Noriyuki · moreEfficient GPU-Implementation for Integer Sorting Based on Histogram and Prefix-Sums · pdf, mp4 · view

G
Gao, Jiechao · moreMulti-Agent Reinforcement Learning based Distributed Renewable Energy Matching for Datacenters · pdf, mp4 · view
Gao, Liang · moreFIFL: A Fairness Incentive Framework for Federated Learning · pdf, mp4 · view
Gao, Weiguo · moreProcessor-Aware Cache-Oblivious Algorithms · pdf, mp4 · view
Gavirangaswamy, Vinay · moreTowards Faster Execution of Ensemble ML Bootstrap Based Techniques · pdf, mp4 · view
Georgakoudis, Giorgis · moreTowards Compile-Time-Reducing Compiler Optimization Selection via Machine Learning · view
Gerofi, Balazs · moreIntra-page Cache Update in SLC-mode with Partial Programming in High Density SSDs · pdf, mov · view
Gerstlauer, Andreas · moreWave-PIM: Accelerating Wave Simulation Using Processing-in-Memory · pdf, mp4 · view
Ghafoor, Sheikh · moreDesign of a Portable Implementation of Partitioned Point-to-Point Communication Primitives · pdf, mp4 · view
Ghanim, Fady · moreImplementing Arbitrary/Common Concurrent Writes of CRCW PRAM · pdf, mp4 · view
Gibbs, Thomas · moreIMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view
Gite, Rahul · moreAn Intelligent Parallel Distributed Streaming Framework for Near Real-time Science Sensors and High Resolution Medical Images · pdf, mp4 · view
Goel, Garvit · moreComputeCOVID19+: Accelerating COVID-19 Diagnosis and Monitoring via High-Performance Deep Learning · pdf, mp4 · view
Gondhalekar, Atharva · moreComputeCOVID19+: Accelerating COVID-19 Diagnosis and Monitoring via High-Performance Deep Learning · pdf, mp4 · view
Gong, Xiaoli · moreAscetic: Enhancing Cross-Iterations Data Efficiency in Out-of-Memory Graph Processing on GPUs · pdf, mp4 · view
Gourounas, Dimitrios · moreWave-PIM: Accelerating Wave Simulation Using Processing-in-Memory · pdf, mp4 · view
Govil, Karan · moreWave-PIM: Accelerating Wave Simulation Using Processing-in-Memory · pdf, mp4 · view
Grant, Ryan · moreDesign of a Portable Implementation of Partitioned Point-to-Point Communication Primitives · pdf, mp4 · view
Groppe, Sven · moreCuART - a CUDA-based, scalable Radix-Tree lookup and update engine · pdf, mp4 · view
Groth, Tobias · moreCuART - a CUDA-based, scalable Radix-Tree lookup and update engine · pdf, mp4 · view
Guo, Minyi · moreDubhe: Towards Data Unbiasedness with Homomorphic Encryption in Federated Learning Client Selection · pdf, mov · view
Guo, Xiao-Wei · moreCNN+LSTM Accelerated Turbulent Flow Simulation with Link-Wise Artificial Compressibility Method · pdf, mp4 · view
Guo, Yeting · moreFedCav: Contribution-aware Model Aggregation on Distributed Heterogeneous Data in Federated Learning · pdf, mp4 · view
Guo, Zehua · moreOptimizing Flow Completion Time via Adaptive Buffer Management in Data Center Networks · pdf, mp4 · view
Gupta, Ajay · moreTowards Faster Execution of Ensemble ML Bootstrap Based Techniques · pdf, mp4 · view

H
Hale, Kyle · moreCache-Aware Data Management for Memory-Mapped Forests · pdf, mp4 · view
Halem, Milton · moreAn Intelligent Parallel Distributed Streaming Framework for Near Real-time Science Sensors and High Resolution Medical Images · pdf, mp4 · view
Han, Yongguo · morePREP: Predicting Job Runtime with Job Running Path on Supercomputers · pdf, mp4 · view
Hanindhito, Bagus · moreWave-PIM: Accelerating Wave Simulation Using Processing-in-Memory · pdf, mp4 · view
He, Heng · moreFMSM: A Fuzzy Multi-keyword Search Scheme for Encrypted Cloud Data based on Multi-chain Network · pdf, mp4 · view
He, Shuibing · moreA Novel Multi-CPU/GPU Collaborative Computing Framework for SGD-based Matrix Factorization · pdf, mp4 · view
Hong, Yang · moreTeddy: An Efficient SIMD-based Literal Matching Engine for Scalable Deep Packet Inspection · pdf, mp4 · view
Hossain, Md Maruf · morePostmortem Graph Analysis on the Temporal Graph · pdf, pdf · view
Impact of AVX-512 Instructions on Graph Partitioning Problems. · pdf, mp4 · view
Hsu, Wei-Chung · moreIntra- and Inter- Layer Transformation to Reduce Memory Traffic for CNN Computation · pdf, mp4 · view
Hu, Jing · moreParallel Multi-split Extendible Hashing for Persistent Memory · pdf, mp4 · view
Hu, Yang · moreEnabling Efficient SIMD Acceleration for Virtual Radio Access Network · pdf, mp4 · view
Hu, Yi · moreBest VM Selection for Big Data Applications across Multiple Frameworks by Transfer Learning · pdf, mp4 · view
Hu, Yongmin · moreAutomatic Code Generation and Optimization of Large-scale Stencil Computation on Many-core Processors · pdf, mp4 · view
Hu, Yuchong · moreA Log-Free and Consistent Chained Hashing for Non-volatile Memory · pdf, pdf · view
Coupling Right-Provisioned Cold Storage Data Centers with Deduplication · pdf, mp4 · view
Multi-level Forwarding and Scheduling Recovery Technique in Heterogeneous Network for Erasure-coded Clusters · pdf, mp4 · view
Hua, Fei · moreBGPQ: A Heap-Based Priority Queue Design for GPUs · pdf, mp4 · view
Hua, Qiang-Sheng · moreCommunication Avoiding All-Pairs Shortest Paths Algorithm for Sparse graphs · pdf, mp4 · view
Efficient Complete Event Trend Detection over High-Velocity Streams · pdf, mp4 · view
Huang, Chenglong · moreHDNH: a read-efficient and write-optimized hashing scheme for hybrid DRAM-NVM memory · pdf, mp4 · view
Huang, Chung-Wen · moreSupport Convolution of CNN with Compression Sparse Matrix Multiplication Flow in TVM · pdf, mp4 · view
Huang, Dan · moreOptimizing Massively Parallel Winograd Convolution on ARM Processor · pdf, mp4 · view
Huang, H. Howie · moreAutomatic Generation of High-Performance Inference Kernels for Graph Neural Networks on Multi-Core Systems · pdf, mp4 · view
Huang, Jiawen · moreBitX: Empower Versatile Inference with Hardware Runtime Pruning · pdf, mp4 · view
Huang, Kaixin · moreHDNH: a read-efficient and write-optimized hashing scheme for hybrid DRAM-NVM memory · pdf, mp4 · view
Huang, Min · moreIntra-page Cache Update in SLC-mode with Partial Programming in High Density SSDs · pdf, mov · view
Huang, Tao · moreBest VM Selection for Big Data Applications across Multiple Frameworks by Transfer Learning · pdf, mp4 · view
Huang, Yizhi · moreA Novel Multi-CPU/GPU Collaborative Computing Framework for SGD-based Matrix Factorization · pdf, mp4 · view
Huber, Joseph · moreAdvancing OpenMP Offload Debugging Capabilities in LLVM · view
Hundt, Christian · moreMetaCache-GPU: Ultra-Fast Metagenomic Classification · pdf, mp4 · view
Hung, Ming-Yu · moreAccelerate Binarized Neural Networks with Processing-in-Memory Enabled by RISC-V Custom Instructions · pdf, mp4 · view
Support Convolution of CNN with Compression Sparse Matrix Multiplication Flow in TVM · pdf, mp4 · view

I
Ikeda, Takuya · moreNew Evacuation Guidance Using Augmented Reality for Emergency Rescue Evacuation Support System (ERESS) · pdf, mp4 · view
Ilic, Aleksandar · moreFourth-Order Exhaustive Epistasis Detection for the xPU Era · pdf, mp4 · view
Imamura, Toshiyuki · moreAccurate Matrix Multiplication on Binary128 Format Accelerated by Ozaki Scheme · pdf, file · view

J
Jahic, Jasmin · moreArchViMP – a Framework for Automatic Extraction of Concurrency-related Software Architectural Properties · pdf, mp4 · view
Jarachanthan, Jananie · moreAMPS-Inf: Automatic Model Partitioning for Serverless Inference with Cost Efficiency. · pdf, mp4 · view
Jayatilaka, Tarindu · moreTowards Compile-Time-Reducing Compiler Optimization Selection via Machine Learning · view
Jeannot, Emmanuel · moreInterferences between Communications and Computations in Distributed HPC Systems · pdf, mp4 · view
Jenkins, Louis · moreA Fast, General System for Buffered Persistent Data Structures · pdf, mp4 · view
Jha, Shantenu · moreIMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view
Ji, Zhuoran · moreAccelerating DBSCAN Algorithm with AI Chips for Large Datasets · pdf, mp4 · view
Jia, Ranhao · moreA Graph-Assisted Out-of-Place Update Scheme for Erasure Coded Storage Systems · pdf, mp4 · view
Jia, Zhen · moreLoWino: Towards Efficient Low-Precision Winograd Convolutions on Modern CPUs · pdf, mp4 · view
Jiang, Dejun · moreUsing Vectorized Execution to Improve SQL Query Performance on Spark · pdf, mp4 · view
Jiang, Hao · moreXHYPRE: A high-precision numerical software package for solving large-scale sparse linear equations · pdf, pdf · view
Jiang, Shizhi · moreMatryoshka: A Coalesced Delta Sequence Prefetcher · pdf, mp4 · view
Jin, Hai · moreCommunication Avoiding All-Pairs Shortest Paths Algorithm for Sparse graphs · pdf, mp4 · view
Efficient Complete Event Trend Detection over High-Velocity Streams · pdf, mp4 · view
Jin, Yuwei · moreBGPQ: A Heap-Based Priority Queue Design for GPUs · pdf, mp4 · view
Jin, Zheming · moreEvaluating the Performance of Integer Sum Reduction in SYCL · pdf, pptx · view
John, Lizy · moreWave-PIM: Accelerating Wave Simulation Using Processing-in-Memory · pdf, mp4 · view
Jünger, Daniel · moreMetaCache-GPU: Ultra-Fast Metagenomic Classification · pdf, mp4 · view

K
Kanayama, Yuta · moreNew Evacuation Guidance Using Augmented Reality for Emergency Rescue Evacuation Support System (ERESS) · pdf, mp4 · view
Kao, Henry · moreGhostwriter: A Cache Coherence Protocol for Error-Tolerant Applications · pdf, mp4 · view
Ke, Zhaokang · moreCoupling Right-Provisioned Cold Storage Data Centers with Deduplication · pdf, mp4 · view
Ke, Zong-Ming · moreDual-KV: Improving Performance of Key-value Caches on Multilevel Cell Non-volatile Memory · pdf, mp4 · view
Keipert, Kristopher · moreIMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view
Khan, Md Muhib · moreROBOTune: High-Dimensional Configuration Tuning for Cluster-Based Data Analytics · pdf, mp4 · view
Kilpatrick, Peter · moreExploiting in-Hub Temporal Locality in SpMV-based Graph Processing · pdf, mp4 · view
Klein, Christoph · moreTridiagonal GPU Solver with Scaled Partial Pivoting at Maximum Bandwidth · pdf, mp4 · view
Kobus, Robin · moreMetaCache-GPU: Ultra-Fast Metagenomic Classification · pdf, mp4 · view
Koohi Esfahani, Mohsen · moreExploiting in-Hub Temporal Locality in SpMV-based Graph Processing · pdf, mp4 · view
Koppehel, Martin · moreCuART - a CUDA-based, scalable Radix-Tree lookup and update engine · pdf, mp4 · view
Kozakai, Seiya · moreEfficient GPU-Implementation for Integer Sorting Based on Histogram and Prefix-Sums · pdf, mp4 · view
Kranzlmüller, Dieter · moreIMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view
Kruse, Michael · moreLoop Transformations using Clang's Abstract Syntax Tree · view
Kurth, Thorsten · moreIMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view

L
Lai, Junjie · moreOptimizing Winograd-Based Convolution with Tensor Cores · pdf, mp4 · view
Lai, Jyun-Kai · moreHyperchaining Optimizations for an LLVM-Based Binary Translator on x86-64 and RISC-V Platforms · pdf, mp4 · view
Lai, Wei-Chih · moreSupport Convolution of CNN with Compression Sparse Matrix Multiplication Flow in TVM · pdf, mp4 · view
Lai, Zhiquan · moreHippie: A Data-Paralleled Pipeline Approach to Improve Memory-Efficiency and Scalability for Large DNN Training · pdf, mp4 · view
Lan, Hao · moreAccelerated Device Placement Optimization with Contrastive Learning · pdf, mp4 · view
Langguth, Johannes · moreExplaining the Classification Performance of Supervised and Semi-Supervised Methods for Automated Sparse Matrix Format Selection · pdf, mp4 · view
Larkins, D. Brian · moreOptimizing Work Stealing Communication with Structured Atomic Operations · pdf, mp4 · view
Leandro Nesi, Lucas · moreExploiting system level heterogeneity to improve the performance of a GeoStatistics multi-phase task-based application · pdf, mp4 · view
Lee, Chao-Lin · moreAccelerate Binarized Neural Networks with Processing-in-Memory Enabled by RISC-V Custom Instructions · pdf, mp4 · view
Support Convolution of CNN with Compression Sparse Matrix Multiplication Flow in TVM · pdf, mp4 · view
Lee, Hyungro · moreIMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view
Lee, Jenq-Kuen · moreAccelerate Binarized Neural Networks with Processing-in-Memory Enabled by RISC-V Custom Instructions · pdf, mp4 · view
Support Convolution of CNN with Compression Sparse Matrix Multiplication Flow in TVM · pdf, mp4 · view
Legrand, Arnaud · moreExploiting system level heterogeneity to improve the performance of a GeoStatistics multi-phase task-based application · pdf, mp4 · view
Lehr, Jan-Patrick · moreTool-Supported Mini-App Extraction to Facilitate Program Analysis and Parallelization · pdf, mp4 · view
Lei, Mengya · moreCrash-Consistency-Aware Encryption for Non-Volatile Memories · pdf, mp4 · view
Leng, Jingwen · moreDubhe: Towards Data Unbiasedness with Homomorphic Encryption in Federated Learning Client Selection · pdf, mov · view
Li, Ang · moreFast and Scalable Sparse Triangular Solver for Multi-GPU Based HPC Architectures · pdf, mp4 · view
Li, Angela · moreCache-Aware Data Management for Memory-Mapped Forests · pdf, mp4 · view
Li, Baochun · moreAccelerated Device Placement Optimization with Contrastive Learning · pdf, mp4 · view
Li, Baoqian · moreTeddy: An Efficient SIMD-based Literal Matching Engine for Scalable Deep Packet Inspection · pdf, mp4 · view
Li, Bo · moreAMPS-Inf: Automatic Model Partitioning for Serverless Inference with Cost Efficiency. · pdf, mp4 · view
Li, Chuanying · moreXHYPRE: A high-precision numerical software package for solving large-scale sparse linear equations · pdf, pdf · view
Li, Dawei · moreDistributed Game-Theoretical Route Navigation for Vehicular Crowdsensing · pdf, mp4 · view
Li, Dongsheng · moreHippie: A Data-Paralleled Pipeline Approach to Improve Memory-Efficiency and Scalability for Large DNN Training · pdf, mp4 · view
Li, Fan · moreFast and Consistent Remote Direct Access to Non-volatile Memory · pdf, mp4 · view
Crash-Consistency-Aware Encryption for Non-Volatile Memories · pdf, mp4 · view
Li, Guangli · moreLoWino: Towards Efficient Low-Precision Winograd Convolutions on Modern CPUs · pdf, mp4 · view
Li, Hongyan · moreBitX: Empower Versatile Inference with Hardware Runtime Pruning · pdf, mp4 · view
Li, Jiajia · moreFast and Scalable Sparse Triangular Solver for Multi-GPU Based HPC Architectures · pdf, mp4 · view
Li, Jiawei · moreProgressive Memory Adjustment with Performance Guarantee in Virtualized Systems · pdf, mp4 · view
Li, Jun · moreIntra-page Cache Update in SLC-mode with Partial Programming in High Density SSDs · pdf, mov · view
Li, Li · moreFIFL: A Fairness Incentive Framework for Federated Learning · pdf, mp4 · view
Li, Mingshu · moreMatryoshka: A Coalesced Delta Sequence Prefetcher · pdf, mp4 · view
Li, Mingzhen · moreAutomatic Code Generation and Optimization of Large-scale Stencil Computation on Many-core Processors · pdf, mp4 · view
Li, Minjun · moreIntra-page Cache Update in SLC-mode with Partial Programming in High Density SSDs · pdf, mov · view
Li, Qiliang · moreFast Reconstruction for Large Disk Enclosures Based on RAID2.0 · pdf, mp4 · view
Li, Renfa · moreA Novel Multi-CPU/GPU Collaborative Computing Framework for SGD-based Matrix Factorization · pdf, mp4 · view
Li, Ruihao · moreWave-PIM: Accelerating Wave Simulation Using Processing-in-Memory · pdf, mp4 · view
Li, Shengwei · moreHippie: A Data-Paralleled Pipeline Approach to Improve Memory-Efficiency and Scalability for Large DNN Training · pdf, mp4 · view
Li, Weiguang · moreFast and Consistent Remote Direct Access to Non-volatile Memory · pdf, mp4 · view
Li, Xiaowei · moreBitX: Empower Versatile Inference with Hardware Runtime Pruning · pdf, mp4 · view
Li, Xiaoying · moreMulti-Agent Reinforcement Learning based Distributed Renewable Energy Matching for Datacenters · pdf, mp4 · view
Li, Yongkun · moreProgressive Memory Adjustment with Performance Guarantee in Virtualized Systems · pdf, mp4 · view
Li, Yubo · moreExploring HW/SW Co-Optimizations for Accelerating Large-scale Texture Identification on Distributed GPUs · pdf, mp4 · view
Li, Yun-Ze · moreDual-KV: Improving Performance of Key-value Caches on Multilevel Cell Non-volatile Memory · pdf, mp4 · view
Li, Zhuozhao · moreIMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view
Li, Zirui · moreDubhe: Towards Data Unbiasedness with Homomorphic Encryption in Federated Learning Client Selection · pdf, mov · view
Li, Zitong · moreParallel Tucker Decomposition with Numerically Accurate SVD · pdf, mp4 · view
Liao, Hui-Hsin · moreSupport Convolution of CNN with Compression Sparse Matrix Multiplication Flow in TVM · pdf, mp4 · view
Liao, Jianwei · moreIntra-page Cache Update in SLC-mode with Partial Programming in High Density SSDs · pdf, mov · view
Liao, Pin-Wei · moreIntra- and Inter- Layer Transformation to Reduce Memory Traffic for CNN Computation · pdf, mp4 · view
Liao, Shih-Wei · moreIntra- and Inter- Layer Transformation to Reduce Memory Traffic for CNN Computation · pdf, mp4 · view
Liao, Xiangke · moreCD-SGD: Distributed Stochastic Gradient Descent with Compression and Delay Compensation · pdf, mp4 · view
Lin, Che-Chia · moreAccelerate Binarized Neural Networks with Processing-in-Memory Enabled by RISC-V Custom Instructions · pdf, mp4 · view
Lin, Tzu-Chia · moreAutomated Arrhythmia Detection using Hilbert-Huang Transform based Convolutional Neural Network · pdf, mp4 · view
Lin, Xiang · moreOptimizing Flow Completion Time via Adaptive Buffer Management in Data Center Networks · pdf, mp4 · view
Lin, Yonghua · moreExploring HW/SW Co-Optimizations for Accelerating Large-scale Texture Identification on Distributed GPUs · pdf, mp4 · view
Liu, chengyu · moreFMSM: A Fuzzy Multi-keyword Search Scheme for Encrypted Cloud Data based on Multi-chain Network · pdf, mp4 · view
Liu, Fang · moreFedCav: Contribution-aware Model Aggregation on Distributed Heterogeneous Data in Federated Learning · pdf, mp4 · view
Liu, Hanfeng · moreFastPSO: Towards Efficient Swarm Intelligence Algorithm on GPUs · pdf, mp4 · view
Liu, Junhong · moreOptimizing Winograd-Based Convolution with Tensor Cores · pdf, mp4 · view
Liu, Sen · moreOptimizing Flow Completion Time via Adaptive Buffer Management in Data Center Networks · pdf, mp4 · view
Liu, Wenbin · moreDistributed Game-Theoretical Route Navigation for Vehicular Crowdsensing · pdf, mp4 · view
Liu, Wuji · moreNoStop: A Novel Configuration Optimization Scheme for Spark Streaming · pdf, mp4 · view
Liu, Xiaoyan · moreAutomatic Code Generation and Optimization of Large-scale Stencil Computation on Many-core Processors · pdf, mp4 · view
Liu, Yan · moreA Novel Multi-CPU/GPU Collaborative Computing Framework for SGD-based Matrix Factorization · pdf, mp4 · view
Liu, Yi · moreAutomatic Code Generation and Optimization of Large-scale Stencil Computation on Many-core Processors · pdf, mp4 · view
Liu, Zhiming · moreIntra-page Cache Update in SLC-mode with Partial Programming in High Density SSDs · pdf, mov · view
López-Paradís, Guillem · moregem5+RTL: A Framework to Enable RTL Models Inside a Full-System Simulator · pdf, mp4 · view
Lu, Hang · moreBitX: Empower Versatile Inference with Hardware Runtime Pruning · pdf, mp4 · view
Lu, Yutong · moreOptimizing Massively Parallel Winograd Convolution on ARM Processor · pdf, mp4 · view
Luan, Dongming · moreDistributed Game-Theoretical Route Navigation for Vehicular Crowdsensing · pdf, mp4 · view
Luan, Zhongzhi · moreAutomatic Code Generation and Optimization of Large-scale Stencil Computation on Many-core Processors · pdf, mp4 · view
Luo, Qiong · moreAccelerating Sequence-to-Graph Alignment on Heterogeneous Processors · pdf, mp4 · view
Luo, Yingwei · moreAn Edge-Fencing Strategy for Optimizing SSSP Computations on Large-Scale Graphs · pdf, mp4 · view
Lv, Pengze · moreCERES: Container-Based Elastic Resource Management System for Mixed Workloads · pdf, mp4 · view
Lyu, Min · moreFast Reconstruction for Large Disk Enclosures Based on RAID2.0 · pdf, mp4 · view

M
Ma, Heng · moreIMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view
Maggioli, Filippo · moreEfficiently Parallelizable Strassen-Based Multiplication of a Matrix by its Transpose · pdf, mp4 · view
Maldonado, Daniel Adrian · moreDomain Decomposition Preconditioners for Unstructured Network Problems in Parallel Vector Architectures · pdf, mov · view
Mangalagiri, Jayalakshmi · moreAn Intelligent Parallel Distributed Streaming Framework for Near Real-time Science Sensors and High Resolution Medical Images · pdf, mp4 · view
Mantel, Heiko · moreTool-Supported Mini-App Extraction to Facilitate Program Analysis and Parallelization · pdf, mp4 · view
Massini, Annalisa · moreEfficiently Parallelizable Strassen-Based Multiplication of a Matrix by its Transpose · pdf, mp4 · view
Mathias, Gerald · moreIMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view
Matsui, Tomokazu · moreAnalysis on Nursing Care Activity Related Stress Level for Reduction of Caregiving Workload · pdf, mp4 · view
Mehta, Kshitij · moreDYFLOW: A flexible framework for orchestrating scientific workflows on supercomputers · pdf, mp4 · view
Mei, Huiyao · moreEfficient Complete Event Trend Detection over High-Velocity Streams · pdf, mp4 · view
Mello Schnorr, Lucas · moreExploiting system level heterogeneity to improve the performance of a GeoStatistics multi-phase task-based application · pdf, mp4 · view
Merzky, Andre · moreIMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view
Meyer, Bruno · moreWarp-centric K-Nearest Neighbor Graphs construction on GPU · pdf, mp4 · view
Miandji, Ehsan · moreGPU Accelerated SL0 for Multidimensional Signals · pdf, mp4 · view
Mishin, Nikita · moreEfficient Parallel Algorithms for String Comparison · pdf, mp4 · view
Miyaji, Atsushi · moreAnalysis on Nursing Care Activity Related Stress Level for Reduction of Caregiving Workload · pdf, mp4 · view
Moretó, Miquel · moregem5+RTL: A Framework to Enable RTL Models Inside a Full-System Simulator · pdf, mp4 · view
Morris, Nathaniel · moreCache-Aware Data Management for Memory-Mapped Forests · pdf, mp4 · view
Mukunoki, Daichi · moreAccurate Matrix Multiplication on Binary128 Format Accelerated by Ozaki Scheme · pdf, file · view
Müller, André · moreMetaCache-GPU: Ultra-Fast Metagenomic Classification · pdf, mp4 · view

N
Navarro Muñoz, Antoni · moreCombining Dynamic Concurrency Throttling with Voltage and Frequency Scaling on Task-based Programming Models · pdf, mp4 · view
Nguyen, Phuong · moreAn Intelligent Parallel Distributed Streaming Framework for Near Real-time Science Sensors and High Resolution Medical Images · pdf, mp4 · view
Nobre, Ricardo · moreFourth-Order Exhaustive Epistasis Detection for the xPU Era · pdf, mp4 · view
Norouzi, Mohammad · moreTool-Supported Mini-App Extraction to Facilitate Program Analysis and Parallelization · pdf, mp4 · view
Nunan Zola, Wagner M. · moreWarp-centric K-Nearest Neighbor Graphs construction on GPU · pdf, mp4 · view

O
Ogita, Takeshi · moreAccurate Matrix Multiplication on Binary128 Format Accelerated by Ozaki Scheme · pdf, file · view
Ohtsuki, Kazuhiro · moreNew Evacuation Guidance Using Augmented Reality for Emergency Rescue Evacuation Support System (ERESS) · pdf, mp4 · view
Ouyang, Shuo · moreCD-SGD: Distributed Stochastic Gradient Descent with Compression and Delay Compensation · pdf, mp4 · view
Ozaki, Katsuhisa · moreAccurate Matrix Multiplication on Binary128 Format Accelerated by Ozaki Scheme · pdf, file · view
Ozkaya, M. Yusuf · moreAn Evaluation of Task-Parallel Frameworks for Sparse Solvers on Multicore and Manycore CPU Architectures · pdf, mov · view

P
Pacaud, François · moreDomain Decomposition Preconditioners for Unstructured Network Problems in Parallel Vector Architectures · pdf, mov · view
Paluri, Pavan Kumar · moreA Virtualization Platform Designed for Irregular Multi-Process Applications · pdf, pdf · view
Park, EunJung · moreTowards Compile-Time-Reducing Compiler Optimization Selection via Machine Learning · view
Partin, Alexander · moreIMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view
Patel, Atmn · moreA Virtual GPU as Developer-Friendly OpenMP Offload Target · view
Peng, Zhouxuan · moreParallel Multi-split Extendible Hashing for Persistent Memory · pdf, mp4 · view
Perotin, Lucas · moreMulti-Resource List Scheduling of Moldable Parallel Jobs under Precedence Constraints · pdf, mp4 · view
Perovic, Vasilije · moreTowards Faster Execution of Ensemble ML Bootstrap Based Techniques · pdf, mp4 · view
Perumalla, Kalyan · moreDesign Considerations for GPU-based Mixed Integer Programming on Parallel Computing Platforms · pdf, mp4 · view
Pionteck, Thilo · moreCuART - a CUDA-based, scalable Radix-Tree lookup and update engine · pdf, mp4 · view
Plano, Tom · moreEnabling Real-Time Irregular Data-Flow Pipelines on SIMD Devices · pdf, mp4 · view
Pogorelov, Konstantin · moreExplaining the Classification Performance of Supervised and Semi-Supervised Methods for Automated Sparse Matrix Format Selection · pdf, mp4 · view
Ponomarev, Dmitry · moreGVT-Guided Demand-Driven Scheduling in Parallel Discrete Event Simulation · pdf, mp4 · view
Posner, Jonas · moreTransparent Resource Elasticity for Task-Based Cluster Environments with Work Stealing · pdf, mp4 · view
Pottier, Loïc · moreAssessing Resource Provisioning and Allocation of Ensembles of In Situ Workflows · pdf, mp4 · view
Pourjafarian, Monireh · moreArchViMP – a Framework for Automatic Extraction of Concurrency-related Software Architectural Properties · pdf, mp4 · view
Pozo, Aurora · moreWarp-centric K-Nearest Neighbor Graphs construction on GPU · pdf, mp4 · view
Prema Soundararajan, Prema · moreDesign of a Portable Implementation of Partitioned Point-to-Point Communication Primitives · pdf, mp4 · view

Q
Qi, Jingyuan · moreComputeCOVID19+: Accelerating COVID-19 Diagnosis and Monitoring via High-Performance Deep Learning · pdf, mp4 · view
Qi, Qiang · moreProphet: Speeding up Distributed DNN Training with Predictable Communication Scheduling · pdf, mp4 · view
Qian, Depei · moreAutomatic Code Generation and Optimization of Large-scale Stencil Computation on Many-core Processors · pdf, mp4 · view
Qian, Kun · moreReceiver-Driven Congestion Control for InfiniBand · pdf, mp4 · view
Qiao, Linbo · moreHippie: A Data-Paralleled Pipeline Approach to Improve Memory-Efficiency and Scalability for Large DNN Training · pdf, mp4 · view
Qiu, Kun · moreTeddy: An Efficient SIMD-based Literal Matching Engine for Scalable Deep Packet Inspection · pdf, mp4 · view
Quan, Zhe · moreXHYPRE: A high-precision numerical software package for solving large-scale sparse linear equations · pdf, pdf · view

R
Rabbi, Fazlay · moreAn Evaluation of Task-Parallel Frameworks for Sparse Solvers on Multicore and Manycore CPU Architectures · pdf, mov · view
Raghavan, Padma · moreMulti-Resource List Scheduling of Moldable Parallel Jobs under Precedence Constraints · pdf, mp4 · view
Ramanathan, Arvind · moreIMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view
Ramtin, Amir Reza · moreSelf-Stabilization with Selfish Agents · pdf, mp4 · view
Raugas, Mark · moreFast and Scalable Sparse Triangular Solver for Multi-GPU Based HPC Architectures · pdf, mp4 · view
Ren, Fengyuan · moreReceiver-Driven Congestion Control for InfiniBand · pdf, mp4 · view
Ren, Runtian · moreGeneralized Skyline Interval Coloring and Dynamic Geometric Bin Packing Problems · pdf, mp4 · view
Revell, Alistair · moreCNN+LSTM Accelerated Turbulent Flow Simulation with Link-Wise Artificial Compressibility Method · pdf, mp4 · view
Rodolà, Emanuele · moreEfficiently Parallelizable Strassen-Based Multiplication of a Matrix by its Transpose · pdf, mp4 · view
Romero-Gainza, Eduardo · moreCache-Aware Data Management for Memory-Mapped Forests · pdf, mp4 · view

S
Saeed, Fahad · moreTurboBC: A Memory Efficient and Scalable GPU Based Betweenness Centrality(BC) Algorithm in the Language of Linear Algebra · pdf, mp4 · view
Saleh, Hisham · moreTowards Faster Execution of Ensemble ML Bootstrap Based Techniques · pdf, mp4 · view
San Miguel, Joshua · moreGhostwriter: A Cache Coherence Protocol for Error-Tolerant Applications · pdf, mp4 · view
Santander-Jiménez, Sergio · moreFourth-Order Exhaustive Epistasis Detection for the xPU Era · pdf, mp4 · view
Saule, Erik · morePostmortem Graph Analysis on the Temporal Graph · pdf, pdf · view
Impact of AVX-512 Instructions on Graph Partitioning Problems. · pdf, mp4 · view
Schafer, Derek · moreDesign of a Portable Implementation of Partitioned Point-to-Point Communication Primitives · pdf, mp4 · view
Schanen, Michel · moreDomain Decomposition Preconditioners for Unstructured Network Problems in Parallel Vector Architectures · pdf, mov · view
Schildermans, Stijn · moreParatick: Reducing Timer Overhead in Virtual Machines · pdf, mp4 · view
Schmidt, Bertil · moreMetaCache-GPU: Ultra-Fast Metagenomic Classification · pdf, mp4 · view
Scott, Michael L. · moreA Fast, General System for Buffered Persistent Data Structures · pdf, mp4 · view
SEN, TANMOY · moreContext-aware Data Operation Strategies in Edge Systems for High Application Performance · pdf, mp4 · view
Serhani, Mohamed Adel · moreOptimizing Flow Completion Time via Adaptive Buffer Management in Data Center Networks · pdf, mp4 · view
Shah, Ashka · moreIMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view
Shan, Jianchen · moreParatick: Reducing Timer Overhead in Virtual Machines · pdf, mp4 · view
Shan, Tianyi · moreSparker: Efficient Reduction for More Scalable Machine Learning with Spark · pdf, mp4 · view
Shang, Ruitao · moreProphet: Speeding up Distributed DNN Training with Predictable Communication Scheduling · pdf, mp4 · view
Shen, Haiying · moreContext-aware Data Operation Strategies in Edge Systems for High Application Performance · pdf, mp4 · view
Multi-Agent Reinforcement Learning based Distributed Renewable Energy Matching for Datacenters · pdf, mp4 · view
Shen, Yijie · moreUsing Vectorized Execution to Improve SQL Query Performance on Spark · pdf, mp4 · view
Shi, Yang · moresRouting: Towards a Better Flow Size Estimation Performance through Routing and Sketch Configuration · pdf, mp4 · view
Shivadekar, Samit · moreAn Intelligent Parallel Distributed Streaming Framework for Near Real-time Science Sensors and High Resolution Medical Images · pdf, mp4 · view
Singhal, Swati · moreDYFLOW: A flexible framework for orchestrating scientific workflows on supercomputers · pdf, mp4 · view
Skjellum, Anthony · moreDesign of a Portable Implementation of Partitioned Point-to-Point Communication Primitives · pdf, mp4 · view
Song, Shuaiwen Leon · moreFast and Scalable Sparse Triangular Solver for Multi-GPU Based HPC Architectures · pdf, mp4 · view
Sousa, Leonel · moreFourth-Order Exhaustive Epistasis Detection for the xPU Era · pdf, mp4 · view
Stef, Graillat · moreXHYPRE: A high-precision numerical software package for solving large-scale sparse linear equations · pdf, pdf · view
Stern, Abraham · moreIMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view
Stevens, Rick · moreIMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view
Stewart, Christopher · moreCache-Aware Data Management for Memory-Mapped Forests · pdf, mp4 · view
Strzodka, Robert · moreTridiagonal GPU Solver with Scaled Partial Pivoting at Maximum Bandwidth · pdf, mp4 · view
Sun, Ding · moreHippie: A Data-Paralleled Pipeline Approach to Improve Memory-Efficiency and Scalability for Large DNN Training · pdf, mp4 · view
Sun, Hongyang · moreMulti-Resource List Scheduling of Moldable Parallel Jobs under Precedence Constraints · pdf, mp4 · view
Sun, Hui · moreBoosting Compaction Performance of LSM-tree-based KV Stores in Multi-Near-Data Processing Systems · pdf, pdf · view
Sun, Min-Te · moreAutomated Arrhythmia Detection using Hilbert-Huang Transform based Convolutional Neural Network · pdf, mp4 · view
Sun, Qingxiao · moreAutomatic Code Generation and Optimization of Large-scale Stencil Computation on Many-core Processors · pdf, mp4 · view
Sussman, Alan · moreDYFLOW: A flexible framework for orchestrating scientific workflows on supercomputers · pdf, mp4 · view
Swartvagher, Philippe · moreInterferences between Communications and Computations in Distributed HPC Systems · pdf, mp4 · view

T
Tan, Li · moreIMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view
Tang, Ruiqi · moreAscetic: Enhancing Cross-Iterations Data Efficiency in Out-of-Memory Graph Processing on GPUs · pdf, mp4 · view
Tang, Xiongchao · moreSparker: Efficient Reduction for More Scalable Machine Learning with Spark · pdf, mp4 · view
Tang, Xueyan · moreGeneralized Skyline Interval Coloring and Dynamic Geometric Bin Packing Problems · pdf, mp4 · view
Tang, Yuan · moreProcessor-Aware Cache-Oblivious Algorithms · pdf, mp4 · view
Taufer, Michela · moreAssessing Resource Provisioning and Allocation of Ensembles of In Situ Workflows · pdf, mp4 · view
Tian, Shilei · moreA Virtual GPU as Developer-Friendly OpenMP Offload Target · view
Timmerman, David · moreGVT-Guided Demand-Driven Scheduling in Parallel Discrete Event Simulation · pdf, mp4 · view
Tiskin, Alexander · moreEfficient Parallel Algorithms for String Comparison · pdf, mp4 · view
Titov, Mikhail · moreIMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view
Tong, Wei · moreCERES: Container-Based Elastic Resource Management System for Mixed Workloads · pdf, mp4 · view
Towsley, Don · moreSelf-Stabilization with Selfish Agents · pdf, mp4 · view
Trahay, Francois · moreIntra-page Cache Update in SLC-mode with Partial Programming in High Density SSDs · pdf, mov · view
Trenev, Dimitar · moreWave-PIM: Accelerating Wave Simulation Using Processing-in-Memory · pdf, mp4 · view
Trifan, Anda · moreIMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view
Tsaris, Aristeidis · moreIMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view
Turilli, Matteo · moreIMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view

U
Ueno, Hideto · moreTowards Compile-Time-Reducing Compiler Optimization Selection via Machine Learning · view
Unger, Jonas · moreGPU Accelerated SL0 for Multidimensional Signals · pdf, mp4 · view

V
Valpey, Benjamin · moreA Fast, General System for Buffered Persistent Data Structures · pdf, mp4 · view
Van Dam, Huub · moreIMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view
Vandierendonck, Hans · moreExploiting in-Hub Temporal Locality in SpMV-based Graph Processing · pdf, mp4 · view
Vetter, Jeff · moreEvaluating the Performance of Integer Sum Reduction in SYCL · pdf, pptx · view

W
Wada, Koichi · moreEfficient GPU-Implementation for Integer Sorting Based on Histogram and Prefix-Sums · pdf, mp4 · view
Wada, Tomotaka · moreNew Evacuation Guidance Using Augmented Reality for Emergency Rescue Evacuation Support System (ERESS) · pdf, mp4 · view
Wahib, Mohamed · moreIntra-page Cache Update in SLC-mode with Partial Programming in High Density SSDs · pdf, mov · view
Wan, Shunzhou · moreIMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view
Wang, Cho-Li · moreAccelerating DBSCAN Algorithm with AI Chips for Large Datasets · pdf, mp4 · view
Wang, En · moreDistributed Game-Theoretical Route Navigation for Vehicular Crowdsensing · pdf, mp4 · view
Wang, Fang · moreASLDP: An Active Semi-supervised Learning method for Disk Failure Prediction · pdf, mp4 · view
Wang, Haojie · moreSparker: Efficient Reduction for More Scalable Machine Learning with Spark · pdf, mp4 · view
Wang, Haoyu · moreMulti-Agent Reinforcement Learning based Distributed Renewable Energy Matching for Datacenters · pdf, mp4 · view
Wang, Howard · moreAccelerate Binarized Neural Networks with Processing-in-Memory Enabled by RISC-V Custom Instructions · pdf, mp4 · view
Wang, Jianda · moreEnabling Efficient SIMD Acceleration for Virtual Radio Access Network · pdf, mp4 · view
Wang, Jiashu · moreAdapting SYCL’s SIMT Programming Paradigm for Accelerators via Program Reconstruction · view
Wang, Junsong · moreExploring HW/SW Co-Optimizations for Accelerating Large-scale Texture Identification on Distributed GPUs · pdf, mp4 · view
Wang, Kai-Ting Amy · moreAdapting SYCL’s SIMT Programming Paradigm for Accelerators via Program Reconstruction · view
Wang, Kailun · moreAscetic: Enhancing Cross-Iterations Data Efficiency in Out-of-Memory Graph Processing on GPUs · pdf, mp4 · view
Wang, Qiang · moreBoosting Compaction Performance of LSM-tree-based KV Stores in Multi-Near-Data Processing Systems · pdf, pdf · view
Wang, Wei · moreFast Reconstruction for Large Disk Enclosures Based on RAID2.0 · pdf, mp4 · view
Wang, Wenwen · moreAscetic: Enhancing Cross-Iterations Data Efficiency in Out-of-Memory Graph Processing on GPUs · pdf, mp4 · view
Wang, Wenxu · moreBitX: Empower Versatile Inference with Hardware Runtime Pruning · pdf, mp4 · view
Wang, Xiang · moreTeddy: An Efficient SIMD-based Literal Matching Engine for Scalable Deep Packet Inspection · pdf, mp4 · view
Wang, Xiaolin · moreAn Edge-Fencing Strategy for Optimizing SSSP Computations on Large-Scale Graphs · pdf, mp4 · view
Wang, Yi · moreOptimizing Flow Completion Time via Adaptive Buffer Management in Data Center Networks · pdf, mp4 · view
Wang, Yida · moreLoWino: Towards Efficient Low-Precision Winograd Convolutions on Modern CPUs · pdf, mp4 · view
Wang, Yuchen · moreEfficient Modeling of Random Sampling-Based LRU · pdf, mp4 · view
Wang, Zhenlin · moreEfficient Modeling of Random Sampling-Based LRU · pdf, mp4 · view
Wang, Zihe · moreDistributed Game-Theoretical Route Navigation for Vehicular Crowdsensing · pdf, mp4 · view
Wei, Xueliang · moreCrash-Consistency-Aware Encryption for Non-Volatile Memories · pdf, mp4 · view
Weissenberger, Jack · moreAccelerating Neural Network Training using Arbitrary Precision Approximating Matrix Multiplication Algorithms · pdf, mp4 · view
Wen, Haosen · moreA Fast, General System for Buffered Persistent Data Structures · pdf, mp4 · view
Wen, Mei · moresRouting: Towards a Better Flow Size Estimation Performance through Routing and Sketch Configuration · pdf, mp4 · view
Wen, Zeyi · moreFastPSO: Towards Efficient Swarm Intelligence Algorithm on GPUs · pdf, mp4 · view
Wifling, David · moreIMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view
Williams, Barry · moreGVT-Guided Demand-Driven Scheduling in Parallel Discrete Event Simulation · pdf, mp4 · view
Wolf, Felix · moreTool-Supported Mini-App Extraction to Facilitate Program Analysis and Parallelization · pdf, mp4 · view
Wolf, Matthew · moreDYFLOW: A flexible framework for orchestrating scientific workflows on supercomputers · pdf, mp4 · view
Worley, Andrew · moreDesign of a Portable Implementation of Partitioned Point-to-Point Communication Primitives · pdf, mp4 · view
Wu, Chase Q. · moreNoStop: A Novel Configuration Optimization Scheme for Spark Streaming · pdf, mp4 · view
Wu, Chentao · moreA Graph-Assisted Out-of-Place Update Scheme for Erasure Coded Storage Systems · pdf, mp4 · view
Wu, Hanpei · moreADA: An Application-Conscious Data Acquirer for Visual Molecular Dynamics · pdf, mp4 · view
Wu, Heng · moreBest VM Selection for Big Data Applications across Multiple Frameworks by Transfer Learning · pdf, mp4 · view
Wu, Jie · moreJoint Optimization of DNN Partition and Scheduling for Mobile Cloud Computing · pdf, mp4 · view
Distributed Game-Theoretical Route Navigation for Vehicular Crowdsensing · pdf, mp4 · view
Wu, Panruo · moreA Virtualization Platform Designed for Irregular Multi-Process Applications · pdf, pdf · view
Recursion Brings Speedup to Out-of-Core TensorCore-based Linear Algebra Algorithms: A Case Study of Classic Gram-Schmidt QR Factorization · pdf, mp4 · view
Wu, Weijie · moreProgressive Memory Adjustment with Performance Guarantee in Virtualized Systems · pdf, mp4 · view
Wu, Yadong · morePREP: Predicting Job Runtime with Job Running Path on Supercomputers · pdf, mp4 · view
Wu, Yuewen · moreBest VM Selection for Big Data Applications across Multiple Frameworks by Transfer Learning · pdf, mp4 · view
Wu, Zhongjie · moreCoupling Right-Provisioned Cold Storage Data Centers with Deduplication · pdf, mp4 · view

X
Xiao, Renzhi · moreA Log-Free and Consistent Chained Hashing for Non-volatile Memory · pdf, pdf · view
XIE, CHENHAO · moreFast and Scalable Sparse Triangular Solver for Multi-GPU Based HPC Architectures · pdf, mp4 · view
Xie, Tao · moreADA: An Application-Conscious Data Acquirer for Visual Molecular Dynamics · pdf, mp4 · view
Xiong, Jin · moreUsing Vectorized Execution to Improve SQL Query Performance on Spark · pdf, mp4 · view
Xiong, Yufei · moreCERES: Container-Based Elastic Resource Management System for Mixed Workloads · pdf, mp4 · view
Xu, ChengZhong · moreFIFL: A Fairness Incentive Framework for Federated Learning · pdf, mp4 · view
Xu, Fei · moreProphet: Speeding up Distributed DNN Training with Predictable Communication Scheduling · pdf, mp4 · view
Xu, Liangliang · moreFast Reconstruction for Large Disk Enclosures Based on RAID2.0 · pdf, mp4 · view
Xu, Ming · moreFIFL: A Fairness Incentive Framework for Federated Learning · pdf, mp4 · view
Xu, Nuo · moreHDNH: a read-efficient and write-optimized hashing scheme for hybrid DRAM-NVM memory · pdf, mp4 · view
Xu, Yang · moreOptimizing Flow Completion Time via Adaptive Buffer Management in Data Center Networks · pdf, mp4 · view
Xu, Yemao · moreCD-SGD: Distributed Stochastic Gradient Descent with Compression and Delay Compensation · pdf, mp4 · view
Xu, Yinlong · moreFast Reconstruction for Large Disk Enclosures Based on RAID2.0 · pdf, mp4 · view
Progressive Memory Adjustment with Performance Guarantee in Virtualized Systems · pdf, mp4 · view
Xu, Yuanjia · moreBest VM Selection for Big Data Applications across Multiple Frameworks by Transfer Learning · pdf, mp4 · view

Y
Yang, Canqun · moreCNN+LSTM Accelerated Turbulent Flow Simulation with Link-Wise Artificial Compressibility Method · pdf, mp4 · view
Yang, Dongxu · moreOptimizing Winograd-Based Convolution with Tensor Cores · pdf, mp4 · view
Yang, Hailong · moreAutomatic Code Generation and Optimization of Large-scale Stencil Computation on Many-core Processors · pdf, mp4 · view
Yang, Junyao · moreEfficient Modeling of Random Sampling-Based LRU · pdf, mp4 · view
Yang, Qing · moreParallel Multi-split Extendible Hashing for Persistent Memory · pdf, mp4 · view
Yang, Qiusong · moreMatryoshka: A Coalesced Delta Sequence Prefetcher · pdf, mp4 · view
Yang, Wenxiang · morePREP: Predicting Job Runtime with Job Running Path on Supercomputers · pdf, mp4 · view
Yang, Wuu · moreHyperchaining Optimizations for an LLVM-Based Binary Translator on x86-64 and RISC-V Platforms · pdf, mp4 · view
Yang, Yang · moreSPMFS: A Scalable Persistent Memory File System on Optane Persistent Memory · pdf, mp4 · view
Yang, Yongjian · moreDistributed Game-Theoretical Route Navigation for Vehicular Crowdsensing · pdf, mp4 · view
Yao, Lulu · moreProgressive Memory Adjustment with Performance Guarantee in Virtualized Systems · pdf, mp4 · view
Yao, Yiping · moreA Universal Construction to implement Concurrent Data Structure for NUMA-multicore · pdf, mp4 · view
Yasumoto, Keiichi · moreAnalysis on Nursing Care Activity Related Stress Level for Reduction of Caregiving Workload · pdf, mp4 · view
Ye, Qianwen · moreNoStop: A Novel Configuration Optimization Scheme for Spark Streaming · pdf, mp4 · view
Ye, Xiangyu · moreHippie: A Data-Paralleled Pipeline Approach to Improve Memory-Efficiency and Scalability for Large DNN Training · pdf, mp4 · view
Ye, ZiChun · moreAdapting SYCL’s SIMT Programming Paradigm for Accelerators via Program Reconstruction · view
Yelick, Katherine · moreScaling Generalized N-Body Problems, A Case Study from Genomics · pdf, mp4 · view
Yew, Pen-Chung · moreAscetic: Enhancing Cross-Iterations Data Efficiency in Out-of-Memory Graph Processing on GPUs · pdf, mp4 · view
Yi, Zhengming · moreA Universal Construction to implement Concurrent Data Structure for NUMA-multicore · pdf, mp4 · view
Yin, Junqi · moreIMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view
Yin, Shu · moreADA: An Application-Conscious Data Acquirer for Visual Molecular Dynamics · pdf, mp4 · view
Yin, Yanlong · moreA Novel Multi-CPU/GPU Collaborative Computing Framework for SGD-based Matrix Factorization · pdf, mp4 · view
You, Xin · moreAutomatic Code Generation and Optimization of Large-scale Stencil Computation on Many-core Processors · pdf, mp4 · view
Yu, Bowen · moreSparker: Efficient Reduction for More Scalable Machine Learning with Spark · pdf, mp4 · view
Yu, Enda · moreCD-SGD: Distributed Stochastic Gradient Descent with Compression and Delay Compensation · pdf, mp4 · view
Yu, Huashan · moreAn Edge-Fencing Strategy for Optimizing SSSP Computations on Large-Scale Graphs · pdf, mp4 · view
Yu, Jie · morePREP: Predicting Job Runtime with Job Running Path on Supercomputers · pdf, mp4 · view
Yu, Jinyu · moreCERES: Container-Based Elastic Resource Management System for Mixed Workloads · pdf, mp4 · view
Yu, Weikuan · moreROBOTune: High-Dimensional Configuration Tuning for Cluster-Based Data Analytics · pdf, mp4 · view
Yu, Ya · moreParallel Multi-split Extendible Hashing for Persistent Memory · pdf, mp4 · view
Yue, Yinliang · moreBoosting Compaction Performance of LSM-tree-based KV Stores in Multi-Near-Data Processing Systems · pdf, pdf · view

Z
Zeng, Hui · moreFedCav: Contribution-aware Model Aggregation on Distributed Heterogeneous Data in Federated Learning · pdf, mp4 · view
Zhang, Eddy Z. · moreBGPQ: A Heap-Based Priority Queue Design for GPUs · pdf, mp4 · view
Zhang, Jie · moreAutomated Arrhythmia Detection using Hilbert-Huang Transform based Convolutional Neural Network · pdf, mp4 · view
Zhang, Jin · moreAscetic: Enhancing Cross-Iterations Data Efficiency in Out-of-Memory Graph Processing on GPUs · pdf, mp4 · view
Zhang, Luoping · moreAccelerating Neural Network Training using Arbitrary Precision Approximating Matrix Multiplication Algorithms · pdf, mp4 · view
Zhang, Mingzhe · moreBitX: Empower Versatile Inference with Hardware Runtime Pruning · pdf, mp4 · view
Zhang, Shaoshuai · moreRecursion Brings Speedup to Out-of-Core TensorCore-based Linear Algebra Algorithms: A Case Study of Classic Gram-Schmidt QR Factorization · pdf, mp4 · view
Zhang, Shulai · moreDubhe: Towards Data Unbiasedness with Homomorphic Encryption in Federated Learning Client Selection · pdf, mov · view
Zhang, Wenbo · moreBest VM Selection for Big Data Applications across Multiple Frameworks by Transfer Learning · pdf, mp4 · view
Zhang, Xiaofan · moreExploring HW/SW Co-Optimizations for Accelerating Large-scale Texture Identification on Distributed GPUs · pdf, mp4 · view
Zhang, Xiaorong · morePREP: Predicting Job Runtime with Job Running Path on Supercomputers · pdf, mp4 · view
Zhang, Yiran · moreReceiver-Driven Congestion Control for InfiniBand · pdf, mp4 · view
Zhang, Youhui · moreRegu2D: Accelerating Vectorization of SpMV on Intel Processors through 2D-partitioning and Regular Arrangement · pdf, mp4 · view
Zhang, Zhenwei · moreProphet: Speeding up Distributed DNN Training with Predictable Communication Scheduling · pdf, mp4 · view
Zhang, Zhicheng · moreComputeCOVID19+: Accelerating COVID-19 Diagnosis and Monitoring via High-Performance Deep Learning · pdf, mp4 · view
Zhang, Zhihua · moreAnalysis on Nursing Care Activity Related Stress Level for Reduction of Caregiving Workload · pdf, mp4 · view
Zhao, Yuhong · moreBoosting Compaction Performance of LSM-tree-based KV Stores in Multi-Near-Data Processing Systems · pdf, pdf · view
Zhao, Ziyi · moreAscetic: Enhancing Cross-Iterations Data Efficiency in Out-of-Memory Graph Processing on GPUs · pdf, mp4 · view
Zheng, Kevin · moreMulti-Agent Reinforcement Learning based Distributed Renewable Energy Matching for Datacenters · pdf, mp4 · view
Zheng, Wenli · moreFIFL: A Fairness Incentive Framework for Federated Learning · pdf, mp4 · view
Dubhe: Towards Data Unbiasedness with Homomorphic Encryption in Federated Learning Client Selection · pdf, mov · view
Zhong, Hua · moreBest VM Selection for Big Data Applications across Multiple Frameworks by Transfer Learning · pdf, mp4 · view
Zhou, Bing Bing · moreEfficient Complete Event Trend Detection over High-Velocity Streams · pdf, mp4 · view
Zhou, Hai · moreMulti-level Forwarding and Scheduling Recovery Technique in Heterogeneous Network for Erasure-coded Clusters · pdf, mp4 · view
Zhou, Longfang · morePREP: Predicting Job Runtime with Job Running Path on Supercomputers · pdf, mp4 · view
Zhou, Tongqing · moreFedCav: Contribution-aware Model Aggregation on Distributed Heterogeneous Data in Federated Learning · pdf, mp4 · view
Zhou, Xiaohu · moreFMSM: A Fuzzy Multi-keyword Search Scheme for Encrypted Cloud Data based on Multi-chain Network · pdf, mp4 · view
Zhou, Yang · moreASLDP: An Active Semi-supervised Learning method for Disk Failure Prediction · pdf, mp4 · view
Zhu, Junhao · moreHDNH: a read-efficient and write-optimized hashing scheme for hybrid DRAM-NVM memory · pdf, mp4 · view
Zhu, Lin · moreCommunication Avoiding All-Pairs Shortest Paths Algorithm for Sparse graphs · pdf, mp4 · view
Zhu, Wenjun · moreTeddy: An Efficient SIMD-based Literal Matching Engine for Scalable Deep Packet Inspection · pdf, mp4 · view
Zhu, Yifeng · moreParallel Multi-split Extendible Hashing for Persistent Memory · pdf, mp4 · view
Zou, Xiaomin · moreHDNH: a read-efficient and write-optimized hashing scheme for hybrid DRAM-NVM memory · pdf, mp4 · view
Zou, Yanliang · moreADA: An Application-Conscious Data Acquirer for Visual Molecular Dynamics · pdf, mp4 · view

Created 2021-8-8 18:53