ICPP 2021 Program

A

Aerts, Kris · more

Paratick: Reducing Timer Overhead in Virtual Machines · pdf, mp4 · view

Afibuzzaman, Md · more

An Evaluation of Task-Parallel Frameworks for Sparse Solvers on Multicore and Manycore CPU Architectures · pdf, mov · view

Aktulga, Hasan Metin · more

An Evaluation of Task-Parallel Frameworks for Sparse Solvers on Multicore and Manycore CPU Architectures · pdf, mov · view

Al Saadi, Aymen · more

IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view

Alam, Maksudul · more

Design Considerations for GPU-based Mixed Integer Programming on Parallel Computing Platforms · pdf, mp4 · view

Alfe, Dario · more

IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view

Alperen, Abdullah · more

An Evaluation of Task-Parallel Frameworks for Sparse Solvers on Multicore and Manycore CPU Architectures · pdf, mov · view

Anitescu, Mihai · more

Domain Decomposition Preconditioners for Unstructured Network Problems in Parallel Vector Architectures · pdf, mov · view

Armejach, Adrià · more

gem5+RTL: A Framework to Enable RTL Models Inside a Full-System Simulator · pdf, mp4 · view

Arrigoni, Viviana · more

Efficiently Parallelizable Strassen-Based Multiplication of a Matrix by its Transpose · pdf, mp4 · view

Artiles, Oswaldo · more

TurboBC: A Memory Efficient and Scalable GPU Based Betweenness Centrality(BC) Algorithm in the Language of Linear Algebra · pdf, mp4 · view

Ayguadé Parra, Eduard · more

Combining Dynamic Concurrency Throttling with Voltage and Frequency Scaling on Task-based Programming Models · pdf, mp4 · view

Return to Top

B

Babuji, Yadu · more

IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view

Bai, Yang · more

A Novel Multi-CPU/GPU Collaborative Computing Framework for SGD-based Matrix Factorization · pdf, mp4 · view

Ballard, Grey · more

Accelerating Neural Network Training using Arbitrary Precision Approximating Matrix Multiplication Algorithms · pdf, mp4 · view
Parallel Tucker Decomposition with Numerically Accurate SVD · pdf, mp4 · view

Bangalore, Purushotham · more

Design of a Portable Implementation of Partitioned Point-to-Point Communication Primitives · pdf, mp4 · view

Baravdish, Gabriel George · more

GPU Accelerated SL0 for Multidimensional Signals · pdf, mp4 · view

Barker, Kevin · more

Fast and Scalable Sparse Triangular Solver for Multi-GPU Based HPC Architectures · pdf, mp4 · view

Beltran Querol, Vicenç · more

Combining Dynamic Concurrency Throttling with Voltage and Frequency Scaling on Task-based Programming Models · pdf, mp4 · view

Berezun, Daniil · more

Efficient Parallel Algorithms for String Comparison · pdf, mp4 · view

Bernholdt, David · more

Implementing Arbitrary/Common Concurrent Writes of CRCW PRAM · pdf, mp4 · view

Bhati, Agastya · more

IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view

Bischof, Christian · more

Tool-Supported Mini-App Extraction to Facilitate Program Analysis and Parallelization · pdf, mp4 · view

Biswas, Swarnendu · more

Explaining the Classification Performance of Supervised and Semi-Supervised Methods for Automated Sparse Matrix Format Selection · pdf, mp4 · view

Blaiszik, Benjamin · more

IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view

Brace, Alexander · more

IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view

Brettin, Thomas · more

IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view

Buhler, Jeremy · more

Enabling Real-Time Irregular Data-Flow Pipelines on SIMD Devices · pdf, mp4 · view

Buluc, Aydin · more

Scaling Generalized N-Body Problems, A Case Study from Genomics · pdf, mp4 · view

Return to Top

C

Cai, Lei · more

Hippie: A Data-Paralleled Pipeline Approach to Improve Memory-Efficiency and Scalability for Large DNN Training · pdf, mp4 · view

Cai, Wei · more

FastPSO: Towards Efficient Swarm Intelligence Algorithm on GPUs · pdf, mp4 · view

Cai, Wentao · more

A Fast, General System for Buffered Persistent Data Structures · pdf, mp4 · view

Cai, Zhigang · more

Intra-page Cache Update in SLC-mode with Partial Programming in High Density SSDs · pdf, mov · view

Cai, Zhiping · more

FedCav: Contribution-aware Model Aggregation on Distributed Heterogeneous Data in Federated Learning · pdf, mp4 · view

Caíno-Lores, Silvina · more

Assessing Resource Provisioning and Allocation of Ensembles of In Situ Workflows · pdf, mp4 · view

Cao, Guohua · more

ComputeCOVID19+: Accelerating COVID-19 Diagnosis and Monitoring via High-Performance Deep Learning · pdf, mp4 · view

Cao, Huanqi · more

Sparker: Efficient Reduction for More Scalable Machine Learning with Spark · pdf, mp4 · view

Cao, Qiang · more

SPMFS: A Scalable Persistent Memory File System on Optane Persistent Memory · pdf, mp4 · view

Cartier, Hannah · more

Optimizing Work Stealing Communication with Structured Atomic Operations · pdf, mp4 · view

Catalyurek, Umit · more

An Evaluation of Task-Parallel Frameworks for Sparse Solvers on Multicore and Manycore CPU Architectures · pdf, mov · view

Chang, Da-Wei · more

Dual-KV: Improving Performance of Key-value Caches on Multilevel Cell Non-volatile Memory · pdf, mp4 · view

Chang, Harry · more

Teddy: An Efficient SIMD-based Literal Matching Engine for Scalable Deep Packet Inspection · pdf, mp4 · view

Chang, Liang · more

BitX: Empower Versatile Inference with Hardware Runtime Pruning · pdf, mp4 · view

Chapman, Barbara · more

A Virtual GPU as Developer-Friendly OpenMP Offload Target · view

Chapman, David · more

An Intelligent Parallel Distributed Streaming Framework for Near Real-time Science Sensors and High Resolution Medical Images · pdf, mp4 · view

Chard, Kyle · more

IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view

Chard, Ryan · more

IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view

Chen, Bangduo · more

Automatic Code Generation and Optimization of Large-scale Stencil Computation on Many-core Processors · pdf, mp4 · view

Chen, Hanhua · more

Efficient Complete Event Trend Detection over High-Velocity Streams · pdf, mp4 · view

Chen, Jianxi · more

Parallel Multi-split Extendible Hashing for Persistent Memory · pdf, mp4 · view

Chen, Jieyang · more

Fast and Scalable Sparse Triangular Solver for Multi-GPU Based HPC Architectures · pdf, mp4 · view

Chen, Kai · more

A Universal Construction to implement Concurrent Data Structure for NUMA-multicore · pdf, mp4 · view

Chen, Li · more

Prophet: Speeding up Distributed DNN Training with Predictable Communication Scheduling · pdf, mp4 · view
AMPS-Inf: Automatic Model Partitioning for Serverless Inference with Cost Efficiency. · pdf, mp4 · view
Accelerated Device Placement Optimization with Contrastive Learning · pdf, mp4 · view

Chen, Quan · more

Dubhe: Towards Data Unbiasedness with Homomorphic Encryption in Federated Learning Client Selection · pdf, mov · view

Chen, Si · more

ADA: An Application-Conscious Data Acquirer for Visual Molecular Dynamics · pdf, mp4 · view

Chen, Wei · more

BitX: Empower Versatile Inference with Hardware Runtime Pruning · pdf, mp4 · view

Chen, Wenguang · more

Sparker: Efficient Reduction for More Scalable Machine Learning with Spark · pdf, mp4 · view

Chen, Yanhao · more

BGPQ: A Heap-Based Priority Queue Design for GPUs · pdf, mp4 · view

Chen, Yingwen · more

FIFL: A Fairness Incentive Framework for Federated Learning · pdf, mp4 · view

Chen, YuAng · more

HiPa: Hierarchical Partitioning for Fast PageRank on NUMA Multicore Systems · pdf, mp4 · view

Chen, Zhiguang · more

Optimizing Massively Parallel Winograd Convolution on ARM Processor · pdf, mp4 · view

Cheng, Albert Mo Kim · more

A Virtualization Platform Designed for Irregular Multi-Process Applications · pdf, pdf · view

Cheng, Liangfeng · more

Coupling Right-Provisioned Cold Storage Data Centers with Deduplication · pdf, mp4 · view

Chesterfield, Jon · more

Shared Memory Remote Procedure Calls · view

Chiu, Kenneth · more

GVT-Guided Demand-Driven Scheduling in Parallel Discrete Event Simulation · pdf, mp4 · view

Choi, Hyuckjin · more

Analysis on Nursing Care Activity Related Stress Level for Reduction of Caregiving Workload · pdf, mp4 · view

Choi, Jong · more

DYFLOW: A flexible framework for orchestrating scientific workflows on supercomputers · pdf, mp4 · view

Chung, Yeh-ching · more

HiPa: Hierarchical Partitioning for Fast PageRank on NUMA Multicore Systems · pdf, mp4 · view

Ci, Yiwei · more

Matryoshka: A Coalesced Delta Sequence Prefetcher · pdf, mp4 · view

Clyde, Austin · more

IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view

Codognet, Philippe · more

Constraint Solving by Quantum Annealing · pdf, mp4 · view

Cornelius, Melanie · more

Advancing OpenMP Offload Debugging Capabilities in LLVM · view

Coveney, Peter · more

IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view

Return to Top

D

Dai, Guangli · more

A Virtualization Platform Designed for Irregular Multi-Process Applications · pdf, pdf · view

Deelman, Ewa · more

Assessing Resource Provisioning and Allocation of Ensembles of In Situ Workflows · pdf, mp4 · view

Deng, Haiwei · more

A Graph-Assisted Out-of-Place Update Scheme for Erasure Coded Storage Systems · pdf, mp4 · view

Deng, Tongliang · more

ADA: An Application-Conscious Data Acquirer for Visual Molecular Dynamics · pdf, mp4 · view

Deng, Xun · more

Adapting SYCL’s SIMT Programming Paradigm for Accelerators via Program Reconstruction · view

Denis, Alexandre · more

Interferences between Communications and Computations in Distributed HPC Systems · pdf, mp4 · view

Deodhar, Akshay · more

Explaining the Classification Performance of Supervised and Semi-Supervised Methods for Automated Sparse Matrix Format Selection · pdf, mp4 · view

Dewald, Florian · more

Tool-Supported Mini-App Extraction to Facilitate Program Analysis and Parallelization · pdf, mp4 · view

Dhandhania, Sunidhi · more

Explaining the Classification Performance of Supervised and Semi-Supervised Methods for Automated Sparse Matrix Format Selection · pdf, mp4 · view

Dietz, Henry · more

Tangled: A Conventional Processor Integrating A Quantum-Inspired Coprocessor · pdf, mp4 · view

Dinan, James · more

Optimizing Work Stealing Communication with Structured Atomic Operations · pdf, mp4 · view

Ding, Xiaoning · more

Paratick: Reducing Timer Overhead in Virtual Machines · pdf, mp4 · view

Do, Tu Mai Anh · more

Assessing Resource Provisioning and Allocation of Ensembles of In Situ Workflows · pdf, mp4 · view

Doerfert, Johannes · more

Advancing OpenMP Offload Debugging Capabilities in LLVM · view
A Virtual GPU as Developer-Friendly OpenMP Offload Target · view
Towards Compile-Time-Reducing Compiler Optimization Selection via Machine Learning · view

Dong, Dezun · more

CD-SGD: Distributed Stochastic Gradient Descent with Compression and Delay Compensation · pdf, mp4 · view

Dong, Pengmin · more

Distributed Game-Theoretical Route Navigation for Vehicular Crowdsensing · pdf, mp4 · view

Dosanjh, Matthew · more

Design of a Portable Implementation of Partitioned Point-to-Point Communication Primitives · pdf, mp4 · view

Du, Jingwen · more

Fast and Consistent Remote Direct Access to Non-volatile Memory · pdf, mp4 · view

Du, Mingzhe · more

A Fast, General System for Buffered Persistent Data Structures · pdf, mp4 · view

Duan, Yubin · more

Joint Optimization of DNN Partition and Scheduling for Mobile Cloud Computing · pdf, mp4 · view

Return to Top

E

Eker, Ali · more

GVT-Guided Demand-Driven Scheduling in Parallel Discrete Event Simulation · pdf, mp4 · view

Ellis, Marquita · more

Scaling Generalized N-Body Problems, A Case Study from Genomics · pdf, mp4 · view

Elwasif, Wael · more

Implementing Arbitrary/Common Concurrent Writes of CRCW PRAM · pdf, mp4 · view

Enright Jerger, Natalie · more

Ghostwriter: A Cache Coherence Protocol for Error-Tolerant Applications · pdf, mp4 · view

Return to Top

F

F. Lorenzon, Arthur · more

Combining Dynamic Concurrency Throttling with Voltage and Frequency Scaling on Task-based Programming Models · pdf, mp4 · view

Fan, Sijiang · more

CNN+LSTM Accelerated Turbulent Flow Simulation with Link-Wise Artificial Compressibility Method · pdf, mp4 · view

Fang, Liang · more

HDNH: a read-efficient and write-optimized hashing scheme for hybrid DRAM-NVM memory · pdf, mp4 · view

Fang, Qiming · more

Parallel Tucker Decomposition with Numerically Accurate SVD · pdf, mp4 · view

Fathi, Arash · more

Wave-PIM: Accelerating Wave Simulation Using Processing-in-Memory · pdf, mp4 · view

Fei, Jiawei · more

CNN+LSTM Accelerated Turbulent Flow Simulation with Link-Wise Artificial Compressibility Method · pdf, mp4 · view

Fei, Xiang · more

Regu2D: Accelerating Vectorization of SpMV on Intel Processors through 2D-partitioning and Regular Arrangement · pdf, mp4 · view

Feng, Dan · more

A Log-Free and Consistent Chained Hashing for Non-volatile Memory · pdf, pdf · view
Fast and Consistent Remote Direct Access to Non-volatile Memory · pdf, mp4 · view
CERES: Container-Based Elastic Resource Management System for Mixed Workloads · pdf, mp4 · view
Crash-Consistency-Aware Encryption for Non-Volatile Memories · pdf, mp4 · view
ASLDP: An Active Semi-supervised Learning method for Disk Failure Prediction · pdf, mp4 · view
Multi-level Forwarding and Scheduling Recovery Technique in Heterogeneous Network for Erasure-coded Clusters · pdf, mp4 · view

Feng, Ke · more

FMSM: A Fuzzy Multi-keyword Search Scheme for Encrypted Cloud Data based on Multi-chain Network · pdf, mp4 · view

Feng, Wu · more

ComputeCOVID19+: Accelerating COVID-19 Diagnosis and Monitoring via High-Performance Deep Learning · pdf, mp4 · view

Feng, Xiaobing · more

LoWino: Towards Efficient Low-Precision Winograd Convolutions on Modern CPUs · pdf, mp4 · view

Feng, Zonghao · more

Accelerating Sequence-to-Graph Alignment on Heterogeneous Processors · pdf, mp4 · view

Ferreira da Silva, Rafael · more

Assessing Resource Provisioning and Allocation of Ensembles of In Situ Workflows · pdf, mp4 · view

Firoz, Jesun · more

Fast and Scalable Sparse Triangular Solver for Multi-GPU Based HPC Architectures · pdf, mp4 · view

Fohry, Claudia · more

Transparent Resource Elasticity for Task-Based Cluster Environments with Work Stealing · pdf, mp4 · view

Foster, Ian · more

IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view

Fu, Qiang · more

Automatic Generation of High-Performance Inference Kernels for Graph Neural Networks on Multi-Core Systems · pdf, mp4 · view

Fu, Song · more

Boosting Compaction Performance of LSM-tree-based KV Stores in Multi-Near-Data Processing Systems · pdf, pdf · view

Fujimoto, Manato · more

Analysis on Nursing Care Activity Related Stress Level for Reduction of Caregiving Workload · pdf, mp4 · view

Fujimoto, Noriyuki · more

Efficient GPU-Implementation for Integer Sorting Based on Histogram and Prefix-Sums · pdf, mp4 · view

Return to Top

G

Gao, Jiechao · more

Multi-Agent Reinforcement Learning based Distributed Renewable Energy Matching for Datacenters · pdf, mp4 · view

Gao, Liang · more

FIFL: A Fairness Incentive Framework for Federated Learning · pdf, mp4 · view

Gao, Weiguo · more

Processor-Aware Cache-Oblivious Algorithms · pdf, mp4 · view

Gavirangaswamy, Vinay · more

Towards Faster Execution of Ensemble ML Bootstrap Based Techniques · pdf, mp4 · view

Georgakoudis, Giorgis · more

Towards Compile-Time-Reducing Compiler Optimization Selection via Machine Learning · view

Gerofi, Balazs · more

Intra-page Cache Update in SLC-mode with Partial Programming in High Density SSDs · pdf, mov · view

Gerstlauer, Andreas · more

Wave-PIM: Accelerating Wave Simulation Using Processing-in-Memory · pdf, mp4 · view

Ghafoor, Sheikh · more

Design of a Portable Implementation of Partitioned Point-to-Point Communication Primitives · pdf, mp4 · view

Ghanim, Fady · more

Implementing Arbitrary/Common Concurrent Writes of CRCW PRAM · pdf, mp4 · view

Gibbs, Thomas · more

IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view

Gite, Rahul · more

An Intelligent Parallel Distributed Streaming Framework for Near Real-time Science Sensors and High Resolution Medical Images · pdf, mp4 · view

Goel, Garvit · more

ComputeCOVID19+: Accelerating COVID-19 Diagnosis and Monitoring via High-Performance Deep Learning · pdf, mp4 · view

Gondhalekar, Atharva · more

ComputeCOVID19+: Accelerating COVID-19 Diagnosis and Monitoring via High-Performance Deep Learning · pdf, mp4 · view

Gong, Xiaoli · more

Ascetic: Enhancing Cross-Iterations Data Efficiency in Out-of-Memory Graph Processing on GPUs · pdf, mp4 · view

Gourounas, Dimitrios · more

Wave-PIM: Accelerating Wave Simulation Using Processing-in-Memory · pdf, mp4 · view

Govil, Karan · more

Wave-PIM: Accelerating Wave Simulation Using Processing-in-Memory · pdf, mp4 · view

Grant, Ryan · more

Design of a Portable Implementation of Partitioned Point-to-Point Communication Primitives · pdf, mp4 · view

Groppe, Sven · more

CuART - a CUDA-based, scalable Radix-Tree lookup and update engine · pdf, mp4 · view

Groth, Tobias · more

CuART - a CUDA-based, scalable Radix-Tree lookup and update engine · pdf, mp4 · view

Guo, Minyi · more

Dubhe: Towards Data Unbiasedness with Homomorphic Encryption in Federated Learning Client Selection · pdf, mov · view

Guo, Xiao-Wei · more

CNN+LSTM Accelerated Turbulent Flow Simulation with Link-Wise Artificial Compressibility Method · pdf, mp4 · view

Guo, Yeting · more

FedCav: Contribution-aware Model Aggregation on Distributed Heterogeneous Data in Federated Learning · pdf, mp4 · view

Guo, Zehua · more

Optimizing Flow Completion Time via Adaptive Buffer Management in Data Center Networks · pdf, mp4 · view

Gupta, Ajay · more

Towards Faster Execution of Ensemble ML Bootstrap Based Techniques · pdf, mp4 · view

Return to Top

H

Hale, Kyle · more

Cache-Aware Data Management for Memory-Mapped Forests · pdf, mp4 · view

Halem, Milton · more

An Intelligent Parallel Distributed Streaming Framework for Near Real-time Science Sensors and High Resolution Medical Images · pdf, mp4 · view

Han, Yongguo · more

PREP: Predicting Job Runtime with Job Running Path on Supercomputers · pdf, mp4 · view

Hanindhito, Bagus · more

Wave-PIM: Accelerating Wave Simulation Using Processing-in-Memory · pdf, mp4 · view

He, Heng · more

FMSM: A Fuzzy Multi-keyword Search Scheme for Encrypted Cloud Data based on Multi-chain Network · pdf, mp4 · view

He, Shuibing · more

A Novel Multi-CPU/GPU Collaborative Computing Framework for SGD-based Matrix Factorization · pdf, mp4 · view

Hong, Yang · more

Teddy: An Efficient SIMD-based Literal Matching Engine for Scalable Deep Packet Inspection · pdf, mp4 · view

Hossain, Md Maruf · more

Postmortem Graph Analysis on the Temporal Graph · pdf, pdf · view
Impact of AVX-512 Instructions on Graph Partitioning Problems. · pdf, mp4 · view

Hsu, Wei-Chung · more

Intra- and Inter- Layer Transformation to Reduce Memory Traffic for CNN Computation · pdf, mp4 · view

Hu, Jing · more

Parallel Multi-split Extendible Hashing for Persistent Memory · pdf, mp4 · view

Hu, Yang · more

Enabling Efficient SIMD Acceleration for Virtual Radio Access Network · pdf, mp4 · view

Hu, Yi · more

Best VM Selection for Big Data Applications across Multiple Frameworks by Transfer Learning · pdf, mp4 · view

Hu, Yongmin · more

Automatic Code Generation and Optimization of Large-scale Stencil Computation on Many-core Processors · pdf, mp4 · view

Hu, Yuchong · more

A Log-Free and Consistent Chained Hashing for Non-volatile Memory · pdf, pdf · view
Coupling Right-Provisioned Cold Storage Data Centers with Deduplication · pdf, mp4 · view
Multi-level Forwarding and Scheduling Recovery Technique in Heterogeneous Network for Erasure-coded Clusters · pdf, mp4 · view

Hua, Fei · more

BGPQ: A Heap-Based Priority Queue Design for GPUs · pdf, mp4 · view

Hua, Qiang-Sheng · more

Communication Avoiding All-Pairs Shortest Paths Algorithm for Sparse graphs · pdf, mp4 · view
Efficient Complete Event Trend Detection over High-Velocity Streams · pdf, mp4 · view

Huang, Chenglong · more

HDNH: a read-efficient and write-optimized hashing scheme for hybrid DRAM-NVM memory · pdf, mp4 · view

Huang, Chung-Wen · more

Support Convolution of CNN with Compression Sparse Matrix Multiplication Flow in TVM · pdf, mp4 · view

Huang, Dan · more

Optimizing Massively Parallel Winograd Convolution on ARM Processor · pdf, mp4 · view

Huang, H. Howie · more

Automatic Generation of High-Performance Inference Kernels for Graph Neural Networks on Multi-Core Systems · pdf, mp4 · view

Huang, Jiawen · more

BitX: Empower Versatile Inference with Hardware Runtime Pruning · pdf, mp4 · view

Huang, Kaixin · more

HDNH: a read-efficient and write-optimized hashing scheme for hybrid DRAM-NVM memory · pdf, mp4 · view

Huang, Min · more

Intra-page Cache Update in SLC-mode with Partial Programming in High Density SSDs · pdf, mov · view

Huang, Tao · more

Best VM Selection for Big Data Applications across Multiple Frameworks by Transfer Learning · pdf, mp4 · view

Huang, Yizhi · more

A Novel Multi-CPU/GPU Collaborative Computing Framework for SGD-based Matrix Factorization · pdf, mp4 · view

Huber, Joseph · more

Advancing OpenMP Offload Debugging Capabilities in LLVM · view

Hundt, Christian · more

MetaCache-GPU: Ultra-Fast Metagenomic Classification · pdf, mp4 · view

Hung, Ming-Yu · more

Accelerate Binarized Neural Networks with Processing-in-Memory Enabled by RISC-V Custom Instructions · pdf, mp4 · view
Support Convolution of CNN with Compression Sparse Matrix Multiplication Flow in TVM · pdf, mp4 · view

Return to Top

I

Ikeda, Takuya · more

New Evacuation Guidance Using Augmented Reality for Emergency Rescue Evacuation Support System (ERESS) · pdf, mp4 · view

Ilic, Aleksandar · more

Fourth-Order Exhaustive Epistasis Detection for the xPU Era · pdf, mp4 · view

Imamura, Toshiyuki · more

Accurate Matrix Multiplication on Binary128 Format Accelerated by Ozaki Scheme · pdf, file · view

Return to Top

J

Jahic, Jasmin · more

ArchViMP – a Framework for Automatic Extraction of Concurrency-related Software Architectural Properties · pdf, mp4 · view

Jarachanthan, Jananie · more

AMPS-Inf: Automatic Model Partitioning for Serverless Inference with Cost Efficiency. · pdf, mp4 · view

Jayatilaka, Tarindu · more

Towards Compile-Time-Reducing Compiler Optimization Selection via Machine Learning · view

Jeannot, Emmanuel · more

Interferences between Communications and Computations in Distributed HPC Systems · pdf, mp4 · view

Jenkins, Louis · more

A Fast, General System for Buffered Persistent Data Structures · pdf, mp4 · view

Jha, Shantenu · more

IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view

Ji, Zhuoran · more

Accelerating DBSCAN Algorithm with AI Chips for Large Datasets · pdf, mp4 · view

Jia, Ranhao · more

A Graph-Assisted Out-of-Place Update Scheme for Erasure Coded Storage Systems · pdf, mp4 · view

Jia, Zhen · more

LoWino: Towards Efficient Low-Precision Winograd Convolutions on Modern CPUs · pdf, mp4 · view

Jiang, Dejun · more

Using Vectorized Execution to Improve SQL Query Performance on Spark · pdf, mp4 · view

Jiang, Hao · more

XHYPRE: A high-precision numerical software package for solving large-scale sparse linear equations · pdf, pdf · view

Jiang, Shizhi · more

Matryoshka: A Coalesced Delta Sequence Prefetcher · pdf, mp4 · view

Jin, Hai · more

Communication Avoiding All-Pairs Shortest Paths Algorithm for Sparse graphs · pdf, mp4 · view
Efficient Complete Event Trend Detection over High-Velocity Streams · pdf, mp4 · view

Jin, Yuwei · more

BGPQ: A Heap-Based Priority Queue Design for GPUs · pdf, mp4 · view

Jin, Zheming · more

Evaluating the Performance of Integer Sum Reduction in SYCL · pdf, pptx · view

John, Lizy · more

Wave-PIM: Accelerating Wave Simulation Using Processing-in-Memory · pdf, mp4 · view

Jünger, Daniel · more

MetaCache-GPU: Ultra-Fast Metagenomic Classification · pdf, mp4 · view

Return to Top

K

Kanayama, Yuta · more

New Evacuation Guidance Using Augmented Reality for Emergency Rescue Evacuation Support System (ERESS) · pdf, mp4 · view

Kao, Henry · more

Ghostwriter: A Cache Coherence Protocol for Error-Tolerant Applications · pdf, mp4 · view

Ke, Zhaokang · more

Coupling Right-Provisioned Cold Storage Data Centers with Deduplication · pdf, mp4 · view

Ke, Zong-Ming · more

Dual-KV: Improving Performance of Key-value Caches on Multilevel Cell Non-volatile Memory · pdf, mp4 · view

Keipert, Kristopher · more

IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view

Khan, Md Muhib · more

ROBOTune: High-Dimensional Configuration Tuning for Cluster-Based Data Analytics · pdf, mp4 · view

Kilpatrick, Peter · more

Exploiting in-Hub Temporal Locality in SpMV-based Graph Processing · pdf, mp4 · view

Klein, Christoph · more

Tridiagonal GPU Solver with Scaled Partial Pivoting at Maximum Bandwidth · pdf, mp4 · view

Kobus, Robin · more

MetaCache-GPU: Ultra-Fast Metagenomic Classification · pdf, mp4 · view

Koohi Esfahani, Mohsen · more

Exploiting in-Hub Temporal Locality in SpMV-based Graph Processing · pdf, mp4 · view

Koppehel, Martin · more

CuART - a CUDA-based, scalable Radix-Tree lookup and update engine · pdf, mp4 · view

Kozakai, Seiya · more

Efficient GPU-Implementation for Integer Sorting Based on Histogram and Prefix-Sums · pdf, mp4 · view

Kranzlmüller, Dieter · more

IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view

Kruse, Michael · more

Loop Transformations using Clang's Abstract Syntax Tree · view

Kurth, Thorsten · more

IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view

Return to Top

L

Lai, Junjie · more

Optimizing Winograd-Based Convolution with Tensor Cores · pdf, mp4 · view

Lai, Jyun-Kai · more

Hyperchaining Optimizations for an LLVM-Based Binary Translator on x86-64 and RISC-V Platforms · pdf, mp4 · view

Lai, Wei-Chih · more

Support Convolution of CNN with Compression Sparse Matrix Multiplication Flow in TVM · pdf, mp4 · view

Lai, Zhiquan · more

Hippie: A Data-Paralleled Pipeline Approach to Improve Memory-Efficiency and Scalability for Large DNN Training · pdf, mp4 · view

Lan, Hao · more

Accelerated Device Placement Optimization with Contrastive Learning · pdf, mp4 · view

Langguth, Johannes · more

Explaining the Classification Performance of Supervised and Semi-Supervised Methods for Automated Sparse Matrix Format Selection · pdf, mp4 · view

Larkins, D. Brian · more

Optimizing Work Stealing Communication with Structured Atomic Operations · pdf, mp4 · view

Leandro Nesi, Lucas · more

Exploiting system level heterogeneity to improve the performance of a GeoStatistics multi-phase task-based application · pdf, mp4 · view

Lee, Chao-Lin · more

Accelerate Binarized Neural Networks with Processing-in-Memory Enabled by RISC-V Custom Instructions · pdf, mp4 · view
Support Convolution of CNN with Compression Sparse Matrix Multiplication Flow in TVM · pdf, mp4 · view

Lee, Hyungro · more

IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view

Lee, Jenq-Kuen · more

Accelerate Binarized Neural Networks with Processing-in-Memory Enabled by RISC-V Custom Instructions · pdf, mp4 · view
Support Convolution of CNN with Compression Sparse Matrix Multiplication Flow in TVM · pdf, mp4 · view

Legrand, Arnaud · more

Exploiting system level heterogeneity to improve the performance of a GeoStatistics multi-phase task-based application · pdf, mp4 · view

Lehr, Jan-Patrick · more

Tool-Supported Mini-App Extraction to Facilitate Program Analysis and Parallelization · pdf, mp4 · view

Lei, Mengya · more

Crash-Consistency-Aware Encryption for Non-Volatile Memories · pdf, mp4 · view

Leng, Jingwen · more

Dubhe: Towards Data Unbiasedness with Homomorphic Encryption in Federated Learning Client Selection · pdf, mov · view

Li, Ang · more

Fast and Scalable Sparse Triangular Solver for Multi-GPU Based HPC Architectures · pdf, mp4 · view

Li, Angela · more

Cache-Aware Data Management for Memory-Mapped Forests · pdf, mp4 · view

Li, Baochun · more

Accelerated Device Placement Optimization with Contrastive Learning · pdf, mp4 · view

Li, Baoqian · more

Teddy: An Efficient SIMD-based Literal Matching Engine for Scalable Deep Packet Inspection · pdf, mp4 · view

Li, Bo · more

AMPS-Inf: Automatic Model Partitioning for Serverless Inference with Cost Efficiency. · pdf, mp4 · view

Li, Chuanying · more

XHYPRE: A high-precision numerical software package for solving large-scale sparse linear equations · pdf, pdf · view

Li, Dawei · more

Distributed Game-Theoretical Route Navigation for Vehicular Crowdsensing · pdf, mp4 · view

Li, Dongsheng · more

Hippie: A Data-Paralleled Pipeline Approach to Improve Memory-Efficiency and Scalability for Large DNN Training · pdf, mp4 · view

Li, Fan · more

Fast and Consistent Remote Direct Access to Non-volatile Memory · pdf, mp4 · view
Crash-Consistency-Aware Encryption for Non-Volatile Memories · pdf, mp4 · view

Li, Guangli · more

LoWino: Towards Efficient Low-Precision Winograd Convolutions on Modern CPUs · pdf, mp4 · view

Li, Hongyan · more

BitX: Empower Versatile Inference with Hardware Runtime Pruning · pdf, mp4 · view

Li, Jiajia · more

Fast and Scalable Sparse Triangular Solver for Multi-GPU Based HPC Architectures · pdf, mp4 · view

Li, Jiawei · more

Progressive Memory Adjustment with Performance Guarantee in Virtualized Systems · pdf, mp4 · view

Li, Jun · more

Intra-page Cache Update in SLC-mode with Partial Programming in High Density SSDs · pdf, mov · view

Li, Li · more

FIFL: A Fairness Incentive Framework for Federated Learning · pdf, mp4 · view

Li, Mingshu · more

Matryoshka: A Coalesced Delta Sequence Prefetcher · pdf, mp4 · view

Li, Mingzhen · more

Automatic Code Generation and Optimization of Large-scale Stencil Computation on Many-core Processors · pdf, mp4 · view

Li, Minjun · more

Intra-page Cache Update in SLC-mode with Partial Programming in High Density SSDs · pdf, mov · view

Li, Qiliang · more

Fast Reconstruction for Large Disk Enclosures Based on RAID2.0 · pdf, mp4 · view

Li, Renfa · more

A Novel Multi-CPU/GPU Collaborative Computing Framework for SGD-based Matrix Factorization · pdf, mp4 · view

Li, Ruihao · more

Wave-PIM: Accelerating Wave Simulation Using Processing-in-Memory · pdf, mp4 · view

Li, Shengwei · more

Hippie: A Data-Paralleled Pipeline Approach to Improve Memory-Efficiency and Scalability for Large DNN Training · pdf, mp4 · view

Li, Weiguang · more

Fast and Consistent Remote Direct Access to Non-volatile Memory · pdf, mp4 · view

Li, Xiaowei · more

BitX: Empower Versatile Inference with Hardware Runtime Pruning · pdf, mp4 · view

Li, Xiaoying · more

Multi-Agent Reinforcement Learning based Distributed Renewable Energy Matching for Datacenters · pdf, mp4 · view

Li, Yongkun · more

Progressive Memory Adjustment with Performance Guarantee in Virtualized Systems · pdf, mp4 · view

Li, Yubo · more

Exploring HW/SW Co-Optimizations for Accelerating Large-scale Texture Identification on Distributed GPUs · pdf, mp4 · view

Li, Yun-Ze · more

Dual-KV: Improving Performance of Key-value Caches on Multilevel Cell Non-volatile Memory · pdf, mp4 · view

Li, Zhuozhao · more

IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view

Li, Zirui · more

Dubhe: Towards Data Unbiasedness with Homomorphic Encryption in Federated Learning Client Selection · pdf, mov · view

Li, Zitong · more

Parallel Tucker Decomposition with Numerically Accurate SVD · pdf, mp4 · view

Liao, Hui-Hsin · more

Support Convolution of CNN with Compression Sparse Matrix Multiplication Flow in TVM · pdf, mp4 · view

Liao, Jianwei · more

Intra-page Cache Update in SLC-mode with Partial Programming in High Density SSDs · pdf, mov · view

Liao, Pin-Wei · more

Intra- and Inter- Layer Transformation to Reduce Memory Traffic for CNN Computation · pdf, mp4 · view

Liao, Shih-Wei · more

Intra- and Inter- Layer Transformation to Reduce Memory Traffic for CNN Computation · pdf, mp4 · view

Liao, Xiangke · more

CD-SGD: Distributed Stochastic Gradient Descent with Compression and Delay Compensation · pdf, mp4 · view

Lin, Che-Chia · more

Accelerate Binarized Neural Networks with Processing-in-Memory Enabled by RISC-V Custom Instructions · pdf, mp4 · view

Lin, Tzu-Chia · more

Automated Arrhythmia Detection using Hilbert-Huang Transform based Convolutional Neural Network · pdf, mp4 · view

Lin, Xiang · more

Optimizing Flow Completion Time via Adaptive Buffer Management in Data Center Networks · pdf, mp4 · view

Lin, Yonghua · more

Exploring HW/SW Co-Optimizations for Accelerating Large-scale Texture Identification on Distributed GPUs · pdf, mp4 · view

Liu, chengyu · more

FMSM: A Fuzzy Multi-keyword Search Scheme for Encrypted Cloud Data based on Multi-chain Network · pdf, mp4 · view

Liu, Fang · more

FedCav: Contribution-aware Model Aggregation on Distributed Heterogeneous Data in Federated Learning · pdf, mp4 · view

Liu, Hanfeng · more

FastPSO: Towards Efficient Swarm Intelligence Algorithm on GPUs · pdf, mp4 · view

Liu, Junhong · more

Optimizing Winograd-Based Convolution with Tensor Cores · pdf, mp4 · view

Liu, Sen · more

Optimizing Flow Completion Time via Adaptive Buffer Management in Data Center Networks · pdf, mp4 · view

Liu, Wenbin · more

Distributed Game-Theoretical Route Navigation for Vehicular Crowdsensing · pdf, mp4 · view

Liu, Wuji · more

NoStop: A Novel Configuration Optimization Scheme for Spark Streaming · pdf, mp4 · view

Liu, Xiaoyan · more

Automatic Code Generation and Optimization of Large-scale Stencil Computation on Many-core Processors · pdf, mp4 · view

Liu, Yan · more

A Novel Multi-CPU/GPU Collaborative Computing Framework for SGD-based Matrix Factorization · pdf, mp4 · view

Liu, Yi · more

Automatic Code Generation and Optimization of Large-scale Stencil Computation on Many-core Processors · pdf, mp4 · view

Liu, Zhiming · more

Intra-page Cache Update in SLC-mode with Partial Programming in High Density SSDs · pdf, mov · view

López-Paradís, Guillem · more

gem5+RTL: A Framework to Enable RTL Models Inside a Full-System Simulator · pdf, mp4 · view

Lu, Hang · more

BitX: Empower Versatile Inference with Hardware Runtime Pruning · pdf, mp4 · view

Lu, Yutong · more

Optimizing Massively Parallel Winograd Convolution on ARM Processor · pdf, mp4 · view

Luan, Dongming · more

Distributed Game-Theoretical Route Navigation for Vehicular Crowdsensing · pdf, mp4 · view

Luan, Zhongzhi · more

Automatic Code Generation and Optimization of Large-scale Stencil Computation on Many-core Processors · pdf, mp4 · view

Luo, Qiong · more

Accelerating Sequence-to-Graph Alignment on Heterogeneous Processors · pdf, mp4 · view

Luo, Yingwei · more

An Edge-Fencing Strategy for Optimizing SSSP Computations on Large-Scale Graphs · pdf, mp4 · view

Lv, Pengze · more

CERES: Container-Based Elastic Resource Management System for Mixed Workloads · pdf, mp4 · view

Lyu, Min · more

Fast Reconstruction for Large Disk Enclosures Based on RAID2.0 · pdf, mp4 · view

Return to Top

M

Ma, Heng · more

IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view

Maggioli, Filippo · more

Efficiently Parallelizable Strassen-Based Multiplication of a Matrix by its Transpose · pdf, mp4 · view

Maldonado, Daniel Adrian · more

Domain Decomposition Preconditioners for Unstructured Network Problems in Parallel Vector Architectures · pdf, mov · view

Mangalagiri, Jayalakshmi · more

An Intelligent Parallel Distributed Streaming Framework for Near Real-time Science Sensors and High Resolution Medical Images · pdf, mp4 · view

Mantel, Heiko · more

Tool-Supported Mini-App Extraction to Facilitate Program Analysis and Parallelization · pdf, mp4 · view

Massini, Annalisa · more

Efficiently Parallelizable Strassen-Based Multiplication of a Matrix by its Transpose · pdf, mp4 · view

Mathias, Gerald · more

IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view

Matsui, Tomokazu · more

Analysis on Nursing Care Activity Related Stress Level for Reduction of Caregiving Workload · pdf, mp4 · view

Mehta, Kshitij · more

DYFLOW: A flexible framework for orchestrating scientific workflows on supercomputers · pdf, mp4 · view

Mei, Huiyao · more

Efficient Complete Event Trend Detection over High-Velocity Streams · pdf, mp4 · view

Mello Schnorr, Lucas · more

Exploiting system level heterogeneity to improve the performance of a GeoStatistics multi-phase task-based application · pdf, mp4 · view

Merzky, Andre · more

IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view

Meyer, Bruno · more

Warp-centric K-Nearest Neighbor Graphs construction on GPU · pdf, mp4 · view

Miandji, Ehsan · more

GPU Accelerated SL0 for Multidimensional Signals · pdf, mp4 · view

Mishin, Nikita · more

Efficient Parallel Algorithms for String Comparison · pdf, mp4 · view

Miyaji, Atsushi · more

Analysis on Nursing Care Activity Related Stress Level for Reduction of Caregiving Workload · pdf, mp4 · view

Moretó, Miquel · more

gem5+RTL: A Framework to Enable RTL Models Inside a Full-System Simulator · pdf, mp4 · view

Morris, Nathaniel · more

Cache-Aware Data Management for Memory-Mapped Forests · pdf, mp4 · view

Mukunoki, Daichi · more

Accurate Matrix Multiplication on Binary128 Format Accelerated by Ozaki Scheme · pdf, file · view

Müller, André · more

MetaCache-GPU: Ultra-Fast Metagenomic Classification · pdf, mp4 · view

Return to Top

N

Navarro Muñoz, Antoni · more

Combining Dynamic Concurrency Throttling with Voltage and Frequency Scaling on Task-based Programming Models · pdf, mp4 · view

Nguyen, Phuong · more

An Intelligent Parallel Distributed Streaming Framework for Near Real-time Science Sensors and High Resolution Medical Images · pdf, mp4 · view

Nobre, Ricardo · more

Fourth-Order Exhaustive Epistasis Detection for the xPU Era · pdf, mp4 · view

Norouzi, Mohammad · more

Tool-Supported Mini-App Extraction to Facilitate Program Analysis and Parallelization · pdf, mp4 · view

Nunan Zola, Wagner M. · more

Warp-centric K-Nearest Neighbor Graphs construction on GPU · pdf, mp4 · view

Return to Top

O

Ogita, Takeshi · more

Accurate Matrix Multiplication on Binary128 Format Accelerated by Ozaki Scheme · pdf, file · view

Ohtsuki, Kazuhiro · more

New Evacuation Guidance Using Augmented Reality for Emergency Rescue Evacuation Support System (ERESS) · pdf, mp4 · view

Ouyang, Shuo · more

CD-SGD: Distributed Stochastic Gradient Descent with Compression and Delay Compensation · pdf, mp4 · view

Ozaki, Katsuhisa · more

Accurate Matrix Multiplication on Binary128 Format Accelerated by Ozaki Scheme · pdf, file · view

Ozkaya, M. Yusuf · more

An Evaluation of Task-Parallel Frameworks for Sparse Solvers on Multicore and Manycore CPU Architectures · pdf, mov · view

Return to Top

P

Pacaud, François · more

Domain Decomposition Preconditioners for Unstructured Network Problems in Parallel Vector Architectures · pdf, mov · view

Paluri, Pavan Kumar · more

A Virtualization Platform Designed for Irregular Multi-Process Applications · pdf, pdf · view

Park, EunJung · more

Towards Compile-Time-Reducing Compiler Optimization Selection via Machine Learning · view

Partin, Alexander · more

IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view

Patel, Atmn · more

A Virtual GPU as Developer-Friendly OpenMP Offload Target · view

Peng, Zhouxuan · more

Parallel Multi-split Extendible Hashing for Persistent Memory · pdf, mp4 · view

Perotin, Lucas · more

Multi-Resource List Scheduling of Moldable Parallel Jobs under Precedence Constraints · pdf, mp4 · view

Perovic, Vasilije · more

Towards Faster Execution of Ensemble ML Bootstrap Based Techniques · pdf, mp4 · view

Perumalla, Kalyan · more

Design Considerations for GPU-based Mixed Integer Programming on Parallel Computing Platforms · pdf, mp4 · view

Pionteck, Thilo · more

CuART - a CUDA-based, scalable Radix-Tree lookup and update engine · pdf, mp4 · view

Plano, Tom · more

Enabling Real-Time Irregular Data-Flow Pipelines on SIMD Devices · pdf, mp4 · view

Pogorelov, Konstantin · more

Explaining the Classification Performance of Supervised and Semi-Supervised Methods for Automated Sparse Matrix Format Selection · pdf, mp4 · view

Ponomarev, Dmitry · more

GVT-Guided Demand-Driven Scheduling in Parallel Discrete Event Simulation · pdf, mp4 · view

Posner, Jonas · more

Transparent Resource Elasticity for Task-Based Cluster Environments with Work Stealing · pdf, mp4 · view

Pottier, Loïc · more

Assessing Resource Provisioning and Allocation of Ensembles of In Situ Workflows · pdf, mp4 · view

Pourjafarian, Monireh · more

ArchViMP – a Framework for Automatic Extraction of Concurrency-related Software Architectural Properties · pdf, mp4 · view

Pozo, Aurora · more

Warp-centric K-Nearest Neighbor Graphs construction on GPU · pdf, mp4 · view

Prema Soundararajan, Prema · more

Design of a Portable Implementation of Partitioned Point-to-Point Communication Primitives · pdf, mp4 · view

Return to Top

Q

Qi, Jingyuan · more

ComputeCOVID19+: Accelerating COVID-19 Diagnosis and Monitoring via High-Performance Deep Learning · pdf, mp4 · view

Qi, Qiang · more

Prophet: Speeding up Distributed DNN Training with Predictable Communication Scheduling · pdf, mp4 · view

Qian, Depei · more

Automatic Code Generation and Optimization of Large-scale Stencil Computation on Many-core Processors · pdf, mp4 · view

Qian, Kun · more

Receiver-Driven Congestion Control for InfiniBand · pdf, mp4 · view

Qiao, Linbo · more

Hippie: A Data-Paralleled Pipeline Approach to Improve Memory-Efficiency and Scalability for Large DNN Training · pdf, mp4 · view

Qiu, Kun · more

Teddy: An Efficient SIMD-based Literal Matching Engine for Scalable Deep Packet Inspection · pdf, mp4 · view

Quan, Zhe · more

XHYPRE: A high-precision numerical software package for solving large-scale sparse linear equations · pdf, pdf · view

Return to Top

R

Rabbi, Fazlay · more

An Evaluation of Task-Parallel Frameworks for Sparse Solvers on Multicore and Manycore CPU Architectures · pdf, mov · view

Raghavan, Padma · more

Multi-Resource List Scheduling of Moldable Parallel Jobs under Precedence Constraints · pdf, mp4 · view

Ramanathan, Arvind · more

IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view

Ramtin, Amir Reza · more

Self-Stabilization with Selfish Agents · pdf, mp4 · view

Raugas, Mark · more

Fast and Scalable Sparse Triangular Solver for Multi-GPU Based HPC Architectures · pdf, mp4 · view

Ren, Fengyuan · more

Receiver-Driven Congestion Control for InfiniBand · pdf, mp4 · view

Ren, Runtian · more

Generalized Skyline Interval Coloring and Dynamic Geometric Bin Packing Problems · pdf, mp4 · view

Revell, Alistair · more

CNN+LSTM Accelerated Turbulent Flow Simulation with Link-Wise Artificial Compressibility Method · pdf, mp4 · view

Rodolà, Emanuele · more

Efficiently Parallelizable Strassen-Based Multiplication of a Matrix by its Transpose · pdf, mp4 · view

Romero-Gainza, Eduardo · more

Cache-Aware Data Management for Memory-Mapped Forests · pdf, mp4 · view

Return to Top

S

Saeed, Fahad · more

TurboBC: A Memory Efficient and Scalable GPU Based Betweenness Centrality(BC) Algorithm in the Language of Linear Algebra · pdf, mp4 · view

Saleh, Hisham · more

Towards Faster Execution of Ensemble ML Bootstrap Based Techniques · pdf, mp4 · view

San Miguel, Joshua · more

Ghostwriter: A Cache Coherence Protocol for Error-Tolerant Applications · pdf, mp4 · view

Santander-Jiménez, Sergio · more

Fourth-Order Exhaustive Epistasis Detection for the xPU Era · pdf, mp4 · view

Saule, Erik · more

Postmortem Graph Analysis on the Temporal Graph · pdf, pdf · view
Impact of AVX-512 Instructions on Graph Partitioning Problems. · pdf, mp4 · view

Schafer, Derek · more

Design of a Portable Implementation of Partitioned Point-to-Point Communication Primitives · pdf, mp4 · view

Schanen, Michel · more

Domain Decomposition Preconditioners for Unstructured Network Problems in Parallel Vector Architectures · pdf, mov · view

Schildermans, Stijn · more

Paratick: Reducing Timer Overhead in Virtual Machines · pdf, mp4 · view

Schmidt, Bertil · more

MetaCache-GPU: Ultra-Fast Metagenomic Classification · pdf, mp4 · view

Scott, Michael L. · more

A Fast, General System for Buffered Persistent Data Structures · pdf, mp4 · view

SEN, TANMOY · more

Context-aware Data Operation Strategies in Edge Systems for High Application Performance · pdf, mp4 · view

Serhani, Mohamed Adel · more

Optimizing Flow Completion Time via Adaptive Buffer Management in Data Center Networks · pdf, mp4 · view

Shah, Ashka · more

IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view

Shan, Jianchen · more

Paratick: Reducing Timer Overhead in Virtual Machines · pdf, mp4 · view

Shan, Tianyi · more

Sparker: Efficient Reduction for More Scalable Machine Learning with Spark · pdf, mp4 · view

Shang, Ruitao · more

Prophet: Speeding up Distributed DNN Training with Predictable Communication Scheduling · pdf, mp4 · view

Shen, Haiying · more

Context-aware Data Operation Strategies in Edge Systems for High Application Performance · pdf, mp4 · view
Multi-Agent Reinforcement Learning based Distributed Renewable Energy Matching for Datacenters · pdf, mp4 · view

Shen, Yijie · more

Using Vectorized Execution to Improve SQL Query Performance on Spark · pdf, mp4 · view

Shi, Yang · more

sRouting: Towards a Better Flow Size Estimation Performance through Routing and Sketch Configuration · pdf, mp4 · view

Shivadekar, Samit · more

An Intelligent Parallel Distributed Streaming Framework for Near Real-time Science Sensors and High Resolution Medical Images · pdf, mp4 · view

Singhal, Swati · more

DYFLOW: A flexible framework for orchestrating scientific workflows on supercomputers · pdf, mp4 · view

Skjellum, Anthony · more

Design of a Portable Implementation of Partitioned Point-to-Point Communication Primitives · pdf, mp4 · view

Song, Shuaiwen Leon · more

Fast and Scalable Sparse Triangular Solver for Multi-GPU Based HPC Architectures · pdf, mp4 · view

Sousa, Leonel · more

Fourth-Order Exhaustive Epistasis Detection for the xPU Era · pdf, mp4 · view

Stef, Graillat · more

XHYPRE: A high-precision numerical software package for solving large-scale sparse linear equations · pdf, pdf · view

Stern, Abraham · more

IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view

Stevens, Rick · more

IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view

Stewart, Christopher · more

Cache-Aware Data Management for Memory-Mapped Forests · pdf, mp4 · view

Strzodka, Robert · more

Tridiagonal GPU Solver with Scaled Partial Pivoting at Maximum Bandwidth · pdf, mp4 · view

Sun, Ding · more

Hippie: A Data-Paralleled Pipeline Approach to Improve Memory-Efficiency and Scalability for Large DNN Training · pdf, mp4 · view

Sun, Hongyang · more

Multi-Resource List Scheduling of Moldable Parallel Jobs under Precedence Constraints · pdf, mp4 · view

Sun, Hui · more

Boosting Compaction Performance of LSM-tree-based KV Stores in Multi-Near-Data Processing Systems · pdf, pdf · view

Sun, Min-Te · more

Automated Arrhythmia Detection using Hilbert-Huang Transform based Convolutional Neural Network · pdf, mp4 · view

Sun, Qingxiao · more

Automatic Code Generation and Optimization of Large-scale Stencil Computation on Many-core Processors · pdf, mp4 · view

Sussman, Alan · more

DYFLOW: A flexible framework for orchestrating scientific workflows on supercomputers · pdf, mp4 · view

Swartvagher, Philippe · more

Interferences between Communications and Computations in Distributed HPC Systems · pdf, mp4 · view

Return to Top

T

Tan, Li · more

IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view

Tang, Ruiqi · more

Ascetic: Enhancing Cross-Iterations Data Efficiency in Out-of-Memory Graph Processing on GPUs · pdf, mp4 · view

Tang, Xiongchao · more

Sparker: Efficient Reduction for More Scalable Machine Learning with Spark · pdf, mp4 · view

Tang, Xueyan · more

Generalized Skyline Interval Coloring and Dynamic Geometric Bin Packing Problems · pdf, mp4 · view

Tang, Yuan · more

Processor-Aware Cache-Oblivious Algorithms · pdf, mp4 · view

Taufer, Michela · more

Assessing Resource Provisioning and Allocation of Ensembles of In Situ Workflows · pdf, mp4 · view

Tian, Shilei · more

A Virtual GPU as Developer-Friendly OpenMP Offload Target · view

Timmerman, David · more

GVT-Guided Demand-Driven Scheduling in Parallel Discrete Event Simulation · pdf, mp4 · view

Tiskin, Alexander · more

Efficient Parallel Algorithms for String Comparison · pdf, mp4 · view

Titov, Mikhail · more

IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view

Tong, Wei · more

CERES: Container-Based Elastic Resource Management System for Mixed Workloads · pdf, mp4 · view

Towsley, Don · more

Self-Stabilization with Selfish Agents · pdf, mp4 · view

Trahay, Francois · more

Intra-page Cache Update in SLC-mode with Partial Programming in High Density SSDs · pdf, mov · view

Trenev, Dimitar · more

Wave-PIM: Accelerating Wave Simulation Using Processing-in-Memory · pdf, mp4 · view

Trifan, Anda · more

IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view

Tsaris, Aristeidis · more

IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view

Turilli, Matteo · more

IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view

Return to Top

U

Ueno, Hideto · more

Towards Compile-Time-Reducing Compiler Optimization Selection via Machine Learning · view

Unger, Jonas · more

GPU Accelerated SL0 for Multidimensional Signals · pdf, mp4 · view

Return to Top

V

Valpey, Benjamin · more

A Fast, General System for Buffered Persistent Data Structures · pdf, mp4 · view

Van Dam, Huub · more

IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view

Vandierendonck, Hans · more

Exploiting in-Hub Temporal Locality in SpMV-based Graph Processing · pdf, mp4 · view

Vetter, Jeff · more

Evaluating the Performance of Integer Sum Reduction in SYCL · pdf, pptx · view

Return to Top

W

Wada, Koichi · more

Efficient GPU-Implementation for Integer Sorting Based on Histogram and Prefix-Sums · pdf, mp4 · view

Wada, Tomotaka · more

New Evacuation Guidance Using Augmented Reality for Emergency Rescue Evacuation Support System (ERESS) · pdf, mp4 · view

Wahib, Mohamed · more

Intra-page Cache Update in SLC-mode with Partial Programming in High Density SSDs · pdf, mov · view

Wan, Shunzhou · more

IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view

Wang, Cho-Li · more

Accelerating DBSCAN Algorithm with AI Chips for Large Datasets · pdf, mp4 · view

Wang, En · more

Distributed Game-Theoretical Route Navigation for Vehicular Crowdsensing · pdf, mp4 · view

Wang, Fang · more

ASLDP: An Active Semi-supervised Learning method for Disk Failure Prediction · pdf, mp4 · view

Wang, Haojie · more

Sparker: Efficient Reduction for More Scalable Machine Learning with Spark · pdf, mp4 · view

Wang, Haoyu · more

Multi-Agent Reinforcement Learning based Distributed Renewable Energy Matching for Datacenters · pdf, mp4 · view

Wang, Howard · more

Accelerate Binarized Neural Networks with Processing-in-Memory Enabled by RISC-V Custom Instructions · pdf, mp4 · view

Wang, Jianda · more

Enabling Efficient SIMD Acceleration for Virtual Radio Access Network · pdf, mp4 · view

Wang, Jiashu · more

Adapting SYCL’s SIMT Programming Paradigm for Accelerators via Program Reconstruction · view

Wang, Junsong · more

Exploring HW/SW Co-Optimizations for Accelerating Large-scale Texture Identification on Distributed GPUs · pdf, mp4 · view

Wang, Kai-Ting Amy · more

Adapting SYCL’s SIMT Programming Paradigm for Accelerators via Program Reconstruction · view

Wang, Kailun · more

Ascetic: Enhancing Cross-Iterations Data Efficiency in Out-of-Memory Graph Processing on GPUs · pdf, mp4 · view

Wang, Qiang · more

Boosting Compaction Performance of LSM-tree-based KV Stores in Multi-Near-Data Processing Systems · pdf, pdf · view

Wang, Wei · more

Fast Reconstruction for Large Disk Enclosures Based on RAID2.0 · pdf, mp4 · view

Wang, Wenwen · more

Ascetic: Enhancing Cross-Iterations Data Efficiency in Out-of-Memory Graph Processing on GPUs · pdf, mp4 · view

Wang, Wenxu · more

BitX: Empower Versatile Inference with Hardware Runtime Pruning · pdf, mp4 · view

Wang, Xiang · more

Teddy: An Efficient SIMD-based Literal Matching Engine for Scalable Deep Packet Inspection · pdf, mp4 · view

Wang, Xiaolin · more

An Edge-Fencing Strategy for Optimizing SSSP Computations on Large-Scale Graphs · pdf, mp4 · view

Wang, Yi · more

Optimizing Flow Completion Time via Adaptive Buffer Management in Data Center Networks · pdf, mp4 · view

Wang, Yida · more

LoWino: Towards Efficient Low-Precision Winograd Convolutions on Modern CPUs · pdf, mp4 · view

Wang, Yuchen · more

Efficient Modeling of Random Sampling-Based LRU · pdf, mp4 · view

Wang, Zhenlin · more

Efficient Modeling of Random Sampling-Based LRU · pdf, mp4 · view

Wang, Zihe · more

Distributed Game-Theoretical Route Navigation for Vehicular Crowdsensing · pdf, mp4 · view

Wei, Xueliang · more

Crash-Consistency-Aware Encryption for Non-Volatile Memories · pdf, mp4 · view

Weissenberger, Jack · more

Accelerating Neural Network Training using Arbitrary Precision Approximating Matrix Multiplication Algorithms · pdf, mp4 · view

Wen, Haosen · more

A Fast, General System for Buffered Persistent Data Structures · pdf, mp4 · view

Wen, Mei · more

sRouting: Towards a Better Flow Size Estimation Performance through Routing and Sketch Configuration · pdf, mp4 · view

Wen, Zeyi · more

FastPSO: Towards Efficient Swarm Intelligence Algorithm on GPUs · pdf, mp4 · view

Wifling, David · more

IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view

Williams, Barry · more

GVT-Guided Demand-Driven Scheduling in Parallel Discrete Event Simulation · pdf, mp4 · view

Wolf, Felix · more

Tool-Supported Mini-App Extraction to Facilitate Program Analysis and Parallelization · pdf, mp4 · view

Wolf, Matthew · more

DYFLOW: A flexible framework for orchestrating scientific workflows on supercomputers · pdf, mp4 · view

Worley, Andrew · more

Design of a Portable Implementation of Partitioned Point-to-Point Communication Primitives · pdf, mp4 · view

Wu, Chase Q. · more

NoStop: A Novel Configuration Optimization Scheme for Spark Streaming · pdf, mp4 · view

Wu, Chentao · more

A Graph-Assisted Out-of-Place Update Scheme for Erasure Coded Storage Systems · pdf, mp4 · view

Wu, Hanpei · more

ADA: An Application-Conscious Data Acquirer for Visual Molecular Dynamics · pdf, mp4 · view

Wu, Heng · more

Best VM Selection for Big Data Applications across Multiple Frameworks by Transfer Learning · pdf, mp4 · view

Wu, Jie · more

Joint Optimization of DNN Partition and Scheduling for Mobile Cloud Computing · pdf, mp4 · view
Distributed Game-Theoretical Route Navigation for Vehicular Crowdsensing · pdf, mp4 · view

Wu, Panruo · more

A Virtualization Platform Designed for Irregular Multi-Process Applications · pdf, pdf · view
Recursion Brings Speedup to Out-of-Core TensorCore-based Linear Algebra Algorithms: A Case Study of Classic Gram-Schmidt QR Factorization · pdf, mp4 · view

Wu, Weijie · more

Progressive Memory Adjustment with Performance Guarantee in Virtualized Systems · pdf, mp4 · view

Wu, Yadong · more

PREP: Predicting Job Runtime with Job Running Path on Supercomputers · pdf, mp4 · view

Wu, Yuewen · more

Best VM Selection for Big Data Applications across Multiple Frameworks by Transfer Learning · pdf, mp4 · view

Wu, Zhongjie · more

Coupling Right-Provisioned Cold Storage Data Centers with Deduplication · pdf, mp4 · view

Return to Top

X

Xiao, Renzhi · more

A Log-Free and Consistent Chained Hashing for Non-volatile Memory · pdf, pdf · view

XIE, CHENHAO · more

Fast and Scalable Sparse Triangular Solver for Multi-GPU Based HPC Architectures · pdf, mp4 · view

Xie, Tao · more

ADA: An Application-Conscious Data Acquirer for Visual Molecular Dynamics · pdf, mp4 · view

Xiong, Jin · more

Using Vectorized Execution to Improve SQL Query Performance on Spark · pdf, mp4 · view

Xiong, Yufei · more

CERES: Container-Based Elastic Resource Management System for Mixed Workloads · pdf, mp4 · view

Xu, ChengZhong · more

FIFL: A Fairness Incentive Framework for Federated Learning · pdf, mp4 · view

Xu, Fei · more

Prophet: Speeding up Distributed DNN Training with Predictable Communication Scheduling · pdf, mp4 · view

Xu, Liangliang · more

Fast Reconstruction for Large Disk Enclosures Based on RAID2.0 · pdf, mp4 · view

Xu, Ming · more

FIFL: A Fairness Incentive Framework for Federated Learning · pdf, mp4 · view

Xu, Nuo · more

HDNH: a read-efficient and write-optimized hashing scheme for hybrid DRAM-NVM memory · pdf, mp4 · view

Xu, Yang · more

Optimizing Flow Completion Time via Adaptive Buffer Management in Data Center Networks · pdf, mp4 · view

Xu, Yemao · more

CD-SGD: Distributed Stochastic Gradient Descent with Compression and Delay Compensation · pdf, mp4 · view

Xu, Yinlong · more

Fast Reconstruction for Large Disk Enclosures Based on RAID2.0 · pdf, mp4 · view
Progressive Memory Adjustment with Performance Guarantee in Virtualized Systems · pdf, mp4 · view

Xu, Yuanjia · more

Best VM Selection for Big Data Applications across Multiple Frameworks by Transfer Learning · pdf, mp4 · view

Return to Top

Y

Yang, Canqun · more

CNN+LSTM Accelerated Turbulent Flow Simulation with Link-Wise Artificial Compressibility Method · pdf, mp4 · view

Yang, Dongxu · more

Optimizing Winograd-Based Convolution with Tensor Cores · pdf, mp4 · view

Yang, Hailong · more

Automatic Code Generation and Optimization of Large-scale Stencil Computation on Many-core Processors · pdf, mp4 · view

Yang, Junyao · more

Efficient Modeling of Random Sampling-Based LRU · pdf, mp4 · view

Yang, Qing · more

Parallel Multi-split Extendible Hashing for Persistent Memory · pdf, mp4 · view

Yang, Qiusong · more

Matryoshka: A Coalesced Delta Sequence Prefetcher · pdf, mp4 · view

Yang, Wenxiang · more

PREP: Predicting Job Runtime with Job Running Path on Supercomputers · pdf, mp4 · view

Yang, Wuu · more

Hyperchaining Optimizations for an LLVM-Based Binary Translator on x86-64 and RISC-V Platforms · pdf, mp4 · view

Yang, Yang · more

SPMFS: A Scalable Persistent Memory File System on Optane Persistent Memory · pdf, mp4 · view

Yang, Yongjian · more

Distributed Game-Theoretical Route Navigation for Vehicular Crowdsensing · pdf, mp4 · view

Yao, Lulu · more

Progressive Memory Adjustment with Performance Guarantee in Virtualized Systems · pdf, mp4 · view

Yao, Yiping · more

A Universal Construction to implement Concurrent Data Structure for NUMA-multicore · pdf, mp4 · view

Yasumoto, Keiichi · more

Analysis on Nursing Care Activity Related Stress Level for Reduction of Caregiving Workload · pdf, mp4 · view

Ye, Qianwen · more

NoStop: A Novel Configuration Optimization Scheme for Spark Streaming · pdf, mp4 · view

Ye, Xiangyu · more

Hippie: A Data-Paralleled Pipeline Approach to Improve Memory-Efficiency and Scalability for Large DNN Training · pdf, mp4 · view

Ye, ZiChun · more

Adapting SYCL’s SIMT Programming Paradigm for Accelerators via Program Reconstruction · view

Yelick, Katherine · more

Scaling Generalized N-Body Problems, A Case Study from Genomics · pdf, mp4 · view

Yew, Pen-Chung · more

Ascetic: Enhancing Cross-Iterations Data Efficiency in Out-of-Memory Graph Processing on GPUs · pdf, mp4 · view

Yi, Zhengming · more

A Universal Construction to implement Concurrent Data Structure for NUMA-multicore · pdf, mp4 · view

Yin, Junqi · more

IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view

Yin, Shu · more

ADA: An Application-Conscious Data Acquirer for Visual Molecular Dynamics · pdf, mp4 · view

Yin, Yanlong · more

A Novel Multi-CPU/GPU Collaborative Computing Framework for SGD-based Matrix Factorization · pdf, mp4 · view

You, Xin · more

Automatic Code Generation and Optimization of Large-scale Stencil Computation on Many-core Processors · pdf, mp4 · view

Yu, Bowen · more

Sparker: Efficient Reduction for More Scalable Machine Learning with Spark · pdf, mp4 · view

Yu, Enda · more

CD-SGD: Distributed Stochastic Gradient Descent with Compression and Delay Compensation · pdf, mp4 · view

Yu, Huashan · more

An Edge-Fencing Strategy for Optimizing SSSP Computations on Large-Scale Graphs · pdf, mp4 · view

Yu, Jie · more

PREP: Predicting Job Runtime with Job Running Path on Supercomputers · pdf, mp4 · view

Yu, Jinyu · more

CERES: Container-Based Elastic Resource Management System for Mixed Workloads · pdf, mp4 · view

Yu, Weikuan · more

ROBOTune: High-Dimensional Configuration Tuning for Cluster-Based Data Analytics · pdf, mp4 · view

Yu, Ya · more

Parallel Multi-split Extendible Hashing for Persistent Memory · pdf, mp4 · view

Yue, Yinliang · more

Boosting Compaction Performance of LSM-tree-based KV Stores in Multi-Near-Data Processing Systems · pdf, pdf · view

Return to Top

Z

Zeng, Hui · more

FedCav: Contribution-aware Model Aggregation on Distributed Heterogeneous Data in Federated Learning · pdf, mp4 · view

Zhang, Eddy Z. · more

BGPQ: A Heap-Based Priority Queue Design for GPUs · pdf, mp4 · view

Zhang, Jie · more

Automated Arrhythmia Detection using Hilbert-Huang Transform based Convolutional Neural Network · pdf, mp4 · view

Zhang, Jin · more

Ascetic: Enhancing Cross-Iterations Data Efficiency in Out-of-Memory Graph Processing on GPUs · pdf, mp4 · view

Zhang, Luoping · more

Accelerating Neural Network Training using Arbitrary Precision Approximating Matrix Multiplication Algorithms · pdf, mp4 · view

Zhang, Mingzhe · more

BitX: Empower Versatile Inference with Hardware Runtime Pruning · pdf, mp4 · view

Zhang, Shaoshuai · more

Recursion Brings Speedup to Out-of-Core TensorCore-based Linear Algebra Algorithms: A Case Study of Classic Gram-Schmidt QR Factorization · pdf, mp4 · view

Zhang, Shulai · more

Dubhe: Towards Data Unbiasedness with Homomorphic Encryption in Federated Learning Client Selection · pdf, mov · view

Zhang, Wenbo · more

Best VM Selection for Big Data Applications across Multiple Frameworks by Transfer Learning · pdf, mp4 · view

Zhang, Xiaofan · more

Exploring HW/SW Co-Optimizations for Accelerating Large-scale Texture Identification on Distributed GPUs · pdf, mp4 · view

Zhang, Xiaorong · more

PREP: Predicting Job Runtime with Job Running Path on Supercomputers · pdf, mp4 · view

Zhang, Yiran · more

Receiver-Driven Congestion Control for InfiniBand · pdf, mp4 · view

Zhang, Youhui · more

Regu2D: Accelerating Vectorization of SpMV on Intel Processors through 2D-partitioning and Regular Arrangement · pdf, mp4 · view

Zhang, Zhenwei · more

Prophet: Speeding up Distributed DNN Training with Predictable Communication Scheduling · pdf, mp4 · view

Zhang, Zhicheng · more

ComputeCOVID19+: Accelerating COVID-19 Diagnosis and Monitoring via High-Performance Deep Learning · pdf, mp4 · view

Zhang, Zhihua · more

Analysis on Nursing Care Activity Related Stress Level for Reduction of Caregiving Workload · pdf, mp4 · view

Zhao, Yuhong · more

Boosting Compaction Performance of LSM-tree-based KV Stores in Multi-Near-Data Processing Systems · pdf, pdf · view

Zhao, Ziyi · more

Ascetic: Enhancing Cross-Iterations Data Efficiency in Out-of-Memory Graph Processing on GPUs · pdf, mp4 · view

Zheng, Kevin · more

Multi-Agent Reinforcement Learning based Distributed Renewable Energy Matching for Datacenters · pdf, mp4 · view

Zheng, Wenli · more

FIFL: A Fairness Incentive Framework for Federated Learning · pdf, mp4 · view
Dubhe: Towards Data Unbiasedness with Homomorphic Encryption in Federated Learning Client Selection · pdf, mov · view

Zhong, Hua · more

Best VM Selection for Big Data Applications across Multiple Frameworks by Transfer Learning · pdf, mp4 · view

Zhou, Bing Bing · more

Efficient Complete Event Trend Detection over High-Velocity Streams · pdf, mp4 · view

Zhou, Hai · more

Multi-level Forwarding and Scheduling Recovery Technique in Heterogeneous Network for Erasure-coded Clusters · pdf, mp4 · view

Zhou, Longfang · more

PREP: Predicting Job Runtime with Job Running Path on Supercomputers · pdf, mp4 · view

Zhou, Tongqing · more

FedCav: Contribution-aware Model Aggregation on Distributed Heterogeneous Data in Federated Learning · pdf, mp4 · view

Zhou, Xiaohu · more

FMSM: A Fuzzy Multi-keyword Search Scheme for Encrypted Cloud Data based on Multi-chain Network · pdf, mp4 · view

Zhou, Yang · more

ASLDP: An Active Semi-supervised Learning method for Disk Failure Prediction · pdf, mp4 · view

Zhu, Junhao · more

HDNH: a read-efficient and write-optimized hashing scheme for hybrid DRAM-NVM memory · pdf, mp4 · view

Zhu, Lin · more

Communication Avoiding All-Pairs Shortest Paths Algorithm for Sparse graphs · pdf, mp4 · view

Zhu, Wenjun · more

Teddy: An Efficient SIMD-based Literal Matching Engine for Scalable Deep Packet Inspection · pdf, mp4 · view

Zhu, Yifeng · more

Parallel Multi-split Extendible Hashing for Persistent Memory · pdf, mp4 · view

Zou, Xiaomin · more

HDNH: a read-efficient and write-optimized hashing scheme for hybrid DRAM-NVM memory · pdf, mp4 · view

Zou, Yanliang · more

ADA: An Application-Conscious Data Acquirer for Visual Molecular Dynamics · pdf, mp4 · view

Return to Top