A | B | C | D | E | F | G | H | I | J | K | L | M | N | O | P | Q | R | S | T | U | V | W | X | Y | Z
| Aerts, Kris · more Kris Aerts (KU Leuven) | Paratick: Reducing Timer Overhead in Virtual Machines · pdf, mp4 · view |
| Afibuzzaman, Md · more Md Afibuzzaman (Michigan State University) | An Evaluation of Task-Parallel Frameworks for Sparse Solvers on Multicore and Manycore CPU Architectures · pdf, mov · view |
| Aktulga, Hasan Metin · more Hasan Metin Aktulga (Michigan State University) | An Evaluation of Task-Parallel Frameworks for Sparse Solvers on Multicore and Manycore CPU Architectures · pdf, mov · view |
| Al Saadi, Aymen · more Aymen Al Saadi (Rutgers University) | IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view |
| Alam, Maksudul · more Maksudul Alam (Oak Ridge National Laboratory) | Design Considerations for GPU-based Mixed Integer Programming on Parallel Computing Platforms · pdf, mp4 · view |
| Alfe, Dario · more Dario Alfe (University College London) | IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view |
| Alperen, Abdullah · more Abdullah Alperen (Michigan State University) | An Evaluation of Task-Parallel Frameworks for Sparse Solvers on Multicore and Manycore CPU Architectures · pdf, mov · view |
| Anitescu, Mihai · more Mihai Anitescu (Argonne National Laboratory) | Domain Decomposition Preconditioners for Unstructured Network Problems in Parallel Vector Architectures · pdf, mov · view |
| Armejach, Adrià · more Adrià Armejach (Barcelona Supercomputing Center, Universitat Politècnica de Catalunya) | gem5+RTL: A Framework to Enable RTL Models Inside a Full-System Simulator · pdf, mp4 · view |
| Arrigoni, Viviana · more Viviana Arrigoni (Sapienza, University of Rome) | Efficiently Parallelizable Strassen-Based Multiplication of a Matrix by its Transpose · pdf, mp4 · view |
| Artiles, Oswaldo · more Oswaldo Artiles (Florida International University) | TurboBC: A Memory Efficient and Scalable GPU Based Betweenness Centrality(BC) Algorithm in the Language of Linear Algebra · pdf, mp4 · view |
| Ayguadé Parra, Eduard · more Eduard Ayguadé Parra (Universitat Politècnica de Catalunya, Barcelona Supercomputing Center (BSC-CNS)) | Combining Dynamic Concurrency Throttling with Voltage and Frequency Scaling on Task-based Programming Models · pdf, mp4 · view |
| Babuji, Yadu · more Yadu Babuji (University of Chicago, Argonne National Laboratory) | IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view |
| Bai, Yang · more Yang Bai (Hunan University) | A Novel Multi-CPU/GPU Collaborative Computing Framework for SGD-based Matrix Factorization · pdf, mp4 · view |
| Ballard, Grey · more Grey Ballard (Wake Forest University) | Accelerating Neural Network Training using Arbitrary Precision Approximating Matrix Multiplication Algorithms · pdf, mp4 · view Parallel Tucker Decomposition with Numerically Accurate SVD · pdf, mp4 · view |
| Bangalore, Purushotham · more Purushotham Bangalore (University of Alabama at Birmingham) | Design of a Portable Implementation of Partitioned Point-to-Point Communication Primitives · pdf, mp4 · view |
| Baravdish, Gabriel George · more Gabriel George Baravdish (Linköping University) | GPU Accelerated SL0 for Multidimensional Signals · pdf, mp4 · view |
| Barker, Kevin · more Kevin Barker (Pacific Northwest National Laboratory) | Fast and Scalable Sparse Triangular Solver for Multi-GPU Based HPC Architectures · pdf, mp4 · view |
| Beltran Querol, Vicenç · more Vicenç Beltran Querol (Barcelona Supercomputing Center Barcelona Supercomputing Center (BSC-CNS)) | Combining Dynamic Concurrency Throttling with Voltage and Frequency Scaling on Task-based Programming Models · pdf, mp4 · view |
| Berezun, Daniil · more Daniil Berezun (Saint Petersburg State University, JetBrains Research) | Efficient Parallel Algorithms for String Comparison · pdf, mp4 · view |
| Bernholdt, David · more David Bernholdt (Oak Ridge National Labs) | Implementing Arbitrary/Common Concurrent Writes of CRCW PRAM · pdf, mp4 · view |
| Bhati, Agastya · more Agastya Bhati (UCL) | IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view |
| Bischof, Christian · more Christian Bischof (Technical University of Darmstadt) | Tool-Supported Mini-App Extraction to Facilitate Program Analysis and Parallelization · pdf, mp4 · view |
| Biswas, Swarnendu · more Swarnendu Biswas (Indian Institute of Technology Kanpur) | Explaining the Classification Performance of Supervised and Semi-Supervised Methods for Automated Sparse Matrix Format Selection · pdf, mp4 · view |
| Blaiszik, Benjamin · more Benjamin Blaiszik (University of Chicago, Argonne National Laboratory) | IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view |
| Brace, Alexander · more Alexander Brace (Argonne) | IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view |
| Brettin, Thomas · more Thomas Brettin (Argonne National Laboratory) | IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view |
| Buhler, Jeremy · more Jeremy Buhler (Washington University) | Enabling Real-Time Irregular Data-Flow Pipelines on SIMD Devices · pdf, mp4 · view |
| Buluc, Aydin · more Aydin Buluc (Lawrence Berkeley National Lab, The University of California at Berkeley) | Scaling Generalized N-Body Problems, A Case Study from Genomics · pdf, mp4 · view |
| Cai, Lei · more Lei Cai (National University of Defense Technology) | Hippie: A Data-Paralleled Pipeline Approach to Improve Memory-Efficiency and Scalability for Large DNN Training · pdf, mp4 · view |
| Cai, Wei · more Wei Cai (School of Science and Engineering, The Chinese University of Hong Kong, Shenzhen; Shenzhen Institute of Artificial Intelligence and Robotics for Society) | FastPSO: Towards Efficient Swarm Intelligence Algorithm on GPUs · pdf, mp4 · view |
| Cai, Wentao · more Wentao Cai (University of Rochester) | A Fast, General System for Buffered Persistent Data Structures · pdf, mp4 · view |
| Cai, Zhigang · more Zhigang Cai (Southwest University) | Intra-page Cache Update in SLC-mode with Partial Programming in High Density SSDs · pdf, mov · view |
| Cai, Zhiping · more Zhiping Cai (College of Computer, National University of Defense Technology) | FedCav: Contribution-aware Model Aggregation on Distributed Heterogeneous Data in Federated Learning · pdf, mp4 · view |
| Caíno-Lores, Silvina · more Silvina Caíno-Lores (University of Tennessee at Knoxville) | Assessing Resource Provisioning and Allocation of Ensembles of In Situ Workflows · pdf, mp4 · view |
| Cao, Guohua · more Guohua Cao (Virginia Tech) | ComputeCOVID19+: Accelerating COVID-19 Diagnosis and Monitoring via High-Performance Deep Learning · pdf, mp4 · view |
| Cao, Huanqi · more Huanqi Cao (Tsinghua University) | Sparker: Efficient Reduction for More Scalable Machine Learning with Spark · pdf, mp4 · view |
| Cao, Qiang · more Qiang Cao (Huazhong University of Science and Technology) | SPMFS: A Scalable Persistent Memory File System on Optane Persistent Memory · pdf, mp4 · view |
| Cartier, Hannah · more Hannah Cartier (Rhodes College) | Optimizing Work Stealing Communication with Structured Atomic Operations · pdf, mp4 · view |
| Catalyurek, Umit · more Umit Catalyurek (Georgia Institute of Technology) | An Evaluation of Task-Parallel Frameworks for Sparse Solvers on Multicore and Manycore CPU Architectures · pdf, mov · view |
| Chang, Da-Wei · more Da-Wei Chang (National Cheng Kung University) | Dual-KV: Improving Performance of Key-value Caches on Multilevel Cell Non-volatile Memory · pdf, mp4 · view |
| Chang, Harry · more Harry Chang (Intel APAC) | Teddy: An Efficient SIMD-based Literal Matching Engine for Scalable Deep Packet Inspection · pdf, mp4 · view |
| Chang, Liang · more Liang Chang (University of Electronic Science and Technology of China) | BitX: Empower Versatile Inference with Hardware Runtime Pruning · pdf, mp4 · view |
| Chapman, Barbara · more Barbara Chapman (Stony Brook University) | A Virtual GPU as Developer-Friendly OpenMP Offload Target · view |
| Chapman, David · more David Chapman (UMBC) | An Intelligent Parallel Distributed Streaming Framework for Near Real-time Science Sensors and High Resolution Medical Images · pdf, mp4 · view |
| Chard, Kyle · more Kyle Chard (University of Chicago) | IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view |
| Chard, Ryan · more Ryan Chard (Argonne National Laboratory) | IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view |
| Chen, Bangduo · more Bangduo Chen (Beihang University) | Automatic Code Generation and Optimization of Large-scale Stencil Computation on Many-core Processors · pdf, mp4 · view |
| Chen, Hanhua · more Hanhua Chen (Huazhong University of Science and Technology) | Efficient Complete Event Trend Detection over High-Velocity Streams · pdf, mp4 · view |
| Chen, Jianxi · more Jianxi Chen (Wuhan National Laboratory for Optoelectronics,Huazhong University of Science and Technology) | Parallel Multi-split Extendible Hashing for Persistent Memory · pdf, mp4 · view |
| Chen, Jieyang · more Jieyang Chen (Oak Ridge National Laboratory) | Fast and Scalable Sparse Triangular Solver for Multi-GPU Based HPC Architectures · pdf, mp4 · view |
| Chen, Kai · more Kai Chen (National University of Defense Technology) | A Universal Construction to implement Concurrent Data Structure for NUMA-multicore · pdf, mp4 · view |
| Chen, Li · more Li Chen (University of Louisiana at Lafayette) | Prophet: Speeding up Distributed DNN Training with Predictable Communication Scheduling · pdf, mp4 · view AMPS-Inf: Automatic Model Partitioning for Serverless Inference with Cost Efficiency. · pdf, mp4 · view Accelerated Device Placement Optimization with Contrastive Learning · pdf, mp4 · view |
| Chen, Quan · more Quan Chen (Shanghai Jiao Tong University) | Dubhe: Towards Data Unbiasedness with Homomorphic Encryption in Federated Learning Client Selection · pdf, mov · view |
| Chen, Si · more Si Chen (West Chester University, PA, USA) | ADA: An Application-Conscious Data Acquirer for Visual Molecular Dynamics · pdf, mp4 · view |
| Chen, Wei · more Wei Chen (State Key Laborotary of Computer Architecture, Institute of Computing Technology, Chinese Academy of Sciences) | BitX: Empower Versatile Inference with Hardware Runtime Pruning · pdf, mp4 · view |
| Chen, Wenguang · more Wenguang Chen (Tsinghua University) | Sparker: Efficient Reduction for More Scalable Machine Learning with Spark · pdf, mp4 · view |
| Chen, Yanhao · more Yanhao Chen (Rutgers University) | BGPQ: A Heap-Based Priority Queue Design for GPUs · pdf, mp4 · view |
| Chen, Yingwen · more Yingwen Chen (National University of Defense Technology) | FIFL: A Fairness Incentive Framework for Federated Learning · pdf, mp4 · view |
| Chen, YuAng · more YuAng Chen (Chinese University of Hong Kong, Shenzhen) | HiPa: Hierarchical Partitioning for Fast PageRank on NUMA Multicore Systems · pdf, mp4 · view |
| Chen, Zhiguang · more Zhiguang Chen (Sun Yat-sen University) | Optimizing Massively Parallel Winograd Convolution on ARM Processor · pdf, mp4 · view |
| Cheng, Albert Mo Kim · more Albert Mo Kim Cheng (University of Houston) | A Virtualization Platform Designed for Irregular Multi-Process Applications · pdf, pdf · view |
| Cheng, Liangfeng · more Liangfeng Cheng (Huazhong University of Science and Technology) | Coupling Right-Provisioned Cold Storage Data Centers with Deduplication · pdf, mp4 · view |
| Chesterfield, Jon · more Jon Chesterfield (AMD) | Shared Memory Remote Procedure Calls · view |
| Chiu, Kenneth · more Kenneth Chiu (State University of New York at Binghamton) | GVT-Guided Demand-Driven Scheduling in Parallel Discrete Event Simulation · pdf, mp4 · view |
| Choi, Hyuckjin · more Hyuckjin Choi (Nara Institute of Science and Technology) | Analysis on Nursing Care Activity Related Stress Level for Reduction of Caregiving Workload · pdf, mp4 · view |
| Choi, Jong · more Jong Choi (Oak Ridge National Lab) | DYFLOW: A flexible framework for orchestrating scientific workflows on supercomputers · pdf, mp4 · view |
| Chung, Yeh-ching · more Yeh-ching Chung (Chinese University of Hong Kong, Shenzhen) | HiPa: Hierarchical Partitioning for Fast PageRank on NUMA Multicore Systems · pdf, mp4 · view |
| Ci, Yiwei · more Yiwei Ci (Institute of Software, Chinese Academy of Sciences) | Matryoshka: A Coalesced Delta Sequence Prefetcher · pdf, mp4 · view |
| Clyde, Austin · more Austin Clyde (Argonne, Univ. of Chicago) | IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view |
| Codognet, Philippe · more Philippe Codognet (Sorbonne University / CNRS, University of Tokyo) | Constraint Solving by Quantum Annealing · pdf, mp4 · view |
| Cornelius, Melanie · more Melanie Cornelius (Illinois Institute of Technology) | Advancing OpenMP Offload Debugging Capabilities in LLVM · view |
| Coveney, Peter · more Peter Coveney (UCL) | IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view |
| Dai, Guangli · more Guangli Dai (University of Houston) | A Virtualization Platform Designed for Irregular Multi-Process Applications · pdf, pdf · view |
| Deelman, Ewa · more Ewa Deelman (University of Southern California, Information Sciences Institute) | Assessing Resource Provisioning and Allocation of Ensembles of In Situ Workflows · pdf, mp4 · view |
| Deng, Haiwei · more Haiwei Deng (Department of Computer Science and Engineering, Shanghai Jiao Tong University) | A Graph-Assisted Out-of-Place Update Scheme for Erasure Coded Storage Systems · pdf, mp4 · view |
| Deng, Tongliang · more Tongliang Deng (SenseTime Research, China) | ADA: An Application-Conscious Data Acquirer for Visual Molecular Dynamics · pdf, mp4 · view |
| Deng, Xun · more Xun Deng (Huawei Technologies Canada Co., Ltd.) | Adapting SYCL’s SIMT Programming Paradigm for Accelerators via Program Reconstruction · view |
| Denis, Alexandre · more Alexandre Denis (INRIA) | Interferences between Communications and Computations in Distributed HPC Systems · pdf, mp4 · view |
| Deodhar, Akshay · more Akshay Deodhar (College of Engineering, Pune) | Explaining the Classification Performance of Supervised and Semi-Supervised Methods for Automated Sparse Matrix Format Selection · pdf, mp4 · view |
| Dewald, Florian · more Florian Dewald (Technical University of Darmstadt) | Tool-Supported Mini-App Extraction to Facilitate Program Analysis and Parallelization · pdf, mp4 · view |
| Dhandhania, Sunidhi · more Sunidhi Dhandhania (Indian Institute of Technology Kanpur) | Explaining the Classification Performance of Supervised and Semi-Supervised Methods for Automated Sparse Matrix Format Selection · pdf, mp4 · view |
| Dietz, Henry · more Henry Dietz (University of Kentucky) | Tangled: A Conventional Processor Integrating A Quantum-Inspired Coprocessor · pdf, mp4 · view |
| Dinan, James · more James Dinan (NVIDIA) | Optimizing Work Stealing Communication with Structured Atomic Operations · pdf, mp4 · view |
| Ding, Xiaoning · more Xiaoning Ding (New Jersey Institute of Technology) | Paratick: Reducing Timer Overhead in Virtual Machines · pdf, mp4 · view |
| Do, Tu Mai Anh · more Tu Mai Anh Do (University of Southern California, Information Sciences Institute) | Assessing Resource Provisioning and Allocation of Ensembles of In Situ Workflows · pdf, mp4 · view |
| Doerfert, Johannes · more Johannes Doerfert (Argonne National Laboratory) | Advancing OpenMP Offload Debugging Capabilities in LLVM · view A Virtual GPU as Developer-Friendly OpenMP Offload Target · view Towards Compile-Time-Reducing Compiler Optimization Selection via Machine Learning · view |
| Dong, Dezun · more Dezun Dong (National University of Defense Technology) | CD-SGD: Distributed Stochastic Gradient Descent with Compression and Delay Compensation · pdf, mp4 · view |
| Dong, Pengmin · more Pengmin Dong (Jilin University) | Distributed Game-Theoretical Route Navigation for Vehicular Crowdsensing · pdf, mp4 · view |
| Dosanjh, Matthew · more Matthew Dosanjh (Sandia National Laboratories) | Design of a Portable Implementation of Partitioned Point-to-Point Communication Primitives · pdf, mp4 · view |
| Du, Jingwen · more Jingwen Du (Wuhan National Laboratory for Optoelectronics, Huazhong University of Science and Technology) | Fast and Consistent Remote Direct Access to Non-volatile Memory · pdf, mp4 · view |
| Du, Mingzhe · more Mingzhe Du (University of Rochester) | A Fast, General System for Buffered Persistent Data Structures · pdf, mp4 · view |
| Duan, Yubin · more Yubin Duan (Temple University) | Joint Optimization of DNN Partition and Scheduling for Mobile Cloud Computing · pdf, mp4 · view |
| Eker, Ali · more Ali Eker (State University of New York at Binghamton) | GVT-Guided Demand-Driven Scheduling in Parallel Discrete Event Simulation · pdf, mp4 · view |
| Ellis, Marquita · more Marquita Ellis (The University of California at Berkeley, Lawrence Berkeley National Lab) | Scaling Generalized N-Body Problems, A Case Study from Genomics · pdf, mp4 · view |
| Elwasif, Wael · more Wael Elwasif (Oak ridge National Labs) | Implementing Arbitrary/Common Concurrent Writes of CRCW PRAM · pdf, mp4 · view |
| Enright Jerger, Natalie · more Natalie Enright Jerger (University of Toronto) | Ghostwriter: A Cache Coherence Protocol for Error-Tolerant Applications · pdf, mp4 · view |
| F. Lorenzon, Arthur · more Arthur F. Lorenzon (Federal University of Pampa) | Combining Dynamic Concurrency Throttling with Voltage and Frequency Scaling on Task-based Programming Models · pdf, mp4 · view |
| Fan, Sijiang · more Sijiang Fan (National University of Defense Technology, University of Manchester) | CNN+LSTM Accelerated Turbulent Flow Simulation with Link-Wise Artificial Compressibility Method · pdf, mp4 · view |
| Fang, Liang · more Liang Fang (National University of Defense Technology) | HDNH: a read-efficient and write-optimized hashing scheme for hybrid DRAM-NVM memory · pdf, mp4 · view |
| Fang, Qiming · more Qiming Fang (Wake Forest University) | Parallel Tucker Decomposition with Numerically Accurate SVD · pdf, mp4 · view |
| Fathi, Arash · more Arash Fathi (ExxonMobil) | Wave-PIM: Accelerating Wave Simulation Using Processing-in-Memory · pdf, mp4 · view |
| Fei, Jiawei · more Jiawei Fei (National University of Defense Technology) | CNN+LSTM Accelerated Turbulent Flow Simulation with Link-Wise Artificial Compressibility Method · pdf, mp4 · view |
| Fei, Xiang · more Xiang Fei (Tsinghua University) | Regu2D: Accelerating Vectorization of SpMV on Intel Processors through 2D-partitioning and Regular Arrangement · pdf, mp4 · view |
| Feng, Dan · more Dan Feng (huazhong university of science and technology) | A Log-Free and Consistent Chained Hashing for Non-volatile Memory · pdf, pdf · view Fast and Consistent Remote Direct Access to Non-volatile Memory · pdf, mp4 · view CERES: Container-Based Elastic Resource Management System for Mixed Workloads · pdf, mp4 · view Crash-Consistency-Aware Encryption for Non-Volatile Memories · pdf, mp4 · view ASLDP: An Active Semi-supervised Learning method for Disk Failure Prediction · pdf, mp4 · view Multi-level Forwarding and Scheduling Recovery Technique in Heterogeneous Network for Erasure-coded Clusters · pdf, mp4 · view |
| Feng, Ke · more Ke Feng (School of Computer Science and Technology, Wuhan University of Science and Technology; Hubei Province Key Laboratory of Intelligent Information Processing and Real-time Industrial System) | FMSM: A Fuzzy Multi-keyword Search Scheme for Encrypted Cloud Data based on Multi-chain Network · pdf, mp4 · view |
| Feng, Wu · more Wu Feng (Virginia Tech) | ComputeCOVID19+: Accelerating COVID-19 Diagnosis and Monitoring via High-Performance Deep Learning · pdf, mp4 · view |
| Feng, Xiaobing · more Xiaobing Feng (Institute of Computing Technology, Chinese Academy of Sciences) | LoWino: Towards Efficient Low-Precision Winograd Convolutions on Modern CPUs · pdf, mp4 · view |
| Feng, Zonghao · more Zonghao Feng (Hong Kong University of Science and Technology) | Accelerating Sequence-to-Graph Alignment on Heterogeneous Processors · pdf, mp4 · view |
| Ferreira da Silva, Rafael · more Rafael Ferreira da Silva (University of Southern California, Information Sciences Institute) | Assessing Resource Provisioning and Allocation of Ensembles of In Situ Workflows · pdf, mp4 · view |
| Firoz, Jesun · more Jesun Firoz (Pacific Northwest National Laboratory) | Fast and Scalable Sparse Triangular Solver for Multi-GPU Based HPC Architectures · pdf, mp4 · view |
| Fohry, Claudia · more Claudia Fohry (University of Kassel) | Transparent Resource Elasticity for Task-Based Cluster Environments with Work Stealing · pdf, mp4 · view |
| Foster, Ian · more Ian Foster (Argonne National Laboratory, University of Chicago) | IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view |
| Fu, Qiang · more Qiang Fu (George Washington University) | Automatic Generation of High-Performance Inference Kernels for Graph Neural Networks on Multi-Core Systems · pdf, mp4 · view |
| Fu, Song · more Song Fu (University of North Texas) | Boosting Compaction Performance of LSM-tree-based KV Stores in Multi-Near-Data Processing Systems · pdf, pdf · view |
| Fujimoto, Manato · more Manato Fujimoto (Nara Institute of Science and Technology) | Analysis on Nursing Care Activity Related Stress Level for Reduction of Caregiving Workload · pdf, mp4 · view |
| Fujimoto, Noriyuki · more Noriyuki Fujimoto (Osaka Prefecture University) | Efficient GPU-Implementation for Integer Sorting Based on Histogram and Prefix-Sums · pdf, mp4 · view |
| Gao, Jiechao · more Jiechao Gao (University of Virginia) | Multi-Agent Reinforcement Learning based Distributed Renewable Energy Matching for Datacenters · pdf, mp4 · view |
| Gao, Liang · more Liang Gao (National University of Defense Technology) | FIFL: A Fairness Incentive Framework for Federated Learning · pdf, mp4 · view |
| Gao, Weiguo · more Weiguo Gao (Fudan University) | Processor-Aware Cache-Oblivious Algorithms · pdf, mp4 · view |
| Gavirangaswamy, Vinay · more Vinay Gavirangaswamy (Western Michigan University, Cray- A Hewlett Packard Enterprise Company) | Towards Faster Execution of Ensemble ML Bootstrap Based Techniques · pdf, mp4 · view |
| Georgakoudis, Giorgis · more Giorgis Georgakoudis (Lawrence Livermore National Laboratory) | Towards Compile-Time-Reducing Compiler Optimization Selection via Machine Learning · view |
| Gerofi, Balazs · more Balazs Gerofi (RIKEN Center for Computational Science) | Intra-page Cache Update in SLC-mode with Partial Programming in High Density SSDs · pdf, mov · view |
| Gerstlauer, Andreas · more Andreas Gerstlauer (The University of Texas at Austin) | Wave-PIM: Accelerating Wave Simulation Using Processing-in-Memory · pdf, mp4 · view |
| Ghafoor, Sheikh · more Sheikh Ghafoor (Tennessee Technological University) | Design of a Portable Implementation of Partitioned Point-to-Point Communication Primitives · pdf, mp4 · view |
| Ghanim, Fady · more Fady Ghanim (Oak Ridge National Labs) | Implementing Arbitrary/Common Concurrent Writes of CRCW PRAM · pdf, mp4 · view |
| Gibbs, Thomas · more Thomas Gibbs (NVIDIA Inc.) | IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view |
| Gite, Rahul · more Rahul Gite (UMBC) | An Intelligent Parallel Distributed Streaming Framework for Near Real-time Science Sensors and High Resolution Medical Images · pdf, mp4 · view |
| Goel, Garvit · more Garvit Goel (Virginia Tech) | ComputeCOVID19+: Accelerating COVID-19 Diagnosis and Monitoring via High-Performance Deep Learning · pdf, mp4 · view |
| Gondhalekar, Atharva · more Atharva Gondhalekar (Virginia Tech) | ComputeCOVID19+: Accelerating COVID-19 Diagnosis and Monitoring via High-Performance Deep Learning · pdf, mp4 · view |
| Gong, Xiaoli · more Xiaoli Gong (NNankai University) | Ascetic: Enhancing Cross-Iterations Data Efficiency in Out-of-Memory Graph Processing on GPUs · pdf, mp4 · view |
| Gourounas, Dimitrios · more Dimitrios Gourounas (The University of Texas at Austin) | Wave-PIM: Accelerating Wave Simulation Using Processing-in-Memory · pdf, mp4 · view |
| Govil, Karan · more Karan Govil (ExxonMobil) | Wave-PIM: Accelerating Wave Simulation Using Processing-in-Memory · pdf, mp4 · view |
| Grant, Ryan · more Ryan Grant (Sandia National Laboratories) | Design of a Portable Implementation of Partitioned Point-to-Point Communication Primitives · pdf, mp4 · view |
| Groppe, Sven · more Sven Groppe (Universität zu Lübeck) | CuART - a CUDA-based, scalable Radix-Tree lookup and update engine · pdf, mp4 · view |
| Groth, Tobias · more Tobias Groth (Universität zu Lübeck) | CuART - a CUDA-based, scalable Radix-Tree lookup and update engine · pdf, mp4 · view |
| Guo, Minyi · more Minyi Guo (Shanghai Jiao Tong University) | Dubhe: Towards Data Unbiasedness with Homomorphic Encryption in Federated Learning Client Selection · pdf, mov · view |
| Guo, Xiao-Wei · more Xiao-Wei Guo (National University of Defense Technology) | CNN+LSTM Accelerated Turbulent Flow Simulation with Link-Wise Artificial Compressibility Method · pdf, mp4 · view |
| Guo, Yeting · more Yeting Guo (College of Computer, National University of Defense Technology) | FedCav: Contribution-aware Model Aggregation on Distributed Heterogeneous Data in Federated Learning · pdf, mp4 · view |
| Guo, Zehua · more Zehua Guo (Beijing Institute of Technology) | Optimizing Flow Completion Time via Adaptive Buffer Management in Data Center Networks · pdf, mp4 · view |
| Gupta, Ajay · more Ajay Gupta (Western Michigan University) | Towards Faster Execution of Ensemble ML Bootstrap Based Techniques · pdf, mp4 · view |
| Hale, Kyle · more Kyle Hale (Illinois Institute of Technology) | Cache-Aware Data Management for Memory-Mapped Forests · pdf, mp4 · view |
| Halem, Milton · more Milton Halem (UMBC) | An Intelligent Parallel Distributed Streaming Framework for Near Real-time Science Sensors and High Resolution Medical Images · pdf, mp4 · view |
| Han, Yongguo · more Yongguo Han (Southwest University of Science and Technology) | PREP: Predicting Job Runtime with Job Running Path on Supercomputers · pdf, mp4 · view |
| Hanindhito, Bagus · more Bagus Hanindhito (The University of Texas at Austin) | Wave-PIM: Accelerating Wave Simulation Using Processing-in-Memory · pdf, mp4 · view |
| He, Heng · more Heng He (School of Computer Science and Technology, Wuhan University of Science and Technology; Hubei Province Key Laboratory of Intelligent Information Processing and Real-time Industrial System) | FMSM: A Fuzzy Multi-keyword Search Scheme for Encrypted Cloud Data based on Multi-chain Network · pdf, mp4 · view |
| He, Shuibing · more Shuibing He (Zhejiang University, Zhejiang Lab) | A Novel Multi-CPU/GPU Collaborative Computing Framework for SGD-based Matrix Factorization · pdf, mp4 · view |
| Hong, Yang · more Yang Hong (Intel APAC) | Teddy: An Efficient SIMD-based Literal Matching Engine for Scalable Deep Packet Inspection · pdf, mp4 · view |
| Hossain, Md Maruf · more Md Maruf Hossain (University of North Carolina at Charlotte) | Postmortem Graph Analysis on the Temporal Graph · pdf, pdf · view Impact of AVX-512 Instructions on Graph Partitioning Problems. · pdf, mp4 · view |
| Hsu, Wei-Chung · more Wei-Chung Hsu (National Taiwan University) | Intra- and Inter- Layer Transformation to Reduce Memory Traffic for CNN Computation · pdf, mp4 · view |
| Hu, Jing · more Jing Hu (Wuhan National Laboratory for Optoelectronics,Huazhong University of Science and Technology) | Parallel Multi-split Extendible Hashing for Persistent Memory · pdf, mp4 · view |
| Hu, Yang · more Yang Hu (University of Texas at Dallas) | Enabling Efficient SIMD Acceleration for Virtual Radio Access Network · pdf, mp4 · view |
| Hu, Yi · more Yi Hu (Institute of Software, Chinese Academy of Sciences; University of Chinese Academy of Sciences) | Best VM Selection for Big Data Applications across Multiple Frameworks by Transfer Learning · pdf, mp4 · view |
| Hu, Yongmin · more Yongmin Hu (Beihang University) | Automatic Code Generation and Optimization of Large-scale Stencil Computation on Many-core Processors · pdf, mp4 · view |
| Hu, Yuchong · more Yuchong Hu (Huazhong University of Science and Technology) | A Log-Free and Consistent Chained Hashing for Non-volatile Memory · pdf, pdf · view Coupling Right-Provisioned Cold Storage Data Centers with Deduplication · pdf, mp4 · view Multi-level Forwarding and Scheduling Recovery Technique in Heterogeneous Network for Erasure-coded Clusters · pdf, mp4 · view |
| Hua, Fei · more Fei Hua (Rutgers Unversity) | BGPQ: A Heap-Based Priority Queue Design for GPUs · pdf, mp4 · view |
| Hua, Qiang-Sheng · more Qiangsheng Hua (Huazhong University of Science and Technology) | Communication Avoiding All-Pairs Shortest Paths Algorithm for Sparse graphs · pdf, mp4 · view Efficient Complete Event Trend Detection over High-Velocity Streams · pdf, mp4 · view |
| Huang, Chenglong · more Chenglong Huang (National University of Defense Technology) | HDNH: a read-efficient and write-optimized hashing scheme for hybrid DRAM-NVM memory · pdf, mp4 · view |
| Huang, Chung-Wen · more Chung-Wen Huang (MediaTek Inc) | Support Convolution of CNN with Compression Sparse Matrix Multiplication Flow in TVM · pdf, mp4 · view |
| Huang, Dan · more Dan Huang (Sun Yat-sen University) | Optimizing Massively Parallel Winograd Convolution on ARM Processor · pdf, mp4 · view |
| Huang, H. Howie · more H. Howie Huang (George Washington University) | Automatic Generation of High-Performance Inference Kernels for Graph Neural Networks on Multi-Core Systems · pdf, mp4 · view |
| Huang, Jiawen · more Jiawen Huang (State Key Laborotary of Computer Architecture, Institute of Computing Technology, Chinese Academy of Sciences) | BitX: Empower Versatile Inference with Hardware Runtime Pruning · pdf, mp4 · view |
| Huang, Kaixin · more Kaixin Huang (ByteDance Inc., Shanghai Jiao Tong University) | HDNH: a read-efficient and write-optimized hashing scheme for hybrid DRAM-NVM memory · pdf, mp4 · view |
| Huang, Min · more Min Huang (Southwest University) | Intra-page Cache Update in SLC-mode with Partial Programming in High Density SSDs · pdf, mov · view |
| Huang, Tao · more Tao Huang (Institute of Software, Chinese Academy of Sciences; University of Chinese Academy of Sciences) | Best VM Selection for Big Data Applications across Multiple Frameworks by Transfer Learning · pdf, mp4 · view |
| Huang, Yizhi · more Yizhi Huang (Hunan University, Zhejiang Lab) | A Novel Multi-CPU/GPU Collaborative Computing Framework for SGD-based Matrix Factorization · pdf, mp4 · view |
| Huber, Joseph · more Joseph Huber (Oak Ridge National Laboratory) | Advancing OpenMP Offload Debugging Capabilities in LLVM · view |
| Hundt, Christian · more Christian Hundt (NVIDIA AI Technology Center Luxembourg) | MetaCache-GPU: Ultra-Fast Metagenomic Classification · pdf, mp4 · view |
| Hung, Ming-Yu · more Ming-Yu Hung (MediaTek Inc) | Accelerate Binarized Neural Networks with Processing-in-Memory Enabled by RISC-V Custom Instructions · pdf, mp4 · view Support Convolution of CNN with Compression Sparse Matrix Multiplication Flow in TVM · pdf, mp4 · view |
| Ikeda, Takuya · more Takuya Ikeda (Kansai University) | New Evacuation Guidance Using Augmented Reality for Emergency Rescue Evacuation Support System (ERESS) · pdf, mp4 · view |
| Ilic, Aleksandar · more Aleksandar Ilic (INESC-ID; Instituto Superior Técnico, Universidade de Lisboa) | Fourth-Order Exhaustive Epistasis Detection for the xPU Era · pdf, mp4 · view |
| Imamura, Toshiyuki · more Toshiyuki Imamura (RIKEN Center for Computational Science) | Accurate Matrix Multiplication on Binary128 Format Accelerated by Ozaki Scheme · pdf, file · view |
| Jahic, Jasmin · more Jasmin Jahic (University of Cambridge) | ArchViMP – a Framework for Automatic Extraction of Concurrency-related Software Architectural Properties · pdf, mp4 · view |
| Jarachanthan, Jananie · more Jananie Jarachanthan (University of Louisiana at Lafayette) | AMPS-Inf: Automatic Model Partitioning for Serverless Inference with Cost Efficiency. · pdf, mp4 · view |
| Jayatilaka, Tarindu · more Tarindu Jayatilaka (University of Moratuwa) | Towards Compile-Time-Reducing Compiler Optimization Selection via Machine Learning · view |
| Jeannot, Emmanuel · more Emmanuel Jeannot (INRIA) | Interferences between Communications and Computations in Distributed HPC Systems · pdf, mp4 · view |
| Jenkins, Louis · more Louis Jenkins (University of Rochester) | A Fast, General System for Buffered Persistent Data Structures · pdf, mp4 · view |
| Jha, Shantenu · more Shantenu Jha (Brookhaven National Laboratory, Rutgers University) | IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view |
| Ji, Zhuoran · more Zhuoran Ji (The University of Hong Kong) | Accelerating DBSCAN Algorithm with AI Chips for Large Datasets · pdf, mp4 · view |
| Jia, Ranhao · more Ranhao Jia (Department of Computer Science and Engineering, Shanghai Jiao Tong University) | A Graph-Assisted Out-of-Place Update Scheme for Erasure Coded Storage Systems · pdf, mp4 · view |
| Jia, Zhen · more Zhen Jia (Amazon) | LoWino: Towards Efficient Low-Precision Winograd Convolutions on Modern CPUs · pdf, mp4 · view |
| Jiang, Dejun · more Dejun Jiang (Institute of Computing Technology, CAS; University of Chinese Academy of Sciences) | Using Vectorized Execution to Improve SQL Query Performance on Spark · pdf, mp4 · view |
| Jiang, Hao · more Hao Jiang (National University of Defense Technology) | XHYPRE: A high-precision numerical software package for solving large-scale sparse linear equations · pdf, pdf · view |
| Jiang, Shizhi · more Shizhi Jiang (University of Chinese Academy of Sciences; Institute of Software, Chinese Academy of Sciences) | Matryoshka: A Coalesced Delta Sequence Prefetcher · pdf, mp4 · view |
| Jin, Hai · more Hai Jin (Huazhong University of Science and Technology) | Communication Avoiding All-Pairs Shortest Paths Algorithm for Sparse graphs · pdf, mp4 · view Efficient Complete Event Trend Detection over High-Velocity Streams · pdf, mp4 · view |
| Jin, Yuwei · more Yuwei Jin (Rutgers Unversity) | BGPQ: A Heap-Based Priority Queue Design for GPUs · pdf, mp4 · view |
| Jin, Zheming · more Zheming Jin (ORNL) | Evaluating the Performance of Integer Sum Reduction in SYCL · pdf, pptx · view |
| John, Lizy · more Lizy John (The University of Texas at Austin) | Wave-PIM: Accelerating Wave Simulation Using Processing-in-Memory · pdf, mp4 · view |
| Jünger, Daniel · more Daniel Jünger (Johannes Gutenberg University Mainz) | MetaCache-GPU: Ultra-Fast Metagenomic Classification · pdf, mp4 · view |
| Kanayama, Yuta · more Yuta Kanayama (Kansai University) | New Evacuation Guidance Using Augmented Reality for Emergency Rescue Evacuation Support System (ERESS) · pdf, mp4 · view |
| Kao, Henry · more Henry Kao (University of Toronto) | Ghostwriter: A Cache Coherence Protocol for Error-Tolerant Applications · pdf, mp4 · view |
| Ke, Zhaokang · more Zhaokang Ke (Huazhong University of Science and Technology) | Coupling Right-Provisioned Cold Storage Data Centers with Deduplication · pdf, mp4 · view |
| Ke, Zong-Ming · more Zong-Ming Ke (National Cheng Kung University) | Dual-KV: Improving Performance of Key-value Caches on Multilevel Cell Non-volatile Memory · pdf, mp4 · view |
| Keipert, Kristopher · more Kristopher Keipert (NVIDIA Inc.) | IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view |
| Khan, Md Muhib · more Md Muhib Khan (Florida State University) | ROBOTune: High-Dimensional Configuration Tuning for Cluster-Based Data Analytics · pdf, mp4 · view |
| Kilpatrick, Peter · more Peter Kilpatrick (Queen's University Belfast) | Exploiting in-Hub Temporal Locality in SpMV-based Graph Processing · pdf, mp4 · view |
| Klein, Christoph · more Christoph Klein (University of Heidelberg, ZITI) | Tridiagonal GPU Solver with Scaled Partial Pivoting at Maximum Bandwidth · pdf, mp4 · view |
| Kobus, Robin · more Robin Kobus (Johannes Gutenberg University Mainz) | MetaCache-GPU: Ultra-Fast Metagenomic Classification · pdf, mp4 · view |
| Koohi Esfahani, Mohsen · more Mohsen Koohi Esfahani (Queen's University Belfast) | Exploiting in-Hub Temporal Locality in SpMV-based Graph Processing · pdf, mp4 · view |
| Koppehel, Martin · more Martin Koppehel (Otto-von-Guericke Universität Magdeburg) | CuART - a CUDA-based, scalable Radix-Tree lookup and update engine · pdf, mp4 · view |
| Kozakai, Seiya · more Seiya Kozakai (Hosei University) | Efficient GPU-Implementation for Integer Sorting Based on Histogram and Prefix-Sums · pdf, mp4 · view |
| Kranzlmüller, Dieter · more Dieter Kranzlmüller (Leibniz Research Centre) | IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view |
| Kruse, Michael · more Michael Kruse (Argonne National Laboratory) | Loop Transformations using Clang's Abstract Syntax Tree · view |
| Kurth, Thorsten · more Thorsten Kurth (NVIDIA Inc.) | IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view |
| Lai, Junjie · more Junjie Lai (NVIDIA) | Optimizing Winograd-Based Convolution with Tensor Cores · pdf, mp4 · view |
| Lai, Jyun-Kai · more Jyun-Kai Lai (National Yang Ming Chiao Tung University) | Hyperchaining Optimizations for an LLVM-Based Binary Translator on x86-64 and RISC-V Platforms · pdf, mp4 · view |
| Lai, Wei-Chih · more Wei-Chih Lai (MediaTek Inc) | Support Convolution of CNN with Compression Sparse Matrix Multiplication Flow in TVM · pdf, mp4 · view |
| Lai, Zhiquan · more Zhiquan Lai (National University of Defense Technology) | Hippie: A Data-Paralleled Pipeline Approach to Improve Memory-Efficiency and Scalability for Large DNN Training · pdf, mp4 · view |
| Lan, Hao · more Hao Lan (University of Toronto) | Accelerated Device Placement Optimization with Contrastive Learning · pdf, mp4 · view |
| Langguth, Johannes · more Johannes Langguth (Simula Research Laboratory) | Explaining the Classification Performance of Supervised and Semi-Supervised Methods for Automated Sparse Matrix Format Selection · pdf, mp4 · view |
| Larkins, D. Brian · more D. Brian Larkins (Rhodes College) | Optimizing Work Stealing Communication with Structured Atomic Operations · pdf, mp4 · view |
| Leandro Nesi, Lucas · more Lucas Leandro Nesi (Institute of Informatics, Federal University of Rio Grande do Sul) | Exploiting system level heterogeneity to improve the performance of a GeoStatistics multi-phase task-based application · pdf, mp4 · view |
| Lee, Chao-Lin · more Chao-Lin Lee (National Tsing Hua University) | Accelerate Binarized Neural Networks with Processing-in-Memory Enabled by RISC-V Custom Instructions · pdf, mp4 · view Support Convolution of CNN with Compression Sparse Matrix Multiplication Flow in TVM · pdf, mp4 · view |
| Lee, Hyungro · more Hyungro Lee (Rutgers University) | IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view |
| Lee, Jenq-Kuen · more Jenq-Kuen Lee (National Tsing Hua University) | Accelerate Binarized Neural Networks with Processing-in-Memory Enabled by RISC-V Custom Instructions · pdf, mp4 · view Support Convolution of CNN with Compression Sparse Matrix Multiplication Flow in TVM · pdf, mp4 · view |
| Legrand, Arnaud · more Arnaud Legrand (University Grenoble Alpes, CNRS) | Exploiting system level heterogeneity to improve the performance of a GeoStatistics multi-phase task-based application · pdf, mp4 · view |
| Lehr, Jan-Patrick · more Jan-Patrick Lehr (Technical University of Darmstadt) | Tool-Supported Mini-App Extraction to Facilitate Program Analysis and Parallelization · pdf, mp4 · view |
| Lei, Mengya · more Mengya Lei (huazhong university of science and technology) | Crash-Consistency-Aware Encryption for Non-Volatile Memories · pdf, mp4 · view |
| Leng, Jingwen · more Jingwen Leng (Shanghai Jiao Tong University) | Dubhe: Towards Data Unbiasedness with Homomorphic Encryption in Federated Learning Client Selection · pdf, mov · view |
| Li, Ang · more Ang Li (Pacific Northwest National Laboratory) | Fast and Scalable Sparse Triangular Solver for Multi-GPU Based HPC Architectures · pdf, mp4 · view |
| Li, Angela · more Angela Li (The Ohio State University) | Cache-Aware Data Management for Memory-Mapped Forests · pdf, mp4 · view |
| Li, Baochun · more Baochun Li (University of Toronto) | Accelerated Device Placement Optimization with Contrastive Learning · pdf, mp4 · view |
| Li, Baoqian · more Baoqian Li (Intel APAC) | Teddy: An Efficient SIMD-based Literal Matching Engine for Scalable Deep Packet Inspection · pdf, mp4 · view |
| Li, Bo · more Bo Li (Hong Kong University of Science and Technology) | AMPS-Inf: Automatic Model Partitioning for Serverless Inference with Cost Efficiency. · pdf, mp4 · view |
| Li, Chuanying · more Chuanying Li (Hunan University) | XHYPRE: A high-precision numerical software package for solving large-scale sparse linear equations · pdf, pdf · view |
| Li, Dawei · more Dawei Li (Montclair State University) | Distributed Game-Theoretical Route Navigation for Vehicular Crowdsensing · pdf, mp4 · view |
| Li, Dongsheng · more Dongsheng Li (Sun Yat-sen University) | Hippie: A Data-Paralleled Pipeline Approach to Improve Memory-Efficiency and Scalability for Large DNN Training · pdf, mp4 · view |
| Li, Fan · more Fan Li (huazhong university of science and technology) | Fast and Consistent Remote Direct Access to Non-volatile Memory · pdf, mp4 · view Crash-Consistency-Aware Encryption for Non-Volatile Memories · pdf, mp4 · view |
| Li, Guangli · more Guangli Li (Institute of Computing Technology, Chinese Academy of Sciences; University of Chinese Academy of Sciences) | LoWino: Towards Efficient Low-Precision Winograd Convolutions on Modern CPUs · pdf, mp4 · view |
| Li, Hongyan · more Hongyan Li (State Key Laborotary of Computer Architecture, Institute of Computing Technology, Chinese Academy of Sciences) | BitX: Empower Versatile Inference with Hardware Runtime Pruning · pdf, mp4 · view |
| Li, Jiajia · more Jiajia Li (Pacific Northwest National Laboratory, William&Mary) | Fast and Scalable Sparse Triangular Solver for Multi-GPU Based HPC Architectures · pdf, mp4 · view |
| Li, Jiawei · more Jiawei Li (University of Science and Technology of China) | Progressive Memory Adjustment with Performance Guarantee in Virtualized Systems · pdf, mp4 · view |
| Li, Jun · more Jun Li (Southwest University) | Intra-page Cache Update in SLC-mode with Partial Programming in High Density SSDs · pdf, mov · view |
| Li, Li · more Li Li (ShenZhen Institutes of Advanced Technology, Chinese Academy of Sciences) | FIFL: A Fairness Incentive Framework for Federated Learning · pdf, mp4 · view |
| Li, Mingshu · more Mingshu Li (Institute of Software, Chinese Academy of Sciences) | Matryoshka: A Coalesced Delta Sequence Prefetcher · pdf, mp4 · view |
| Li, Mingzhen · more Mingzhen Li (Beihang University) | Automatic Code Generation and Optimization of Large-scale Stencil Computation on Many-core Processors · pdf, mp4 · view |
| Li, Minjun · more Minjun Li (Southwest University) | Intra-page Cache Update in SLC-mode with Partial Programming in High Density SSDs · pdf, mov · view |
| Li, Qiliang · more Qiliang Li (University of Science and Technology of China) | Fast Reconstruction for Large Disk Enclosures Based on RAID2.0 · pdf, mp4 · view |
| Li, Renfa · more Renfa Li (Hunan University) | A Novel Multi-CPU/GPU Collaborative Computing Framework for SGD-based Matrix Factorization · pdf, mp4 · view |
| Li, Ruihao · more Ruihao Li (The University of Texas at Austin) | Wave-PIM: Accelerating Wave Simulation Using Processing-in-Memory · pdf, mp4 · view |
| Li, Shengwei · more Shengwei Li (National University of Defense Technology) | Hippie: A Data-Paralleled Pipeline Approach to Improve Memory-Efficiency and Scalability for Large DNN Training · pdf, mp4 · view |
| Li, Weiguang · more Weiguang Li (Wuhan National Laboratory for Optoelectronics, Huazhong University of Science and Technology) | Fast and Consistent Remote Direct Access to Non-volatile Memory · pdf, mp4 · view |
| Li, Xiaowei · more Xiaowei Li (State Key Laborotary of Computer Architecture, Institute of Computing Technology, Chinese Academy of Sciences) | BitX: Empower Versatile Inference with Hardware Runtime Pruning · pdf, mp4 · view |
| Li, Xiaoying · more Xiaoying Li (University of Virginia) | Multi-Agent Reinforcement Learning based Distributed Renewable Energy Matching for Datacenters · pdf, mp4 · view |
| Li, Yongkun · more Yongkun Li (University of Science and Technology of China) | Progressive Memory Adjustment with Performance Guarantee in Virtualized Systems · pdf, mp4 · view |
| Li, Yubo · more Yubo Li (V-Origin) | Exploring HW/SW Co-Optimizations for Accelerating Large-scale Texture Identification on Distributed GPUs · pdf, mp4 · view |
| Li, Yun-Ze · more Yun-Ze Li (National Cheng Kung University) | Dual-KV: Improving Performance of Key-value Caches on Multilevel Cell Non-volatile Memory · pdf, mp4 · view |
| Li, Zhuozhao · more Zhuozhao Li (University of Chicago) | IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view |
| Li, Zirui · more Zirui Li (Shanghai Jiao Tong University) | Dubhe: Towards Data Unbiasedness with Homomorphic Encryption in Federated Learning Client Selection · pdf, mov · view |
| Li, Zitong · more Zitong Li (Wake Forest University) | Parallel Tucker Decomposition with Numerically Accurate SVD · pdf, mp4 · view |
| Liao, Hui-Hsin · more Hui-Hsin Liao (National Tsing Hua University) | Support Convolution of CNN with Compression Sparse Matrix Multiplication Flow in TVM · pdf, mp4 · view |
| Liao, Jianwei · more Jianwei Liao (Southwest University) | Intra-page Cache Update in SLC-mode with Partial Programming in High Density SSDs · pdf, mov · view |
| Liao, Pin-Wei · more Pin-Wei Liao (National Taiwan University) | Intra- and Inter- Layer Transformation to Reduce Memory Traffic for CNN Computation · pdf, mp4 · view |
| Liao, Shih-Wei · more Shih-Wei Liao (National Taiwan University) | Intra- and Inter- Layer Transformation to Reduce Memory Traffic for CNN Computation · pdf, mp4 · view |
| Liao, Xiangke · more Xiangke Liao (National University of Defense Technology) | CD-SGD: Distributed Stochastic Gradient Descent with Compression and Delay Compensation · pdf, mp4 · view |
| Lin, Che-Chia · more Che-Chia Lin (National Tsing Hua University) | Accelerate Binarized Neural Networks with Processing-in-Memory Enabled by RISC-V Custom Instructions · pdf, mp4 · view |
| Lin, Tzu-Chia · more Tzu-Chia Lin (National Central University) | Automated Arrhythmia Detection using Hilbert-Huang Transform based Convolutional Neural Network · pdf, mp4 · view |
| Lin, Xiang · more Xiang Lin (Fudan University) | Optimizing Flow Completion Time via Adaptive Buffer Management in Data Center Networks · pdf, mp4 · view |
| Lin, Yonghua · more Yonghua Lin (V-Origin) | Exploring HW/SW Co-Optimizations for Accelerating Large-scale Texture Identification on Distributed GPUs · pdf, mp4 · view |
| Liu, chengyu · more chengyu Liu (Wuhan University of Science and Technology, School of Computer Science and Technology) | FMSM: A Fuzzy Multi-keyword Search Scheme for Encrypted Cloud Data based on Multi-chain Network · pdf, mp4 · view |
| Liu, Fang · more Fang Liu (School of Design, Hunan University) | FedCav: Contribution-aware Model Aggregation on Distributed Heterogeneous Data in Federated Learning · pdf, mp4 · view |
| Liu, Hanfeng · more Hanfeng Liu (School of Science and Engineering, The Chinese University of Hong Kong, Shenzhen; Shenzhen Institute of Artificial Intelligence and Robotics for Society) | FastPSO: Towards Efficient Swarm Intelligence Algorithm on GPUs · pdf, mp4 · view |
| Liu, Junhong · more Junhong Liu (NVIDIA) | Optimizing Winograd-Based Convolution with Tensor Cores · pdf, mp4 · view |
| Liu, Sen · more Sen Liu (Fudan University) | Optimizing Flow Completion Time via Adaptive Buffer Management in Data Center Networks · pdf, mp4 · view |
| Liu, Wenbin · more Wenbin Liu (Jilin University) | Distributed Game-Theoretical Route Navigation for Vehicular Crowdsensing · pdf, mp4 · view |
| Liu, Wuji · more Wuji Liu (New Jersey Institute of Technology) | NoStop: A Novel Configuration Optimization Scheme for Spark Streaming · pdf, mp4 · view |
| Liu, Xiaoyan · more Xiaoyan Liu (Beihang University) | Automatic Code Generation and Optimization of Large-scale Stencil Computation on Many-core Processors · pdf, mp4 · view |
| Liu, Yan · more Yan Liu (Hunan University) | A Novel Multi-CPU/GPU Collaborative Computing Framework for SGD-based Matrix Factorization · pdf, mp4 · view |
| Liu, Yi · more Yi Liu (Beihang University) | Automatic Code Generation and Optimization of Large-scale Stencil Computation on Many-core Processors · pdf, mp4 · view |
| Liu, Zhiming · more Zhiming Liu (Southwest University) | Intra-page Cache Update in SLC-mode with Partial Programming in High Density SSDs · pdf, mov · view |
| López-Paradís, Guillem · more Guillem López-Paradís (Barcelona Supercomputing Center, Universitat Politècnica de Catalunya) | gem5+RTL: A Framework to Enable RTL Models Inside a Full-System Simulator · pdf, mp4 · view |
| Lu, Hang · more Hang Lu (State Key Laborotary of Computer Architecture, Institute of Computing Technology, Chinese Academy of Sciences) | BitX: Empower Versatile Inference with Hardware Runtime Pruning · pdf, mp4 · view |
| Lu, Yutong · more Yutong Lu (Sun Yat-sen University) | Optimizing Massively Parallel Winograd Convolution on ARM Processor · pdf, mp4 · view |
| Luan, Dongming · more Dongming Luan (Jilin University) | Distributed Game-Theoretical Route Navigation for Vehicular Crowdsensing · pdf, mp4 · view |
| Luan, Zhongzhi · more Zhongzhi Luan (Beihang University) | Automatic Code Generation and Optimization of Large-scale Stencil Computation on Many-core Processors · pdf, mp4 · view |
| Luo, Qiong · more Qiong Luo (Hong Kong University of Science and Technology) | Accelerating Sequence-to-Graph Alignment on Heterogeneous Processors · pdf, mp4 · view |
| Luo, Yingwei · more Yingwei Luo (Department of Computer Science and Technology, Peking University; Peng Cheng Lab, Shenzhen) | An Edge-Fencing Strategy for Optimizing SSSP Computations on Large-Scale Graphs · pdf, mp4 · view |
| Lv, Pengze · more Pengze Lv (Wuhan National Laboratory for Optoelectronics, Huazhong University of Science and Technology) | CERES: Container-Based Elastic Resource Management System for Mixed Workloads · pdf, mp4 · view |
| Lyu, Min · more Min Lyu (University of Science and Technology of China) | Fast Reconstruction for Large Disk Enclosures Based on RAID2.0 · pdf, mp4 · view |
| Ma, Heng · more Heng Ma (Argonne National Laboratory) | IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view |
| Maggioli, Filippo · more Filippo Maggioli (Sapienza, University of Rome) | Efficiently Parallelizable Strassen-Based Multiplication of a Matrix by its Transpose · pdf, mp4 · view |
| Maldonado, Daniel Adrian · more Daniel Adrian Maldonado (Argonne National Laboratory) | Domain Decomposition Preconditioners for Unstructured Network Problems in Parallel Vector Architectures · pdf, mov · view |
| Mangalagiri, Jayalakshmi · more Jayalakshmi Mangalagiri (UMBC) | An Intelligent Parallel Distributed Streaming Framework for Near Real-time Science Sensors and High Resolution Medical Images · pdf, mp4 · view |
| Mantel, Heiko · more Heiko Mantel (Technical University of Darmstadt) | Tool-Supported Mini-App Extraction to Facilitate Program Analysis and Parallelization · pdf, mp4 · view |
| Massini, Annalisa · more Annalisa Massini (Sapienza, University of Rome) | Efficiently Parallelizable Strassen-Based Multiplication of a Matrix by its Transpose · pdf, mp4 · view |
| Mathias, Gerald · more Gerald Mathias (Leibniz Research Centre) | IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view |
| Matsui, Tomokazu · more Tomokazu Matsui (Nara Institute of Science and Technology) | Analysis on Nursing Care Activity Related Stress Level for Reduction of Caregiving Workload · pdf, mp4 · view |
| Mehta, Kshitij · more Kshitij Mehta (Oak Ridge National Lab) | DYFLOW: A flexible framework for orchestrating scientific workflows on supercomputers · pdf, mp4 · view |
| Mei, Huiyao · more Huiyao Mei (Huazhong University of Science and Technology) | Efficient Complete Event Trend Detection over High-Velocity Streams · pdf, mp4 · view |
| Mello Schnorr, Lucas · more Lucas Mello Schnorr (Institute of Informatics, Federal University of Rio Grande do Sul) | Exploiting system level heterogeneity to improve the performance of a GeoStatistics multi-phase task-based application · pdf, mp4 · view |
| Merzky, Andre · more Andre Merzky (Rutgers University) | IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view |
| Meyer, Bruno · more Bruno Meyer (Federal University of Paraná) | Warp-centric K-Nearest Neighbor Graphs construction on GPU · pdf, mp4 · view |
| Miandji, Ehsan · more Ehsan Miandji (Linköping University) | GPU Accelerated SL0 for Multidimensional Signals · pdf, mp4 · view |
| Mishin, Nikita · more Nikita Mishin (Saint Petersburg State University, JetBrains Research) | Efficient Parallel Algorithms for String Comparison · pdf, mp4 · view |
| Miyaji, Atsushi · more Atsushi Miyaji (Nara Institute of Science and Technology) | Analysis on Nursing Care Activity Related Stress Level for Reduction of Caregiving Workload · pdf, mp4 · view |
| Moretó, Miquel · more Miquel Moretó (Barcelona Supercomputing Center, Universitat Politècnica de Catalunya) | gem5+RTL: A Framework to Enable RTL Models Inside a Full-System Simulator · pdf, mp4 · view |
| Morris, Nathaniel · more Nathaniel Morris (The Ohio State University) | Cache-Aware Data Management for Memory-Mapped Forests · pdf, mp4 · view |
| Mukunoki, Daichi · more Daichi Mukunoki (RIKEN Center for Computational Science) | Accurate Matrix Multiplication on Binary128 Format Accelerated by Ozaki Scheme · pdf, file · view |
| Müller, André · more André Müller (Johannes Gutenberg University Mainz) | MetaCache-GPU: Ultra-Fast Metagenomic Classification · pdf, mp4 · view |
| Navarro Muñoz, Antoni · more Antoni Navarro Muñoz (Barcelona Supercomputing Center (BSC-CNS), Universitat Politècnica de Catalunya) | Combining Dynamic Concurrency Throttling with Voltage and Frequency Scaling on Task-based Programming Models · pdf, mp4 · view |
| Nguyen, Phuong · more Phuong Nguyen (UMBC, OpenKneck Inc) | An Intelligent Parallel Distributed Streaming Framework for Near Real-time Science Sensors and High Resolution Medical Images · pdf, mp4 · view |
| Nobre, Ricardo · more Ricardo Nobre (INESC-ID; Instituto Superior Técnico, Universidade de Lisboa) | Fourth-Order Exhaustive Epistasis Detection for the xPU Era · pdf, mp4 · view |
| Norouzi, Mohammad · more Mohammad Norouzi (Technical University of Darmstadt) | Tool-Supported Mini-App Extraction to Facilitate Program Analysis and Parallelization · pdf, mp4 · view |
| Nunan Zola, Wagner M. · more Wagner M. Nunan Zola (Federal University of Paraná) | Warp-centric K-Nearest Neighbor Graphs construction on GPU · pdf, mp4 · view |
| Ogita, Takeshi · more Takeshi Ogita (Tokyo Woman's Christian University) | Accurate Matrix Multiplication on Binary128 Format Accelerated by Ozaki Scheme · pdf, file · view |
| Ohtsuki, Kazuhiro · more Kazuhiro Ohtsuki (Kobe University) | New Evacuation Guidance Using Augmented Reality for Emergency Rescue Evacuation Support System (ERESS) · pdf, mp4 · view |
| Ouyang, Shuo · more Shuo Ouyang (National University of Defense Technology) | CD-SGD: Distributed Stochastic Gradient Descent with Compression and Delay Compensation · pdf, mp4 · view |
| Ozaki, Katsuhisa · more Katsuhisa Ozaki (Shibaura Institute of Technology) | Accurate Matrix Multiplication on Binary128 Format Accelerated by Ozaki Scheme · pdf, file · view |
| Ozkaya, M. Yusuf · more M. Yusuf Ozkaya (Georgia Institute of Technology) | An Evaluation of Task-Parallel Frameworks for Sparse Solvers on Multicore and Manycore CPU Architectures · pdf, mov · view |
| Pacaud, François · more François Pacaud (Argonne National Laboratory) | Domain Decomposition Preconditioners for Unstructured Network Problems in Parallel Vector Architectures · pdf, mov · view |
| Paluri, Pavan Kumar · more Pavan Kumar Paluri (University of Houston) | A Virtualization Platform Designed for Irregular Multi-Process Applications · pdf, pdf · view |
| Park, EunJung · more EunJung Park (Los Alamos National Laboratory) | Towards Compile-Time-Reducing Compiler Optimization Selection via Machine Learning · view |
| Partin, Alexander · more Alexander Partin (Argonne National Laboratory) | IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view |
| Patel, Atmn · more Atmn Patel (University of Waterloo) | A Virtual GPU as Developer-Friendly OpenMP Offload Target · view |
| Peng, Zhouxuan · more Zhouxuan Peng (Wuhan National Laboratory for Optoelectronics,Huazhong University of Science and Technology) | Parallel Multi-split Extendible Hashing for Persistent Memory · pdf, mp4 · view |
| Perotin, Lucas · more Lucas Perotin (ENS Lyon) | Multi-Resource List Scheduling of Moldable Parallel Jobs under Precedence Constraints · pdf, mp4 · view |
| Perovic, Vasilije · more Vasilije Perovic (University of Rhode Island) | Towards Faster Execution of Ensemble ML Bootstrap Based Techniques · pdf, mp4 · view |
| Perumalla, Kalyan · more Kalyan Perumalla (Oak Ridge National Laboratory) | Design Considerations for GPU-based Mixed Integer Programming on Parallel Computing Platforms · pdf, mp4 · view |
| Pionteck, Thilo · more Thilo Pionteck (Otto-von-Guericke Universität Magdeburg) | CuART - a CUDA-based, scalable Radix-Tree lookup and update engine · pdf, mp4 · view |
| Plano, Tom · more Tom Plano (Washington University) | Enabling Real-Time Irregular Data-Flow Pipelines on SIMD Devices · pdf, mp4 · view |
| Pogorelov, Konstantin · more Konstantin Pogorelov (Simula Research Laboratory) | Explaining the Classification Performance of Supervised and Semi-Supervised Methods for Automated Sparse Matrix Format Selection · pdf, mp4 · view |
| Ponomarev, Dmitry · more Dmitry Ponomarev (State University of New York at Binghamton) | GVT-Guided Demand-Driven Scheduling in Parallel Discrete Event Simulation · pdf, mp4 · view |
| Posner, Jonas · more Jonas Posner (University of Kassel) | Transparent Resource Elasticity for Task-Based Cluster Environments with Work Stealing · pdf, mp4 · view |
| Pottier, Loïc · more Loïc Pottier (University of Southern California, Information Sciences Institute) | Assessing Resource Provisioning and Allocation of Ensembles of In Situ Workflows · pdf, mp4 · view |
| Pourjafarian, Monireh · more Monireh Pourjafarian (Technical University of Kaiserslautern) | ArchViMP – a Framework for Automatic Extraction of Concurrency-related Software Architectural Properties · pdf, mp4 · view |
| Pozo, Aurora · more Aurora Pozo (Federal University of Paraná) | Warp-centric K-Nearest Neighbor Graphs construction on GPU · pdf, mp4 · view |
| Prema Soundararajan, Prema · more Prema Prema Soundararajan (University of Alabama at Birmingham) | Design of a Portable Implementation of Partitioned Point-to-Point Communication Primitives · pdf, mp4 · view |
| Qi, Jingyuan · more Jingyuan Qi (Virginia Tech) | ComputeCOVID19+: Accelerating COVID-19 Diagnosis and Monitoring via High-Performance Deep Learning · pdf, mp4 · view |
| Qi, Qiang · more Qiang Qi (East China Normal University) | Prophet: Speeding up Distributed DNN Training with Predictable Communication Scheduling · pdf, mp4 · view |
| Qian, Depei · more Depei Qian (Beihang University) | Automatic Code Generation and Optimization of Large-scale Stencil Computation on Many-core Processors · pdf, mp4 · view |
| Qian, Kun · more Kun Qian (Alibaba) | Receiver-Driven Congestion Control for InfiniBand · pdf, mp4 · view |
| Qiao, Linbo · more Linbo Qiao (National University of Defense Technology) | Hippie: A Data-Paralleled Pipeline Approach to Improve Memory-Efficiency and Scalability for Large DNN Training · pdf, mp4 · view |
| Qiu, Kun · more Kun Qiu (Intel APAC) | Teddy: An Efficient SIMD-based Literal Matching Engine for Scalable Deep Packet Inspection · pdf, mp4 · view |
| Quan, Zhe · more Zhe Quan (Hunan University) | XHYPRE: A high-precision numerical software package for solving large-scale sparse linear equations · pdf, pdf · view |
| Rabbi, Fazlay · more Fazlay Rabbi (Michigan State University) | An Evaluation of Task-Parallel Frameworks for Sparse Solvers on Multicore and Manycore CPU Architectures · pdf, mov · view |
| Raghavan, Padma · more Padma Raghavan (Vanderbilt University) | Multi-Resource List Scheduling of Moldable Parallel Jobs under Precedence Constraints · pdf, mp4 · view |
| Ramanathan, Arvind · more Arvind Ramanathan (Argonne) | IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view |
| Ramtin, Amir Reza · more Amir Reza Ramtin (University of Massachusetts Amherst) | Self-Stabilization with Selfish Agents · pdf, mp4 · view |
| Raugas, Mark · more Mark Raugas (Pacific Northwest National Laboratory) | Fast and Scalable Sparse Triangular Solver for Multi-GPU Based HPC Architectures · pdf, mp4 · view |
| Ren, Fengyuan · more Fengyuan Ren (Tsinghua university, Beijing National Research Center for Information Science and Technology (BNRist)) | Receiver-Driven Congestion Control for InfiniBand · pdf, mp4 · view |
| Ren, Runtian · more Runtian Ren (Nanyang Technological University) | Generalized Skyline Interval Coloring and Dynamic Geometric Bin Packing Problems · pdf, mp4 · view |
| Revell, Alistair · more Alistair Revell (University of Manchester) | CNN+LSTM Accelerated Turbulent Flow Simulation with Link-Wise Artificial Compressibility Method · pdf, mp4 · view |
| Rodolà, Emanuele · more Emanuele Rodolà (Sapienza, University of Rome) | Efficiently Parallelizable Strassen-Based Multiplication of a Matrix by its Transpose · pdf, mp4 · view |
| Romero-Gainza, Eduardo · more Eduardo Romero-Gainza (The Ohio State University) | Cache-Aware Data Management for Memory-Mapped Forests · pdf, mp4 · view |
| Saeed, Fahad · more Fahad Saeed (Florida International University) | TurboBC: A Memory Efficient and Scalable GPU Based Betweenness Centrality(BC) Algorithm in the Language of Linear Algebra · pdf, mp4 · view |
| Saleh, Hisham · more Hisham Saleh (Western Michigan University) | Towards Faster Execution of Ensemble ML Bootstrap Based Techniques · pdf, mp4 · view |
| San Miguel, Joshua · more Joshua San Miguel (University of Wisconsin-Madison) | Ghostwriter: A Cache Coherence Protocol for Error-Tolerant Applications · pdf, mp4 · view |
| Santander-Jiménez, Sergio · more Sergio Santander-Jiménez (University of Extremadura) | Fourth-Order Exhaustive Epistasis Detection for the xPU Era · pdf, mp4 · view |
| Saule, Erik · more Erik Saule (University of North Carolina at Charlotte) | Postmortem Graph Analysis on the Temporal Graph · pdf, pdf · view Impact of AVX-512 Instructions on Graph Partitioning Problems. · pdf, mp4 · view |
| Schafer, Derek · more Derek Schafer (University of Tennessee at Chattanooga) | Design of a Portable Implementation of Partitioned Point-to-Point Communication Primitives · pdf, mp4 · view |
| Schanen, Michel · more Michel Schanen (Argonne National Laboratory) | Domain Decomposition Preconditioners for Unstructured Network Problems in Parallel Vector Architectures · pdf, mov · view |
| Schildermans, Stijn · more Stijn Schildermans (KU Leuven) | Paratick: Reducing Timer Overhead in Virtual Machines · pdf, mp4 · view |
| Schmidt, Bertil · more Bertil Schmidt (Johannes Gutenberg University Mainz) | MetaCache-GPU: Ultra-Fast Metagenomic Classification · pdf, mp4 · view |
| Scott, Michael L. · more Michael L. Scott (University of Rochester) | A Fast, General System for Buffered Persistent Data Structures · pdf, mp4 · view |
| SEN, TANMOY · more TANMOY SEN (University of Virginia) | Context-aware Data Operation Strategies in Edge Systems for High Application Performance · pdf, mp4 · view |
| Serhani, Mohamed Adel · more Mohamed Adel Serhani (United Arab Emirates University) | Optimizing Flow Completion Time via Adaptive Buffer Management in Data Center Networks · pdf, mp4 · view |
| Shah, Ashka · more Ashka Shah (University of Chicago) | IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view |
| Shan, Jianchen · more Jianchen Shan (Hofstra University) | Paratick: Reducing Timer Overhead in Virtual Machines · pdf, mp4 · view |
| Shan, Tianyi · more Tianyi Shan (University of California San Diego) | Sparker: Efficient Reduction for More Scalable Machine Learning with Spark · pdf, mp4 · view |
| Shang, Ruitao · more Ruitao Shang (East China Normal University) | Prophet: Speeding up Distributed DNN Training with Predictable Communication Scheduling · pdf, mp4 · view |
| Shen, Haiying · more Haiying Shen (University of Virginia) | Context-aware Data Operation Strategies in Edge Systems for High Application Performance · pdf, mp4 · view Multi-Agent Reinforcement Learning based Distributed Renewable Energy Matching for Datacenters · pdf, mp4 · view |
| Shen, Yijie · more Yijie Shen (Institute of Computing Technology, CAS; University of Chinese Academy of Sciences) | Using Vectorized Execution to Improve SQL Query Performance on Spark · pdf, mp4 · view |
| Shi, Yang · more Yang Shi (National University of Defense Technology) | sRouting: Towards a Better Flow Size Estimation Performance through Routing and Sketch Configuration · pdf, mp4 · view |
| Shivadekar, Samit · more Samit Shivadekar (UMBC) | An Intelligent Parallel Distributed Streaming Framework for Near Real-time Science Sensors and High Resolution Medical Images · pdf, mp4 · view |
| Singhal, Swati · more Swati Singhal (University of Maryland, College Park) | DYFLOW: A flexible framework for orchestrating scientific workflows on supercomputers · pdf, mp4 · view |
| Skjellum, Anthony · more Anthony Skjellum (University of Tennessee, Chattanooga; SimCenter) | Design of a Portable Implementation of Partitioned Point-to-Point Communication Primitives · pdf, mp4 · view |
| Song, Shuaiwen Leon · more Shuaiwen Leon Song (University of Sydney) | Fast and Scalable Sparse Triangular Solver for Multi-GPU Based HPC Architectures · pdf, mp4 · view |
| Sousa, Leonel · more Leonel Sousa (INESC-ID; Instituto Superior Técnico, Universidade de Lisboa) | Fourth-Order Exhaustive Epistasis Detection for the xPU Era · pdf, mp4 · view |
| Stef, Graillat · more Graillat Stef (Sorbonne Université) | XHYPRE: A high-precision numerical software package for solving large-scale sparse linear equations · pdf, pdf · view |
| Stern, Abraham · more Abraham Stern (NVIDIA Inc.) | IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view |
| Stevens, Rick · more Rick Stevens (Argonne National Laboratory, University of Chicago) | IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view |
| Stewart, Christopher · more Christopher Stewart (The Ohio State University) | Cache-Aware Data Management for Memory-Mapped Forests · pdf, mp4 · view |
| Strzodka, Robert · more Robert Strzodka (University of Heidelberg, ZITI) | Tridiagonal GPU Solver with Scaled Partial Pivoting at Maximum Bandwidth · pdf, mp4 · view |
| Sun, Ding · more Ding Sun (National University of Defense Technology) | Hippie: A Data-Paralleled Pipeline Approach to Improve Memory-Efficiency and Scalability for Large DNN Training · pdf, mp4 · view |
| Sun, Hongyang · more Hongyang Sun (Vanderbilt University) | Multi-Resource List Scheduling of Moldable Parallel Jobs under Precedence Constraints · pdf, mp4 · view |
| Sun, Hui · more Hui Sun (Anhui Universtiy) | Boosting Compaction Performance of LSM-tree-based KV Stores in Multi-Near-Data Processing Systems · pdf, pdf · view |
| Sun, Min-Te · more Min-Te Sun (National Central University) | Automated Arrhythmia Detection using Hilbert-Huang Transform based Convolutional Neural Network · pdf, mp4 · view |
| Sun, Qingxiao · more Qingxiao Sun (Beihang University) | Automatic Code Generation and Optimization of Large-scale Stencil Computation on Many-core Processors · pdf, mp4 · view |
| Sussman, Alan · more Alan Sussman (University of Maryland, College Park) | DYFLOW: A flexible framework for orchestrating scientific workflows on supercomputers · pdf, mp4 · view |
| Swartvagher, Philippe · more Philippe Swartvagher (INRIA) | Interferences between Communications and Computations in Distributed HPC Systems · pdf, mp4 · view |
| Tan, Li · more Li Tan (Brookhaven National Laboratory) | IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view |
| Tang, Ruiqi · more Ruiqi Tang (Nankai University) | Ascetic: Enhancing Cross-Iterations Data Efficiency in Out-of-Memory Graph Processing on GPUs · pdf, mp4 · view |
| Tang, Xiongchao · more Xiongchao Tang (Sangfor Technologies Inc. and Tsinghua Shenzhen International Graduate School) | Sparker: Efficient Reduction for More Scalable Machine Learning with Spark · pdf, mp4 · view |
| Tang, Xueyan · more Xueyan Tang (Nanyang Technological University) | Generalized Skyline Interval Coloring and Dynamic Geometric Bin Packing Problems · pdf, mp4 · view |
| Tang, Yuan · more Yuan Tang (Fudan University) | Processor-Aware Cache-Oblivious Algorithms · pdf, mp4 · view |
| Taufer, Michela · more Michela Taufer (University of Tennessee at Knoxville) | Assessing Resource Provisioning and Allocation of Ensembles of In Situ Workflows · pdf, mp4 · view |
| Tian, Shilei · more Shilei Tian (Stony Brook University) | A Virtual GPU as Developer-Friendly OpenMP Offload Target · view |
| Timmerman, David · more David Timmerman (State University of New York at Binghamton) | GVT-Guided Demand-Driven Scheduling in Parallel Discrete Event Simulation · pdf, mp4 · view |
| Tiskin, Alexander · more Alexander Tiskin (Saint Petersburg State University) | Efficient Parallel Algorithms for String Comparison · pdf, mp4 · view |
| Titov, Mikhail · more Mikhail Titov (Brookhaven National Laboratory) | IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view |
| Tong, Wei · more Wei Tong (Wuhan National Laboratory for Optoelectronics, Huazhong University of Science and Technology) | CERES: Container-Based Elastic Resource Management System for Mixed Workloads · pdf, mp4 · view |
| Towsley, Don · more Don Towsley (University of Massachusetts Amherst) | Self-Stabilization with Selfish Agents · pdf, mp4 · view |
| Trahay, Francois · more Francois Trahay (Telecom SudParis) | Intra-page Cache Update in SLC-mode with Partial Programming in High Density SSDs · pdf, mov · view |
| Trenev, Dimitar · more Dimitar Trenev (ExxonMobil) | Wave-PIM: Accelerating Wave Simulation Using Processing-in-Memory · pdf, mp4 · view |
| Trifan, Anda · more Anda Trifan (University of Illinois at Urbana Champaign, Argonne National Laboratory) | IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view |
| Tsaris, Aristeidis · more Aristeidis Tsaris (Oak Ridge National Laboratory) | IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view |
| Turilli, Matteo · more Matteo Turilli (Rutgers University, Brookhaven National Laboratory) | IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view |
| Ueno, Hideto · more Hideto Ueno (University of Tokyo) | Towards Compile-Time-Reducing Compiler Optimization Selection via Machine Learning · view |
| Unger, Jonas · more Jonas Unger (Linköping University) | GPU Accelerated SL0 for Multidimensional Signals · pdf, mp4 · view |
| Valpey, Benjamin · more Benjamin Valpey (University of Rochester) | A Fast, General System for Buffered Persistent Data Structures · pdf, mp4 · view |
| Van Dam, Huub · more Huub Van Dam (Brookhaven National Laboratory) | IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view |
| Vandierendonck, Hans · more Hans Vandierendonck (Queen's University Belfast) | Exploiting in-Hub Temporal Locality in SpMV-based Graph Processing · pdf, mp4 · view |
| Vetter, Jeff · more Jeff Vetter (ORNL) | Evaluating the Performance of Integer Sum Reduction in SYCL · pdf, pptx · view |
| Wada, Koichi · more Koichi Wada (Hosei University) | Efficient GPU-Implementation for Integer Sorting Based on Histogram and Prefix-Sums · pdf, mp4 · view |
| Wada, Tomotaka · more Tomotaka Wada (Kansai University) | New Evacuation Guidance Using Augmented Reality for Emergency Rescue Evacuation Support System (ERESS) · pdf, mp4 · view |
| Wahib, Mohamed · more Mohamed Wahib (National Institute of Advanced Industrial Science and Technology, RIKEN Center for Computational Science) | Intra-page Cache Update in SLC-mode with Partial Programming in High Density SSDs · pdf, mov · view |
| Wan, Shunzhou · more Shunzhou Wan (University College London) | IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view |
| Wang, Cho-Li · more Cho-Li Wang (The University of Hong Kong) | Accelerating DBSCAN Algorithm with AI Chips for Large Datasets · pdf, mp4 · view |
| Wang, En · more En Wang (Jilin University) | Distributed Game-Theoretical Route Navigation for Vehicular Crowdsensing · pdf, mp4 · view |
| Wang, Fang · more Fang Wang (huazhong university of science and technology) | ASLDP: An Active Semi-supervised Learning method for Disk Failure Prediction · pdf, mp4 · view |
| Wang, Haojie · more Haojie Wang (Tsinghua University) | Sparker: Efficient Reduction for More Scalable Machine Learning with Spark · pdf, mp4 · view |
| Wang, Haoyu · more Haoyu Wang (University of Virginia) | Multi-Agent Reinforcement Learning based Distributed Renewable Energy Matching for Datacenters · pdf, mp4 · view |
| Wang, Howard · more Howard Wang (MediaTek Inc.) | Accelerate Binarized Neural Networks with Processing-in-Memory Enabled by RISC-V Custom Instructions · pdf, mp4 · view |
| Wang, Jianda · more Jianda Wang (University of Texas at Dallas) | Enabling Efficient SIMD Acceleration for Virtual Radio Access Network · pdf, mp4 · view |
| Wang, Jiashu · more Jiashu Wang (Huawei Technologies Canada Co., Ltd.) | Adapting SYCL’s SIMT Programming Paradigm for Accelerators via Program Reconstruction · view |
| Wang, Junsong · more Junsong Wang (V-Origin) | Exploring HW/SW Co-Optimizations for Accelerating Large-scale Texture Identification on Distributed GPUs · pdf, mp4 · view |
| Wang, Kai-Ting Amy · more Kai-Ting Amy Wang (Huawei Technologies Canada Co., Ltd.) | Adapting SYCL’s SIMT Programming Paradigm for Accelerators via Program Reconstruction · view |
| Wang, Kailun · more Kailun Wang (Nankai University) | Ascetic: Enhancing Cross-Iterations Data Efficiency in Out-of-Memory Graph Processing on GPUs · pdf, mp4 · view |
| Wang, Qiang · more Qiang Wang (Anhui University) | Boosting Compaction Performance of LSM-tree-based KV Stores in Multi-Near-Data Processing Systems · pdf, pdf · view |
| Wang, Wei · more Wei Wang (University of Science and Technology of China) | Fast Reconstruction for Large Disk Enclosures Based on RAID2.0 · pdf, mp4 · view |
| Wang, Wenwen · more Wenwen Wang (University of Georgia) | Ascetic: Enhancing Cross-Iterations Data Efficiency in Out-of-Memory Graph Processing on GPUs · pdf, mp4 · view |
| Wang, Wenxu · more Wenxu Wang (State Key Laborotary of Computer Architecture, Institute of Computing Technology, Chinese Academy of Sciences) | BitX: Empower Versatile Inference with Hardware Runtime Pruning · pdf, mp4 · view |
| Wang, Xiang · more Xiang Wang (Intel APAC) | Teddy: An Efficient SIMD-based Literal Matching Engine for Scalable Deep Packet Inspection · pdf, mp4 · view |
| Wang, Xiaolin · more Xiaolin Wang (Department of Computer Science and Technology, Peking University; Peng Cheng Lab, Shenzhen) | An Edge-Fencing Strategy for Optimizing SSSP Computations on Large-Scale Graphs · pdf, mp4 · view |
| Wang, Yi · more Yi Wang (Peng Cheng Laboratory) | Optimizing Flow Completion Time via Adaptive Buffer Management in Data Center Networks · pdf, mp4 · view |
| Wang, Yida · more Yida Wang (Amazon) | LoWino: Towards Efficient Low-Precision Winograd Convolutions on Modern CPUs · pdf, mp4 · view |
| Wang, Yuchen · more Yuchen Wang (Michigan Technological University) | Efficient Modeling of Random Sampling-Based LRU · pdf, mp4 · view |
| Wang, Zhenlin · more Zhenlin Wang (Michigan Technological University) | Efficient Modeling of Random Sampling-Based LRU · pdf, mp4 · view |
| Wang, Zihe · more Zihe Wang (Renmin University of China) | Distributed Game-Theoretical Route Navigation for Vehicular Crowdsensing · pdf, mp4 · view |
| Wei, Xueliang · more Xueliang Wei (huazhong university of science and technology) | Crash-Consistency-Aware Encryption for Non-Volatile Memories · pdf, mp4 · view |
| Weissenberger, Jack · more Jack Weissenberger (Wake Forest University) | Accelerating Neural Network Training using Arbitrary Precision Approximating Matrix Multiplication Algorithms · pdf, mp4 · view |
| Wen, Haosen · more Haosen Wen (University of Rochester) | A Fast, General System for Buffered Persistent Data Structures · pdf, mp4 · view |
| Wen, Mei · more Mei Wen (National University of Defense Technology) | sRouting: Towards a Better Flow Size Estimation Performance through Routing and Sketch Configuration · pdf, mp4 · view |
| Wen, Zeyi · more Zeyi Wen (Department of Computer Science and Software Engineering, The University of Western Australia) | FastPSO: Towards Efficient Swarm Intelligence Algorithm on GPUs · pdf, mp4 · view |
| Wifling, David · more David Wifling (Leibniz Research Centre) | IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view |
| Williams, Barry · more Barry Williams (State University of New York at Binghamton) | GVT-Guided Demand-Driven Scheduling in Parallel Discrete Event Simulation · pdf, mp4 · view |
| Wolf, Felix · more Felix Wolf (Technical University of Darmstadt) | Tool-Supported Mini-App Extraction to Facilitate Program Analysis and Parallelization · pdf, mp4 · view |
| Wolf, Matthew · more Matthew Wolf (Oak Ridge National Lab) | DYFLOW: A flexible framework for orchestrating scientific workflows on supercomputers · pdf, mp4 · view |
| Worley, Andrew · more Andrew Worley (Tennessee Technological University) | Design of a Portable Implementation of Partitioned Point-to-Point Communication Primitives · pdf, mp4 · view |
| Wu, Chase Q. · more Chase Q. Wu (New Jersey Institute of Technology) | NoStop: A Novel Configuration Optimization Scheme for Spark Streaming · pdf, mp4 · view |
| Wu, Chentao · more Chentao Wu (Department of Computer Science and Engineering, Shanghai Jiao Tong University; Sichuan Research Institute, Shanghai Jiao Tong University) | A Graph-Assisted Out-of-Place Update Scheme for Erasure Coded Storage Systems · pdf, mp4 · view |
| Wu, Hanpei · more Hanpei Wu (SIST, ShanghaiTech University, China) | ADA: An Application-Conscious Data Acquirer for Visual Molecular Dynamics · pdf, mp4 · view |
| Wu, Heng · more Heng Wu (Institute of Software, Chinese Academy of Sciences) | Best VM Selection for Big Data Applications across Multiple Frameworks by Transfer Learning · pdf, mp4 · view |
| Wu, Jie · more Jie Wu (Temple University) | Joint Optimization of DNN Partition and Scheduling for Mobile Cloud Computing · pdf, mp4 · view Distributed Game-Theoretical Route Navigation for Vehicular Crowdsensing · pdf, mp4 · view |
| Wu, Panruo · more Panruo Wu (University of Houston) | A Virtualization Platform Designed for Irregular Multi-Process Applications · pdf, pdf · view Recursion Brings Speedup to Out-of-Core TensorCore-based Linear Algebra Algorithms: A Case Study of Classic Gram-Schmidt QR Factorization · pdf, mp4 · view |
| Wu, Weijie · more Weijie Wu (Independent Researcher) | Progressive Memory Adjustment with Performance Guarantee in Virtualized Systems · pdf, mp4 · view |
| Wu, Yadong · more Yadong Wu (Sichuan University of Science and Engineering) | PREP: Predicting Job Runtime with Job Running Path on Supercomputers · pdf, mp4 · view |
| Wu, Yuewen · more Yuewen Wu (Institute of Software, Chinese Academy of Sciences) | Best VM Selection for Big Data Applications across Multiple Frameworks by Transfer Learning · pdf, mp4 · view |
| Wu, Zhongjie · more Zhongjie Wu (Alibaba) | Coupling Right-Provisioned Cold Storage Data Centers with Deduplication · pdf, mp4 · view |
| Xiao, Renzhi · more Renzhi Xiao (Huazhong University of Science and Technology) | A Log-Free and Consistent Chained Hashing for Non-volatile Memory · pdf, pdf · view |
| XIE, CHENHAO · more CHENHAO XIE (Pacific Northwest National Laboratory) | Fast and Scalable Sparse Triangular Solver for Multi-GPU Based HPC Architectures · pdf, mp4 · view |
| Xie, Tao · more Tao Xie (San Diego State University, CA, USA) | ADA: An Application-Conscious Data Acquirer for Visual Molecular Dynamics · pdf, mp4 · view |
| Xiong, Jin · more Jin Xiong (Institute of Computing Technology, CAS; University of Chinese Academy of Sciences) | Using Vectorized Execution to Improve SQL Query Performance on Spark · pdf, mp4 · view |
| Xiong, Yufei · more Yufei Xiong (Wuhan National Laboratory for Optoelectronics, Huazhong University of Science and Technology) | CERES: Container-Based Elastic Resource Management System for Mixed Workloads · pdf, mp4 · view |
| Xu, ChengZhong · more ChengZhong Xu (University of Macau) | FIFL: A Fairness Incentive Framework for Federated Learning · pdf, mp4 · view |
| Xu, Fei · more Fei Xu (East China Normal University) | Prophet: Speeding up Distributed DNN Training with Predictable Communication Scheduling · pdf, mp4 · view |
| Xu, Liangliang · more Liangliang Xu (University of Science and Technology of China) | Fast Reconstruction for Large Disk Enclosures Based on RAID2.0 · pdf, mp4 · view |
| Xu, Ming · more Ming Xu (National University of Defense Technology) | FIFL: A Fairness Incentive Framework for Federated Learning · pdf, mp4 · view |
| Xu, Nuo · more Nuo Xu (National University of Defense Technology) | HDNH: a read-efficient and write-optimized hashing scheme for hybrid DRAM-NVM memory · pdf, mp4 · view |
| Xu, Yang · more Yang Xu (Fudan University) | Optimizing Flow Completion Time via Adaptive Buffer Management in Data Center Networks · pdf, mp4 · view |
| Xu, Yemao · more Yemao Xu (National University of Defense Technology, Information and Communication Engineering Design Institute) | CD-SGD: Distributed Stochastic Gradient Descent with Compression and Delay Compensation · pdf, mp4 · view |
| Xu, Yinlong · more Yinlong Xu (University of Science and Technology of China) | Fast Reconstruction for Large Disk Enclosures Based on RAID2.0 · pdf, mp4 · view Progressive Memory Adjustment with Performance Guarantee in Virtualized Systems · pdf, mp4 · view |
| Xu, Yuanjia · more Yuanjia Xu (Institute of Software, Chinese Academy of Sciences; University of Chinese Academy of Sciences) | Best VM Selection for Big Data Applications across Multiple Frameworks by Transfer Learning · pdf, mp4 · view |
| Yang, Canqun · more Canqun Yang (National University of Defense Technology) | CNN+LSTM Accelerated Turbulent Flow Simulation with Link-Wise Artificial Compressibility Method · pdf, mp4 · view |
| Yang, Dongxu · more Dongxu Yang (NVIDIA) | Optimizing Winograd-Based Convolution with Tensor Cores · pdf, mp4 · view |
| Yang, Hailong · more Hailong Yang (Beihang University) | Automatic Code Generation and Optimization of Large-scale Stencil Computation on Many-core Processors · pdf, mp4 · view |
| Yang, Junyao · more Junyao Yang (Michigan Technological University) | Efficient Modeling of Random Sampling-Based LRU · pdf, mp4 · view |
| Yang, Qing · more Qing Yang (Wuhan National Laboratory for Optoelectronics,Huazhong University of Science and Technology) | Parallel Multi-split Extendible Hashing for Persistent Memory · pdf, mp4 · view |
| Yang, Qiusong · more Qiusong Yang (Institute of Software, Chinese Academy of Sciences) | Matryoshka: A Coalesced Delta Sequence Prefetcher · pdf, mp4 · view |
| Yang, Wenxiang · more Wenxiang Yang (College of Computer, National University of Defense Technology; Computational Aerodynamics Institute, China Aerodynamics Research and Development Center) | PREP: Predicting Job Runtime with Job Running Path on Supercomputers · pdf, mp4 · view |
| Yang, Wuu · more Wuu Yang (National Yang Ming Chiao Tung University) | Hyperchaining Optimizations for an LLVM-Based Binary Translator on x86-64 and RISC-V Platforms · pdf, mp4 · view |
| Yang, Yang · more Yang Yang (Huazhong University of Science and Technology) | SPMFS: A Scalable Persistent Memory File System on Optane Persistent Memory · pdf, mp4 · view |
| Yang, Yongjian · more Yongjian Yang (Jilin University) | Distributed Game-Theoretical Route Navigation for Vehicular Crowdsensing · pdf, mp4 · view |
| Yao, Lulu · more Lulu Yao (University of Science and Technology of China) | Progressive Memory Adjustment with Performance Guarantee in Virtualized Systems · pdf, mp4 · view |
| Yao, Yiping · more Yiping Yao (National University of Defense Technology) | A Universal Construction to implement Concurrent Data Structure for NUMA-multicore · pdf, mp4 · view |
| Yasumoto, Keiichi · more Keiichi Yasumoto (Nara Institute of Science and Technology) | Analysis on Nursing Care Activity Related Stress Level for Reduction of Caregiving Workload · pdf, mp4 · view |
| Ye, Qianwen · more Qianwen Ye (New Jersey Institute of Technology) | NoStop: A Novel Configuration Optimization Scheme for Spark Streaming · pdf, mp4 · view |
| Ye, Xiangyu · more Xiangyu Ye (National University of Defense Technology) | Hippie: A Data-Paralleled Pipeline Approach to Improve Memory-Efficiency and Scalability for Large DNN Training · pdf, mp4 · view |
| Ye, ZiChun · more ZiChun Ye (Huawei Technologies Canada Co., Ltd.) | Adapting SYCL’s SIMT Programming Paradigm for Accelerators via Program Reconstruction · view |
| Yelick, Katherine · more Katherine Yelick (The University of California at Berkeley, Lawrence Berkeley National Lab) | Scaling Generalized N-Body Problems, A Case Study from Genomics · pdf, mp4 · view |
| Yew, Pen-Chung · more Pen-Chung Yew (University of Minnesota at Twin Cities) | Ascetic: Enhancing Cross-Iterations Data Efficiency in Out-of-Memory Graph Processing on GPUs · pdf, mp4 · view |
| Yi, Zhengming · more Zhengming Yi (National University of Defense Technology) | A Universal Construction to implement Concurrent Data Structure for NUMA-multicore · pdf, mp4 · view |
| Yin, Junqi · more Junqi Yin (Oak Ridge National Laboratory) | IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view |
| Yin, Shu · more Shu Yin (SIST, ShanghaiTech University, China; State Key Lab of High Performance Computing) | ADA: An Application-Conscious Data Acquirer for Visual Molecular Dynamics · pdf, mp4 · view |
| Yin, Yanlong · more Yanlong Yin (Zhejiang Lab) | A Novel Multi-CPU/GPU Collaborative Computing Framework for SGD-based Matrix Factorization · pdf, mp4 · view |
| You, Xin · more Xin You (Beihang University) | Automatic Code Generation and Optimization of Large-scale Stencil Computation on Many-core Processors · pdf, mp4 · view |
| Yu, Bowen · more Bowen Yu (Tsinghua University) | Sparker: Efficient Reduction for More Scalable Machine Learning with Spark · pdf, mp4 · view |
| Yu, Enda · more Enda Yu (National University of Defense Technology) | CD-SGD: Distributed Stochastic Gradient Descent with Compression and Delay Compensation · pdf, mp4 · view |
| Yu, Huashan · more Huashan Yu (Department of Computer Science and Technology, Peking University) | An Edge-Fencing Strategy for Optimizing SSSP Computations on Large-Scale Graphs · pdf, mp4 · view |
| Yu, Jie · more Jie Yu (State Key Laboratory of Aerodynamics; Computational Aerodynamics Institute, China Aerodynamics Research and Development Center) | PREP: Predicting Job Runtime with Job Running Path on Supercomputers · pdf, mp4 · view |
| Yu, Jinyu · more Jinyu Yu (Wuhan National Laboratory for Optoelectronics, Huazhong University of Science and Technology) | CERES: Container-Based Elastic Resource Management System for Mixed Workloads · pdf, mp4 · view |
| Yu, Weikuan · more Weikuan Yu (Florida State University) | ROBOTune: High-Dimensional Configuration Tuning for Cluster-Based Data Analytics · pdf, mp4 · view |
| Yu, Ya · more Ya Yu (Wuhan National Laboratory for Optoelectronics,Huazhong University of Science and Technology) | Parallel Multi-split Extendible Hashing for Persistent Memory · pdf, mp4 · view |
| Yue, Yinliang · more Yinliang Yue (Institute of Information Engineering,Chinese Academy of Sciences) | Boosting Compaction Performance of LSM-tree-based KV Stores in Multi-Near-Data Processing Systems · pdf, pdf · view |
| Zeng, Hui · more Hui Zeng (College of Computer, National University of Defense Technology) | FedCav: Contribution-aware Model Aggregation on Distributed Heterogeneous Data in Federated Learning · pdf, mp4 · view |
| Zhang, Eddy Z. · more Eddy Z. Zhang (Rutgers Unversity) | BGPQ: A Heap-Based Priority Queue Design for GPUs · pdf, mp4 · view |
| Zhang, Jie · more Jie Zhang (National Central University) | Automated Arrhythmia Detection using Hilbert-Huang Transform based Convolutional Neural Network · pdf, mp4 · view |
| Zhang, Jin · more Jin Zhang (Nankai University) | Ascetic: Enhancing Cross-Iterations Data Efficiency in Out-of-Memory Graph Processing on GPUs · pdf, mp4 · view |
| Zhang, Luoping · more Luoping Zhang (Wake Forest University) | Accelerating Neural Network Training using Arbitrary Precision Approximating Matrix Multiplication Algorithms · pdf, mp4 · view |
| Zhang, Mingzhe · more Mingzhe Zhang (State Key Laborotary of Computer Architecture, Institute of Computing Technology, Chinese Academy of Sciences) | BitX: Empower Versatile Inference with Hardware Runtime Pruning · pdf, mp4 · view |
| Zhang, Shaoshuai · more Shaoshuai Zhang (University of Houston) | Recursion Brings Speedup to Out-of-Core TensorCore-based Linear Algebra Algorithms: A Case Study of Classic Gram-Schmidt QR Factorization · pdf, mp4 · view |
| Zhang, Shulai · more Shulai Zhang (Shanghai Jiao Tong University) | Dubhe: Towards Data Unbiasedness with Homomorphic Encryption in Federated Learning Client Selection · pdf, mov · view |
| Zhang, Wenbo · more Wenbo Zhang (Institute of Software, Chinese Academy of Sciences; University of Chinese Academy of Sciences) | Best VM Selection for Big Data Applications across Multiple Frameworks by Transfer Learning · pdf, mp4 · view |
| Zhang, Xiaofan · more Xiaofan Zhang (Unviversity of Illinois at Urbana-Champaign) | Exploring HW/SW Co-Optimizations for Accelerating Large-scale Texture Identification on Distributed GPUs · pdf, mp4 · view |
| Zhang, Xiaorong · more Xiaorong Zhang (South West University of Science and Technology) | PREP: Predicting Job Runtime with Job Running Path on Supercomputers · pdf, mp4 · view |
| Zhang, Yiran · more Yiran Zhang (Tsinghua university, Beijing National Research Center for Information Science and Technology (BNRist)) | Receiver-Driven Congestion Control for InfiniBand · pdf, mp4 · view |
| Zhang, Youhui · more Youhui Zhang (Tsinghua University) | Regu2D: Accelerating Vectorization of SpMV on Intel Processors through 2D-partitioning and Regular Arrangement · pdf, mp4 · view |
| Zhang, Zhenwei · more Zhenwei Zhang (East China Normal University) | Prophet: Speeding up Distributed DNN Training with Predictable Communication Scheduling · pdf, mp4 · view |
| Zhang, Zhicheng · more Zhicheng Zhang (Stanford University) | ComputeCOVID19+: Accelerating COVID-19 Diagnosis and Monitoring via High-Performance Deep Learning · pdf, mp4 · view |
| Zhang, Zhihua · more Zhihua Zhang (Nara Institute of Science and Technology) | Analysis on Nursing Care Activity Related Stress Level for Reduction of Caregiving Workload · pdf, mp4 · view |
| Zhao, Yuhong · more Yuhong Zhao (Institute of Information Engineering,Chinese Academy of Sciences) | Boosting Compaction Performance of LSM-tree-based KV Stores in Multi-Near-Data Processing Systems · pdf, pdf · view |
| Zhao, Ziyi · more Ziyi Zhao (Nankai University) | Ascetic: Enhancing Cross-Iterations Data Efficiency in Out-of-Memory Graph Processing on GPUs · pdf, mp4 · view |
| Zheng, Kevin · more Kevin Zheng (University of Virginia) | Multi-Agent Reinforcement Learning based Distributed Renewable Energy Matching for Datacenters · pdf, mp4 · view |
| Zheng, Wenli · more Wenli Zheng (Shanghai Jiao Tong University) | FIFL: A Fairness Incentive Framework for Federated Learning · pdf, mp4 · view Dubhe: Towards Data Unbiasedness with Homomorphic Encryption in Federated Learning Client Selection · pdf, mov · view |
| Zhong, Hua · more Hua Zhong (Institute of Software, Chinese Academy of Sciences; University of Chinese Academy of Sciences) | Best VM Selection for Big Data Applications across Multiple Frameworks by Transfer Learning · pdf, mp4 · view |
| Zhou, Bing Bing · more Bing Bing Zhou (The University of Sydney) | Efficient Complete Event Trend Detection over High-Velocity Streams · pdf, mp4 · view |
| Zhou, Hai · more Hai Zhou (Huazhong University of Science and Technology, Wuhan National Laboratory for Optoelectronics) | Multi-level Forwarding and Scheduling Recovery Technique in Heterogeneous Network for Erasure-coded Clusters · pdf, mp4 · view |
| Zhou, Longfang · more Longfang Zhou (Southwest University of Science and Technology, State Key Laboratory of Aerodynamics) | PREP: Predicting Job Runtime with Job Running Path on Supercomputers · pdf, mp4 · view |
| Zhou, Tongqing · more Tongqing Zhou (College of Computer, National University of Defense Technology) | FedCav: Contribution-aware Model Aggregation on Distributed Heterogeneous Data in Federated Learning · pdf, mp4 · view |
| Zhou, Xiaohu · more Xiaohu Zhou (School of Computing, Engineering, and Built Environment,Birmingham City University) | FMSM: A Fuzzy Multi-keyword Search Scheme for Encrypted Cloud Data based on Multi-chain Network · pdf, mp4 · view |
| Zhou, Yang · more Yang Zhou (Huazhong University of Science and Technology, Wuhan National Laboratory for Optoelectronics) | ASLDP: An Active Semi-supervised Learning method for Disk Failure Prediction · pdf, mp4 · view |
| Zhu, Junhao · more Junhao Zhu (National University of Defense Technology) | HDNH: a read-efficient and write-optimized hashing scheme for hybrid DRAM-NVM memory · pdf, mp4 · view |
| Zhu, Lin · more Lin Zhu (Huazhong University of Science and Technology) | Communication Avoiding All-Pairs Shortest Paths Algorithm for Sparse graphs · pdf, mp4 · view |
| Zhu, Wenjun · more Wenjun Zhu (Intel APAC) | Teddy: An Efficient SIMD-based Literal Matching Engine for Scalable Deep Packet Inspection · pdf, mp4 · view |
| Zhu, Yifeng · more Yifeng Zhu (University of Maine) | Parallel Multi-split Extendible Hashing for Persistent Memory · pdf, mp4 · view |
| Zou, Xiaomin · more Xiaomin Zou (Huazhong University of Science and Technology) | HDNH: a read-efficient and write-optimized hashing scheme for hybrid DRAM-NVM memory · pdf, mp4 · view |
| Zou, Yanliang · more Yanliang Zou (SIST, ShanghaiTech University, China) | ADA: An Application-Conscious Data Acquirer for Visual Molecular Dynamics · pdf, mp4 · view |