A | B | C | D | E | F | G | H | I | J | K | L | M | N | O | P | Q | R | S | T | U | V | W | X | Y | Z
Aerts, Kris · more Kris Aerts (KU Leuven) | Paratick: Reducing Timer Overhead in Virtual Machines · pdf, mp4 · view |
Afibuzzaman, Md · more Md Afibuzzaman (Michigan State University) | An Evaluation of Task-Parallel Frameworks for Sparse Solvers on Multicore and Manycore CPU Architectures · pdf, mov · view |
Aktulga, Hasan Metin · more Hasan Metin Aktulga (Michigan State University) | An Evaluation of Task-Parallel Frameworks for Sparse Solvers on Multicore and Manycore CPU Architectures · pdf, mov · view |
Al Saadi, Aymen · more Aymen Al Saadi (Rutgers University) | IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view |
Alam, Maksudul · more Maksudul Alam (Oak Ridge National Laboratory) | Design Considerations for GPU-based Mixed Integer Programming on Parallel Computing Platforms · pdf, mp4 · view |
Alfe, Dario · more Dario Alfe (University College London) | IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view |
Alperen, Abdullah · more Abdullah Alperen (Michigan State University) | An Evaluation of Task-Parallel Frameworks for Sparse Solvers on Multicore and Manycore CPU Architectures · pdf, mov · view |
Anitescu, Mihai · more Mihai Anitescu (Argonne National Laboratory) | Domain Decomposition Preconditioners for Unstructured Network Problems in Parallel Vector Architectures · pdf, mov · view |
Armejach, Adrià · more Adrià Armejach (Barcelona Supercomputing Center, Universitat Politècnica de Catalunya) | gem5+RTL: A Framework to Enable RTL Models Inside a Full-System Simulator · pdf, mp4 · view |
Arrigoni, Viviana · more Viviana Arrigoni (Sapienza, University of Rome) | Efficiently Parallelizable Strassen-Based Multiplication of a Matrix by its Transpose · pdf, mp4 · view |
Artiles, Oswaldo · more Oswaldo Artiles (Florida International University) | TurboBC: A Memory Efficient and Scalable GPU Based Betweenness Centrality(BC) Algorithm in the Language of Linear Algebra · pdf, mp4 · view |
Ayguadé Parra, Eduard · more Eduard Ayguadé Parra (Universitat Politècnica de Catalunya, Barcelona Supercomputing Center (BSC-CNS)) | Combining Dynamic Concurrency Throttling with Voltage and Frequency Scaling on Task-based Programming Models · pdf, mp4 · view |
Babuji, Yadu · more Yadu Babuji (University of Chicago, Argonne National Laboratory) | IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view |
Bai, Yang · more Yang Bai (Hunan University) | A Novel Multi-CPU/GPU Collaborative Computing Framework for SGD-based Matrix Factorization · pdf, mp4 · view |
Ballard, Grey · more Grey Ballard (Wake Forest University) | Accelerating Neural Network Training using Arbitrary Precision Approximating Matrix Multiplication Algorithms · pdf, mp4 · view Parallel Tucker Decomposition with Numerically Accurate SVD · pdf, mp4 · view |
Bangalore, Purushotham · more Purushotham Bangalore (University of Alabama at Birmingham) | Design of a Portable Implementation of Partitioned Point-to-Point Communication Primitives · pdf, mp4 · view |
Baravdish, Gabriel George · more Gabriel George Baravdish (Linköping University) | GPU Accelerated SL0 for Multidimensional Signals · pdf, mp4 · view |
Barker, Kevin · more Kevin Barker (Pacific Northwest National Laboratory) | Fast and Scalable Sparse Triangular Solver for Multi-GPU Based HPC Architectures · pdf, mp4 · view |
Beltran Querol, Vicenç · more Vicenç Beltran Querol (Barcelona Supercomputing Center Barcelona Supercomputing Center (BSC-CNS)) | Combining Dynamic Concurrency Throttling with Voltage and Frequency Scaling on Task-based Programming Models · pdf, mp4 · view |
Berezun, Daniil · more Daniil Berezun (Saint Petersburg State University, JetBrains Research) | Efficient Parallel Algorithms for String Comparison · pdf, mp4 · view |
Bernholdt, David · more David Bernholdt (Oak Ridge National Labs) | Implementing Arbitrary/Common Concurrent Writes of CRCW PRAM · pdf, mp4 · view |
Bhati, Agastya · more Agastya Bhati (UCL) | IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view |
Bischof, Christian · more Christian Bischof (Technical University of Darmstadt) | Tool-Supported Mini-App Extraction to Facilitate Program Analysis and Parallelization · pdf, mp4 · view |
Biswas, Swarnendu · more Swarnendu Biswas (Indian Institute of Technology Kanpur) | Explaining the Classification Performance of Supervised and Semi-Supervised Methods for Automated Sparse Matrix Format Selection · pdf, mp4 · view |
Blaiszik, Benjamin · more Benjamin Blaiszik (University of Chicago, Argonne National Laboratory) | IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view |
Brace, Alexander · more Alexander Brace (Argonne) | IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view |
Brettin, Thomas · more Thomas Brettin (Argonne National Laboratory) | IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view |
Buhler, Jeremy · more Jeremy Buhler (Washington University) | Enabling Real-Time Irregular Data-Flow Pipelines on SIMD Devices · pdf, mp4 · view |
Buluc, Aydin · more Aydin Buluc (Lawrence Berkeley National Lab, The University of California at Berkeley) | Scaling Generalized N-Body Problems, A Case Study from Genomics · pdf, mp4 · view |
Cai, Lei · more Lei Cai (National University of Defense Technology) | Hippie: A Data-Paralleled Pipeline Approach to Improve Memory-Efficiency and Scalability for Large DNN Training · pdf, mp4 · view |
Cai, Wei · more Wei Cai (School of Science and Engineering, The Chinese University of Hong Kong, Shenzhen; Shenzhen Institute of Artificial Intelligence and Robotics for Society) | FastPSO: Towards Efficient Swarm Intelligence Algorithm on GPUs · pdf, mp4 · view |
Cai, Wentao · more Wentao Cai (University of Rochester) | A Fast, General System for Buffered Persistent Data Structures · pdf, mp4 · view |
Cai, Zhigang · more Zhigang Cai (Southwest University) | Intra-page Cache Update in SLC-mode with Partial Programming in High Density SSDs · pdf, mov · view |
Cai, Zhiping · more Zhiping Cai (College of Computer, National University of Defense Technology) | FedCav: Contribution-aware Model Aggregation on Distributed Heterogeneous Data in Federated Learning · pdf, mp4 · view |
Caíno-Lores, Silvina · more Silvina Caíno-Lores (University of Tennessee at Knoxville) | Assessing Resource Provisioning and Allocation of Ensembles of In Situ Workflows · pdf, mp4 · view |
Cao, Guohua · more Guohua Cao (Virginia Tech) | ComputeCOVID19+: Accelerating COVID-19 Diagnosis and Monitoring via High-Performance Deep Learning · pdf, mp4 · view |
Cao, Huanqi · more Huanqi Cao (Tsinghua University) | Sparker: Efficient Reduction for More Scalable Machine Learning with Spark · pdf, mp4 · view |
Cao, Qiang · more Qiang Cao (Huazhong University of Science and Technology) | SPMFS: A Scalable Persistent Memory File System on Optane Persistent Memory · pdf, mp4 · view |
Cartier, Hannah · more Hannah Cartier (Rhodes College) | Optimizing Work Stealing Communication with Structured Atomic Operations · pdf, mp4 · view |
Catalyurek, Umit · more Umit Catalyurek (Georgia Institute of Technology) | An Evaluation of Task-Parallel Frameworks for Sparse Solvers on Multicore and Manycore CPU Architectures · pdf, mov · view |
Chang, Da-Wei · more Da-Wei Chang (National Cheng Kung University) | Dual-KV: Improving Performance of Key-value Caches on Multilevel Cell Non-volatile Memory · pdf, mp4 · view |
Chang, Harry · more Harry Chang (Intel APAC) | Teddy: An Efficient SIMD-based Literal Matching Engine for Scalable Deep Packet Inspection · pdf, mp4 · view |
Chang, Liang · more Liang Chang (University of Electronic Science and Technology of China) | BitX: Empower Versatile Inference with Hardware Runtime Pruning · pdf, mp4 · view |
Chapman, Barbara · more Barbara Chapman (Stony Brook University) | A Virtual GPU as Developer-Friendly OpenMP Offload Target · view |
Chapman, David · more David Chapman (UMBC) | An Intelligent Parallel Distributed Streaming Framework for Near Real-time Science Sensors and High Resolution Medical Images · pdf, mp4 · view |
Chard, Kyle · more Kyle Chard (University of Chicago) | IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view |
Chard, Ryan · more Ryan Chard (Argonne National Laboratory) | IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view |
Chen, Bangduo · more Bangduo Chen (Beihang University) | Automatic Code Generation and Optimization of Large-scale Stencil Computation on Many-core Processors · pdf, mp4 · view |
Chen, Hanhua · more Hanhua Chen (Huazhong University of Science and Technology) | Efficient Complete Event Trend Detection over High-Velocity Streams · pdf, mp4 · view |
Chen, Jianxi · more Jianxi Chen (Wuhan National Laboratory for Optoelectronics,Huazhong University of Science and Technology) | Parallel Multi-split Extendible Hashing for Persistent Memory · pdf, mp4 · view |
Chen, Jieyang · more Jieyang Chen (Oak Ridge National Laboratory) | Fast and Scalable Sparse Triangular Solver for Multi-GPU Based HPC Architectures · pdf, mp4 · view |
Chen, Kai · more Kai Chen (National University of Defense Technology) | A Universal Construction to implement Concurrent Data Structure for NUMA-multicore · pdf, mp4 · view |
Chen, Li · more Li Chen (University of Louisiana at Lafayette) | Prophet: Speeding up Distributed DNN Training with Predictable Communication Scheduling · pdf, mp4 · view AMPS-Inf: Automatic Model Partitioning for Serverless Inference with Cost Efficiency. · pdf, mp4 · view Accelerated Device Placement Optimization with Contrastive Learning · pdf, mp4 · view |
Chen, Quan · more Quan Chen (Shanghai Jiao Tong University) | Dubhe: Towards Data Unbiasedness with Homomorphic Encryption in Federated Learning Client Selection · pdf, mov · view |
Chen, Si · more Si Chen (West Chester University, PA, USA) | ADA: An Application-Conscious Data Acquirer for Visual Molecular Dynamics · pdf, mp4 · view |
Chen, Wei · more Wei Chen (State Key Laborotary of Computer Architecture, Institute of Computing Technology, Chinese Academy of Sciences) | BitX: Empower Versatile Inference with Hardware Runtime Pruning · pdf, mp4 · view |
Chen, Wenguang · more Wenguang Chen (Tsinghua University) | Sparker: Efficient Reduction for More Scalable Machine Learning with Spark · pdf, mp4 · view |
Chen, Yanhao · more Yanhao Chen (Rutgers University) | BGPQ: A Heap-Based Priority Queue Design for GPUs · pdf, mp4 · view |
Chen, Yingwen · more Yingwen Chen (National University of Defense Technology) | FIFL: A Fairness Incentive Framework for Federated Learning · pdf, mp4 · view |
Chen, YuAng · more YuAng Chen (Chinese University of Hong Kong, Shenzhen) | HiPa: Hierarchical Partitioning for Fast PageRank on NUMA Multicore Systems · pdf, mp4 · view |
Chen, Zhiguang · more Zhiguang Chen (Sun Yat-sen University) | Optimizing Massively Parallel Winograd Convolution on ARM Processor · pdf, mp4 · view |
Cheng, Albert Mo Kim · more Albert Mo Kim Cheng (University of Houston) | A Virtualization Platform Designed for Irregular Multi-Process Applications · pdf, pdf · view |
Cheng, Liangfeng · more Liangfeng Cheng (Huazhong University of Science and Technology) | Coupling Right-Provisioned Cold Storage Data Centers with Deduplication · pdf, mp4 · view |
Chesterfield, Jon · more Jon Chesterfield (AMD) | Shared Memory Remote Procedure Calls · view |
Chiu, Kenneth · more Kenneth Chiu (State University of New York at Binghamton) | GVT-Guided Demand-Driven Scheduling in Parallel Discrete Event Simulation · pdf, mp4 · view |
Choi, Hyuckjin · more Hyuckjin Choi (Nara Institute of Science and Technology) | Analysis on Nursing Care Activity Related Stress Level for Reduction of Caregiving Workload · pdf, mp4 · view |
Choi, Jong · more Jong Choi (Oak Ridge National Lab) | DYFLOW: A flexible framework for orchestrating scientific workflows on supercomputers · pdf, mp4 · view |
Chung, Yeh-ching · more Yeh-ching Chung (Chinese University of Hong Kong, Shenzhen) | HiPa: Hierarchical Partitioning for Fast PageRank on NUMA Multicore Systems · pdf, mp4 · view |
Ci, Yiwei · more Yiwei Ci (Institute of Software, Chinese Academy of Sciences) | Matryoshka: A Coalesced Delta Sequence Prefetcher · pdf, mp4 · view |
Clyde, Austin · more Austin Clyde (Argonne, Univ. of Chicago) | IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view |
Codognet, Philippe · more Philippe Codognet (Sorbonne University / CNRS, University of Tokyo) | Constraint Solving by Quantum Annealing · pdf, mp4 · view |
Cornelius, Melanie · more Melanie Cornelius (Illinois Institute of Technology) | Advancing OpenMP Offload Debugging Capabilities in LLVM · view |
Coveney, Peter · more Peter Coveney (UCL) | IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view |
Dai, Guangli · more Guangli Dai (University of Houston) | A Virtualization Platform Designed for Irregular Multi-Process Applications · pdf, pdf · view |
Deelman, Ewa · more Ewa Deelman (University of Southern California, Information Sciences Institute) | Assessing Resource Provisioning and Allocation of Ensembles of In Situ Workflows · pdf, mp4 · view |
Deng, Haiwei · more Haiwei Deng (Department of Computer Science and Engineering, Shanghai Jiao Tong University) | A Graph-Assisted Out-of-Place Update Scheme for Erasure Coded Storage Systems · pdf, mp4 · view |
Deng, Tongliang · more Tongliang Deng (SenseTime Research, China) | ADA: An Application-Conscious Data Acquirer for Visual Molecular Dynamics · pdf, mp4 · view |
Deng, Xun · more Xun Deng (Huawei Technologies Canada Co., Ltd.) | Adapting SYCL’s SIMT Programming Paradigm for Accelerators via Program Reconstruction · view |
Denis, Alexandre · more Alexandre Denis (INRIA) | Interferences between Communications and Computations in Distributed HPC Systems · pdf, mp4 · view |
Deodhar, Akshay · more Akshay Deodhar (College of Engineering, Pune) | Explaining the Classification Performance of Supervised and Semi-Supervised Methods for Automated Sparse Matrix Format Selection · pdf, mp4 · view |
Dewald, Florian · more Florian Dewald (Technical University of Darmstadt) | Tool-Supported Mini-App Extraction to Facilitate Program Analysis and Parallelization · pdf, mp4 · view |
Dhandhania, Sunidhi · more Sunidhi Dhandhania (Indian Institute of Technology Kanpur) | Explaining the Classification Performance of Supervised and Semi-Supervised Methods for Automated Sparse Matrix Format Selection · pdf, mp4 · view |
Dietz, Henry · more Henry Dietz (University of Kentucky) | Tangled: A Conventional Processor Integrating A Quantum-Inspired Coprocessor · pdf, mp4 · view |
Dinan, James · more James Dinan (NVIDIA) | Optimizing Work Stealing Communication with Structured Atomic Operations · pdf, mp4 · view |
Ding, Xiaoning · more Xiaoning Ding (New Jersey Institute of Technology) | Paratick: Reducing Timer Overhead in Virtual Machines · pdf, mp4 · view |
Do, Tu Mai Anh · more Tu Mai Anh Do (University of Southern California, Information Sciences Institute) | Assessing Resource Provisioning and Allocation of Ensembles of In Situ Workflows · pdf, mp4 · view |
Doerfert, Johannes · more Johannes Doerfert (Argonne National Laboratory) | Advancing OpenMP Offload Debugging Capabilities in LLVM · view A Virtual GPU as Developer-Friendly OpenMP Offload Target · view Towards Compile-Time-Reducing Compiler Optimization Selection via Machine Learning · view |
Dong, Dezun · more Dezun Dong (National University of Defense Technology) | CD-SGD: Distributed Stochastic Gradient Descent with Compression and Delay Compensation · pdf, mp4 · view |
Dong, Pengmin · more Pengmin Dong (Jilin University) | Distributed Game-Theoretical Route Navigation for Vehicular Crowdsensing · pdf, mp4 · view |
Dosanjh, Matthew · more Matthew Dosanjh (Sandia National Laboratories) | Design of a Portable Implementation of Partitioned Point-to-Point Communication Primitives · pdf, mp4 · view |
Du, Jingwen · more Jingwen Du (Wuhan National Laboratory for Optoelectronics, Huazhong University of Science and Technology) | Fast and Consistent Remote Direct Access to Non-volatile Memory · pdf, mp4 · view |
Du, Mingzhe · more Mingzhe Du (University of Rochester) | A Fast, General System for Buffered Persistent Data Structures · pdf, mp4 · view |
Duan, Yubin · more Yubin Duan (Temple University) | Joint Optimization of DNN Partition and Scheduling for Mobile Cloud Computing · pdf, mp4 · view |
Eker, Ali · more Ali Eker (State University of New York at Binghamton) | GVT-Guided Demand-Driven Scheduling in Parallel Discrete Event Simulation · pdf, mp4 · view |
Ellis, Marquita · more Marquita Ellis (The University of California at Berkeley, Lawrence Berkeley National Lab) | Scaling Generalized N-Body Problems, A Case Study from Genomics · pdf, mp4 · view |
Elwasif, Wael · more Wael Elwasif (Oak ridge National Labs) | Implementing Arbitrary/Common Concurrent Writes of CRCW PRAM · pdf, mp4 · view |
Enright Jerger, Natalie · more Natalie Enright Jerger (University of Toronto) | Ghostwriter: A Cache Coherence Protocol for Error-Tolerant Applications · pdf, mp4 · view |
F. Lorenzon, Arthur · more Arthur F. Lorenzon (Federal University of Pampa) | Combining Dynamic Concurrency Throttling with Voltage and Frequency Scaling on Task-based Programming Models · pdf, mp4 · view |
Fan, Sijiang · more Sijiang Fan (National University of Defense Technology, University of Manchester) | CNN+LSTM Accelerated Turbulent Flow Simulation with Link-Wise Artificial Compressibility Method · pdf, mp4 · view |
Fang, Liang · more Liang Fang (National University of Defense Technology) | HDNH: a read-efficient and write-optimized hashing scheme for hybrid DRAM-NVM memory · pdf, mp4 · view |
Fang, Qiming · more Qiming Fang (Wake Forest University) | Parallel Tucker Decomposition with Numerically Accurate SVD · pdf, mp4 · view |
Fathi, Arash · more Arash Fathi (ExxonMobil) | Wave-PIM: Accelerating Wave Simulation Using Processing-in-Memory · pdf, mp4 · view |
Fei, Jiawei · more Jiawei Fei (National University of Defense Technology) | CNN+LSTM Accelerated Turbulent Flow Simulation with Link-Wise Artificial Compressibility Method · pdf, mp4 · view |
Fei, Xiang · more Xiang Fei (Tsinghua University) | Regu2D: Accelerating Vectorization of SpMV on Intel Processors through 2D-partitioning and Regular Arrangement · pdf, mp4 · view |
Feng, Dan · more Dan Feng (huazhong university of science and technology) | A Log-Free and Consistent Chained Hashing for Non-volatile Memory · pdf, pdf · view Fast and Consistent Remote Direct Access to Non-volatile Memory · pdf, mp4 · view CERES: Container-Based Elastic Resource Management System for Mixed Workloads · pdf, mp4 · view Crash-Consistency-Aware Encryption for Non-Volatile Memories · pdf, mp4 · view ASLDP: An Active Semi-supervised Learning method for Disk Failure Prediction · pdf, mp4 · view Multi-level Forwarding and Scheduling Recovery Technique in Heterogeneous Network for Erasure-coded Clusters · pdf, mp4 · view |
Feng, Ke · more Ke Feng (School of Computer Science and Technology, Wuhan University of Science and Technology; Hubei Province Key Laboratory of Intelligent Information Processing and Real-time Industrial System) | FMSM: A Fuzzy Multi-keyword Search Scheme for Encrypted Cloud Data based on Multi-chain Network · pdf, mp4 · view |
Feng, Wu · more Wu Feng (Virginia Tech) | ComputeCOVID19+: Accelerating COVID-19 Diagnosis and Monitoring via High-Performance Deep Learning · pdf, mp4 · view |
Feng, Xiaobing · more Xiaobing Feng (Institute of Computing Technology, Chinese Academy of Sciences) | LoWino: Towards Efficient Low-Precision Winograd Convolutions on Modern CPUs · pdf, mp4 · view |
Feng, Zonghao · more Zonghao Feng (Hong Kong University of Science and Technology) | Accelerating Sequence-to-Graph Alignment on Heterogeneous Processors · pdf, mp4 · view |
Ferreira da Silva, Rafael · more Rafael Ferreira da Silva (University of Southern California, Information Sciences Institute) | Assessing Resource Provisioning and Allocation of Ensembles of In Situ Workflows · pdf, mp4 · view |
Firoz, Jesun · more Jesun Firoz (Pacific Northwest National Laboratory) | Fast and Scalable Sparse Triangular Solver for Multi-GPU Based HPC Architectures · pdf, mp4 · view |
Fohry, Claudia · more Claudia Fohry (University of Kassel) | Transparent Resource Elasticity for Task-Based Cluster Environments with Work Stealing · pdf, mp4 · view |
Foster, Ian · more Ian Foster (Argonne National Laboratory, University of Chicago) | IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view |
Fu, Qiang · more Qiang Fu (George Washington University) | Automatic Generation of High-Performance Inference Kernels for Graph Neural Networks on Multi-Core Systems · pdf, mp4 · view |
Fu, Song · more Song Fu (University of North Texas) | Boosting Compaction Performance of LSM-tree-based KV Stores in Multi-Near-Data Processing Systems · pdf, pdf · view |
Fujimoto, Manato · more Manato Fujimoto (Nara Institute of Science and Technology) | Analysis on Nursing Care Activity Related Stress Level for Reduction of Caregiving Workload · pdf, mp4 · view |
Fujimoto, Noriyuki · more Noriyuki Fujimoto (Osaka Prefecture University) | Efficient GPU-Implementation for Integer Sorting Based on Histogram and Prefix-Sums · pdf, mp4 · view |
Gao, Jiechao · more Jiechao Gao (University of Virginia) | Multi-Agent Reinforcement Learning based Distributed Renewable Energy Matching for Datacenters · pdf, mp4 · view |
Gao, Liang · more Liang Gao (National University of Defense Technology) | FIFL: A Fairness Incentive Framework for Federated Learning · pdf, mp4 · view |
Gao, Weiguo · more Weiguo Gao (Fudan University) | Processor-Aware Cache-Oblivious Algorithms · pdf, mp4 · view |
Gavirangaswamy, Vinay · more Vinay Gavirangaswamy (Western Michigan University, Cray- A Hewlett Packard Enterprise Company) | Towards Faster Execution of Ensemble ML Bootstrap Based Techniques · pdf, mp4 · view |
Georgakoudis, Giorgis · more Giorgis Georgakoudis (Lawrence Livermore National Laboratory) | Towards Compile-Time-Reducing Compiler Optimization Selection via Machine Learning · view |
Gerofi, Balazs · more Balazs Gerofi (RIKEN Center for Computational Science) | Intra-page Cache Update in SLC-mode with Partial Programming in High Density SSDs · pdf, mov · view |
Gerstlauer, Andreas · more Andreas Gerstlauer (The University of Texas at Austin) | Wave-PIM: Accelerating Wave Simulation Using Processing-in-Memory · pdf, mp4 · view |
Ghafoor, Sheikh · more Sheikh Ghafoor (Tennessee Technological University) | Design of a Portable Implementation of Partitioned Point-to-Point Communication Primitives · pdf, mp4 · view |
Ghanim, Fady · more Fady Ghanim (Oak Ridge National Labs) | Implementing Arbitrary/Common Concurrent Writes of CRCW PRAM · pdf, mp4 · view |
Gibbs, Thomas · more Thomas Gibbs (NVIDIA Inc.) | IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view |
Gite, Rahul · more Rahul Gite (UMBC) | An Intelligent Parallel Distributed Streaming Framework for Near Real-time Science Sensors and High Resolution Medical Images · pdf, mp4 · view |
Goel, Garvit · more Garvit Goel (Virginia Tech) | ComputeCOVID19+: Accelerating COVID-19 Diagnosis and Monitoring via High-Performance Deep Learning · pdf, mp4 · view |
Gondhalekar, Atharva · more Atharva Gondhalekar (Virginia Tech) | ComputeCOVID19+: Accelerating COVID-19 Diagnosis and Monitoring via High-Performance Deep Learning · pdf, mp4 · view |
Gong, Xiaoli · more Xiaoli Gong (NNankai University) | Ascetic: Enhancing Cross-Iterations Data Efficiency in Out-of-Memory Graph Processing on GPUs · pdf, mp4 · view |
Gourounas, Dimitrios · more Dimitrios Gourounas (The University of Texas at Austin) | Wave-PIM: Accelerating Wave Simulation Using Processing-in-Memory · pdf, mp4 · view |
Govil, Karan · more Karan Govil (ExxonMobil) | Wave-PIM: Accelerating Wave Simulation Using Processing-in-Memory · pdf, mp4 · view |
Grant, Ryan · more Ryan Grant (Sandia National Laboratories) | Design of a Portable Implementation of Partitioned Point-to-Point Communication Primitives · pdf, mp4 · view |
Groppe, Sven · more Sven Groppe (Universität zu Lübeck) | CuART - a CUDA-based, scalable Radix-Tree lookup and update engine · pdf, mp4 · view |
Groth, Tobias · more Tobias Groth (Universität zu Lübeck) | CuART - a CUDA-based, scalable Radix-Tree lookup and update engine · pdf, mp4 · view |
Guo, Minyi · more Minyi Guo (Shanghai Jiao Tong University) | Dubhe: Towards Data Unbiasedness with Homomorphic Encryption in Federated Learning Client Selection · pdf, mov · view |
Guo, Xiao-Wei · more Xiao-Wei Guo (National University of Defense Technology) | CNN+LSTM Accelerated Turbulent Flow Simulation with Link-Wise Artificial Compressibility Method · pdf, mp4 · view |
Guo, Yeting · more Yeting Guo (College of Computer, National University of Defense Technology) | FedCav: Contribution-aware Model Aggregation on Distributed Heterogeneous Data in Federated Learning · pdf, mp4 · view |
Guo, Zehua · more Zehua Guo (Beijing Institute of Technology) | Optimizing Flow Completion Time via Adaptive Buffer Management in Data Center Networks · pdf, mp4 · view |
Gupta, Ajay · more Ajay Gupta (Western Michigan University) | Towards Faster Execution of Ensemble ML Bootstrap Based Techniques · pdf, mp4 · view |
Hale, Kyle · more Kyle Hale (Illinois Institute of Technology) | Cache-Aware Data Management for Memory-Mapped Forests · pdf, mp4 · view |
Halem, Milton · more Milton Halem (UMBC) | An Intelligent Parallel Distributed Streaming Framework for Near Real-time Science Sensors and High Resolution Medical Images · pdf, mp4 · view |
Han, Yongguo · more Yongguo Han (Southwest University of Science and Technology) | PREP: Predicting Job Runtime with Job Running Path on Supercomputers · pdf, mp4 · view |
Hanindhito, Bagus · more Bagus Hanindhito (The University of Texas at Austin) | Wave-PIM: Accelerating Wave Simulation Using Processing-in-Memory · pdf, mp4 · view |
He, Heng · more Heng He (School of Computer Science and Technology, Wuhan University of Science and Technology; Hubei Province Key Laboratory of Intelligent Information Processing and Real-time Industrial System) | FMSM: A Fuzzy Multi-keyword Search Scheme for Encrypted Cloud Data based on Multi-chain Network · pdf, mp4 · view |
He, Shuibing · more Shuibing He (Zhejiang University, Zhejiang Lab) | A Novel Multi-CPU/GPU Collaborative Computing Framework for SGD-based Matrix Factorization · pdf, mp4 · view |
Hong, Yang · more Yang Hong (Intel APAC) | Teddy: An Efficient SIMD-based Literal Matching Engine for Scalable Deep Packet Inspection · pdf, mp4 · view |
Hossain, Md Maruf · more Md Maruf Hossain (University of North Carolina at Charlotte) | Postmortem Graph Analysis on the Temporal Graph · pdf, pdf · view Impact of AVX-512 Instructions on Graph Partitioning Problems. · pdf, mp4 · view |
Hsu, Wei-Chung · more Wei-Chung Hsu (National Taiwan University) | Intra- and Inter- Layer Transformation to Reduce Memory Traffic for CNN Computation · pdf, mp4 · view |
Hu, Jing · more Jing Hu (Wuhan National Laboratory for Optoelectronics,Huazhong University of Science and Technology) | Parallel Multi-split Extendible Hashing for Persistent Memory · pdf, mp4 · view |
Hu, Yang · more Yang Hu (University of Texas at Dallas) | Enabling Efficient SIMD Acceleration for Virtual Radio Access Network · pdf, mp4 · view |
Hu, Yi · more Yi Hu (Institute of Software, Chinese Academy of Sciences; University of Chinese Academy of Sciences) | Best VM Selection for Big Data Applications across Multiple Frameworks by Transfer Learning · pdf, mp4 · view |
Hu, Yongmin · more Yongmin Hu (Beihang University) | Automatic Code Generation and Optimization of Large-scale Stencil Computation on Many-core Processors · pdf, mp4 · view |
Hu, Yuchong · more Yuchong Hu (Huazhong University of Science and Technology) | A Log-Free and Consistent Chained Hashing for Non-volatile Memory · pdf, pdf · view Coupling Right-Provisioned Cold Storage Data Centers with Deduplication · pdf, mp4 · view Multi-level Forwarding and Scheduling Recovery Technique in Heterogeneous Network for Erasure-coded Clusters · pdf, mp4 · view |
Hua, Fei · more Fei Hua (Rutgers Unversity) | BGPQ: A Heap-Based Priority Queue Design for GPUs · pdf, mp4 · view |
Hua, Qiang-Sheng · more Qiangsheng Hua (Huazhong University of Science and Technology) | Communication Avoiding All-Pairs Shortest Paths Algorithm for Sparse graphs · pdf, mp4 · view Efficient Complete Event Trend Detection over High-Velocity Streams · pdf, mp4 · view |
Huang, Chenglong · more Chenglong Huang (National University of Defense Technology) | HDNH: a read-efficient and write-optimized hashing scheme for hybrid DRAM-NVM memory · pdf, mp4 · view |
Huang, Chung-Wen · more Chung-Wen Huang (MediaTek Inc) | Support Convolution of CNN with Compression Sparse Matrix Multiplication Flow in TVM · pdf, mp4 · view |
Huang, Dan · more Dan Huang (Sun Yat-sen University) | Optimizing Massively Parallel Winograd Convolution on ARM Processor · pdf, mp4 · view |
Huang, H. Howie · more H. Howie Huang (George Washington University) | Automatic Generation of High-Performance Inference Kernels for Graph Neural Networks on Multi-Core Systems · pdf, mp4 · view |
Huang, Jiawen · more Jiawen Huang (State Key Laborotary of Computer Architecture, Institute of Computing Technology, Chinese Academy of Sciences) | BitX: Empower Versatile Inference with Hardware Runtime Pruning · pdf, mp4 · view |
Huang, Kaixin · more Kaixin Huang (ByteDance Inc., Shanghai Jiao Tong University) | HDNH: a read-efficient and write-optimized hashing scheme for hybrid DRAM-NVM memory · pdf, mp4 · view |
Huang, Min · more Min Huang (Southwest University) | Intra-page Cache Update in SLC-mode with Partial Programming in High Density SSDs · pdf, mov · view |
Huang, Tao · more Tao Huang (Institute of Software, Chinese Academy of Sciences; University of Chinese Academy of Sciences) | Best VM Selection for Big Data Applications across Multiple Frameworks by Transfer Learning · pdf, mp4 · view |
Huang, Yizhi · more Yizhi Huang (Hunan University, Zhejiang Lab) | A Novel Multi-CPU/GPU Collaborative Computing Framework for SGD-based Matrix Factorization · pdf, mp4 · view |
Huber, Joseph · more Joseph Huber (Oak Ridge National Laboratory) | Advancing OpenMP Offload Debugging Capabilities in LLVM · view |
Hundt, Christian · more Christian Hundt (NVIDIA AI Technology Center Luxembourg) | MetaCache-GPU: Ultra-Fast Metagenomic Classification · pdf, mp4 · view |
Hung, Ming-Yu · more Ming-Yu Hung (MediaTek Inc) | Accelerate Binarized Neural Networks with Processing-in-Memory Enabled by RISC-V Custom Instructions · pdf, mp4 · view Support Convolution of CNN with Compression Sparse Matrix Multiplication Flow in TVM · pdf, mp4 · view |
Ikeda, Takuya · more Takuya Ikeda (Kansai University) | New Evacuation Guidance Using Augmented Reality for Emergency Rescue Evacuation Support System (ERESS) · pdf, mp4 · view |
Ilic, Aleksandar · more Aleksandar Ilic (INESC-ID; Instituto Superior Técnico, Universidade de Lisboa) | Fourth-Order Exhaustive Epistasis Detection for the xPU Era · pdf, mp4 · view |
Imamura, Toshiyuki · more Toshiyuki Imamura (RIKEN Center for Computational Science) | Accurate Matrix Multiplication on Binary128 Format Accelerated by Ozaki Scheme · pdf, file · view |
Jahic, Jasmin · more Jasmin Jahic (University of Cambridge) | ArchViMP – a Framework for Automatic Extraction of Concurrency-related Software Architectural Properties · pdf, mp4 · view |
Jarachanthan, Jananie · more Jananie Jarachanthan (University of Louisiana at Lafayette) | AMPS-Inf: Automatic Model Partitioning for Serverless Inference with Cost Efficiency. · pdf, mp4 · view |
Jayatilaka, Tarindu · more Tarindu Jayatilaka (University of Moratuwa) | Towards Compile-Time-Reducing Compiler Optimization Selection via Machine Learning · view |
Jeannot, Emmanuel · more Emmanuel Jeannot (INRIA) | Interferences between Communications and Computations in Distributed HPC Systems · pdf, mp4 · view |
Jenkins, Louis · more Louis Jenkins (University of Rochester) | A Fast, General System for Buffered Persistent Data Structures · pdf, mp4 · view |
Jha, Shantenu · more Shantenu Jha (Brookhaven National Laboratory, Rutgers University) | IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view |
Ji, Zhuoran · more Zhuoran Ji (The University of Hong Kong) | Accelerating DBSCAN Algorithm with AI Chips for Large Datasets · pdf, mp4 · view |
Jia, Ranhao · more Ranhao Jia (Department of Computer Science and Engineering, Shanghai Jiao Tong University) | A Graph-Assisted Out-of-Place Update Scheme for Erasure Coded Storage Systems · pdf, mp4 · view |
Jia, Zhen · more Zhen Jia (Amazon) | LoWino: Towards Efficient Low-Precision Winograd Convolutions on Modern CPUs · pdf, mp4 · view |
Jiang, Dejun · more Dejun Jiang (Institute of Computing Technology, CAS; University of Chinese Academy of Sciences) | Using Vectorized Execution to Improve SQL Query Performance on Spark · pdf, mp4 · view |
Jiang, Hao · more Hao Jiang (National University of Defense Technology) | XHYPRE: A high-precision numerical software package for solving large-scale sparse linear equations · pdf, pdf · view |
Jiang, Shizhi · more Shizhi Jiang (University of Chinese Academy of Sciences; Institute of Software, Chinese Academy of Sciences) | Matryoshka: A Coalesced Delta Sequence Prefetcher · pdf, mp4 · view |
Jin, Hai · more Hai Jin (Huazhong University of Science and Technology) | Communication Avoiding All-Pairs Shortest Paths Algorithm for Sparse graphs · pdf, mp4 · view Efficient Complete Event Trend Detection over High-Velocity Streams · pdf, mp4 · view |
Jin, Yuwei · more Yuwei Jin (Rutgers Unversity) | BGPQ: A Heap-Based Priority Queue Design for GPUs · pdf, mp4 · view |
Jin, Zheming · more Zheming Jin (ORNL) | Evaluating the Performance of Integer Sum Reduction in SYCL · pdf, pptx · view |
John, Lizy · more Lizy John (The University of Texas at Austin) | Wave-PIM: Accelerating Wave Simulation Using Processing-in-Memory · pdf, mp4 · view |
Jünger, Daniel · more Daniel Jünger (Johannes Gutenberg University Mainz) | MetaCache-GPU: Ultra-Fast Metagenomic Classification · pdf, mp4 · view |
Kanayama, Yuta · more Yuta Kanayama (Kansai University) | New Evacuation Guidance Using Augmented Reality for Emergency Rescue Evacuation Support System (ERESS) · pdf, mp4 · view |
Kao, Henry · more Henry Kao (University of Toronto) | Ghostwriter: A Cache Coherence Protocol for Error-Tolerant Applications · pdf, mp4 · view |
Ke, Zhaokang · more Zhaokang Ke (Huazhong University of Science and Technology) | Coupling Right-Provisioned Cold Storage Data Centers with Deduplication · pdf, mp4 · view |
Ke, Zong-Ming · more Zong-Ming Ke (National Cheng Kung University) | Dual-KV: Improving Performance of Key-value Caches on Multilevel Cell Non-volatile Memory · pdf, mp4 · view |
Keipert, Kristopher · more Kristopher Keipert (NVIDIA Inc.) | IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view |
Khan, Md Muhib · more Md Muhib Khan (Florida State University) | ROBOTune: High-Dimensional Configuration Tuning for Cluster-Based Data Analytics · pdf, mp4 · view |
Kilpatrick, Peter · more Peter Kilpatrick (Queen's University Belfast) | Exploiting in-Hub Temporal Locality in SpMV-based Graph Processing · pdf, mp4 · view |
Klein, Christoph · more Christoph Klein (University of Heidelberg, ZITI) | Tridiagonal GPU Solver with Scaled Partial Pivoting at Maximum Bandwidth · pdf, mp4 · view |
Kobus, Robin · more Robin Kobus (Johannes Gutenberg University Mainz) | MetaCache-GPU: Ultra-Fast Metagenomic Classification · pdf, mp4 · view |
Koohi Esfahani, Mohsen · more Mohsen Koohi Esfahani (Queen's University Belfast) | Exploiting in-Hub Temporal Locality in SpMV-based Graph Processing · pdf, mp4 · view |
Koppehel, Martin · more Martin Koppehel (Otto-von-Guericke Universität Magdeburg) | CuART - a CUDA-based, scalable Radix-Tree lookup and update engine · pdf, mp4 · view |
Kozakai, Seiya · more Seiya Kozakai (Hosei University) | Efficient GPU-Implementation for Integer Sorting Based on Histogram and Prefix-Sums · pdf, mp4 · view |
Kranzlmüller, Dieter · more Dieter Kranzlmüller (Leibniz Research Centre) | IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view |
Kruse, Michael · more Michael Kruse (Argonne National Laboratory) | Loop Transformations using Clang's Abstract Syntax Tree · view |
Kurth, Thorsten · more Thorsten Kurth (NVIDIA Inc.) | IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view |
Lai, Junjie · more Junjie Lai (NVIDIA) | Optimizing Winograd-Based Convolution with Tensor Cores · pdf, mp4 · view |
Lai, Jyun-Kai · more Jyun-Kai Lai (National Yang Ming Chiao Tung University) | Hyperchaining Optimizations for an LLVM-Based Binary Translator on x86-64 and RISC-V Platforms · pdf, mp4 · view |
Lai, Wei-Chih · more Wei-Chih Lai (MediaTek Inc) | Support Convolution of CNN with Compression Sparse Matrix Multiplication Flow in TVM · pdf, mp4 · view |
Lai, Zhiquan · more Zhiquan Lai (National University of Defense Technology) | Hippie: A Data-Paralleled Pipeline Approach to Improve Memory-Efficiency and Scalability for Large DNN Training · pdf, mp4 · view |
Lan, Hao · more Hao Lan (University of Toronto) | Accelerated Device Placement Optimization with Contrastive Learning · pdf, mp4 · view |
Langguth, Johannes · more Johannes Langguth (Simula Research Laboratory) | Explaining the Classification Performance of Supervised and Semi-Supervised Methods for Automated Sparse Matrix Format Selection · pdf, mp4 · view |
Larkins, D. Brian · more D. Brian Larkins (Rhodes College) | Optimizing Work Stealing Communication with Structured Atomic Operations · pdf, mp4 · view |
Leandro Nesi, Lucas · more Lucas Leandro Nesi (Institute of Informatics, Federal University of Rio Grande do Sul) | Exploiting system level heterogeneity to improve the performance of a GeoStatistics multi-phase task-based application · pdf, mp4 · view |
Lee, Chao-Lin · more Chao-Lin Lee (National Tsing Hua University) | Accelerate Binarized Neural Networks with Processing-in-Memory Enabled by RISC-V Custom Instructions · pdf, mp4 · view Support Convolution of CNN with Compression Sparse Matrix Multiplication Flow in TVM · pdf, mp4 · view |
Lee, Hyungro · more Hyungro Lee (Rutgers University) | IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view |
Lee, Jenq-Kuen · more Jenq-Kuen Lee (National Tsing Hua University) | Accelerate Binarized Neural Networks with Processing-in-Memory Enabled by RISC-V Custom Instructions · pdf, mp4 · view Support Convolution of CNN with Compression Sparse Matrix Multiplication Flow in TVM · pdf, mp4 · view |
Legrand, Arnaud · more Arnaud Legrand (University Grenoble Alpes, CNRS) | Exploiting system level heterogeneity to improve the performance of a GeoStatistics multi-phase task-based application · pdf, mp4 · view |
Lehr, Jan-Patrick · more Jan-Patrick Lehr (Technical University of Darmstadt) | Tool-Supported Mini-App Extraction to Facilitate Program Analysis and Parallelization · pdf, mp4 · view |
Lei, Mengya · more Mengya Lei (huazhong university of science and technology) | Crash-Consistency-Aware Encryption for Non-Volatile Memories · pdf, mp4 · view |
Leng, Jingwen · more Jingwen Leng (Shanghai Jiao Tong University) | Dubhe: Towards Data Unbiasedness with Homomorphic Encryption in Federated Learning Client Selection · pdf, mov · view |
Li, Ang · more Ang Li (Pacific Northwest National Laboratory) | Fast and Scalable Sparse Triangular Solver for Multi-GPU Based HPC Architectures · pdf, mp4 · view |
Li, Angela · more Angela Li (The Ohio State University) | Cache-Aware Data Management for Memory-Mapped Forests · pdf, mp4 · view |
Li, Baochun · more Baochun Li (University of Toronto) | Accelerated Device Placement Optimization with Contrastive Learning · pdf, mp4 · view |
Li, Baoqian · more Baoqian Li (Intel APAC) | Teddy: An Efficient SIMD-based Literal Matching Engine for Scalable Deep Packet Inspection · pdf, mp4 · view |
Li, Bo · more Bo Li (Hong Kong University of Science and Technology) | AMPS-Inf: Automatic Model Partitioning for Serverless Inference with Cost Efficiency. · pdf, mp4 · view |
Li, Chuanying · more Chuanying Li (Hunan University) | XHYPRE: A high-precision numerical software package for solving large-scale sparse linear equations · pdf, pdf · view |
Li, Dawei · more Dawei Li (Montclair State University) | Distributed Game-Theoretical Route Navigation for Vehicular Crowdsensing · pdf, mp4 · view |
Li, Dongsheng · more Dongsheng Li (Sun Yat-sen University) | Hippie: A Data-Paralleled Pipeline Approach to Improve Memory-Efficiency and Scalability for Large DNN Training · pdf, mp4 · view |
Li, Fan · more Fan Li (huazhong university of science and technology) | Fast and Consistent Remote Direct Access to Non-volatile Memory · pdf, mp4 · view Crash-Consistency-Aware Encryption for Non-Volatile Memories · pdf, mp4 · view |
Li, Guangli · more Guangli Li (Institute of Computing Technology, Chinese Academy of Sciences; University of Chinese Academy of Sciences) | LoWino: Towards Efficient Low-Precision Winograd Convolutions on Modern CPUs · pdf, mp4 · view |
Li, Hongyan · more Hongyan Li (State Key Laborotary of Computer Architecture, Institute of Computing Technology, Chinese Academy of Sciences) | BitX: Empower Versatile Inference with Hardware Runtime Pruning · pdf, mp4 · view |
Li, Jiajia · more Jiajia Li (Pacific Northwest National Laboratory, William&Mary) | Fast and Scalable Sparse Triangular Solver for Multi-GPU Based HPC Architectures · pdf, mp4 · view |
Li, Jiawei · more Jiawei Li (University of Science and Technology of China) | Progressive Memory Adjustment with Performance Guarantee in Virtualized Systems · pdf, mp4 · view |
Li, Jun · more Jun Li (Southwest University) | Intra-page Cache Update in SLC-mode with Partial Programming in High Density SSDs · pdf, mov · view |
Li, Li · more Li Li (ShenZhen Institutes of Advanced Technology, Chinese Academy of Sciences) | FIFL: A Fairness Incentive Framework for Federated Learning · pdf, mp4 · view |
Li, Mingshu · more Mingshu Li (Institute of Software, Chinese Academy of Sciences) | Matryoshka: A Coalesced Delta Sequence Prefetcher · pdf, mp4 · view |
Li, Mingzhen · more Mingzhen Li (Beihang University) | Automatic Code Generation and Optimization of Large-scale Stencil Computation on Many-core Processors · pdf, mp4 · view |
Li, Minjun · more Minjun Li (Southwest University) | Intra-page Cache Update in SLC-mode with Partial Programming in High Density SSDs · pdf, mov · view |
Li, Qiliang · more Qiliang Li (University of Science and Technology of China) | Fast Reconstruction for Large Disk Enclosures Based on RAID2.0 · pdf, mp4 · view |
Li, Renfa · more Renfa Li (Hunan University) | A Novel Multi-CPU/GPU Collaborative Computing Framework for SGD-based Matrix Factorization · pdf, mp4 · view |
Li, Ruihao · more Ruihao Li (The University of Texas at Austin) | Wave-PIM: Accelerating Wave Simulation Using Processing-in-Memory · pdf, mp4 · view |
Li, Shengwei · more Shengwei Li (National University of Defense Technology) | Hippie: A Data-Paralleled Pipeline Approach to Improve Memory-Efficiency and Scalability for Large DNN Training · pdf, mp4 · view |
Li, Weiguang · more Weiguang Li (Wuhan National Laboratory for Optoelectronics, Huazhong University of Science and Technology) | Fast and Consistent Remote Direct Access to Non-volatile Memory · pdf, mp4 · view |
Li, Xiaowei · more Xiaowei Li (State Key Laborotary of Computer Architecture, Institute of Computing Technology, Chinese Academy of Sciences) | BitX: Empower Versatile Inference with Hardware Runtime Pruning · pdf, mp4 · view |
Li, Xiaoying · more Xiaoying Li (University of Virginia) | Multi-Agent Reinforcement Learning based Distributed Renewable Energy Matching for Datacenters · pdf, mp4 · view |
Li, Yongkun · more Yongkun Li (University of Science and Technology of China) | Progressive Memory Adjustment with Performance Guarantee in Virtualized Systems · pdf, mp4 · view |
Li, Yubo · more Yubo Li (V-Origin) | Exploring HW/SW Co-Optimizations for Accelerating Large-scale Texture Identification on Distributed GPUs · pdf, mp4 · view |
Li, Yun-Ze · more Yun-Ze Li (National Cheng Kung University) | Dual-KV: Improving Performance of Key-value Caches on Multilevel Cell Non-volatile Memory · pdf, mp4 · view |
Li, Zhuozhao · more Zhuozhao Li (University of Chicago) | IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view |
Li, Zirui · more Zirui Li (Shanghai Jiao Tong University) | Dubhe: Towards Data Unbiasedness with Homomorphic Encryption in Federated Learning Client Selection · pdf, mov · view |
Li, Zitong · more Zitong Li (Wake Forest University) | Parallel Tucker Decomposition with Numerically Accurate SVD · pdf, mp4 · view |
Liao, Hui-Hsin · more Hui-Hsin Liao (National Tsing Hua University) | Support Convolution of CNN with Compression Sparse Matrix Multiplication Flow in TVM · pdf, mp4 · view |
Liao, Jianwei · more Jianwei Liao (Southwest University) | Intra-page Cache Update in SLC-mode with Partial Programming in High Density SSDs · pdf, mov · view |
Liao, Pin-Wei · more Pin-Wei Liao (National Taiwan University) | Intra- and Inter- Layer Transformation to Reduce Memory Traffic for CNN Computation · pdf, mp4 · view |
Liao, Shih-Wei · more Shih-Wei Liao (National Taiwan University) | Intra- and Inter- Layer Transformation to Reduce Memory Traffic for CNN Computation · pdf, mp4 · view |
Liao, Xiangke · more Xiangke Liao (National University of Defense Technology) | CD-SGD: Distributed Stochastic Gradient Descent with Compression and Delay Compensation · pdf, mp4 · view |
Lin, Che-Chia · more Che-Chia Lin (National Tsing Hua University) | Accelerate Binarized Neural Networks with Processing-in-Memory Enabled by RISC-V Custom Instructions · pdf, mp4 · view |
Lin, Tzu-Chia · more Tzu-Chia Lin (National Central University) | Automated Arrhythmia Detection using Hilbert-Huang Transform based Convolutional Neural Network · pdf, mp4 · view |
Lin, Xiang · more Xiang Lin (Fudan University) | Optimizing Flow Completion Time via Adaptive Buffer Management in Data Center Networks · pdf, mp4 · view |
Lin, Yonghua · more Yonghua Lin (V-Origin) | Exploring HW/SW Co-Optimizations for Accelerating Large-scale Texture Identification on Distributed GPUs · pdf, mp4 · view |
Liu, chengyu · more chengyu Liu (Wuhan University of Science and Technology, School of Computer Science and Technology) | FMSM: A Fuzzy Multi-keyword Search Scheme for Encrypted Cloud Data based on Multi-chain Network · pdf, mp4 · view |
Liu, Fang · more Fang Liu (School of Design, Hunan University) | FedCav: Contribution-aware Model Aggregation on Distributed Heterogeneous Data in Federated Learning · pdf, mp4 · view |
Liu, Hanfeng · more Hanfeng Liu (School of Science and Engineering, The Chinese University of Hong Kong, Shenzhen; Shenzhen Institute of Artificial Intelligence and Robotics for Society) | FastPSO: Towards Efficient Swarm Intelligence Algorithm on GPUs · pdf, mp4 · view |
Liu, Junhong · more Junhong Liu (NVIDIA) | Optimizing Winograd-Based Convolution with Tensor Cores · pdf, mp4 · view |
Liu, Sen · more Sen Liu (Fudan University) | Optimizing Flow Completion Time via Adaptive Buffer Management in Data Center Networks · pdf, mp4 · view |
Liu, Wenbin · more Wenbin Liu (Jilin University) | Distributed Game-Theoretical Route Navigation for Vehicular Crowdsensing · pdf, mp4 · view |
Liu, Wuji · more Wuji Liu (New Jersey Institute of Technology) | NoStop: A Novel Configuration Optimization Scheme for Spark Streaming · pdf, mp4 · view |
Liu, Xiaoyan · more Xiaoyan Liu (Beihang University) | Automatic Code Generation and Optimization of Large-scale Stencil Computation on Many-core Processors · pdf, mp4 · view |
Liu, Yan · more Yan Liu (Hunan University) | A Novel Multi-CPU/GPU Collaborative Computing Framework for SGD-based Matrix Factorization · pdf, mp4 · view |
Liu, Yi · more Yi Liu (Beihang University) | Automatic Code Generation and Optimization of Large-scale Stencil Computation on Many-core Processors · pdf, mp4 · view |
Liu, Zhiming · more Zhiming Liu (Southwest University) | Intra-page Cache Update in SLC-mode with Partial Programming in High Density SSDs · pdf, mov · view |
López-Paradís, Guillem · more Guillem López-Paradís (Barcelona Supercomputing Center, Universitat Politècnica de Catalunya) | gem5+RTL: A Framework to Enable RTL Models Inside a Full-System Simulator · pdf, mp4 · view |
Lu, Hang · more Hang Lu (State Key Laborotary of Computer Architecture, Institute of Computing Technology, Chinese Academy of Sciences) | BitX: Empower Versatile Inference with Hardware Runtime Pruning · pdf, mp4 · view |
Lu, Yutong · more Yutong Lu (Sun Yat-sen University) | Optimizing Massively Parallel Winograd Convolution on ARM Processor · pdf, mp4 · view |
Luan, Dongming · more Dongming Luan (Jilin University) | Distributed Game-Theoretical Route Navigation for Vehicular Crowdsensing · pdf, mp4 · view |
Luan, Zhongzhi · more Zhongzhi Luan (Beihang University) | Automatic Code Generation and Optimization of Large-scale Stencil Computation on Many-core Processors · pdf, mp4 · view |
Luo, Qiong · more Qiong Luo (Hong Kong University of Science and Technology) | Accelerating Sequence-to-Graph Alignment on Heterogeneous Processors · pdf, mp4 · view |
Luo, Yingwei · more Yingwei Luo (Department of Computer Science and Technology, Peking University; Peng Cheng Lab, Shenzhen) | An Edge-Fencing Strategy for Optimizing SSSP Computations on Large-Scale Graphs · pdf, mp4 · view |
Lv, Pengze · more Pengze Lv (Wuhan National Laboratory for Optoelectronics, Huazhong University of Science and Technology) | CERES: Container-Based Elastic Resource Management System for Mixed Workloads · pdf, mp4 · view |
Lyu, Min · more Min Lyu (University of Science and Technology of China) | Fast Reconstruction for Large Disk Enclosures Based on RAID2.0 · pdf, mp4 · view |
Ma, Heng · more Heng Ma (Argonne National Laboratory) | IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view |
Maggioli, Filippo · more Filippo Maggioli (Sapienza, University of Rome) | Efficiently Parallelizable Strassen-Based Multiplication of a Matrix by its Transpose · pdf, mp4 · view |
Maldonado, Daniel Adrian · more Daniel Adrian Maldonado (Argonne National Laboratory) | Domain Decomposition Preconditioners for Unstructured Network Problems in Parallel Vector Architectures · pdf, mov · view |
Mangalagiri, Jayalakshmi · more Jayalakshmi Mangalagiri (UMBC) | An Intelligent Parallel Distributed Streaming Framework for Near Real-time Science Sensors and High Resolution Medical Images · pdf, mp4 · view |
Mantel, Heiko · more Heiko Mantel (Technical University of Darmstadt) | Tool-Supported Mini-App Extraction to Facilitate Program Analysis and Parallelization · pdf, mp4 · view |
Massini, Annalisa · more Annalisa Massini (Sapienza, University of Rome) | Efficiently Parallelizable Strassen-Based Multiplication of a Matrix by its Transpose · pdf, mp4 · view |
Mathias, Gerald · more Gerald Mathias (Leibniz Research Centre) | IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view |
Matsui, Tomokazu · more Tomokazu Matsui (Nara Institute of Science and Technology) | Analysis on Nursing Care Activity Related Stress Level for Reduction of Caregiving Workload · pdf, mp4 · view |
Mehta, Kshitij · more Kshitij Mehta (Oak Ridge National Lab) | DYFLOW: A flexible framework for orchestrating scientific workflows on supercomputers · pdf, mp4 · view |
Mei, Huiyao · more Huiyao Mei (Huazhong University of Science and Technology) | Efficient Complete Event Trend Detection over High-Velocity Streams · pdf, mp4 · view |
Mello Schnorr, Lucas · more Lucas Mello Schnorr (Institute of Informatics, Federal University of Rio Grande do Sul) | Exploiting system level heterogeneity to improve the performance of a GeoStatistics multi-phase task-based application · pdf, mp4 · view |
Merzky, Andre · more Andre Merzky (Rutgers University) | IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view |
Meyer, Bruno · more Bruno Meyer (Federal University of Paraná) | Warp-centric K-Nearest Neighbor Graphs construction on GPU · pdf, mp4 · view |
Miandji, Ehsan · more Ehsan Miandji (Linköping University) | GPU Accelerated SL0 for Multidimensional Signals · pdf, mp4 · view |
Mishin, Nikita · more Nikita Mishin (Saint Petersburg State University, JetBrains Research) | Efficient Parallel Algorithms for String Comparison · pdf, mp4 · view |
Miyaji, Atsushi · more Atsushi Miyaji (Nara Institute of Science and Technology) | Analysis on Nursing Care Activity Related Stress Level for Reduction of Caregiving Workload · pdf, mp4 · view |
Moretó, Miquel · more Miquel Moretó (Barcelona Supercomputing Center, Universitat Politècnica de Catalunya) | gem5+RTL: A Framework to Enable RTL Models Inside a Full-System Simulator · pdf, mp4 · view |
Morris, Nathaniel · more Nathaniel Morris (The Ohio State University) | Cache-Aware Data Management for Memory-Mapped Forests · pdf, mp4 · view |
Mukunoki, Daichi · more Daichi Mukunoki (RIKEN Center for Computational Science) | Accurate Matrix Multiplication on Binary128 Format Accelerated by Ozaki Scheme · pdf, file · view |
Müller, André · more André Müller (Johannes Gutenberg University Mainz) | MetaCache-GPU: Ultra-Fast Metagenomic Classification · pdf, mp4 · view |
Navarro Muñoz, Antoni · more Antoni Navarro Muñoz (Barcelona Supercomputing Center (BSC-CNS), Universitat Politècnica de Catalunya) | Combining Dynamic Concurrency Throttling with Voltage and Frequency Scaling on Task-based Programming Models · pdf, mp4 · view |
Nguyen, Phuong · more Phuong Nguyen (UMBC, OpenKneck Inc) | An Intelligent Parallel Distributed Streaming Framework for Near Real-time Science Sensors and High Resolution Medical Images · pdf, mp4 · view |
Nobre, Ricardo · more Ricardo Nobre (INESC-ID; Instituto Superior Técnico, Universidade de Lisboa) | Fourth-Order Exhaustive Epistasis Detection for the xPU Era · pdf, mp4 · view |
Norouzi, Mohammad · more Mohammad Norouzi (Technical University of Darmstadt) | Tool-Supported Mini-App Extraction to Facilitate Program Analysis and Parallelization · pdf, mp4 · view |
Nunan Zola, Wagner M. · more Wagner M. Nunan Zola (Federal University of Paraná) | Warp-centric K-Nearest Neighbor Graphs construction on GPU · pdf, mp4 · view |
Ogita, Takeshi · more Takeshi Ogita (Tokyo Woman's Christian University) | Accurate Matrix Multiplication on Binary128 Format Accelerated by Ozaki Scheme · pdf, file · view |
Ohtsuki, Kazuhiro · more Kazuhiro Ohtsuki (Kobe University) | New Evacuation Guidance Using Augmented Reality for Emergency Rescue Evacuation Support System (ERESS) · pdf, mp4 · view |
Ouyang, Shuo · more Shuo Ouyang (National University of Defense Technology) | CD-SGD: Distributed Stochastic Gradient Descent with Compression and Delay Compensation · pdf, mp4 · view |
Ozaki, Katsuhisa · more Katsuhisa Ozaki (Shibaura Institute of Technology) | Accurate Matrix Multiplication on Binary128 Format Accelerated by Ozaki Scheme · pdf, file · view |
Ozkaya, M. Yusuf · more M. Yusuf Ozkaya (Georgia Institute of Technology) | An Evaluation of Task-Parallel Frameworks for Sparse Solvers on Multicore and Manycore CPU Architectures · pdf, mov · view |
Pacaud, François · more François Pacaud (Argonne National Laboratory) | Domain Decomposition Preconditioners for Unstructured Network Problems in Parallel Vector Architectures · pdf, mov · view |
Paluri, Pavan Kumar · more Pavan Kumar Paluri (University of Houston) | A Virtualization Platform Designed for Irregular Multi-Process Applications · pdf, pdf · view |
Park, EunJung · more EunJung Park (Los Alamos National Laboratory) | Towards Compile-Time-Reducing Compiler Optimization Selection via Machine Learning · view |
Partin, Alexander · more Alexander Partin (Argonne National Laboratory) | IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view |
Patel, Atmn · more Atmn Patel (University of Waterloo) | A Virtual GPU as Developer-Friendly OpenMP Offload Target · view |
Peng, Zhouxuan · more Zhouxuan Peng (Wuhan National Laboratory for Optoelectronics,Huazhong University of Science and Technology) | Parallel Multi-split Extendible Hashing for Persistent Memory · pdf, mp4 · view |
Perotin, Lucas · more Lucas Perotin (ENS Lyon) | Multi-Resource List Scheduling of Moldable Parallel Jobs under Precedence Constraints · pdf, mp4 · view |
Perovic, Vasilije · more Vasilije Perovic (University of Rhode Island) | Towards Faster Execution of Ensemble ML Bootstrap Based Techniques · pdf, mp4 · view |
Perumalla, Kalyan · more Kalyan Perumalla (Oak Ridge National Laboratory) | Design Considerations for GPU-based Mixed Integer Programming on Parallel Computing Platforms · pdf, mp4 · view |
Pionteck, Thilo · more Thilo Pionteck (Otto-von-Guericke Universität Magdeburg) | CuART - a CUDA-based, scalable Radix-Tree lookup and update engine · pdf, mp4 · view |
Plano, Tom · more Tom Plano (Washington University) | Enabling Real-Time Irregular Data-Flow Pipelines on SIMD Devices · pdf, mp4 · view |
Pogorelov, Konstantin · more Konstantin Pogorelov (Simula Research Laboratory) | Explaining the Classification Performance of Supervised and Semi-Supervised Methods for Automated Sparse Matrix Format Selection · pdf, mp4 · view |
Ponomarev, Dmitry · more Dmitry Ponomarev (State University of New York at Binghamton) | GVT-Guided Demand-Driven Scheduling in Parallel Discrete Event Simulation · pdf, mp4 · view |
Posner, Jonas · more Jonas Posner (University of Kassel) | Transparent Resource Elasticity for Task-Based Cluster Environments with Work Stealing · pdf, mp4 · view |
Pottier, Loïc · more Loïc Pottier (University of Southern California, Information Sciences Institute) | Assessing Resource Provisioning and Allocation of Ensembles of In Situ Workflows · pdf, mp4 · view |
Pourjafarian, Monireh · more Monireh Pourjafarian (Technical University of Kaiserslautern) | ArchViMP – a Framework for Automatic Extraction of Concurrency-related Software Architectural Properties · pdf, mp4 · view |
Pozo, Aurora · more Aurora Pozo (Federal University of Paraná) | Warp-centric K-Nearest Neighbor Graphs construction on GPU · pdf, mp4 · view |
Prema Soundararajan, Prema · more Prema Prema Soundararajan (University of Alabama at Birmingham) | Design of a Portable Implementation of Partitioned Point-to-Point Communication Primitives · pdf, mp4 · view |
Qi, Jingyuan · more Jingyuan Qi (Virginia Tech) | ComputeCOVID19+: Accelerating COVID-19 Diagnosis and Monitoring via High-Performance Deep Learning · pdf, mp4 · view |
Qi, Qiang · more Qiang Qi (East China Normal University) | Prophet: Speeding up Distributed DNN Training with Predictable Communication Scheduling · pdf, mp4 · view |
Qian, Depei · more Depei Qian (Beihang University) | Automatic Code Generation and Optimization of Large-scale Stencil Computation on Many-core Processors · pdf, mp4 · view |
Qian, Kun · more Kun Qian (Alibaba) | Receiver-Driven Congestion Control for InfiniBand · pdf, mp4 · view |
Qiao, Linbo · more Linbo Qiao (National University of Defense Technology) | Hippie: A Data-Paralleled Pipeline Approach to Improve Memory-Efficiency and Scalability for Large DNN Training · pdf, mp4 · view |
Qiu, Kun · more Kun Qiu (Intel APAC) | Teddy: An Efficient SIMD-based Literal Matching Engine for Scalable Deep Packet Inspection · pdf, mp4 · view |
Quan, Zhe · more Zhe Quan (Hunan University) | XHYPRE: A high-precision numerical software package for solving large-scale sparse linear equations · pdf, pdf · view |
Rabbi, Fazlay · more Fazlay Rabbi (Michigan State University) | An Evaluation of Task-Parallel Frameworks for Sparse Solvers on Multicore and Manycore CPU Architectures · pdf, mov · view |
Raghavan, Padma · more Padma Raghavan (Vanderbilt University) | Multi-Resource List Scheduling of Moldable Parallel Jobs under Precedence Constraints · pdf, mp4 · view |
Ramanathan, Arvind · more Arvind Ramanathan (Argonne) | IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view |
Ramtin, Amir Reza · more Amir Reza Ramtin (University of Massachusetts Amherst) | Self-Stabilization with Selfish Agents · pdf, mp4 · view |
Raugas, Mark · more Mark Raugas (Pacific Northwest National Laboratory) | Fast and Scalable Sparse Triangular Solver for Multi-GPU Based HPC Architectures · pdf, mp4 · view |
Ren, Fengyuan · more Fengyuan Ren (Tsinghua university, Beijing National Research Center for Information Science and Technology (BNRist)) | Receiver-Driven Congestion Control for InfiniBand · pdf, mp4 · view |
Ren, Runtian · more Runtian Ren (Nanyang Technological University) | Generalized Skyline Interval Coloring and Dynamic Geometric Bin Packing Problems · pdf, mp4 · view |
Revell, Alistair · more Alistair Revell (University of Manchester) | CNN+LSTM Accelerated Turbulent Flow Simulation with Link-Wise Artificial Compressibility Method · pdf, mp4 · view |
Rodolà, Emanuele · more Emanuele Rodolà (Sapienza, University of Rome) | Efficiently Parallelizable Strassen-Based Multiplication of a Matrix by its Transpose · pdf, mp4 · view |
Romero-Gainza, Eduardo · more Eduardo Romero-Gainza (The Ohio State University) | Cache-Aware Data Management for Memory-Mapped Forests · pdf, mp4 · view |
Saeed, Fahad · more Fahad Saeed (Florida International University) | TurboBC: A Memory Efficient and Scalable GPU Based Betweenness Centrality(BC) Algorithm in the Language of Linear Algebra · pdf, mp4 · view |
Saleh, Hisham · more Hisham Saleh (Western Michigan University) | Towards Faster Execution of Ensemble ML Bootstrap Based Techniques · pdf, mp4 · view |
San Miguel, Joshua · more Joshua San Miguel (University of Wisconsin-Madison) | Ghostwriter: A Cache Coherence Protocol for Error-Tolerant Applications · pdf, mp4 · view |
Santander-Jiménez, Sergio · more Sergio Santander-Jiménez (University of Extremadura) | Fourth-Order Exhaustive Epistasis Detection for the xPU Era · pdf, mp4 · view |
Saule, Erik · more Erik Saule (University of North Carolina at Charlotte) | Postmortem Graph Analysis on the Temporal Graph · pdf, pdf · view Impact of AVX-512 Instructions on Graph Partitioning Problems. · pdf, mp4 · view |
Schafer, Derek · more Derek Schafer (University of Tennessee at Chattanooga) | Design of a Portable Implementation of Partitioned Point-to-Point Communication Primitives · pdf, mp4 · view |
Schanen, Michel · more Michel Schanen (Argonne National Laboratory) | Domain Decomposition Preconditioners for Unstructured Network Problems in Parallel Vector Architectures · pdf, mov · view |
Schildermans, Stijn · more Stijn Schildermans (KU Leuven) | Paratick: Reducing Timer Overhead in Virtual Machines · pdf, mp4 · view |
Schmidt, Bertil · more Bertil Schmidt (Johannes Gutenberg University Mainz) | MetaCache-GPU: Ultra-Fast Metagenomic Classification · pdf, mp4 · view |
Scott, Michael L. · more Michael L. Scott (University of Rochester) | A Fast, General System for Buffered Persistent Data Structures · pdf, mp4 · view |
SEN, TANMOY · more TANMOY SEN (University of Virginia) | Context-aware Data Operation Strategies in Edge Systems for High Application Performance · pdf, mp4 · view |
Serhani, Mohamed Adel · more Mohamed Adel Serhani (United Arab Emirates University) | Optimizing Flow Completion Time via Adaptive Buffer Management in Data Center Networks · pdf, mp4 · view |
Shah, Ashka · more Ashka Shah (University of Chicago) | IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view |
Shan, Jianchen · more Jianchen Shan (Hofstra University) | Paratick: Reducing Timer Overhead in Virtual Machines · pdf, mp4 · view |
Shan, Tianyi · more Tianyi Shan (University of California San Diego) | Sparker: Efficient Reduction for More Scalable Machine Learning with Spark · pdf, mp4 · view |
Shang, Ruitao · more Ruitao Shang (East China Normal University) | Prophet: Speeding up Distributed DNN Training with Predictable Communication Scheduling · pdf, mp4 · view |
Shen, Haiying · more Haiying Shen (University of Virginia) | Context-aware Data Operation Strategies in Edge Systems for High Application Performance · pdf, mp4 · view Multi-Agent Reinforcement Learning based Distributed Renewable Energy Matching for Datacenters · pdf, mp4 · view |
Shen, Yijie · more Yijie Shen (Institute of Computing Technology, CAS; University of Chinese Academy of Sciences) | Using Vectorized Execution to Improve SQL Query Performance on Spark · pdf, mp4 · view |
Shi, Yang · more Yang Shi (National University of Defense Technology) | sRouting: Towards a Better Flow Size Estimation Performance through Routing and Sketch Configuration · pdf, mp4 · view |
Shivadekar, Samit · more Samit Shivadekar (UMBC) | An Intelligent Parallel Distributed Streaming Framework for Near Real-time Science Sensors and High Resolution Medical Images · pdf, mp4 · view |
Singhal, Swati · more Swati Singhal (University of Maryland, College Park) | DYFLOW: A flexible framework for orchestrating scientific workflows on supercomputers · pdf, mp4 · view |
Skjellum, Anthony · more Anthony Skjellum (University of Tennessee, Chattanooga; SimCenter) | Design of a Portable Implementation of Partitioned Point-to-Point Communication Primitives · pdf, mp4 · view |
Song, Shuaiwen Leon · more Shuaiwen Leon Song (University of Sydney) | Fast and Scalable Sparse Triangular Solver for Multi-GPU Based HPC Architectures · pdf, mp4 · view |
Sousa, Leonel · more Leonel Sousa (INESC-ID; Instituto Superior Técnico, Universidade de Lisboa) | Fourth-Order Exhaustive Epistasis Detection for the xPU Era · pdf, mp4 · view |
Stef, Graillat · more Graillat Stef (Sorbonne Université) | XHYPRE: A high-precision numerical software package for solving large-scale sparse linear equations · pdf, pdf · view |
Stern, Abraham · more Abraham Stern (NVIDIA Inc.) | IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view |
Stevens, Rick · more Rick Stevens (Argonne National Laboratory, University of Chicago) | IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view |
Stewart, Christopher · more Christopher Stewart (The Ohio State University) | Cache-Aware Data Management for Memory-Mapped Forests · pdf, mp4 · view |
Strzodka, Robert · more Robert Strzodka (University of Heidelberg, ZITI) | Tridiagonal GPU Solver with Scaled Partial Pivoting at Maximum Bandwidth · pdf, mp4 · view |
Sun, Ding · more Ding Sun (National University of Defense Technology) | Hippie: A Data-Paralleled Pipeline Approach to Improve Memory-Efficiency and Scalability for Large DNN Training · pdf, mp4 · view |
Sun, Hongyang · more Hongyang Sun (Vanderbilt University) | Multi-Resource List Scheduling of Moldable Parallel Jobs under Precedence Constraints · pdf, mp4 · view |
Sun, Hui · more Hui Sun (Anhui Universtiy) | Boosting Compaction Performance of LSM-tree-based KV Stores in Multi-Near-Data Processing Systems · pdf, pdf · view |
Sun, Min-Te · more Min-Te Sun (National Central University) | Automated Arrhythmia Detection using Hilbert-Huang Transform based Convolutional Neural Network · pdf, mp4 · view |
Sun, Qingxiao · more Qingxiao Sun (Beihang University) | Automatic Code Generation and Optimization of Large-scale Stencil Computation on Many-core Processors · pdf, mp4 · view |
Sussman, Alan · more Alan Sussman (University of Maryland, College Park) | DYFLOW: A flexible framework for orchestrating scientific workflows on supercomputers · pdf, mp4 · view |
Swartvagher, Philippe · more Philippe Swartvagher (INRIA) | Interferences between Communications and Computations in Distributed HPC Systems · pdf, mp4 · view |
Tan, Li · more Li Tan (Brookhaven National Laboratory) | IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view |
Tang, Ruiqi · more Ruiqi Tang (Nankai University) | Ascetic: Enhancing Cross-Iterations Data Efficiency in Out-of-Memory Graph Processing on GPUs · pdf, mp4 · view |
Tang, Xiongchao · more Xiongchao Tang (Sangfor Technologies Inc. and Tsinghua Shenzhen International Graduate School) | Sparker: Efficient Reduction for More Scalable Machine Learning with Spark · pdf, mp4 · view |
Tang, Xueyan · more Xueyan Tang (Nanyang Technological University) | Generalized Skyline Interval Coloring and Dynamic Geometric Bin Packing Problems · pdf, mp4 · view |
Tang, Yuan · more Yuan Tang (Fudan University) | Processor-Aware Cache-Oblivious Algorithms · pdf, mp4 · view |
Taufer, Michela · more Michela Taufer (University of Tennessee at Knoxville) | Assessing Resource Provisioning and Allocation of Ensembles of In Situ Workflows · pdf, mp4 · view |
Tian, Shilei · more Shilei Tian (Stony Brook University) | A Virtual GPU as Developer-Friendly OpenMP Offload Target · view |
Timmerman, David · more David Timmerman (State University of New York at Binghamton) | GVT-Guided Demand-Driven Scheduling in Parallel Discrete Event Simulation · pdf, mp4 · view |
Tiskin, Alexander · more Alexander Tiskin (Saint Petersburg State University) | Efficient Parallel Algorithms for String Comparison · pdf, mp4 · view |
Titov, Mikhail · more Mikhail Titov (Brookhaven National Laboratory) | IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view |
Tong, Wei · more Wei Tong (Wuhan National Laboratory for Optoelectronics, Huazhong University of Science and Technology) | CERES: Container-Based Elastic Resource Management System for Mixed Workloads · pdf, mp4 · view |
Towsley, Don · more Don Towsley (University of Massachusetts Amherst) | Self-Stabilization with Selfish Agents · pdf, mp4 · view |
Trahay, Francois · more Francois Trahay (Telecom SudParis) | Intra-page Cache Update in SLC-mode with Partial Programming in High Density SSDs · pdf, mov · view |
Trenev, Dimitar · more Dimitar Trenev (ExxonMobil) | Wave-PIM: Accelerating Wave Simulation Using Processing-in-Memory · pdf, mp4 · view |
Trifan, Anda · more Anda Trifan (University of Illinois at Urbana Champaign, Argonne National Laboratory) | IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view |
Tsaris, Aristeidis · more Aristeidis Tsaris (Oak Ridge National Laboratory) | IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view |
Turilli, Matteo · more Matteo Turilli (Rutgers University, Brookhaven National Laboratory) | IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view |
Ueno, Hideto · more Hideto Ueno (University of Tokyo) | Towards Compile-Time-Reducing Compiler Optimization Selection via Machine Learning · view |
Unger, Jonas · more Jonas Unger (Linköping University) | GPU Accelerated SL0 for Multidimensional Signals · pdf, mp4 · view |
Valpey, Benjamin · more Benjamin Valpey (University of Rochester) | A Fast, General System for Buffered Persistent Data Structures · pdf, mp4 · view |
Van Dam, Huub · more Huub Van Dam (Brookhaven National Laboratory) | IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view |
Vandierendonck, Hans · more Hans Vandierendonck (Queen's University Belfast) | Exploiting in-Hub Temporal Locality in SpMV-based Graph Processing · pdf, mp4 · view |
Vetter, Jeff · more Jeff Vetter (ORNL) | Evaluating the Performance of Integer Sum Reduction in SYCL · pdf, pptx · view |
Wada, Koichi · more Koichi Wada (Hosei University) | Efficient GPU-Implementation for Integer Sorting Based on Histogram and Prefix-Sums · pdf, mp4 · view |
Wada, Tomotaka · more Tomotaka Wada (Kansai University) | New Evacuation Guidance Using Augmented Reality for Emergency Rescue Evacuation Support System (ERESS) · pdf, mp4 · view |
Wahib, Mohamed · more Mohamed Wahib (National Institute of Advanced Industrial Science and Technology, RIKEN Center for Computational Science) | Intra-page Cache Update in SLC-mode with Partial Programming in High Density SSDs · pdf, mov · view |
Wan, Shunzhou · more Shunzhou Wan (University College London) | IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view |
Wang, Cho-Li · more Cho-Li Wang (The University of Hong Kong) | Accelerating DBSCAN Algorithm with AI Chips for Large Datasets · pdf, mp4 · view |
Wang, En · more En Wang (Jilin University) | Distributed Game-Theoretical Route Navigation for Vehicular Crowdsensing · pdf, mp4 · view |
Wang, Fang · more Fang Wang (huazhong university of science and technology) | ASLDP: An Active Semi-supervised Learning method for Disk Failure Prediction · pdf, mp4 · view |
Wang, Haojie · more Haojie Wang (Tsinghua University) | Sparker: Efficient Reduction for More Scalable Machine Learning with Spark · pdf, mp4 · view |
Wang, Haoyu · more Haoyu Wang (University of Virginia) | Multi-Agent Reinforcement Learning based Distributed Renewable Energy Matching for Datacenters · pdf, mp4 · view |
Wang, Howard · more Howard Wang (MediaTek Inc.) | Accelerate Binarized Neural Networks with Processing-in-Memory Enabled by RISC-V Custom Instructions · pdf, mp4 · view |
Wang, Jianda · more Jianda Wang (University of Texas at Dallas) | Enabling Efficient SIMD Acceleration for Virtual Radio Access Network · pdf, mp4 · view |
Wang, Jiashu · more Jiashu Wang (Huawei Technologies Canada Co., Ltd.) | Adapting SYCL’s SIMT Programming Paradigm for Accelerators via Program Reconstruction · view |
Wang, Junsong · more Junsong Wang (V-Origin) | Exploring HW/SW Co-Optimizations for Accelerating Large-scale Texture Identification on Distributed GPUs · pdf, mp4 · view |
Wang, Kai-Ting Amy · more Kai-Ting Amy Wang (Huawei Technologies Canada Co., Ltd.) | Adapting SYCL’s SIMT Programming Paradigm for Accelerators via Program Reconstruction · view |
Wang, Kailun · more Kailun Wang (Nankai University) | Ascetic: Enhancing Cross-Iterations Data Efficiency in Out-of-Memory Graph Processing on GPUs · pdf, mp4 · view |
Wang, Qiang · more Qiang Wang (Anhui University) | Boosting Compaction Performance of LSM-tree-based KV Stores in Multi-Near-Data Processing Systems · pdf, pdf · view |
Wang, Wei · more Wei Wang (University of Science and Technology of China) | Fast Reconstruction for Large Disk Enclosures Based on RAID2.0 · pdf, mp4 · view |
Wang, Wenwen · more Wenwen Wang (University of Georgia) | Ascetic: Enhancing Cross-Iterations Data Efficiency in Out-of-Memory Graph Processing on GPUs · pdf, mp4 · view |
Wang, Wenxu · more Wenxu Wang (State Key Laborotary of Computer Architecture, Institute of Computing Technology, Chinese Academy of Sciences) | BitX: Empower Versatile Inference with Hardware Runtime Pruning · pdf, mp4 · view |
Wang, Xiang · more Xiang Wang (Intel APAC) | Teddy: An Efficient SIMD-based Literal Matching Engine for Scalable Deep Packet Inspection · pdf, mp4 · view |
Wang, Xiaolin · more Xiaolin Wang (Department of Computer Science and Technology, Peking University; Peng Cheng Lab, Shenzhen) | An Edge-Fencing Strategy for Optimizing SSSP Computations on Large-Scale Graphs · pdf, mp4 · view |
Wang, Yi · more Yi Wang (Peng Cheng Laboratory) | Optimizing Flow Completion Time via Adaptive Buffer Management in Data Center Networks · pdf, mp4 · view |
Wang, Yida · more Yida Wang (Amazon) | LoWino: Towards Efficient Low-Precision Winograd Convolutions on Modern CPUs · pdf, mp4 · view |
Wang, Yuchen · more Yuchen Wang (Michigan Technological University) | Efficient Modeling of Random Sampling-Based LRU · pdf, mp4 · view |
Wang, Zhenlin · more Zhenlin Wang (Michigan Technological University) | Efficient Modeling of Random Sampling-Based LRU · pdf, mp4 · view |
Wang, Zihe · more Zihe Wang (Renmin University of China) | Distributed Game-Theoretical Route Navigation for Vehicular Crowdsensing · pdf, mp4 · view |
Wei, Xueliang · more Xueliang Wei (huazhong university of science and technology) | Crash-Consistency-Aware Encryption for Non-Volatile Memories · pdf, mp4 · view |
Weissenberger, Jack · more Jack Weissenberger (Wake Forest University) | Accelerating Neural Network Training using Arbitrary Precision Approximating Matrix Multiplication Algorithms · pdf, mp4 · view |
Wen, Haosen · more Haosen Wen (University of Rochester) | A Fast, General System for Buffered Persistent Data Structures · pdf, mp4 · view |
Wen, Mei · more Mei Wen (National University of Defense Technology) | sRouting: Towards a Better Flow Size Estimation Performance through Routing and Sketch Configuration · pdf, mp4 · view |
Wen, Zeyi · more Zeyi Wen (Department of Computer Science and Software Engineering, The University of Western Australia) | FastPSO: Towards Efficient Swarm Intelligence Algorithm on GPUs · pdf, mp4 · view |
Wifling, David · more David Wifling (Leibniz Research Centre) | IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view |
Williams, Barry · more Barry Williams (State University of New York at Binghamton) | GVT-Guided Demand-Driven Scheduling in Parallel Discrete Event Simulation · pdf, mp4 · view |
Wolf, Felix · more Felix Wolf (Technical University of Darmstadt) | Tool-Supported Mini-App Extraction to Facilitate Program Analysis and Parallelization · pdf, mp4 · view |
Wolf, Matthew · more Matthew Wolf (Oak Ridge National Lab) | DYFLOW: A flexible framework for orchestrating scientific workflows on supercomputers · pdf, mp4 · view |
Worley, Andrew · more Andrew Worley (Tennessee Technological University) | Design of a Portable Implementation of Partitioned Point-to-Point Communication Primitives · pdf, mp4 · view |
Wu, Chase Q. · more Chase Q. Wu (New Jersey Institute of Technology) | NoStop: A Novel Configuration Optimization Scheme for Spark Streaming · pdf, mp4 · view |
Wu, Chentao · more Chentao Wu (Department of Computer Science and Engineering, Shanghai Jiao Tong University; Sichuan Research Institute, Shanghai Jiao Tong University) | A Graph-Assisted Out-of-Place Update Scheme for Erasure Coded Storage Systems · pdf, mp4 · view |
Wu, Hanpei · more Hanpei Wu (SIST, ShanghaiTech University, China) | ADA: An Application-Conscious Data Acquirer for Visual Molecular Dynamics · pdf, mp4 · view |
Wu, Heng · more Heng Wu (Institute of Software, Chinese Academy of Sciences) | Best VM Selection for Big Data Applications across Multiple Frameworks by Transfer Learning · pdf, mp4 · view |
Wu, Jie · more Jie Wu (Temple University) | Joint Optimization of DNN Partition and Scheduling for Mobile Cloud Computing · pdf, mp4 · view Distributed Game-Theoretical Route Navigation for Vehicular Crowdsensing · pdf, mp4 · view |
Wu, Panruo · more Panruo Wu (University of Houston) | A Virtualization Platform Designed for Irregular Multi-Process Applications · pdf, pdf · view Recursion Brings Speedup to Out-of-Core TensorCore-based Linear Algebra Algorithms: A Case Study of Classic Gram-Schmidt QR Factorization · pdf, mp4 · view |
Wu, Weijie · more Weijie Wu (Independent Researcher) | Progressive Memory Adjustment with Performance Guarantee in Virtualized Systems · pdf, mp4 · view |
Wu, Yadong · more Yadong Wu (Sichuan University of Science and Engineering) | PREP: Predicting Job Runtime with Job Running Path on Supercomputers · pdf, mp4 · view |
Wu, Yuewen · more Yuewen Wu (Institute of Software, Chinese Academy of Sciences) | Best VM Selection for Big Data Applications across Multiple Frameworks by Transfer Learning · pdf, mp4 · view |
Wu, Zhongjie · more Zhongjie Wu (Alibaba) | Coupling Right-Provisioned Cold Storage Data Centers with Deduplication · pdf, mp4 · view |
Xiao, Renzhi · more Renzhi Xiao (Huazhong University of Science and Technology) | A Log-Free and Consistent Chained Hashing for Non-volatile Memory · pdf, pdf · view |
XIE, CHENHAO · more CHENHAO XIE (Pacific Northwest National Laboratory) | Fast and Scalable Sparse Triangular Solver for Multi-GPU Based HPC Architectures · pdf, mp4 · view |
Xie, Tao · more Tao Xie (San Diego State University, CA, USA) | ADA: An Application-Conscious Data Acquirer for Visual Molecular Dynamics · pdf, mp4 · view |
Xiong, Jin · more Jin Xiong (Institute of Computing Technology, CAS; University of Chinese Academy of Sciences) | Using Vectorized Execution to Improve SQL Query Performance on Spark · pdf, mp4 · view |
Xiong, Yufei · more Yufei Xiong (Wuhan National Laboratory for Optoelectronics, Huazhong University of Science and Technology) | CERES: Container-Based Elastic Resource Management System for Mixed Workloads · pdf, mp4 · view |
Xu, ChengZhong · more ChengZhong Xu (University of Macau) | FIFL: A Fairness Incentive Framework for Federated Learning · pdf, mp4 · view |
Xu, Fei · more Fei Xu (East China Normal University) | Prophet: Speeding up Distributed DNN Training with Predictable Communication Scheduling · pdf, mp4 · view |
Xu, Liangliang · more Liangliang Xu (University of Science and Technology of China) | Fast Reconstruction for Large Disk Enclosures Based on RAID2.0 · pdf, mp4 · view |
Xu, Ming · more Ming Xu (National University of Defense Technology) | FIFL: A Fairness Incentive Framework for Federated Learning · pdf, mp4 · view |
Xu, Nuo · more Nuo Xu (National University of Defense Technology) | HDNH: a read-efficient and write-optimized hashing scheme for hybrid DRAM-NVM memory · pdf, mp4 · view |
Xu, Yang · more Yang Xu (Fudan University) | Optimizing Flow Completion Time via Adaptive Buffer Management in Data Center Networks · pdf, mp4 · view |
Xu, Yemao · more Yemao Xu (National University of Defense Technology, Information and Communication Engineering Design Institute) | CD-SGD: Distributed Stochastic Gradient Descent with Compression and Delay Compensation · pdf, mp4 · view |
Xu, Yinlong · more Yinlong Xu (University of Science and Technology of China) | Fast Reconstruction for Large Disk Enclosures Based on RAID2.0 · pdf, mp4 · view Progressive Memory Adjustment with Performance Guarantee in Virtualized Systems · pdf, mp4 · view |
Xu, Yuanjia · more Yuanjia Xu (Institute of Software, Chinese Academy of Sciences; University of Chinese Academy of Sciences) | Best VM Selection for Big Data Applications across Multiple Frameworks by Transfer Learning · pdf, mp4 · view |
Yang, Canqun · more Canqun Yang (National University of Defense Technology) | CNN+LSTM Accelerated Turbulent Flow Simulation with Link-Wise Artificial Compressibility Method · pdf, mp4 · view |
Yang, Dongxu · more Dongxu Yang (NVIDIA) | Optimizing Winograd-Based Convolution with Tensor Cores · pdf, mp4 · view |
Yang, Hailong · more Hailong Yang (Beihang University) | Automatic Code Generation and Optimization of Large-scale Stencil Computation on Many-core Processors · pdf, mp4 · view |
Yang, Junyao · more Junyao Yang (Michigan Technological University) | Efficient Modeling of Random Sampling-Based LRU · pdf, mp4 · view |
Yang, Qing · more Qing Yang (Wuhan National Laboratory for Optoelectronics,Huazhong University of Science and Technology) | Parallel Multi-split Extendible Hashing for Persistent Memory · pdf, mp4 · view |
Yang, Qiusong · more Qiusong Yang (Institute of Software, Chinese Academy of Sciences) | Matryoshka: A Coalesced Delta Sequence Prefetcher · pdf, mp4 · view |
Yang, Wenxiang · more Wenxiang Yang (College of Computer, National University of Defense Technology; Computational Aerodynamics Institute, China Aerodynamics Research and Development Center) | PREP: Predicting Job Runtime with Job Running Path on Supercomputers · pdf, mp4 · view |
Yang, Wuu · more Wuu Yang (National Yang Ming Chiao Tung University) | Hyperchaining Optimizations for an LLVM-Based Binary Translator on x86-64 and RISC-V Platforms · pdf, mp4 · view |
Yang, Yang · more Yang Yang (Huazhong University of Science and Technology) | SPMFS: A Scalable Persistent Memory File System on Optane Persistent Memory · pdf, mp4 · view |
Yang, Yongjian · more Yongjian Yang (Jilin University) | Distributed Game-Theoretical Route Navigation for Vehicular Crowdsensing · pdf, mp4 · view |
Yao, Lulu · more Lulu Yao (University of Science and Technology of China) | Progressive Memory Adjustment with Performance Guarantee in Virtualized Systems · pdf, mp4 · view |
Yao, Yiping · more Yiping Yao (National University of Defense Technology) | A Universal Construction to implement Concurrent Data Structure for NUMA-multicore · pdf, mp4 · view |
Yasumoto, Keiichi · more Keiichi Yasumoto (Nara Institute of Science and Technology) | Analysis on Nursing Care Activity Related Stress Level for Reduction of Caregiving Workload · pdf, mp4 · view |
Ye, Qianwen · more Qianwen Ye (New Jersey Institute of Technology) | NoStop: A Novel Configuration Optimization Scheme for Spark Streaming · pdf, mp4 · view |
Ye, Xiangyu · more Xiangyu Ye (National University of Defense Technology) | Hippie: A Data-Paralleled Pipeline Approach to Improve Memory-Efficiency and Scalability for Large DNN Training · pdf, mp4 · view |
Ye, ZiChun · more ZiChun Ye (Huawei Technologies Canada Co., Ltd.) | Adapting SYCL’s SIMT Programming Paradigm for Accelerators via Program Reconstruction · view |
Yelick, Katherine · more Katherine Yelick (The University of California at Berkeley, Lawrence Berkeley National Lab) | Scaling Generalized N-Body Problems, A Case Study from Genomics · pdf, mp4 · view |
Yew, Pen-Chung · more Pen-Chung Yew (University of Minnesota at Twin Cities) | Ascetic: Enhancing Cross-Iterations Data Efficiency in Out-of-Memory Graph Processing on GPUs · pdf, mp4 · view |
Yi, Zhengming · more Zhengming Yi (National University of Defense Technology) | A Universal Construction to implement Concurrent Data Structure for NUMA-multicore · pdf, mp4 · view |
Yin, Junqi · more Junqi Yin (Oak Ridge National Laboratory) | IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads · pdf, mp4 · view |
Yin, Shu · more Shu Yin (SIST, ShanghaiTech University, China; State Key Lab of High Performance Computing) | ADA: An Application-Conscious Data Acquirer for Visual Molecular Dynamics · pdf, mp4 · view |
Yin, Yanlong · more Yanlong Yin (Zhejiang Lab) | A Novel Multi-CPU/GPU Collaborative Computing Framework for SGD-based Matrix Factorization · pdf, mp4 · view |
You, Xin · more Xin You (Beihang University) | Automatic Code Generation and Optimization of Large-scale Stencil Computation on Many-core Processors · pdf, mp4 · view |
Yu, Bowen · more Bowen Yu (Tsinghua University) | Sparker: Efficient Reduction for More Scalable Machine Learning with Spark · pdf, mp4 · view |
Yu, Enda · more Enda Yu (National University of Defense Technology) | CD-SGD: Distributed Stochastic Gradient Descent with Compression and Delay Compensation · pdf, mp4 · view |
Yu, Huashan · more Huashan Yu (Department of Computer Science and Technology, Peking University) | An Edge-Fencing Strategy for Optimizing SSSP Computations on Large-Scale Graphs · pdf, mp4 · view |
Yu, Jie · more Jie Yu (State Key Laboratory of Aerodynamics; Computational Aerodynamics Institute, China Aerodynamics Research and Development Center) | PREP: Predicting Job Runtime with Job Running Path on Supercomputers · pdf, mp4 · view |
Yu, Jinyu · more Jinyu Yu (Wuhan National Laboratory for Optoelectronics, Huazhong University of Science and Technology) | CERES: Container-Based Elastic Resource Management System for Mixed Workloads · pdf, mp4 · view |
Yu, Weikuan · more Weikuan Yu (Florida State University) | ROBOTune: High-Dimensional Configuration Tuning for Cluster-Based Data Analytics · pdf, mp4 · view |
Yu, Ya · more Ya Yu (Wuhan National Laboratory for Optoelectronics,Huazhong University of Science and Technology) | Parallel Multi-split Extendible Hashing for Persistent Memory · pdf, mp4 · view |
Yue, Yinliang · more Yinliang Yue (Institute of Information Engineering,Chinese Academy of Sciences) | Boosting Compaction Performance of LSM-tree-based KV Stores in Multi-Near-Data Processing Systems · pdf, pdf · view |
Zeng, Hui · more Hui Zeng (College of Computer, National University of Defense Technology) | FedCav: Contribution-aware Model Aggregation on Distributed Heterogeneous Data in Federated Learning · pdf, mp4 · view |
Zhang, Eddy Z. · more Eddy Z. Zhang (Rutgers Unversity) | BGPQ: A Heap-Based Priority Queue Design for GPUs · pdf, mp4 · view |
Zhang, Jie · more Jie Zhang (National Central University) | Automated Arrhythmia Detection using Hilbert-Huang Transform based Convolutional Neural Network · pdf, mp4 · view |
Zhang, Jin · more Jin Zhang (Nankai University) | Ascetic: Enhancing Cross-Iterations Data Efficiency in Out-of-Memory Graph Processing on GPUs · pdf, mp4 · view |
Zhang, Luoping · more Luoping Zhang (Wake Forest University) | Accelerating Neural Network Training using Arbitrary Precision Approximating Matrix Multiplication Algorithms · pdf, mp4 · view |
Zhang, Mingzhe · more Mingzhe Zhang (State Key Laborotary of Computer Architecture, Institute of Computing Technology, Chinese Academy of Sciences) | BitX: Empower Versatile Inference with Hardware Runtime Pruning · pdf, mp4 · view |
Zhang, Shaoshuai · more Shaoshuai Zhang (University of Houston) | Recursion Brings Speedup to Out-of-Core TensorCore-based Linear Algebra Algorithms: A Case Study of Classic Gram-Schmidt QR Factorization · pdf, mp4 · view |
Zhang, Shulai · more Shulai Zhang (Shanghai Jiao Tong University) | Dubhe: Towards Data Unbiasedness with Homomorphic Encryption in Federated Learning Client Selection · pdf, mov · view |
Zhang, Wenbo · more Wenbo Zhang (Institute of Software, Chinese Academy of Sciences; University of Chinese Academy of Sciences) | Best VM Selection for Big Data Applications across Multiple Frameworks by Transfer Learning · pdf, mp4 · view |
Zhang, Xiaofan · more Xiaofan Zhang (Unviversity of Illinois at Urbana-Champaign) | Exploring HW/SW Co-Optimizations for Accelerating Large-scale Texture Identification on Distributed GPUs · pdf, mp4 · view |
Zhang, Xiaorong · more Xiaorong Zhang (South West University of Science and Technology) | PREP: Predicting Job Runtime with Job Running Path on Supercomputers · pdf, mp4 · view |
Zhang, Yiran · more Yiran Zhang (Tsinghua university, Beijing National Research Center for Information Science and Technology (BNRist)) | Receiver-Driven Congestion Control for InfiniBand · pdf, mp4 · view |
Zhang, Youhui · more Youhui Zhang (Tsinghua University) | Regu2D: Accelerating Vectorization of SpMV on Intel Processors through 2D-partitioning and Regular Arrangement · pdf, mp4 · view |
Zhang, Zhenwei · more Zhenwei Zhang (East China Normal University) | Prophet: Speeding up Distributed DNN Training with Predictable Communication Scheduling · pdf, mp4 · view |
Zhang, Zhicheng · more Zhicheng Zhang (Stanford University) | ComputeCOVID19+: Accelerating COVID-19 Diagnosis and Monitoring via High-Performance Deep Learning · pdf, mp4 · view |
Zhang, Zhihua · more Zhihua Zhang (Nara Institute of Science and Technology) | Analysis on Nursing Care Activity Related Stress Level for Reduction of Caregiving Workload · pdf, mp4 · view |
Zhao, Yuhong · more Yuhong Zhao (Institute of Information Engineering,Chinese Academy of Sciences) | Boosting Compaction Performance of LSM-tree-based KV Stores in Multi-Near-Data Processing Systems · pdf, pdf · view |
Zhao, Ziyi · more Ziyi Zhao (Nankai University) | Ascetic: Enhancing Cross-Iterations Data Efficiency in Out-of-Memory Graph Processing on GPUs · pdf, mp4 · view |
Zheng, Kevin · more Kevin Zheng (University of Virginia) | Multi-Agent Reinforcement Learning based Distributed Renewable Energy Matching for Datacenters · pdf, mp4 · view |
Zheng, Wenli · more Wenli Zheng (Shanghai Jiao Tong University) | FIFL: A Fairness Incentive Framework for Federated Learning · pdf, mp4 · view Dubhe: Towards Data Unbiasedness with Homomorphic Encryption in Federated Learning Client Selection · pdf, mov · view |
Zhong, Hua · more Hua Zhong (Institute of Software, Chinese Academy of Sciences; University of Chinese Academy of Sciences) | Best VM Selection for Big Data Applications across Multiple Frameworks by Transfer Learning · pdf, mp4 · view |
Zhou, Bing Bing · more Bing Bing Zhou (The University of Sydney) | Efficient Complete Event Trend Detection over High-Velocity Streams · pdf, mp4 · view |
Zhou, Hai · more Hai Zhou (Huazhong University of Science and Technology, Wuhan National Laboratory for Optoelectronics) | Multi-level Forwarding and Scheduling Recovery Technique in Heterogeneous Network for Erasure-coded Clusters · pdf, mp4 · view |
Zhou, Longfang · more Longfang Zhou (Southwest University of Science and Technology, State Key Laboratory of Aerodynamics) | PREP: Predicting Job Runtime with Job Running Path on Supercomputers · pdf, mp4 · view |
Zhou, Tongqing · more Tongqing Zhou (College of Computer, National University of Defense Technology) | FedCav: Contribution-aware Model Aggregation on Distributed Heterogeneous Data in Federated Learning · pdf, mp4 · view |
Zhou, Xiaohu · more Xiaohu Zhou (School of Computing, Engineering, and Built Environment,Birmingham City University) | FMSM: A Fuzzy Multi-keyword Search Scheme for Encrypted Cloud Data based on Multi-chain Network · pdf, mp4 · view |
Zhou, Yang · more Yang Zhou (Huazhong University of Science and Technology, Wuhan National Laboratory for Optoelectronics) | ASLDP: An Active Semi-supervised Learning method for Disk Failure Prediction · pdf, mp4 · view |
Zhu, Junhao · more Junhao Zhu (National University of Defense Technology) | HDNH: a read-efficient and write-optimized hashing scheme for hybrid DRAM-NVM memory · pdf, mp4 · view |
Zhu, Lin · more Lin Zhu (Huazhong University of Science and Technology) | Communication Avoiding All-Pairs Shortest Paths Algorithm for Sparse graphs · pdf, mp4 · view |
Zhu, Wenjun · more Wenjun Zhu (Intel APAC) | Teddy: An Efficient SIMD-based Literal Matching Engine for Scalable Deep Packet Inspection · pdf, mp4 · view |
Zhu, Yifeng · more Yifeng Zhu (University of Maine) | Parallel Multi-split Extendible Hashing for Persistent Memory · pdf, mp4 · view |
Zou, Xiaomin · more Xiaomin Zou (Huazhong University of Science and Technology) | HDNH: a read-efficient and write-optimized hashing scheme for hybrid DRAM-NVM memory · pdf, mp4 · view |
Zou, Yanliang · more Yanliang Zou (SIST, ShanghaiTech University, China) | ADA: An Application-Conscious Data Acquirer for Visual Molecular Dynamics · pdf, mp4 · view |