ICPP 2018 Program


Overview | By Date | By Event Type | By Room | Author Index

Monday, August 13th


8:00am-5:00pm

ICPP Registration
Erb Memorial Union (EMU) Ballroom, 2nd Floor
Registration

9:00am-12:30pm

EMS Workshop
Maple, Ballroom Area, Erb Memorial Union (EMU), 2nd Floor
Workshop

QC Workshop
Oak, Ballroom Area, Erb Memorial Union (EMU), 2nd Floor
Workshop

9:00am-6:30pm

NextGenClouds Workshop
Straub Hall, Room 145
Workshop

P2S2 Workshop
Straub Hall, Room 156
Workshop

SRMPDS Workshop
Straub Hall, Room 245
Workshop

10:30am-11:00am

Break
Straub Hall Lobby
Break

12:30pm-1:30pm

Lunch
Erb Memorial Union (EMU) Ballroom, 2nd Floor
Lunch

2:00pm-3:30pm

AWASN Workshop
Maple, Ballroom Area, Erb Memorial Union (EMU), 2nd Floor
Workshop

2:00pm-5:00pm

BIO-HPC Workshop
Oak, Ballroom Area, Erb Memorial Union (EMU), 2nd Floor
Workshop

2:00pm-6:00pm

Introduction to Running AI Workloads on PowerAI
Gumwood, Ballroom Area, Erb Memorial Union (EMU), 2nd Floor
Tutorial

3:30pm-4:00pm

Break
Straub Hall Lobby
Break

Tuesday, August 14th


8:00am-10:30am

ICPP Registration
Erb Memorial Union (EMU) Ballroom, 2nd Floor
Registration

8:30am-9:00am

Welcome and Introduction
Straub Hall, Room 156
Allen Malony
details
Keynote

9:00am-10:00am

Keynote
Straub Hall, Room 156
Mark Robins
AI and HPC: Challenges and Opportunities
Keynote

10:00am-10:30am

Best Paper Session
Straub Hall, Room 156
Michele Weiland
ImageNet Training in Minutes
Paper

10:30am-11:00am

Break
Straub Hall Lobby
Break

11:00am-12:30pm

Graph Applications
Straub Hall, Room 245
Konstantinos Krommydas
ParaPLL: Fast Parallel Shortest-path Distance Query on Large-scale Weighted Graphs
HUS-Graph: I/O-Efficient Out-of-Core Graph Processing with Hybrid Update Strategy
A Distributed Infomap Algorithm for Scalable and High-Quality Community Detection
Paper

Monitoring and Network Analysis
Straub Hall, Room 156
Martin Schulz
Integrating Low-latency Analysis into HPC System Monitoring
Unveiling Thread Communication Bottlenecks Using Hardware-Independent Metrics
Interference between I/O and MPI Traffic on Fat-tree Networks
Paper

Task Placement Algorithms
Straub Hall, Room 145
Jee Choi
Energy-Efficient Speculative Execution using Advanced Reservation for Heterogeneous Clusters
Topology-induced Enhancement of Mappings
Charging Task Scheduling for Directional Wireless Charger Networks
Paper

12:30pm-2:00pm

ICPP Executive Meeting
Oak, Ballroom Area, Erb Memorial Union (EMU), 2nd Floor
Allen Malony
Lunch

Lunch
Erb Memorial Union (EMU) Ballroom, 2nd Floor
Lunch

1:00pm-2:00pm

Ph.D. Forum Introduction
Maple, Ballroom Area, Erb Memorial Union (EMU), 2nd Floor
Ph.D. Forum

2:00pm-3:30pm

Astronomy and Earth Systems
Straub Hall, Room 245
Kevin Huck
Scalable Solutions for Automated Single Pulse Identification and Classification in Radio Astronomy
Communication-Avoiding for Dynamical Core of Atmospheric General Circulation Model
MPI-Vector-IO: Parallel I/O and Partitioning for Geospatial Vector Data
Paper

Networking Algorithms
Straub Hall, Room 145
Kamesh Madduri
NFV Middlebox Placement with Balanced Set-up Cost and Bandwidth Consumption
DAG-SFC: Minimize the Embedding Cost of SFC with Parallel VNFs
Heterogeneous Wireless Charger Placement with Obstacles
Paper

Performance Tools and Methodologies
Straub Hall, Room 156
Sameer Shende
Scalable Behavioral Emulation of Extreme-Scale Systems Using Structural Simulation Toolkit
Varbench: an Experimental Framework to Measure and Characterize Performance Variability
NumaMMA: NUMA MeMory Analyzer
Paper

3:30pm-4:00pm

Break
Straub Hall Lobby
Break

4:00pm-6:00pm

Algorithms
Straub Hall, Room 145
Wu Feng
MND-MST: A Multi-Node Multi-Device Parallel Boruvka's MST Algorithm
CSTF: Large-Scale Sparse Tensor Factorizations on Distributed Platforms
Reducing Communication in Proximal Newton Methods for Sparse Least Squares Problems
PBCS: An Efficient Parallel Characteristic Set Method for Solving Boolean Polynomial Systems
Paper

Performance on GPU Systems
Straub Hall, Room 156
Sameer Shende
Using Static Allocation Algorithms for Matrix Matrix Multiplication on Multicores and GPUs
Revisiting Multi-pass Scatter and Gather on GPUs
Matrix Factorization on GPUs with Memory Optimization and Approximate Computing
Massively Parallel Huffman Decoding on GPUs
Paper

Scheduling Algorithms
Straub Hall, Room 245
Michele Weiland
A Generic Approach to Scheduling and Checkpointing Workflows
ran-GJS: Orchestrating Data Analytics for Heterogeneous Geo-distributed Edges
Less Provisioning: A Fine-Grained Resource Scaling Engine for Long-Running Services with Tail Latency Guarantees
Improving Resource Utilization through Demand Aware Process Scheduling
Paper

6:30pm-9:30pm

Poster Reception and Dinner
Erb Memorial Union (EMU) Ballroom, 2nd Floor
Kengo Nakajima
Delta-Stepping Synchronous Parallel Model
Linear Time Sorting for Large Data Sets with Specialized Processor
Exploring Memory Coalescing for 3D-Stacked Hybrid Memory Cube
A Low-Communication Method to Solve Poisson's Equation on Locally-Structured Grids
CGAcc: CSR-based Graph Traversal Accelerator on HMC
An Extensible Ecosystem of Tools Providing User Friendly HPC Access and Supporting Jupyter Notebooks
In-Depth Reliability Characterization of NAND Flash based Solid State Drives in High Performance Computing Systems
Leveraging Resource Bottleneck Awareness and Optimizations for Data Analytics Performance
Performance Improvements of an Event Index Distributed System
KeyBin2: Distributed Clustering for Scalable and In-situ Analysis
I/O Bottleneck Investigation in Deep Learning Systems
OpenMP 4.5 Implementations: Evaluation & Verification of Offloading Features
Cost-Time Performance of Scaling Applications on the Cloud
SOSflow: A Scalable Observation System for Introspection and In Situ Analytics
Abstractions for Specifying Sparse Matrix Data Transformations
Topologies and Adaptive Routing on Large-Scale Interconnects
Middleware for Data Intensive Analytics on HPC
Toward a Multi-GPU Implementation of the Modular Integer GCD Algorithm: Extended Abstract
DSAP: Data Structure-Aware Prefetching for Breadth First Search on GPU
Exploiting Inter-Phase Application Dynamism to Auto-Tune HPC Applications for Energy-Efficiency
Resource and Service Management in Fog Computing
Iterative Solver Selection Techniques for Sparse Linear Systems
Performance Analysis of DroughtHPC and Holistic HPC Workflows
Push-Pull on Graphs is Column- and Row-based SpMV Plus Masks
Adaptive auto-tuning in HPX using APEX
A HPC Framework for Big Spatial Data Processing and Analytics
A Computational Investigation of Redistricting Using Simulated Annealing
Fast and generic concurrent message-passing
Designing Domain-Specific Heterogenous Manycores from Dataflow Programs
Identifying Carcinogenic Multi-hit Combinations usingWeighted Set Cover Algorithm
Efficient Matching of GPU Kernel Subgraphs
Toward Footprint-Aware Power Shifting for Hybrid Memory Based Systems
Algorithm Design for Large Scale FFT-Based Simulations on CPU-GPU Platforms
WebNN: A Distributed Framework for Deep Learning
Utilization of Random Profiling for System Modeling and Dynamic Configuration
Performance evaluation of parallel cloud functions
Interval based Framework for Locking in Hierarchies
Models and Techniques for Green High-Performance Computing
Poster, Reception

Wednesday, August 15th


8:00am-10:00am

ICPP Registration
Erb Memorial Union (EMU) Ballroom, 2nd Floor
Registration

9:00am-10:00am

Plenary
Straub Hall, Room 156
Manish Parashar
Transforming Science through Cyberinfrastructure
Plenary Talk

10:00am-10:30am

Break
Straub Hall Lobby
Break

10:30am-12:30pm

Machine Learning and Networks
Straub Hall, Room 145
Peter Pirkelbauer
Learning Driven Parallelization for Large-Scale Video Workload in Hybrid CPU-GPU Cluster
GLP4NN: A Convergence-invariant and Network-agnostic Light-weight Parallelization Framework for Deep Neural Networks on Modern GPUs
KeyBin2: Distributed Clustering for Scalable and In-Situ Analysis
Disk Failure Prediction in Data Centers via Online Learning
Paper

Memory Performance
Straub Hall, Room 156
Martin Schulz
A Performance Model to Execute Workflows on High-Bandwidth-Memory Architectures
Optimizing for KNL Usage Modes When Data Doesn’t Fit in MCDRAM
Nemo: NUMA-aware Concurrency Control for Scalable Transactional Memory
SPECTR: Scalable Parallel Short Read Error Correction on Multi-core and Many-core Architectures
Paper

Networking
Straub Hall, Room 245
Federico Silla
Cache Assisted Randomized Sharing Counters in Network Measurement
Load-Balanced Slim Fly Networks
Toward Performant and Energy-efficient Queries in Three-tier Wireless Sensor Networks
Click-Based Asynchronous Mesh Network with Bounded Bundled Data
Paper

12:30pm-2:00pm

Lunch
Erb Memorial Union (EMU) Ballroom, 2nd Floor
Lunch

Ph.D. Forum Discussion
Erb Memorial Union (EMU) Ballroom, 2nd Floor
Ph.D. Forum

2:00pm-3:30pm

Machine Learning
Straub Hall, Room 145
Wu Feng
Partitioning and Communication Strategies for Sparse Non-negative Matrix Factorization
Energy-efficient Application Resource Scheduling using Machine Learning Classifiers
PRIONN: Predicting Runtime and IO using Neural Networks
Paper

Materials and Molecular Dynamics
Straub Hall, Room 245
Jose Canales
Massively Scaling the Metal Microscopic Damage Simulation on Sunway TaihuLight Supercomputer
Combining Task-based Parallelism and Adaptive Mesh Refinement Techniques in Molecular Dynamics Simulations
Task-parallel Analysis of Molecular Dynamics Trajectories
Paper

Performance Studies
Straub Hall, Room 156
Filippo Mantovani
A Multilevel Subtree Method for Single and Batched Sparse Cholesky Factorization
Vectorised Computation of Diverging Ensembles
Balanced k-means for Parallel Geometric Partitioning
Paper

3:30pm-4:00pm

Break
Straub Hall Lobby
Break

4:00pm-5:30pm

Performance of Sparse Algorithms
Straub Hall, Room 245
Sameer Shende
A Fast Sparse Triangular Solver for Structured-grid Problems on Sunway Many-core Processor SW26010
Bandwidth Reduced Parallel SpMV on the SW26010 Many-Core Platform
Vectorized Parallel Sparse Matrix-Vector Multiplication in PETSc Using AVX-512
Paper

Ph.D. Career Planning
Gumwood, Ballroom Area, Erb Memorial Union (EMU), 2nd Floor
Ph.D. Forum

Programming Models
Straub Hall, Room 156
Olga Pearce
A Comprehensive Study on Bugs in Actor Systems
A Framework for Auto-Parallelization and Code Generation: An Integrative Case Study with Legacy FORTRAN Codes
Improving MPI Multi-threaded RMA Communication Performance
Paper

Resilience and Reliability
Straub Hall, Room 145
Allen Malony
Characterizing the Impact of Soft Errors Affecting Floating-point ALUs using RTL-level Fault Injection
Leverage Redundancy in Hardware Transactional Memory to Improve Cache Reliability
Modeling Application Resilience in Large-scale Parallel Execution
Paper

6:30pm-9:30pm

Conference Dinner
Jordan Schnitzer Museum of Art (JSMA)
Reception

Thursday, August 16th


9:00am-10:00am

Plenary
Straub Hall, Room 156
Mary Hall
Bringing Sparse Computations into the Optimization Light
Plenary Talk

10:00am-10:30am

Break
Straub Hall Lobby
Break

10:30am-12:30pm

Memory and Caching
Straub Hall, Room 145
Sameer Shende
Memory Coalescing for Hybrid Memory Cube
CAMPS: Conflict-Aware Memory-Side Prefetching Scheme for Hybrid Memory Cube
Improving First Level Cache Efficiency for GPUs Using Dynamic Line Protection
Accelerating FM-index Search for Genomic Data Processing
Paper

Resource Management
Straub Hall, Room 156
Taisuke Boku
Joint Optimization of MapReduce Scheduling and Network Policy in Hierarchical Clouds
Performance & Energy Tradeoffs for Dependent Distributed Applications Under System-wide Power Caps
H2Cloud: Maintaining the Whole Filesystem in an Object Storage Cloud
Power Efficient High Performance Packet I/O
Paper

Runtime Systems
Straub Hall, Room 245
Olga Pearce
Efficient Runtime Support for a Partitioned Global Logical Address Space
FULT: Fast User-Level Thread Scheduling Using Bit-Vectors
Constructing Dynamic Policies for Paging Mode Selection
The Case for Semi-Permanent Cache Occupancy
Paper

12:30pm-2:00pm

Lunch
Erb Memorial Union (EMU) Ballroom, 2nd Floor
Lunch

2:00pm-3:30pm

Parallel and Distributed Algorithms
Straub Hall, Room 156
Kamesh Madduri
A Communication-Efficient Causal Broadcast Protocol
NumLock: Towards Optimal Multi-Granularity Locking in Hierarchies
IS-ASGD: Accelerating Asynchronous SGD using Importance Sampling
Paper

Performance of Graph Algorithms
Straub Hall, Room 145
Allen Malony
Parallelizing Pruning-based Graph Structural Clustering
An Empirical Comparison of k-Shortest Simple Path Algorithms on Multicores
C-Graph: A Highly Efficient Concurrent Graph Reachability Query Framework
Paper

Storage
Straub Hall, Room 245
Kevin Huck
Cross-Rack-Aware Updates in Erasure-Coded Data Centers
Duchy: Achieving Both SSD Durability and Controllable SMR Cleaning Overhead in Hybrid Storage Systems
Efficient SSD Caching by Avoiding Unnecessary Writes using Machine Learning
Paper

3:30pm-4:00pm

Break
Straub Hall Lobby
Break

4:00pm-5:30pm

Data Processing
Straub Hall, Room 145
Taisuke Boku
Dual-Paradigm Stream Processing
Index Shard Replication Strategies for Improving Resource Utilization in Large Scale Search Engines
FFS-VA: A Fast Filtering System for Large-scale Video Analytics
Paper

I/O and File Systems
Straub Hall, Room 245
Peter Pirkelbauer
Efficient Search for Free Blocks in the WAFL File System
A Write-efficient and Consistent Hashing Scheme for Non-Volatile Memory
Reference-distance Eviction and Prefetching for Cache Management in Spark
Paper

Matrix and Graph Algorithms
Straub Hall, Room 156
Jee Choi
Implementing Push-Pull Efficiently in GraphBLAS
UHCL-Darknet: An OpenCL-based Deep Neural Network Framework for Heterogeneous Multi-/Many-core Clusters
Optimization of the Spherical Harmonics Transform based Tree Traversals in the Helmholtz FMM Algorithm
Paper

Created 2018-8-10 9:31