# 2020 IEEE/ACM 10th Workshop on Irregular Applications: **Architectures and Algorithms** (IA3 2020)

**Virtual Conference 11 November 2020** 



IEEE Catalog Number: CFP20A47-POD **ISBN:** 

978-1-6654-4656-3

## Copyright © 2020 by the Institute of Electrical and Electronics Engineers, Inc. All Rights Reserved

*Copyright and Reprint Permissions*: Abstracting is permitted with credit to the source. Libraries are permitted to photocopy beyond the limit of U.S. copyright law for private use of patrons those articles in this volume that carry a code at the bottom of the first page, provided the per-copy fee indicated in the code is paid through Copyright Clearance Center, 222 Rosewood Drive, Danvers, MA 01923.

For other copying, reprint or republication permission, write to IEEE Copyrights Manager, IEEE Service Center, 445 Hoes Lane, Piscataway, NJ 08854. All rights reserved.

### \*\*\* This is a print representation of what appears in the IEEE Digital Library. Some format issues inherent in the e-media version may also appear in this print version.

| IEEE Catalog Number:    | CFP20A47-POD      |
|-------------------------|-------------------|
| ISBN (Print-On-Demand): | 978-1-6654-4656-3 |
| ISBN (Online):          | 978-1-6654-1557-6 |

#### Additional Copies of This Publication Are Available From:

Curran Associates, Inc 57 Morehouse Lane Red Hook, NY 12571 USA Phone: (845) 758-0400 Fax: (845) 758-2633 E-mail: curran@proceedings.com Web: www.proceedings.com



### 2020 IEEE/ACM 10th Workshop on Irregular Applications: Architectures and Algorithms (IA3) IA3 2020

### **Table of Contents**

| Message from the Workshop Chairs .v.                                          |
|-------------------------------------------------------------------------------|
| Workshop Organization vi                                                      |
| Keynote - Research Challenges in Compiler Technology for Sparse Tensors .viii |
| Keynote - Memory Performance Optimization ix                                  |

#### Session 1

| Accelerating Domain Propagation: an Efficient GPU-Parallel Algorithm over Sparse Matrices .1<br>Boro Sofranac (Berlin Institute of Technology, Germany and Zuse<br>Institute Berlin, Germany), Ambros Gleixner (HTW Berlin, Germany and<br>Zuse Institute Berlin, Germany), and Sebastian Pokutta (Berlin<br>Institute of Technology, Germany and Zuse Institute Berlin, Germany)                                                                      |
|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| Parallelizing Irregular Computations for Molecular Docking .12<br>Leonardo Solis-Vasquez (Technical University of Darmstadt, Germany),<br>Diogo Santos-Martins (The Scripps Research Institute, USA), Andreas F.<br>Tillack (The Scripps Research Institute, USA), Andreas Koch (Technical<br>University of Darmstadt, Germany), Jérôme Eberhardt (The Scripps<br>Research Institute, USA), and Stefano Forli (The Scripps Research<br>Institute, USA) |
| Reducing Queuing Impact in Irregular Data Streaming Applications .22<br>Stephen W. Timcheck (Washington University in St. Louis, USA) and<br>Jeremy D. Buhler (Washington University in St. Louis, USA)                                                                                                                                                                                                                                                |
| Supporting Irregularity in Throughput-Oriented Computing by SIMT-SIMD Integration .31<br>Daniel Thuerck (NEC Laboratories Europe, Heidelberg, Germany)                                                                                                                                                                                                                                                                                                 |

### Session 2

DistDGL: Distributed Graph Neural Network Training for Billion-Scale Graphs .36..... Da Zheng (Amazon, USA), Chao Ma (Amazon, China), Minjie Wang (Amazon, China), Jinjing Zhou (Amazon, China), Qidong Su (Amazon, China), Xiang Song (Amazon, China), Quan Gan (Amazon, China), Zheng Zhang (Amazon, China), and George Karypis (Amazon, USA)

- Labeled Triangle Indexing for Efficiency Gains in Distributed Interactive Subgraph Search .45...... Tahsin Reza (Lawrence Livermore National Laboratory), Matei Ripeanu (University of British Columbia), Geoffrey Sanders (Lawrence Livermore National Laboratory), and Roger Pearce (Lawrence Livermore National Laboratory)
- Distributed Memory Graph Coloring Algorithms for Multiple GPUs .54..... Ian Bogle (Rensselaer Polytechnic Institute), Erik Boman (Sandia National Laboratories), Karen Devine (Sandia National Laboratories), Sivasankaran Rajamanickam (Sandia National Laboratories), and George Slota (Rensselaer Polytechnic Institute)
- Performance Evaluation of the Vectorizable Binary Search Algorithms on an FPGA Platform .63..... *Zheming Jin (ANL) and Hal Finkel (ANL)*

Author Index 69