This survey is intended to cover research on compilers for AI domain. Some of these compilers are widely known as NPU/TPU compilers.
It's built not by a professional, just for studying.
Of course, this survey is not the first one in this domain. Here is the quickly found list of similar GitHub surveys. You're better to see them first:
Several closely related adjacent areas are not covered here:
- AI for compilers, see awesome-machine-learning-in-compilers
- AI chips, see AI-Chip
- AI for software engineering, see Allamanis survey and former src-d survey
- Capra M. et al. Hardware and software optimizations for accelerating deep neural networks: Survey of current trends, challenges, and the road ahead (IEEE Access. 2020)
- Li et al. The Deep Learning Compiler: A Comprehensive Survey (Beihang & Tsinghua Universities. arXiv, 2020)
To the best of our knowledge, this is the first paper that provides a comprehensive survey on the design of DL compiler.
We pay special attention to papers devoted to products of Samsung company.
- Song J. et al. An 11.5 TOPS/W 1024-MAC butterfly structure dual-core sparsity-aware neural processing unit in 8nm flagship mobile SoC (Samsung Electronics, Hwaseong & SAIT. ISSCC, 2019)
- Kim Y.D. et al. A 7nm High-Performance and Energy-Efficient Mobile Application Processor with Tri-Cluster CPUs and a Sparsity-Aware NPU (Samsung Electronics, Hwaseong. ISSCC, 2020)
- Chae D., Kapoor P. Centralized Generic Interfaces in Hardware/Software Co-design for AI Accelerators (Samsung Research. ICSEW, 2020)
- Hwang J. et al. Low Power SOC Based on High Density MIM Capacitor for beyond Moore Era by Robust Power Integrity Achievement (Samsung S.LSI. ECTC, 2020)
- Lee H. et al. Extracting power supply current profile by using interposer-based low-noise probing technique for PDN design of high-density POP (Samsung S.LSI. ECTC, 2020)
- Hwang S. et al. Effective Substrate Thermal Conductivity Modeling Method Extracted from Detailed Pattern for Premium SOC Packages (Samsung Electronics, Hwaseong. ITherm, 2020)
- Ignatov A. et al. AI benchmark: All about deep learning on smartphones in 2019 (ETHZ, Google, Samsung, Huawei, Qualcomm, MediaTek, Unisoc. ICCVW, 2019)
we evaluate the performance and compare the results of all chipsets from Qualcomm, HiSilicon, Samsung, MediaTek and Unisoc that are providing hardware acceleration for AI inference
- Lee M.Y. et al. (in Korean) Trends in AI Processor Technology (ETRI. Electronics and Telecommunications Trends, 2020)
- compares Apple, Huawei, Samsung, Qualcomm, MediaTek, ARM, CEVA, Gyrfalcon & Hailo processors
- Das S. et al. A Systolic Dataflow Based Accelerator for CNNs (Samsung. ISCAS, 2020)
- Bhattacharya S. et al. Countering Acoustic Adversarial Attacks in Microphone-equipped Smart Home Devices (Samsung AI Center & Cambridge. IMWUT, 2020)
- cites a Song et al paper describing Samsung NPU device
- Jung J., Park J., Kumar A. A Verification Framework of Neural Processing Unit for Super Resolution (Samsung Research. MTV, 2019)
- Oh J. et al. Weight Equalizing Shift Scaler-Coupled Post-training Quantization (Samsung Research. arXiv, 2020)
- AI Benchmark - benchmark for DNN tasks on Mobile phones