Flynn’s reconciliation: Automating the register cache idiom for cross-accelerator programming 01.05.2021 Paper Daniel Thuerck, Nicolas Weber and Roberto Bifulco ACM Transactions on Architecture and Code Optimization (TACO)
SOL: Effortless Device Support for AI Frameworks without Source Code Changes 01.05.2020 Paper Nicolas Weber and Felipe Huici High Performance Machine Learning (HPML)‘20
BrainSlug: Transparent Acceleration of Deep Learning Through Depth-First Parallelism 01.05.2018 Paper Nicolas Weber, Florian Schmidt, Mathias Niepert and Felipe Huici
BrainSlug: Transparent Acceleration of Deep Learning Through Depth-First Parallelism 01.05.2018 Paper Nicolas Weber, Florian Schmidt, Mathias Niepert and Felipe Huici International Workshop on Embedded and Mobile Deep Learning
Detail-Preserving Pooling in Deep Networks 01.05.2018 Paper Faraz Saeedan, Nicolas Weber, Michael Goesele and Stefan Roth ArXiv, Source Code
MATOG: Array Access Auto-Tuning 01.01.2017 Paper Nicolas Weber and Michael Goesele ACM Transactions on Architecture and Code Optimization (TACO), Source Code
Prospect for Knowledge in Survey Data: An Artificial Neural Network Sensitivity Analysis 01.01.2017 Paper Patrick Weber, Nicolas Weber, Michael Goesele and Rüdiger Kabst
Adaptive GPU Array Layout Auto-Tuning 01.01.2016 Paper Nicolas Weber and Michael Goesele Software Engineering Methods for Parallel and High Performance Applications (SEM4HPC), Source Code