Flynn’s reconciliation: Automating the register cache idiom for cross-accelerator programming 01.05.2021 Paper Daniel Thuerck, Nicolas Weber and Roberto Bifulco ACM Transactions on Architecture and Code Optimization (TACO)
The YouTube player can not be loaded with disabled JavaScript. The following video is embedded here: https://youtube.com/watch?v=PrlUvsPra8o SOL: Transparent Neural Network Acceleration on NEC SX-Aurora TSUBASA 01.09.2020 Talk Nicolas Weber Slides
SOL: Effortless Device Support for AI Frameworks without Source Code Changes 01.05.2020 Paper Nicolas Weber and Felipe Huici High Performance Machine Learning (HPML)‘20
SOL4VE: Bringing Deep Neural Networks to the NEC SX-Aurora TSUBASA 01.01.2020 Talk Nicolas Weber NEC User Group Meeting
BrainSlug: Transparent Acceleration of Deep Learning Through Depth-First Parallelism 01.05.2018 Paper Nicolas Weber, Florian Schmidt, Mathias Niepert and Felipe Huici
BrainSlug: Transparent Acceleration of Deep Learning Through Depth-First Parallelism 01.05.2018 Paper Nicolas Weber, Florian Schmidt, Mathias Niepert and Felipe Huici International Workshop on Embedded and Mobile Deep Learning
Detail-Preserving Pooling in Deep Networks 01.05.2018 Paper Faraz Saeedan, Nicolas Weber, Michael Goesele and Stefan Roth ArXiv, Source Code