Search Results for author: Jörg Henkel

Found 19 papers, 6 papers with code

A Comprehensive Survey of Convolutions in Deep Learning: Applications, Challenges, and Future Trends

no code implementations23 Feb 2024 Abolfazl Younesi, Mohsen Ansari, Mohammadamin Fazli, Alireza Ejlali, Muhammad Shafique, Jörg Henkel

In today's digital age, Convolutional Neural Networks (CNNs), a subset of Deep Learning (DL), are widely used for various computer vision tasks such as image classification, object detection, and image segmentation.

6D Vision Image Classification +7

TransAxx: Efficient Transformers with Approximate Computing

no code implementations12 Feb 2024 Dimitrios Danopoulos, Georgios Zervakis, Dimitrios Soudris, Jörg Henkel

Vision Transformer (ViT) models which were recently introduced by the transformer architecture have shown to be very competitive and often become a popular alternative to Convolutional Neural Networks (CNNs).

Hardware-Aware DNN Compression via Diverse Pruning and Mixed-Precision Quantization

no code implementations23 Dec 2023 Konstantinos Balaskas, Andreas Karatzas, Christos Sad, Kostas Siozios, Iraklis Anagnostopoulos, Georgios Zervakis, Jörg Henkel

We explore, for the first time, per-layer fine- and coarse-grained pruning, in the same DNN architecture, in addition to low bit-width mixed-precision quantization for weights and activations.

Quantization Reinforcement Learning (RL)

Federated Learning for Computationally-Constrained Heterogeneous Devices: A Survey

no code implementations18 Jul 2023 Kilian Pfeiffer, Martin Rapp, Ramin Khalili, Jörg Henkel

With an increasing number of smart devices like internet of things (IoT) devices deployed in the field, offloadingtraining of neural networks (NNs) to a central server becomes more and more infeasible.

Federated Learning Privacy Preserving

Model-to-Circuit Cross-Approximation For Printed Machine Learning Classifiers

1 code implementation14 Mar 2023 Giorgos Armeniakos, Georgios Zervakis, Dimitrios Soudris, Mehdi B. Tahoori, Jörg Henkel

Printed electronics (PE) promises on-demand fabrication, low non-recurring engineering costs, and sub-cent fabrication costs.

Co-Design of Approximate Multilayer Perceptron for Ultra-Resource Constrained Printed Circuits

1 code implementation28 Feb 2023 Giorgos Armeniakos, Georgios Zervakis, Dimitrios Soudris, Mehdi B. Tahoori, Jörg Henkel

Printed Electronics (PE) exhibits on-demand, extremely low-cost hardware due to its additive manufacturing process, enabling machine learning (ML) applications for domains that feature ultra-low cost, conformity, and non-toxicity requirements that silicon-based systems cannot deliver.

Approximate Computing and the Efficient Machine Learning Expedition

no code implementations2 Oct 2022 Jörg Henkel, Hai Li, Anand Raghunathan, Mehdi B. Tahoori, Swagath Venkataramani, Xiaoxuan Yang, Georgios Zervakis

In this work, we enlighten the synergistic nature of AxC and ML and elucidate the impact of AxC in designing efficient ML systems.

Descriptive

Energy-efficient DNN Inference on Approximate Accelerators Through Formal Property Exploration

no code implementations25 Jul 2022 Ourania Spantidi, Georgios Zervakis, Iraklis Anagnostopoulos, Jörg Henkel

Deep Neural Networks (DNNs) are being heavily utilized in modern applications and are putting energy-constraint devices to the test.

Hardware Approximate Techniques for Deep Neural Network Accelerators: A Survey

no code implementations16 Mar 2022 Giorgos Armeniakos, Georgios Zervakis, Dimitrios Soudris, Jörg Henkel

To enable efficient execution of DNN inference, more and more research works, therefore, exploit the inherent error resilience of DNNs and employ Approximate Computing (AC) principles to address the elevated energy demands of DNN accelerators.

Cross-Layer Approximation For Printed Machine Learning Circuits

1 code implementation11 Mar 2022 Giorgos Armeniakos, Georgios Zervakis, Dimitrios Soudris, Mehdi B. Tahoori, Jörg Henkel

In our extensive experimental evaluation we consider 14 MLPs and SVMs and evaluate more than 4300 approximate and exact designs.

BIG-bench Machine Learning

CoCoFL: Communication- and Computation-Aware Federated Learning via Partial NN Freezing and Quantization

1 code implementation10 Mar 2022 Kilian Pfeiffer, Martin Rapp, Ramin Khalili, Jörg Henkel

To adapt to the devices' heterogeneous resources, CoCoFL freezes and quantizes selected layers, reducing communication, computation, and memory requirements, whereas other layers are still trained in full precision, enabling to reach a high accuracy.

Fairness Federated Learning +1

AdaPT: Fast Emulation of Approximate DNN Accelerators in PyTorch

1 code implementation8 Mar 2022 Dimitrios Danopoulos, Georgios Zervakis, Kostas Siozios, Dimitrios Soudris, Jörg Henkel

Current state-of-the-art employs approximate multipliers to address the highly increased power demands of DNN accelerators.

DISTREAL: Distributed Resource-Aware Learning in Heterogeneous Systems

no code implementations16 Dec 2021 Martin Rapp, Ramin Khalili, Kilian Pfeiffer, Jörg Henkel

We study the problem of distributed training of neural networks (NNs) on devices with heterogeneous, limited, and time-varying availability of computational resources.

Federated Learning

Positive/Negative Approximate Multipliers for DNN Accelerators

no code implementations20 Jul 2021 Ourania Spantidi, Georgios Zervakis, Iraklis Anagnostopoulos, Hussam Amrouch, Jörg Henkel

In addition, we propose a filter-oriented approximation method to map the weights to the appropriate modes of the approximate multiplier.

Control Variate Approximation for DNN Accelerators

no code implementations18 Feb 2021 Georgios Zervakis, Ourania Spantidi, Iraklis Anagnostopoulos, Hussam Amrouch, Jörg Henkel

In this work, we introduce a control variate approximation technique for low error approximate Deep Neural Network (DNN) accelerators.

Distributed Learning on Heterogeneous Resource-Constrained Devices

no code implementations9 Jun 2020 Martin Rapp, Ramin Khalili, Jörg Henkel

We consider a distributed system, consisting of a heterogeneous set of devices, ranging from low-end to high-end.

Federated Learning Reinforcement Learning (RL)

Hardware Trojan Detection Using Controlled Circuit Aging

no code implementations6 Apr 2020 Virinchi Roy Surabhi, Prashanth Krishnamurthy, Hussam Amrouch, Kanad Basu, Jörg Henkel, Ramesh Karri, Farshad Khorrami

Combining IC aging with over-clocking produces a pattern of bit errors at the IC output by the induced timing violations.

Cannot find the paper you are looking for? You can Submit a new open access paper.