JavaScript is NOT enabled !

Please enable JavaScript.

Article

PubMed

This record comes from PubMed

On energy complexity of fully-connected layers

Šíma, Jiří
Author Šíma, Jiří Institute of Computer Science of the Czech Academy of Sciences, Pod Vodárenskou věží 271/2, Prague 8, 182 00, Czechia. Electronic address: sima@cs.cas.cz
Cabessa, Jérémie
Author Cabessa, Jérémie DAVID Laboratory, University of Versailles Saint-Quentin (UVSQ), University Paris-Saclay, 45 avenue des États-Unis, Versailles, 78035, France. Electronic address: jeremie.cabessa@uvsq.fr
Vidnerová, Petra
Author Vidnerová, Petra Institute of Computer Science of the Czech Academy of Sciences, Pod Vodárenskou věží 271/2, Prague 8, 182 00, Czechia. Electronic address: petra@cs.cas.cz

Neural networks. 2024 Oct ; 178 () : 106419. [epub] 20240531

Neural Netw
ISSN 1879-2782 | 0893-6080
Source

Language English Country United States Media print-electronic

Document type Journal Article

Persistent link https://www.medvik.cz/link/pmid38861836

PubMed 38861836
DOI 10.1016/j.neunet.2024.106419
PII: S0893-6080(24)00343-5
Knihovny.cz E-resources

Keywords
Convolutional neural networks, Dataflow, Deep neural networks, Energy complexity, Energy consumption, Fully-connected layer,
MeSH
Algorithms MeSH
Deep Learning MeSH
Neural Networks, Computer * MeSH
Computers MeSH
Programming, Linear MeSH
Publication type
Journal Article MeSH

The massive increase in the size of deep neural networks (DNNs) is accompanied by a significant increase in energy consumption of their hardware implementations which is critical for their widespread deployment in low-power mobile devices. In our previous work, an abstract hardware-independent model of energy complexity for convolutional neural networks (CNNs) has been proposed and experimentally validated. Based on this model, we provide a theoretical analysis of energy complexity related to the computation of a fully-connected layer when its inputs, outputs, and weights are transferred between two kinds of memories (DRAM and Buffer). First, we establish a general lower bound on this energy complexity. Then, we present two dataflows and calculate their energy costs to achieve the corresponding upper bounds. In the case of a partitioned Buffer, we prove by the weak duality theorem from linear programming that the lower and upper bounds coincide up to an additive constant, and therefore establish the optimal energy complexity. Finally, the asymptotically optimal quadratic energy complexity of fully-connected layers is experimentally validated by estimating their energy consumption on the Simba and Eyeriss hardware.

DAVID Laboratory University of Versailles Saint Quentin University Paris Saclay 45 avenue des États Unis Versailles 78035 France

Institute of Computer Science of the Czech Academy of Sciences Pod Vodárenskou věží 271 2 Prague 8 182 00 Czechia

References provided by Crossref.org

Borrow
RIS

Find record

In BMC

On energy complexity of fully-connected layers

Find record

Citation metrics

Archiving options