ZIB

Hits per page

hit 1 - 1 | 1 hit

Select All Export

Unknown

Principled Deep Neural Network Training Through Linear Programming (2023)

Bienstock, Daniel ; Muñoz, Gonzalo ; Pokutta, Sebastian

add to mindlist on the mindlist

Details

Publication Date: 2023-12-14

Description: Deep learning has received much attention lately due to the impressive empirical performance achieved by training algorithms. Consequently, a need for a better theoretical understanding of these problems has become more evident and multiple works in recent years have focused on this task. In this work, using a unified framework, we show that there exists a polyhedron that simultaneously encodes, in its facial structure, all possible deep neural network training problems that can arise from a given architecture, activation functions, loss function, and sample size. Notably, the size of the polyhedral representation depends only linearly on the sample size, and a better dependency on several other network parameters is unlikely. Using this general result, we compute the size of the polyhedral encoding for commonly used neural network architectures. Our results provide a new perspective on training problems through the lens of polyhedral theory and reveal strong structure arising from these problems.

Language: English

Type: article , doc-type:article

Library	Location	Call Number	Volume/Issue/Year	Availability

Others were also interested in ...

OPUS

Overview

hit 1 - 1 | 1 hit