Network Theorems

About 45,000 results

Open links in new tab

Any time

mlr.press
https://proceedings.mlr.press
[PDF]
Delta Networks for Optimized Recurrent Network Computation
First, a network was trained with Gaussian noise injection (Sec. 5.3) and subsequently tested using the delta network GRU formulation given in Sec. 3. A second network was trained …
mlr.press
https://proceedings.mlr.press
[PDF]
OptNet: Differentiable Optimization as a Layer in Neural …
This paper presents OptNet, a network architec-ture that integrates optimization problems (here, specifically in the form of quadratic programs) as individual layers in larger end-to-end train …
mlr.press
https://proceedings.mlr.press
Shallow-Deep Networks: Understanding and Mitigating Network …
For prediction transparency, we propose the Shallow-Deep Network (SDN), a generic modification to off-the-shelf DNNs that introduces internal classifiers. We apply SDN to four modern …
mlr.press
https://proceedings.mlr.press
Adaptive Smoothing Gradient Learning for Spiking Neural …
Here, we propose a methodology such that training a prototype neural network will evolve into training an SNN gradually by fusing the learnable relaxation degree into the network with …
mlr.press
https://proceedings.mlr.press
[PDF]
Invertible Residual Networks
One of the main appeals of neural network-based models is that a single model architecture can often be used to solve a variety of related tasks. However, many recent advances are based …
mlr.press
https://proceedings.mlr.press
[PDF]
Shallow-Deep Networks: Understanding and Mitigating …
Analogously, to mitigate network overthinking, we propose two SDN-based heuristics: the confidence-based early exits (Section 5.1) and network confusion analysis (Section 5.2).
mlr.press
https://proceedings.mlr.press
[PDF]
Width is Less Important than Depth in ReLU Neural Networks
We proved that any target network with width n, depth L and inputs in Rd can be approximated by a network with width O(d), where the number of parameters increases by only a factor of L …
mlr.press
https://proceedings.mlr.press
Network Morphism - PMLR
We present a systematic study on how to morph a well-trained neural network to a new one so that its network function can be completely preserved. We define this as network morphism in …
mlr.press
https://proceedings.mlr.press
[PDF]
Complexity of Linear Regions in Deep Networks
For a single network (depth 4, width 16), Figure 7 indicates that this distribution does not significantly change during training, although there appears to be a slight skew towards larger …
mlr.press
https://proceedings.mlr.press
[PDF]
Exploring the Power of Graph Neural Networks in Solving …
Our empirical results show that IPM-MPNNs can lead to reduced solving times compared to a state-of-the-art LP solver with time constraints and competing neu-ral network-based …

Pagination
- 1
- 2
- 3
- Next

Delta Networks for Optimized Recurrent Network Computation

OptNet: Differentiable Optimization as a Layer in Neural …

Shallow-Deep Networks: Understanding and Mitigating Network …

Adaptive Smoothing Gradient Learning for Spiking Neural …

Invertible Residual Networks

Shallow-Deep Networks: Understanding and Mitigating …

Width is Less Important than Depth in ReLU Neural Networks

Network Morphism - PMLR

Complexity of Linear Regions in Deep Networks

Exploring the Power of Graph Neural Networks in Solving …