�鶹��ýAV Information Theory Society

A Fourier-Based Approach to Generalization and Optimization in Deep Learning

Submitted by admin on Wed, 10/23/2024 - 01:52

The success of deep neural networks stems from their ability to generalize well on real data; however, et al. have observed that neural networks can easily overfit randomly-generated labels. This observation highlights the following question: why do gradient methods succeed in finding generalizable solutions for neural networks while there exist solutions with poor generalization behavior?

Sample Compression, Support Vectors, and Generalization in Deep Learning

Submitted by admin on Wed, 10/23/2024 - 01:52

Even though Deep Neural Networks (DNNs) are widely celebrated for their practical performance, they possess many intriguing properties related to depth that are difficult to explain both theoretically and intuitively. Understanding how weights in deep networks coordinate together across layers to form useful learners has proven challenging, in part because the repeated composition of nonlinearities has proved intractable. This paper presents a reparameterization of DNNs as a linear function of a feature map that is locally independent of the weights.

Learning-Based Coded Computation

Submitted by admin on Wed, 10/23/2024 - 01:52

Recent advances have shown the potential for coded computation to impart resilience against slowdowns and failures that occur in distributed computing systems. However, existing coded computation approaches are either unable to support non-linear computations, or can only support a limited subset of non-linear computations while requiring high resource overhead. In this work, we propose a learning-based coded computation framework to overcome the challenges of performing coded computation for general non-linear functions.

Solving Inverse Problems via Auto-Encoders

Submitted by admin on Wed, 10/23/2024 - 01:52

Compressed sensing (CS) is about recovering a structured signal from its under-determined linear measurements. Starting from sparsity, recovery methods have steadily moved towards more complex structures. Emerging machine learning tools such as generative functions that are based on neural networks are able to learn general complex structures from training data. This makes them potentially powerful tools for designing CS algorithms.

Harmless Interpolation of Noisy Data in Regression

Submitted by admin on Wed, 10/23/2024 - 01:52

A continuing mystery in understanding the empirical success of deep neural networks is their ability to achieve zero training error and generalize well, even when the training data is noisy and there are more parameters than data points. We investigate this overparameterized regime in linear regression, where all solutions that minimize training error interpolate the data, including noise.

Qsparse-Local-SGD: Distributed SGD With Quantization, Sparsification, and Local Computations

Submitted by admin on Wed, 10/23/2024 - 01:52

Communication bottleneck has been identified as a significant issue in distributed optimization of large-scale learning models. Recently, several approaches to mitigate this problem have been proposed, including different forms of gradient compression or computing local models and mixing them iteratively. In this paper, we propose Qsparse-local-SGD algorithm, which combines aggressive sparsification with quantization and local computation along with error compensation, by keeping track of the difference between the true and compressed gradients.

On Distributed Quantization for Classification

Submitted by admin on Wed, 10/23/2024 - 01:52

We consider the problem of distributed feature quantization, where the goal is to enable a pretrained classifier at a central node to carry out its classification on features that are gathered from distributed nodes through communication constrained channels. We propose the design of distributed quantization schemes specifically tailored to the classification task: unlike quantization schemes that help the central node reconstruct the original signal as accurately as possible, our focus is not reconstruction accuracy, but instead correct classification.

Inference With Deep Generative Priors in High Dimensions

Submitted by admin on Wed, 10/23/2024 - 01:52

Deep generative priors offer powerful models for complex-structured data, such as images, audio, and text. Using these priors in inverse problems typically requires estimating the input and/or hidden signals in a multi-layer deep neural network from observation of its output. While these approaches have been successful in practice, rigorous performance analysis is complicated by the non-convex nature of the underlying optimization problems.

Deepcode: Feedback Codes via Deep Learning

Submitted by admin on Wed, 10/23/2024 - 01:52

The design of codes for communicating reliably over a statistically well defined channel is an important endeavor involving deep mathematical research and wide-ranging practical applications. In this work, we present the first family of codes obtained via deep learning, which significantly outperforms state-of-the-art codes designed over several decades of research.

DeepJSCC-f: Deep Joint Source-Channel Coding of Images With Feedback

Submitted by admin on Wed, 10/23/2024 - 01:52

We consider wireless transmission of images in the presence of channel output feedback. From a Shannon theoretic perspective feedback does not improve the asymptotic end-to-end performance, and separate source coding followed by capacity-achieving channel coding, which ignores the feedback signal, achieves the optimal performance.

Subscribe to Our Mailing List