N-ReLU: Zero-Mean Stochastic Extension of ReLU

arXiv — cs.LG•Wednesday, November 12, 2025 at 5:00:00 AM

N-ReLU, a newly introduced activation function, offers a solution to the problem of dead neurons commonly associated with the standard ReLU. By incorporating zero-mean Gaussian noise to replace negative activations, N-ReLU preserves the expected output while improving gradient flow in inactive regions. This innovative approach was tested on the MNIST dataset using both MLP and CNN architectures, demonstrating accuracy that meets or slightly exceeds that of existing functions like LeakyReLU and GELU at moderate noise levels. Notably, N-ReLU ensures stable convergence and eliminates dead neurons, showcasing its effectiveness as a lightweight mechanism to enhance optimization robustness without altering network structures or adding parameters. This advancement is significant for the field of artificial intelligence, as it provides a straightforward yet powerful tool for improving the performance of deep learning models.

— via World Pulse Now AI Editorial System

Read Original

Was this article worth reading? Share it

Recommended Readings

arXiv — cs.LG2 days ago

Robust inverse material design with physical guarantees using the Voigt-Reuss Net

PositiveArtificial Intelligence

A new method for mechanical homogenization has been proposed, utilizing a spectrally normalized surrogate that incorporates physical guarantees. This approach leverages the Voigt-Reuss bounds and employs a Cholesky-like operator to create a symmetric positive semi-definite representation. The method has been tested on a dataset of stochastic biphasic microstructures, achieving near-perfect fidelity in isotropic projections with R² values exceeding 0.998. The median relative Frobenius error was approximately 1.7%.

Read full article

via arXiv — cs.LG

arXiv — cs.LG2 days ago

Neural Network-Powered Finger-Drawn Biometric Authentication

PositiveArtificial Intelligence

A recent study published on arXiv investigates the use of neural networks for biometric authentication through finger-drawn digits on touchscreen devices. The research involved twenty participants who contributed a total of 2,000 finger-drawn digits. Two CNN architectures were evaluated, achieving approximately 89% authentication accuracy, while autoencoder approaches reached about 75% accuracy. The findings suggest that this method offers a secure and user-friendly biometric solution that can be integrated with existing authentication systems.

Read full article

via arXiv — cs.LG

$On bounds for norms of reparameterized ReLU artificial neural network parameters: sums of fractional powers of the Lipschitz norm control the network parameter vector$

arXiv — cs.LG2 days ago

On bounds for norms of reparameterized ReLU artificial neural network parameters: sums of fractional powers of the Lipschitz norm control the network parameter vector

NeutralArtificial Intelligence

A recent study published on arXiv discusses the bounds for norms of reparameterized ReLU artificial neural network (ANN) parameters. It establishes that the Lipschitz norm of the realization function of a feedforward fully-connected ReLU ANN can be bounded from above by sums of powers of the ANN parameter vector norm. The study also reveals that for shallow ANNs, the converse inequality holds true, and the upper bound is valid only when using the Lipschitz norm, not for Hölder or Sobolev-Slobodeckij norms.

Read full article

via arXiv — cs.LG

arXiv — cs.LG2 days ago

CNN-Enabled Scheduling for Probabilistic Real-Time Guarantees in Industrial URLLC

PositiveArtificial Intelligence

The article discusses an enhancement to the Local Deadline Partition (LDP) algorithm for ultra-reliable, low-latency communications (URLLC) in industrial wireless networks. A Convolutional Neural Network (CNN) is introduced to dynamically predict link priorities, improving interference coordination across multi-cell, multi-channel networks. The proposed method shows significant gains in Signal-to-Interference-plus-Noise Ratio (SINR), achieving up to 113%, 94%, and 49% improvements in different network configurations, thus enhancing resource allocation and network capacity.

Read full article

via arXiv — cs.LG

arXiv — cs.CV2 days ago

YCB-Ev SD: Synthetic event-vision dataset for 6DoF object pose estimation

PositiveArtificial Intelligence

The YCB-Ev SD dataset has been introduced as a synthetic collection of event-camera data aimed at enhancing 6DoF object pose estimation. Comprising 50,000 event sequences, each lasting 34 ms, the dataset is generated from Physically Based Rendering (PBR) scenes of YCB-Video objects. This initiative addresses the lack of comprehensive resources in event-based vision, employing a methodology aligned with the Benchmark for 6D Object Pose (BOP) to improve pose estimation performance through advanced encoding techniques.

Read full article

via arXiv — cs.CV

arXiv — cs.LG2 days ago

Inferring response times of perceptual decisions with Poisson variational autoencoders

PositiveArtificial Intelligence

The article presents a model for perceptual decision-making using Poisson variational autoencoders, which captures the temporal dynamics of decision processes. Unlike traditional models that treat decisions as instantaneous, this approach incorporates sensory encoding and Bayesian decoding of neural spiking activity. The model is capable of generating trial-by-trial patterns of choices and response times, demonstrating its effectiveness in replicating key empirical signatures of perceptual decision-making.

Read full article

via arXiv — cs.LG