Breaking the Cascade: Compact Nonlinear Optical Computing with Single-Layer Encoder-Decoder Co-Localization

TL;DR

Single-layer phase encoder-decoder system achieves nonlinear function approximation via interference, with error below 10^-8, no nonlinear materials needed.

physics.optics 🔴 Advanced 2026-05-31 61 views

Yuntian Wang Alexander Chen Md Sadman Sakib Rahman Aydogan Ozcan

AI Reader Arxiv Page Download PDF

optical computing nonlinear function approximation phase modulation diffractive optics neural network simulation

Key Findings

Methodology

This work introduces a co-localized phase-only diffractive architecture integrating a dynamic input encoder with a static optimized decoder on a single surface. The input-dependent phase profile encodes the target nonlinear function's argument, while the static decoder phase pattern is optimized via differentiable training to produce the desired output. During free-space propagation, the superposition of the encoder and decoder fields interferes coherently, and the intensity measurement at the output plane captures the nonlinear mapping. The theoretical analysis demonstrates that, under suitable physical conditions—such as sufficient decoder degrees of freedom, detector aperture, and propagation distance—the system acts as a universal approximator for arbitrary real-valued band-limited nonlinear functions. Numerical simulations validate the approach across multiple activation functions (ReLU, Sigmoid, GELU) and complex-valued nonlinear functions, achieving errors below 10^-8. Experimentally, the authors implement this framework in a visible-light setup, training the system in situ to simultaneously approximate nine different nonlinear functions in a single pass, confirming its practical viability and high fidelity.

Key Results

The system accurately approximates nine diverse nonlinear functions with mean squared error below 10^-8, outperforming multi-layer optical schemes and significantly simplifying hardware complexity.
Incorporating a trained, frozen phase bias in the encoder enhances approximation accuracy by approximately two orders of magnitude, especially for complex activation functions like GELU, and improves robustness against phase quantization constraints.
Adjusting the detector aperture and propagation distance further reduces approximation errors to the order of 10^-9, aligning with theoretical predictions about physical limits and optimization strategies.

Significance

This work overcomes the fundamental challenge of implementing nonlinear mappings in purely linear optical systems, eliminating the need for nonlinear optical materials that are slow and energy-intensive. By collapsing the nonlinear function approximation into a single diffractive surface, the approach drastically reduces hardware complexity and alignment demands, making high-speed, low-power optical analog computing more feasible. Its universal approximation capability and robustness pave the way for scalable optical neural networks, real-time signal processing, and photonic AI hardware, potentially transforming the landscape of optical information processing and machine learning acceleration.

Technical Contribution

The paper introduces a novel single-layer co-localized phase-only diffractive architecture that leverages interference to realize nonlinear mappings without nonlinear materials. Theoretical analysis proves its universal approximation property for band-limited functions, with physical factors such as decoder degrees of freedom, detector aperture, and propagation distance dictating fidelity. The addition of a trained, frozen phase bias in the encoder significantly enhances expressivity and robustness, especially under hardware quantization constraints. The system's end-to-end training via differentiable digital twins enables precise phase optimization, demonstrating high-fidelity approximation of multiple nonlinear functions in hardware. This represents a fundamental shift from multi-layer or nonlinear-material-based optical computing, opening new engineering possibilities for ultrafast, scalable photonic processors.

Novelty

This study is the first to demonstrate that a single, co-localized phase-only diffractive surface can approximate arbitrary nonlinear functions through interference, eliminating the need for multilayer stacking or nonlinear optical materials. The introduction of a trained, frozen encoder bias further enhances the system's expressive power and robustness, especially under hardware quantization constraints. Unlike previous works relying on nonlinear materials or cascaded layers, this approach leverages physical interference and phase optimization to achieve complex nonlinear mappings within a compact, easy-to-align platform, representing a significant innovation in optical computing.

Limitations

The system's approximation accuracy diminishes when dealing with highly complex or high-frequency nonlinear functions, primarily due to phase quantization and optical noise, limiting its ultimate precision.
Fabrication and alignment of the diffractive surface require high precision, which can be challenging at large scales or in mass production, potentially affecting scalability.
Currently demonstrated in 2D planar configurations, extending the approach to 3D or dynamic reconfigurable systems remains an open challenge, restricting its adaptability to more complex or real-time applications.

Future Work

Future research will focus on integrating multi-layer or multi-scale structures to extend the approximation capacity, employing deep learning-based end-to-end training for adaptive phase profiles, and exploring dynamic phase modulation for real-time control. Additionally, efforts to improve fabrication tolerances, reduce optical noise impact, and extend the architecture to three-dimensional or reconfigurable platforms will be pursued. These advancements aim to broaden the applicability of this approach to more complex tasks such as real-time neural network inference, adaptive signal processing, and large-scale photonic AI systems.

AI Executive Summary

In the rapidly evolving landscape of information processing, the limitations of electronic computers—particularly in speed and energy efficiency—have spurred interest in optical computing as a promising alternative. Light's inherent parallelism and high bandwidth make it ideal for high-speed, low-power operations, but traditional optical systems face significant hurdles in implementing nonlinear functions essential for neural networks and complex computations.

Conventional approaches rely heavily on nonlinear optical materials, which are often slow, lossy, and difficult to integrate at scale. Multi-layer diffractive systems have been proposed to approximate functions, but their complexity and alignment challenges hinder practical deployment. Recognizing these limitations, the authors propose a radically simplified architecture: a single, co-localized phase-only diffractive surface that combines a dynamic input encoder with a static optimized decoder.

This innovative design leverages interference effects to realize nonlinear mappings without nonlinear materials. The input-dependent phase profile encodes the argument of the target function, while the static decoder phase pattern is trained to produce the desired output through end-to-end optimization. During free-space propagation, the superimposed fields interfere coherently, and the intensity measurement at the output captures the nonlinear relationship. Theoretical analysis confirms that, under suitable physical conditions, this system acts as a universal approximator for arbitrary real-valued band-limited nonlinear functions.

Numerical simulations demonstrate that the system can accurately approximate a variety of activation functions used in neural networks, such as ReLU, Sigmoid, and GELU, as well as complex-valued nonlinear functions, achieving errors below 10^-8. The authors further validate their approach experimentally using a visible-light setup, training the system in situ to simultaneously approximate nine different functions in a single pass. The results show remarkable fidelity, robustness to phase quantization, and simplified hardware requirements.

This work represents a significant breakthrough in optical computing, collapsing what traditionally required multiple layers and nonlinear materials into a single, easy-to-align surface. It opens new avenues for ultrafast, scalable, and energy-efficient analog computation, with broad implications for optical neural networks, real-time signal processing, and photonic AI hardware. While current limitations include handling extremely complex functions and fabrication challenges, ongoing research aims to extend the architecture's capacity and robustness, promising a transformative impact on the future of optical information processing.

Deep Dive

Abstract

We demonstrate that nonlinear computing can be achieved with a single linear diffractive surface under coherent illumination. We introduce a compact encoder-decoder co-localization (E+D) architecture in which an input-dependent dynamic encoder and a static optimized decoder are integrated within the same phase-only diffractive plane. Following free-space propagation, coherent interference between the encoder and decoder fields, combined with intensity detection, generates programmable nonlinear input-output mappings without requiring nonlinear optical materials or multiple diffractive layers. We prove that the proposed E+D optical processor is a universal approximator for arbitrary real-valued band-limited nonlinear functions and identify the physical factors governing its approximation fidelity, including the decoder degrees-of-freedom, detector aperture, and axial propagation distance. Crucially, we demonstrate that introducing a trained, frozen phase bias to the encoder region systematically enhances functional expressivity, providing robustness against coarse phase quantization on spatial light modulators. Using this framework, we accurately synthesize diverse nonlinear functions, including commonly used neural network activation functions and complex-valued nonlinear functions. Finally, we experimentally validate the proposed approach using a visible-light optical set-up trained through in situ learning, demonstrating the parallel approximation of 9 nonlinear functions in a single optical forward pass. By collapsing nonlinear optical computation into a single diffractive surface, the E+D architecture substantially reduces hardware and alignment complexity while preserving powerful function-approximation capabilities, providing a compact and scalable framework for analog information processing.

physics.optics cs.NE physics.app-ph

References (20)

Large-scale nonlinear optical computing with incoherent light via linear diffractive systems

Alexander Chen, Yuntian Wang, Md. Sadman Rahman et al.

2026 1 citations ⭐ Influential View Analysis →

Massively parallel and universal approximation of nonlinear functions using diffractive processors

Md. Sadman Rahman, Yuhang Li, Xilin Yang et al.

2025 11 citations ⭐ Influential View Analysis →

Nonlinear processing with linear optics

Mustafa Yildirim, Niyazi Ulaş Dinç, Ilker Oguz et al.

2023 119 citations View Analysis →

All‐Optical Phase Recovery: Diffractive Computing for Quantitative Phase Imaging

Deniz Mengu, A. Ozcan

2022 91 citations View Analysis →

Review of nonlinear activation functions in optical neural networks

Wanxin Shi, Zheng Huang, Tingzhao Fu et al.

2025 5 citations

Gaussian Error Linear Units (GELUs)

Dan Hendrycks, Kevin Gimpel

2016 6842 citations View Analysis →

Overcoming the efficiency-bandwidth tradeoff for optical harmonics generation using nonlinear time-variant resonators

M. Shcherbakov, P. Shafirin, G. Shvets

2019 16 citations View Analysis →

Super-resolution diffractive neural network for all-optical direction of arrival estimation beyond diffraction limits

Sheng Gao, Hang Chen, Yichen Wang et al.

2024 31 citations

Fast and Accurate Deep Network Learning by Exponential Linear Units (ELUs)

Djork-Arné Clevert, Thomas Unterthiner, Sepp Hochreiter

2015 6080 citations View Analysis →

The End of Moore's Law: A New Beginning for Information Technology

T. Theis, H. Wong

2017 722 citations

Roadmap on holography

J. Sheridan, R. Kostuk, A. Gil et al.

2020 103 citations

Rectified Linear Units Improve Restricted Boltzmann Machines

Vinod Nair, Geoffrey E. Hinton

2010 18715 citations

There’s plenty of room at the Top: What will drive computer performance after Moore’s law?

C. Leiserson, Neil C. Thompson, J. Emer et al.

2020 401 citations

Nonlinear optical encoding enabled by recurrent linear scattering

Fei Xia, Kyungduk Kim, Yaniv Eliezer et al.

2023 103 citations View Analysis →

To image, or not to image: class-specific diffractive cameras with all-optical erasure of undesired objects

Bijie Bai, Yilin Luo, Tianyi Gan et al.

2022 75 citations View Analysis →

Ultra-strong nonlinear optical processes and trigonal warping in MoS2 layers

A. Säynätjoki, L. Karvonen, H. Rostami et al.

2016 224 citations View Analysis →

Fully non-linear neuromorphic computing with linear wave scattering

C. C. Wanjura, F. Marquardt

2023 82 citations View Analysis →

Optimizing structured surfaces for diffractive waveguides

Yuntian Wang, Yuhang Li, Tianyi Gan et al.

2024 4 citations View Analysis →

The role of all-optical neural networks

M. Matuszewski, A. Prystupiuk, A. Opala

2023 14 citations View Analysis →

Diffractive optical computing in free space

J. Hu, Deniz Mengu, D. Tzarouchis et al.

2024 183 citations