This review describes recent theoretical and experimental advances in the area of multimode solitons, focusing primarily on multimode fibers. We begin by introducing the basic concepts such as the spatial modes supported by a multimode fiber and the coupled mode equations for describing the different group delays and nonlinear properties of these modes. We review several analytic approaches used to understand the formation of multimode solitons, including those based on the 3D+1 spatiotemporal nonlinear Schrödinger equation (NLSE) and its approximate 1D+1 representation that has been found to be highly efficient for studying the self-imaging phenomena in graded-index multimode fibers. An innovative Gaussian quadrature approach is used for faster numerical simulations of the 3D+1 NLSE. The impact of linear mode coupling is discussed in a separate section using a generalized Jones formalism because of its relevance to space-division multiplexed optical communication systems. The last section is devoted to the relevant experimental studies involving multimode solitons.
【AIGC One Sentence Reading】:This review summarizes recent advances in multimode solitons in multimode fibers, focusing on theoretical models, numerical simulations, and experimental studies, highlighting their relevance to optical communication systems.
【AIGC Short Abstract】:This review summarizes recent progress in multimode solitons, emphasizing multimode fibers. It introduces spatial modes, coupled mode equations, and analytic methods for soliton formation, including 3D+1 NLSE and its 1D+1 approximation. Linear mode coupling's impact on space-division multiplexed optical communication is discussed. Relevant experimental studies are also reviewed.
Note: This section is automatically generated by AI . The website and platform operators shall not be liable for any commercial or legal consequences arising from your use of AI generated content on this website. Please be aware of this.
1. INTRODUCTION
In the realm of nonlinear optics, the phenomenon of temporal solitons has been a subject of fascination for over five decades [1,2], ever since the first prediction of temporal solitons in 1973 [3] and their experimental observation in 1980 [4]. These self-sustained wave packets exhibit remarkable properties, propagating over long distances without distortion due to a balance between chromatic dispersion and nonlinearity [see Fig. 1(a)]. Extensive research efforts have been dedicated to unraveling the mysteries of solitons [5] and harnessing them for various applications, spanning from telecommunications [6] to supercontinuum sources [7], ultrafast mode-locked fiber laser sources [8], and cavity soliton frequency combs [9]. Similarly, spatial optical solitons, akin to their temporal counterparts, are self-trapped light beams capable of maintaining their transverse shape while propagating through nonlinear media, avoiding the divergence associated with freely diffracting beams [see Fig. 1(b)]. The presence of light alters the properties of nonlinear materials, such as refractive index, resembling the intensity profile of the beam and forming an optical lens effect. The concept of self-trapped light beams was initially introduced by Askaryan in 1962 [10], followed by comprehensive investigations in both 1D and 2D diffractive systems [11–14].
Figure 1.(a) Temporal solitons propagate within a singlemode fiber, forming due to the equilibrium between chromatic dispersion induced temporal broadening and nonlinearity induced pulse compression. (b) Spatial solitons manifest in Kerr nonlinear media, shaping a beam with a consistent beam waist as it travels, emerging from the interplay of spatial diffraction induced beam divergence and nonlinear self-focusing. (c) Spatiotemporal MMSs propagate in graded-index multimode fibers, where the interaction between beam diffraction and fiber confinement creates spatial multimodal beams. MMS formation is more complex, necessitating a balance not only between chromatic dispersion broadening and nonlinear pulse compression, but also a balance between nonlinear beam trapping and modal walk-off due to the disparate group velocities of the fiber modes.
While traditional temporal or spatial solitons have primarily been explored in one or two dimensions, recent advancements have unveiled a fascinating realm of solitons involving both spatial and temporal degrees of freedom, known as spatiotemporal solitons [5,15,16], also called light bullets [17]. These solitons represent multidimensional states, as they necessitate the balance among diffraction, dispersion, and nonlinearity across spatial and temporal dimensions in the course of their propagation. This complexity presents new challenges as well as interesting opportunities in nonlinear dynamics. One significant challenge is that these multidimensional states are highly susceptible to perturbations [16]. A notable example of this fragility is the spatiotemporal wave collapse, which is commonly observed in multidimensional solitons within optical materials exhibiting Kerr nonlinearity [16,18–20]. Nevertheless, spatiotemporal solitons have been studied in various configurations [20–31]. Among them, multimode fibers (MMFs) provide a convenient test-bed [32] and have garnered considerable interest in the study of such solitons, owing to their cost-effectiveness and ease of alignment. The interaction of diffraction and spatial confinement in a multimode fiber leads to the formation of multiple co-propagating spatial modes, each with distinct dispersive properties, such as different phase and group velocities, second- and higher-order dispersions. The dispersion of these spatial modes can be balanced by Kerr nonlinearity, causing the modes to become locked together in time. As a result, the multimode field propagates as a single, shape-preserving “ball” of energy in both space and time. These equilibrium states are known as multimode solitons (MMSs) [see Fig. 1(c)].
There are two major motivations for studying MMSs: 1) the intrinsic scientific interest in propagation-invariant waves, and 2) the potential to interpret complex phenomena, especially in more than one dimension, through soliton dynamics. Interpreting instabilities, pulse formation, supercontinuum generation, and other phenomena in one dimension using the soliton concept in the frame of the 1D NLSE has proven to be highly beneficial. Similarly, MMSs could provide a foundation for understanding multidimensional nonlinear phenomena. While the community has demonstrated several instances where this approach is effective, further research is needed before we can definitively state that “a field can be described as a superposition of solitons and dispersive waves,” as is often done for one-dimensional systems.
Sign up for Photonics Research TOC Get the latest issue of Advanced Photonics delivered right to you!Sign up now
Following early work [27,28,33–35], MMSs have gained increasing attention, particularly in graded-index (GRIN) multimode fibers (GIMFs), which minimize intermodal dispersion and reduce modal walk-off time, thus facilitating the formation of an MMS. Many experiments involving multimode solitons in MMFs have explored phenomena such as soliton fission [36], walk-off solitons [37], singlemode spatiotemporal soliton attractors [38], collisions [39], dispersive radiation sideband generation [40,41], and MMS formation in multimode step-index fibers (SIMFs) [42,43]. The development of MMS in conservative systems has spurred interest in the study of the emergence of MMS in dissipative systems, such as spatiotemporal mode-locked lasers [44–46] and different multimode cavity configurations [47,48]. Furthermore, GIMFs have emerged as valuable testbeds for highly multimode systems [21], enabling the observation of a variety of phenomena such as beam self-cleaning [49], thermalization of optical waves [50–52], and supercontinuum generation [53].
This review paper focuses on recent theoretical and experimental advancements in MMSs in conservative systems, with a primary focus on MMFs. We begin by introducing the foundational concepts of MMF and MMS, elucidating the phenomena associated with MMS in relation to the MMF properties in Section 2. Many of these phenomena, such as modal walk-off and MMS formation, can be effectively modeled by the generalized coupled mode equation, with simple examples provided in Section 3. Next, we review analytic approaches to MMS using the two-coupled mode equations in Section 4, and discuss extensions to the 3D+1 spatiotemporal nonlinear Schrödinger equation (NLSE) in Section 5. To enhance understanding, the comprehensive 3D+1 model is simplified to a reduced 1D+1 NLSE representation in Section 6, which is highly efficient for studying the self-imaging-effect-induced phenomena in GIMFs. In Section 7, we introduce an innovative Gaussian quadrature approach tailored for numerical simulations of the generalized NLSE, potentially improving computational efficiency. In Section 8, we examine the propagation regime in space-division multiplexed (SDM) optical communication systems designed with MMFs, discussing the model using a generalized Jones formalism. This model highlights the impact of linear coupling effects, primarily due to manufacturing imperfections and environmental perturbations. Finally, in Section 9, we focus on the relevant experimental studies involving the MMS phenomena.
In essence, this review endeavors to offer a comprehensive overview of MMS, from MMF properties, their theoretical analysis, innovative numerical approaches, the impact of random linear coupling, and practical experiments. By exploring these domains, the paper aims to enrich understanding and acknowledgment of the significance of MMS, thus driving progress in MMS research and technology.
2. MMF EIGENMODES AND MODAL DISPERSIONS
In this section, we first focus on the spatial eigenmodes of two types of MMFs: the step-index multimode fiber (SIMF) and the GIMF. We discuss their key properties related to the MMS, setting the stage for understanding their impact on soliton formation. Subsequently, we introduce the generalized multimode NLSE (GMMNLSE), which is relevant for describing a broad variety of MMFs with identifiable eigenmodes. We provide concise examples, ranging from the propagation of linear multimode pulse fields to the formation of MMS, the Raman-induced MMS self-frequency shift, MMS fissions, and MMS collisions. These examples serve to effectively illustrate the discussed concepts, thereby laying a solid foundation for analytical and experimental studies.
A. Overview of the Relationship between MMF and MMS
MMFs are capable of supporting many guided modes simultaneously. To gain a comprehensive understanding of MMFs, several key concepts must be grasped, as depicted in Fig. 2. Eigenmodes. Eigenmodes represent distinct transverse profiles of the spatial modes that can propagate within an MMF, illustrating how optical fields distribute across the fiber’s cross-section. Calculation of the modal areas and their overlap is necessary to define the modal nonlinear coefficients and the strength of nonlinear mode coupling.Eigenvalues (). The modal eigenvalues, denoted as and representing a mode’s effective refractive index, quantify the increase in the propagation constant at a specific wavelength, when compared with the vacuum wave number . Since varies with frequency, for each mode of index the different derivatives with respect to frequency (or wavelength) directly affect the properties of guided modes such as modal pulse broadening, modal group velocity, and mode beating.nonlinearity. The nonlinearity of an MMF results from the complex third-order optical susceptibility of the fiber’s material (typically silica). There are two main categories of cubic nonlinear effects in fibers according to their response times: the virtually instantaneous Kerr effect, and the delayed nonlinear Raman and Brillouin scattering. As the light intensity increases, the Kerr effect may compensate for both intramodal pulse broadening and intermodal walk-off, leading to the MMS formation.
Subsequent subsections will discuss each of these concepts in more detail, providing a comprehensive understanding of MMF modes and their significance in MMS formation.
Figure 2.Overview of the relationships between the fundamental properties of MMFs and the key physical phenomena leading to the formation of an MMS.
Light travels longitudinally in fibers, while remaining confined transversely to the core through total internal reflection resulting from the refractive index difference between the core and the cladding. This confinement establishes boundary conditions, leading to specific guided propagation modes that depend on the fiber’s transverse refractive index profile. The diversity of index profiles gives rise to various types of MMF, each characterized by spatial modes with distinct transverse beam profiles and eigenvalues. This subsection introduces the eigenmodes by using the specific examples of the SIMF and GIMF with identical core sizes.
In the case of a SIMF, the fiber’s core has a larger refractive index and is surrounded by a cladding with the index such that . The SIMF’s transverse refractive index profile is given by where denotes the core’s radius. The refractive index of the fiber not only varies with the wavelength but also with the spatial coordinates . In the context of silica fibers, the wavelength dependence of is commonly characterized using a Sellmeier equation [54], which can provide the refractive index changes over a wide wavelength range. We use the following values for the SIMF parameters: the core’s radius and a refractive index difference compared to the cladding. The index profile is shown by a black curve in Fig. 3(a).
Figure 3.Refractive index profiles (black solid line) of SIMF in (a), (b) and GIMF in (c), (d) and their corresponding eigenmode profiles (), modal effective indices (). (e)–(h) Mode dispersion comparison between the SIMF and the GIMF with the same core radius of 50 μm at 1.45 μm, for (e) mode effective areas, (f) relative zero-order mode dispersion , (g) relative group velocity , (h) second-order dispersion . The fiber radius for the two fibers is .
Based on the fiber’s index profile, we calculate the first 15 eigenmodes at by employing the semivectorial finite-difference method [55]. These modes , designated as linearly polarized (LP) with indices (azimuthal) and (radial), are displayed on the top in Fig. 3. Degenerate modes sharing the same values are denoted by , . To simplify references to these modes, we assign a singlemode index . For instance, and correspond to modes and , respectively.
To illustrate the relationship among eigenmodes, eigenvalues, and refractive index distribution, we plot the mode profiles in Fig. 3(a) for three distinct radial modes (), each shifted vertically by its corresponding mode eigenvalue, using blue and green curves at wavelengths of 1.55 μm and 1.70 μm, respectively. The corresponding modal eigenvalues at three wavelengths are shown in Fig. 3(b). These modes primarily reside within the core of the fiber. Remarkably, higher-order modes, or those with longer wavelengths, have a smaller effective index, indicating reduced confinement for them. As a result, the number of supported modes varies with wavelength. While MMFs can accommodate more modes at shorter wavelengths, they may support only a single mode at much longer wavelengths, thereby classifying them as singlemode fibers when the wavelength surpasses a certain threshold value (cutoff wavelength).
The shape of the refractive index profile of an MMF influences its guided modes. This is illustrated in Fig. 3(c) using the example of a GIMF with a core radius . The GIMF’s refractive index follows a parabolic distribution:
This distribution leads to eigenmodes resembling Laguerre-Gaussian (LG) modes. The refractive index and LG mode profiles are depicted in Fig. 3(c) for modes (the first three radial modes: , , ) at wavelengths of 1.45 μm and 1.70 μm using red and green curves. While mode profiles for GIMF and SIMF fibers exhibit significant similarities, it is worth noting that the mode areas of a GIMF are considerably smaller than those of a SIMF. To rigorously characterize the influence of mode profiles, it is essential to define the effective areas of the MMF modes.
1. Effective Mode Areas
The mode’s area significantly influences MMF dynamics in the nonlinear (high-power) regimes. This is due to the fact that, for an identical optical power, modes with smaller spatial dimensions lead to elevated field intensities (optical power per unit area), thus inducing more pronounced refractive index changes through the Kerr effect. In the presence of nonlinear effects, the mode’s effective area (MEA) is introduced as follows [56]: where the mode profiles are both normalized and mutually orthogonal:
2. Modal Nonlinear Coefficients
The modal effective area (MEA) is not enough to quantify the strength of nonlinear effects in an MMF because the nonlinear interaction among the modes also depends on the wavelength of incident light and fiber material. Therefore, one needs to introduce the modal nonlinear coefficient (MNC), which can be calculated as where is the nonlinear refractive index of the MMF that produces a change in the refractive index, , which depends linearly on the local intensity of light and has the dimensions of . Different effective mode areas in an MMF lead to different MNCs for its various modes.
3. Modal Nonlinear Lengths
The modal nonlinear length (MNL) is the characteristic length scale over which significant nonlinear optical effects occur in an optical fiber. It provides a measure of the distance along the fiber at which a light pulse is significantly affected by nonlinear interactions. The nonlinear length of a mode is defined as [56] where is the input peak power of mode . This form indicates that a higher input power or a larger MNC leads to a shorter nonlinear length.
It is worth noting that the nonlinear length of a mode is proportional to its MEA, indicating that the MEA is a key parameter for each mode and serves as a critical parameter for distinguishing nonlinear performance across various modes of an MMF.
4. Nonlinear Responses
Optical pulses propagating inside an MMF with high peak powers may be affected by several nonlinear effects. We briefly introduce the effects associated with the third-order nonlinear optical susceptibility () for which the nonlinear polarization is proportional to the third power of the electric field. Effects associated with the nonlinearity can be grouped into two main categories according to their response time, which can be either very short or delayed in time (typically ). The former encompasses the well-known Kerr effect, which produces a nearly instantaneous change in the refractive index proportional to pulse intensity. This change alters the propagation constants of all excited modes and gives rise to nonlinear phenomena such as self-phase modulation (SPM), cross-phase modulation (XPM), and four-wave mixing (FWM).
A delayed nonlinear response implies that the nonlinear polarization not only depends on the electric field at the current time, but also on the field values at earlier times. This delay is attributed to light-induced molecular vibrations involving either optical phonons (Raman scattering) or acoustic phonons (Brillouin scattering). We refer to Ref. [56] for further details. Only Raman scattering is relevant for short pulses that are of interest in this review. As a result of Raman scattering, a longer-wavelength pulse, known as the Stokes pulse, undergoes amplification in the presence of a shorter-wavelength (pump) pulse. In silica fibers, maximum gain occurs for a relative frequency shift of about 13 THz (or 100 nm) from the pump. As the intensity of the Stokes pulse increases, it can serve as the pump and generate another pulse at a longer wavelength through a cascaded Raman process, thus potentially producing multiple pulses from a single pump pulse.
5. Raman-Induced Self-Frequency Shift
The Raman gain affects ultrashort pulses (width ) in a different way through a process called intrapulse Raman scattering [56]. The spectrum of such pulses is so wide that its low-frequency components can be amplified by the high-frequency components of the same pulse. Consequently, the entire spectrum of a short pulse continuously shifts towards longer wavelengths as it propagates along the fiber. This effect is particularly strong for solitons and is often referred to as the soliton self-frequency shift (SSFS). As we discuss later, it also occurs for an MMS forming inside an MMF.
C. Modal Dispersion
Let us introduce modal dispersion, a key property of MMFs, which is associated with the eigenvalues corresponding to different guided modes. These eigenvalues are crucial, as they directly influence the propagation of optical pulses through different guided modes. We initiate our discussion by focusing on the frequency dependence of eigenvalues of the modes, and obtain the associated modal dispersion properties. By considering different orders of modal dispersion for both GIMFs and SIMFs, we are able to find several specific properties of GIMFs. GIMFs exhibit a mode-beating distance that is nearly wavelength independent, a reduced chromatic-dispersion-induced pulse broadening, and a minimal intermodal walk-off. Such attributes render GIMFs a fertile ground for observing many intriguing phenomena, including periodic self-imaging [57], formation of multimode solitons [37,38], and collisions of such solitons [39].
The eigenvalues of MMF modes are the effective modal indices . They quantify the change in a mode’s propagation constant induced by the fiber’s refractive index profile. As shown in Figs. 3(b) and 3(d), for SIMFs and GIMFs decreases with the mode order and wavelength , indicating that a weaker waveguide confinement occurs for HOMs or at longer wavelengths.
To discuss modal dispersion, we consider the frequency dependence of the propagation constant for mode :
Different derivative orders of with respect to frequency provide us with several dispersion parameters, which play a crucial role when short pulses propagate inside an MMF. Utilizing these coefficients, we can elucidate the influence of dispersion on various physical phenomena in MMFs.
1. Modal Propagation Constants
The propagation constant (zeroth-order dispersion) in Eq. (8) quantifies the rate of phase change for mode with propagation. Figure 3(f) compares for SIMF and GIMF the phase difference, , between the propagation constant of mode and that of the fundamental mode of the SIMF. In the case of a GIMF, exhibits a distinct step-wise decrease as the mode order increases across different degenerate mode groups.
Changes in the modal propagation constant with play a crucial role in MMFs and lead to phenomena such as multimode interference or mode beating. This beating rate stems from the difference of for any two modes and can be used to define the mode-beating length (MBL) as
Mode beating is particularly pronounced in GIMFs, where the phase-rate differences between adjacent mode groups are nearly identical. For example, as shown in Fig. 3(f), the modes 1, 6, and 15 satisfy the relation . This feature enhances the mode-beating process, which involves multiple high-order modes. Whenever an optical beam excites multiple modes of a GIMF, the total intensity, obtained by adding the contributions of all modes [see Eq. (25)], exhibits a periodic evolution (the self-imaging effect). The period of mode beating is determined by the constant difference in propagation constants between adjacent groups of modes.
Interestingly, for femtosecond pulses, the intensity at points of maximum beam compression may become so high that it induces multiphoton absorption or ionization [58]. In turn, a visible photoluminescence is generated, which scatters out of the fiber as a series of bright spots, providing an experimental verification of the self-imaging phenomenon [57]. The self-imaging period does not depend on power and is equal to the beating length of different radial modes:
By using the data plotted in Fig. 3(f), we calculate the self-imaging distance to be , which closely matches the experimentally observed values [57].
A remarkable property of a GIMF is its near-constant MBL for different wavelengths. To elucidate this feature, Fig. 4(a) shows the wavelength dependence of for modes and for both the GIMF and the SIMF. Remarkably, as the wavelength shifts from 1.1 μm to 1.8 μm, in SIMF varies from 2.8 mm to 1.7 mm but its value for the GIMF remains fixed at 0.58 mm.
Figure 4.Characteristic lengths of GIMFs and SIMFs at different wavelengths: (a) mode beating length ; (b) modal walk-off length ; (c) dispersion length . In (d) we show the differential mode delay for modes and , for an input pulse temporal duration (FWHM: 184 fs). The significance of each length scale in (a)–(c) becomes apparent when the pulse propagation distance exceeds these characteristic lengths, indicating that the corresponding effect has a significant influence. detailed in (d) quantifies the temporal separation (ps) between two modes per unit of propagation distance (m).
From a different perspective, the self-imaging period in a GIMF depends on the core radius and the refractive index difference as [57], where is the relative index difference between the core and the cladding . Typically, the refractive index varies with wavelength, which is a common characteristic of optical materials. However, if undergoes minimal changes across a range of wavelengths, the self-imaging period remains nearly constant. This feature implies that the self-imaging phenomenon in GIMFs exhibits a remarkable wavelength stability.
When non-radial modes are excited in a GIMF by launching a beam off-set from the core’s center, the beam’s shape exhibits transverse oscillations with distance. This process doubles the spatial period of self-imaging, owing to the beating between radial and non-radial modes. The resulting self-imaging period results from the propagation constant difference between two such modes and is given by [see the blue curve in Fig. 4(a)].
The evolution of periodic mode beating leads to collective intensity oscillations, which in turn facilitates the emergence of spectral sidebands. This phenomenon is associated with resonant dispersive radiation in regions of anomalous dispersion [41,59] and with a so-called geometric parametric instability (GPI) in the region of normal dispersion [60]. The frequencies of the GPI-induced resonant spectral peaks adjacent to the pump frequency follow the relation
2. Intermodal Dispersion
The group velocity of a multimode optical pulse depends not only on the optical frequency (chromatic dispersion) but also on its constituent modes. For mode at wavelength , the first-order term in Eq. (8) is inversely proportional to the group velocity
Figure 3(g) shows the relative group velocity between a mode of order of either a GIMF or a SIMF, and the fundamental mode of a GIMF at . We may observe that, when compared to SIMF, modes of GIMF propagate faster and exhibit smaller group velocity differences. Additionally, for both fibers the group velocity of LOMs is larger than that of HOMs.
It may appear surprising that LOMs have higher group velocities than HOMs, since the effective index decreases as the mode number grows larger [see Figs. 3(b) and 3(d)]. Therefore, the phase velocity of LOMs is smaller than that of HOMs. However, the group velocity has an inverse dependence on the modal group index, which includes the effects of dispersion and is defined as
For a more detailed comparison, we illustrate in Fig. 5 the wavelength dependence of both the modal effective index and the modal group index for modes for both GIMF and SIMF. As can be seen, contrary to the effective index, the modal group index grows larger with mode number for both types of fibers. Moreover, the modal effective index decreases monotonically with increasing wavelength. In contrast, the modal group index reaches its minimum value near the zero-dispersion wavelength, where the largest group velocity is reached. In regions of anomalous dispersion, an increasing for mode means a decrease in the group velocity of that mode as its wavelength grows larger.
Figure 5.Modal effective index in (a) and modal group index in (b) as functions of wavelength for mode in both GIMF and SIMF.
The differential mode delay (DMD) quantifies the temporal delay between modes per unit distance; it is defined as
In Fig. 4(d) we show the wavelength dependence of , for the mode pairs (1, 6) and (1, 15) for both GIMF and SIMF. For both types of fiber, the DMD increases with wavelength. In addition, an increase in the relative mode number difference leads to larger values of . Consequently, the formation of MMS is much more difficult for modes with large DMDs, when compared with MMS composed of adjacent modes.
For GIMFs, the DMD values of are an order of magnitude smaller than those for SIMFs. For example, at a wavelength of 1.45 μm the temporal delay associated with modal dispersion is for a GIMF but approximately 2.4 ps/m for a SIMF. Utilizing allows for a rough estimate of the propagation delay between modes in the linear regime. For instance, when a pulse travels 5 km inside a GIMF, its relative delay is approximately 1.1 ns for two neighboring modes. This estimate agrees with experimental findings, as found in Ref. [61].
The walk-off distance for pulses carried by different modes depends not only on the relative group velocities of these modes but also on pulse duration. Longer pulses have less pronounced walk-off compared to shorter pulses. For a pulse of duration , the walk-off distance between modes and is defined as
Here stands for the distance over which two pulses separate in time by an amount equal to their temporal width. A longer walk-off length implies more interaction time due to their temporal overlap during propagation, while a shorter length suggests significant temporal separation over a shorter distance.
Low-order modes (LOMs) travel faster compared to high-order modes (HOMs), e.g., at any given wavelength. Whenever an MMS is formed by trapping different modes via the Kerr nonlinearity, the resulting group velocity of the MMS will be intrinsically linked to its modal composition. Therefore, one may expect that an MMS composed of HOMs will propagate more slowly than an MMS composed of LOMs.
The binding mechanism for different modes composing an MMS is that the Kerr nonlinearity induces a spectral blue shift for HOMs and a red shift for LOMs. This is due to the fact that modal group velocities decrease as wavelength increases in the case of anomalous dispersion, i.e., for . Consequently, the red-shifted LOMs decelerate while the blue-shifted HOMs accelerate, gradually approaching an equilibrium point where group velocities become balanced across all modes. An illustrative example of this phenomenon can be found in Fig. 7, as detailed in Section 2.B.1.
As previously mentioned, the Raman effect induces a progressive red shift of the soliton, which decreases its group velocity. In GIMFs, the Raman effect also contributes to the MMS formation, since it reduces the speed mismatch between the fast LOMs (which have higher nonlinear coefficients and undergo the largest SSFS) and the slow HOMs. On the other hand, when two multimode Raman solitons with distinct modal compositions are generated during the fission of a single MMS, their different group velocities may lead to collisions between them as outlined in Ref. [39]. This scenario is elucidated by the example presented in Fig. 11(d) in Section 2.B.4.
3. Chromatic Dispersion
Chromatic or group-velocity dispersion (GVD) characterizes the dispersive broadening of optical pulses within each mode of an MMF. As a consequence of material and waveguide dispersion, different frequency components within the pulse in a given mode travel at different speeds, resulting in pulse spreading in time upon its propagation. The GVD is governed by the second-order term in Eq. (8). In standard silica optical fibers, there is a zero-dispersion wavelength (ZDW), close to 1.3 μm, such that . Consequently, optical pulses propagating near this wavelength are little affected by GVD, but the impact of higher-order dispersions on the pulses may become significant.
For a transform-limited (i.e., unchirped) Gaussian pulse with initial temporal width , propagating in either the normal or anomalous dispersion regime, its temporal width increases with distance as [56] . It is useful to introduce a characteristic dispersion length for mode as
Consequently, for a pulse carried by mode the pulse broadening ratio, , becomes significant as it travels a distance greater than .
In Fig. 4(c), we plot as a function of wavelength for both GIMF and SIMF. As can be seen, for wavelengths around the ZDW of 1.26 μm, tends towards infinity because . The ZDW corresponds to an inflection point for , and optical pulses propagate at their fastest speed at this wavelength when compared with other wavelengths. In the normal dispersion regime (), the group velocity increases with wavelength; in the anomalous dispersion regime (), the opposite happens.
The GVD is different for different guided modes in MMFs. This is depicted in Fig. 3(h), where the value of at 1.45 μm is shown for both SIMF and GIMF. The GIMF has smaller GVD values and exhibits narrower gaps across its mode groups compared to the SIMF, as further illustrated by the dispersion lengths plotted in Fig. 4(c). Consequently, GIMFs are expected to produce much less temporal broadening compared to SIMFs, suggesting that a smaller input power will be necessary to form multimode solitons in GIMFs.
D. Soliton Formation Conditions
By comparing the relevant characteristic lengths, we can find interesting conditions for the formation of solitons. For simplicity, here we focus on a fundamental soliton involving only one mode of an MMF. Similar to the case of singlemode fibers, a soliton involving only the mode will have a hyperbolic secant profile: where and are the peak power and duration of the soliton, respectively. From this, energy of this soliton is found to be
By comparing the nonlinear length [Eq. (6)] with the dispersion length [Eq. (16)], we obtain the soliton order for mode [56]:
For , dispersive effects dominate over the nonlinear effects, preventing the formation of a soliton. When , higher-order solitons may form, exhibiting a periodic evolution. A fundamental soliton is formed when . Given that both the effective mode area () and dispersion () are different for different modes for an MMF, a soliton’s peak power will also depend on . By setting , we find the following relations for the peak power and the energy of a soliton of duration :
As the duration varies, and change as shown in Fig. 6(b) for the two types of fibers for the formation of solitons at . For nanosecond pulses, the peak power needed is in the milliwatt range (or picojoule energy). For picosecond pulses, it reaches kilowatt levels (or nanojoule energy). For femtosecond pulses, the soliton needs peak powers in the megawatt range and energies reaching microjoule levels.
Figure 6.(a) Peak power of a fundamental soliton carried by a mode with index in SIMF and GIMF, respectively, for a 105 fs pulse duration. (b) Required peak powers and energies for forming a fundamental soliton of varying duration in mode for SIMF and GIMF when . Fiber parameters are consistent with those in Fig. 3.
The dependence of the peak power on the mode number is shown in Fig. 6(a) for . The evolution of the required peak power closely matches the effective mode area in Fig. 3(e). Modes with larger effective mode areas require higher peak power (or energy) to form a soliton. The GIMF requires 4.6 kW peak power for mode 1 and 12 kW for mode 15, while the SIMF needs higher powers of around 22 kW for each mode.
The formation of an MMS inside an MMF requires compensation of temporal walk-off associated with different mode pairs. By equating the walk-off length from Eq. (15) with the dispersion length in Eq. (16) and the nonlinear length in Eq. (6), we can determine the temporal duration of an MMS involving any two specific modes:
Based on the dispersion parameters of the two fibers in Fig. 3, of the first two radial modes is found to be 115 fs for a GIMF and 12 fs for a SIMF. Therefore, based on the walk-off compensation condition for forming an MMS, the duration and peak power for GIMF should be around 115 fs and 3.8 kW (see Section 2.B.1 where we provide numerical examples of MMS formation). Since and vary with wavelength [see Eqs. (7) and (8)], the values of also change. As a result, the “walk-off” soliton duration will also depend on wavelength (see Fig. 32 in Ref. [37]). For a soliton with , walk-off compensation becomes more critical, but it may be compensated for higher-order MMS with larger peak powers. For example, observing MMS in a SIMF requires a peak power in the MW range (or energy ) [42].
In the case of a soliton propagating in the fundamental mode, by substituting the mode effective area with the beam area of the fundamental mode ( is the waist of the fundamental mode), one may rewrite the relation Eq. (21) between the soliton energy and duration as follows:
For MMSs, this relation can be approximated by replacing the mode energy with the total energy , the mode effective area with the beam area ( is the beam waist) in an MMF, and the second-order dispersion with that of the fundamental mode (supposing that the difference between chromatic dispersion of different modes can be neglected). Thus, the condition for the existence of an MMS can be expressed in terms of the beam size, pulse energy, and duration, as follows [62]:
As we shall discuss in Section 9, the experimental results have indeed demonstrated the validity of such a relationship (see Fig. 31 in Section 9).
E. Insights into the Relationship between MMS and MEA
Building on these foundational concepts, a thorough comparison of SIMF and GIMF can yield profound insights, facilitating a deeper understanding of MMS dynamics in multimode fibers.
An important consideration is that even when two pulses have identical durations and energies (or peak powers), their nonlinear lengths in the same MMF can differ vastly if their modal compositions are not the same. This suggests that the mode distribution of a pulse plays a crucial role in determining the nonlinear dynamics inside MMFs. For the same power and wavelength, the nonlinear length [Eq. (6)] of any mode is proportional to its MEA [Eq. (3)]. This feature indicates that modes with a smaller MEA have larger nonlinear coefficients [Eq. (5)] and they can excite a larger nonlinearity for the same given field power.
The MEA in a GIMF increases with the mode order, while the MEA in a SIMF decreases, as seen in Fig. 3(e). This suggests that, for the same peak power, nonlinear effects may preferentially affect LOMs in GIMFs and HOMs in SIMFs, respectively. As an example, LOMs in a GIMF may initially compensate for their dispersion-induced pulse broadening via the Kerr effect, leading to the formation of an MMS composed by these modes [38].
Conversely, SIMFs are more likely to favor the formation of an MMS composed of HOMs, because of their smaller MEAs [see Fig. 3(e)] and higher MNCs. This is the reason that HOM Raman solitons are preferentially observed in SIMFs [42], while the Raman-beam cleanup is typically observed in GIMFs, rather than SIMFs [63]. With this insight, we can also understand Ref. [43], where a lower input-energy threshold was observed for generating a Raman MMS when the input beam was offset from the core’s center to enhance the excitation of HOMs of the fiber, compared to the case of on-center beam’s coupling (see Fig. 36 in Section 9.F).
In general, linear LOMs propagate faster than HOMs at the same wavelength, due to their larger group indices [see Fig. 5(b)]. In GIMFs, the Raman effect can initially cause a stronger red-shift for LOM components in individual pulses, because of their relatively higher nonlinear coefficients compared with HOMs. This red-shift decreases the speed of pulses in the LOMs [see the explanation of in Section 2.C], thereby minimizing the group-velocity mismatch between LOMs and HOMs and facilitating the formation of an MMS.
Furthermore, for an MMS in GIMFs, the pulse in the fundamental mode is red-shifted the most, and HOMs with shorter wavelengths serve as pumps, thus facilitating a substantial energy transfer from HOMs to the fundamental mode via the Raman effect. This explains why a significant Raman beam clean-up is observed, as shown in Ref. [38]. This phenomenon is harder to occur in SIMFs because their fundamental mode does not have the largest nonlinear parameter.
The mode size increases in all MMFs as wavelength increases, which in turn decreases the MNCs. This feature potentially contributes to the observed decrease in the rate of red shift (SSFS) of a Raman MMS as it propagates over longer distances. Furthermore, at higher wavelengths the number of modes within an MMF diminishes, especially affecting HOMs, which lose their confinement as wavelength increases.
The nonlinear coupling among the modes of an MMF depends on the overlap of their mode profiles. Generally speaking, nonlinear coupling between two radial modes () tends to be stronger than that between a radial and a non-radial mode. Furthermore, within the same degenerate group, nonlinear coupling is stronger when compared with the coupling between two non-degenerate modes.
During mode competition within an MMF, modes with a larger MNC are more likely to dominate the nonlinear dynamics. This may partially explain the occurrence of self-cleaning observed in GIMFs [49]. It can also explain the observation of speckled patterns in SIMFs, which is indicative of the dominance of HOMs in SIMFs.
3. COUPLED MODE EQUATIONS
A. Coupled Multimode Nonlinear Schrödinger Equations
Building upon the insights gained from the preceding section, we discuss a widely used model for describing the evolution of optical pulses inside MMFs. As usual, the optical field of each pulse is written as the product of a rapidly evolving optical carrier and a slowly varying envelope , where denotes the frequency at which the pulse’s spectrum peaks. The envelope is normalized in a way that is expressed in units of . This envelope can be expanded in terms of the eigenmodes of an MMF as follows: where are the previously introduced transverse mode profiles and the mode amplitudes have units of .
The evolution of each modal component along the propagation direction is governed by the generalized multimode nonlinear Schrödinger equations (GMMNLSE) [64,65] where the dispersion operator is given by and the dispersion coefficients represent a Taylor-series expansion of in Eq. (7) evaluated at the frequency :
It is assumed that the pulse evolves in a reference frame moving at the velocity of the fundamental mode . All phase and group velocities are referenced to the corresponding values for the fundamental mode, i.e., the first two terms in Eq. (27) are relative to the fundamental mode.
The modes are coupled by the nonlinear effects such as SPM, XPM, and FWM, whose impact in Eq. (26) is included by the following nonlinear terms: where and are the nonlinear mode-coupling coefficients for the Kerr and Raman effects, respectively [64]. The values of and are in general different when the modes’ polarization is taken into account, whereas they become identical within the scalar description of the fiber modes that we adopt in this section [66]. Therefore, we can drop the superscripts K and R and express the nonlinear coefficients as where we made the further assumption of real-valued transverse mode profiles. Equation (29) incorporates the effects of Kerr nonlinearity, Raman scattering (associated with the coefficient ), and self-steepening (related to the coefficient ). The nonlinear index of silica is , and its Raman fraction is . The Raman response function, which is crucial for capturing the nonlinear response of an MMF to ultrashort pulses, can be approximated by where and are related to the frequency and the damping time of the Raman response in silica [56].
B. Numerical Examples of MMS Formation in a GIMF
This subsection provides concise examples illustrating various phenomena, such as the linear multimode walk off, formation of multimode solitons, their fission and creation of Raman solitons, and collision of two such solitons. These examples not only serve to clarify the basic concepts but also provide a basis for both analytical and experimental exploration.
We assume that the input conditions for all modes are the same in the form of Gaussian pulse, centered at with a duration characterized by its full width at half maximum (FWHM) . More specifically, where represents the initial modal coupling coefficient, and is the input energy. They are found by projecting an input Gaussian beam onto the eigenmode basis, expressed mathematically as
Input conditions are varied by adjusting the two width ratios for the input beam. They are defined as where is beam’s width, is its lateral offset, and is the width of the fundamental mode. The use of these dimensionless parameters allows for describing diverse input conditions for a given input pulse with parameters such as input energy , temporal duration , and the parameter values and .
When an input beam is aligned with the center of the fiber’s core, only a few radial modes are excited (specifically modes , as depicted in Fig. 3). For simplicity, we only take into account these three dominant radial modes corresponding to in the following examples.
1. Modal Walk-Off and the Formation of an MMS
In the linear regime, modal dispersion leads to temporal walk-off of pulses carried by different modes in a multimode GIMF; see Figs. 7(a) and 7(b). In this case, an input beam with can be decomposed into the first three radial modes (, 6, and 15). The input pulse has a Gaussian shape with a duration of (FWHM: 184 fs) and has an energy of . As the peak power of the input pulse is merely 27.5 W, nonlinear effects remain insignificant due to , but dispersion effects play a pivotal role. This can be seen by noting that the dispersion length is . The walk-off lengths are 0.82 m and 0.40 m for the mode pairs (1.6) and (1,15), which correspond to DMDs of 0.224 ps/m and 0.464 ps/m, respectively. As seen in Fig. 7(a), after propagating over a distance of , the multimode pulse undergoes significant temporal broadening. The peak delay between mode 1 and mode 6 is approximately 4.8 ps and it increases to 9.3 ps for modes 1 and 15. Given that dispersion solely alters the spectral phase, the spectral power distribution remains unaffected, as illustrated in Fig. 7(b).
Figure 7.Comparison of the time evolution of temporal and spectral fields in the linear and soliton regimes, respectively: (a), (b) linear regime, (c), (d) MMS regime; the input pulse has a beam size and a duration of (FWHM: 184 fs), corresponding to a dispersion length . The normalized temporal and spectral field profiles for modes 1, 6, and 15 are depicted using red, blue, and green curves, respectively, in each panel. For (a), (b) the input pulse energy is ; for (c), (d) ; here the Raman effect is neglected, and we set .
However, when the input energy is increased to 0.7 nJ, corresponding to a peak power , the peak powers of the three excited modes become 1.76 kW for mode 1, 1.10 kW for mode 6, and 0.72 kW for mode 15, and result in different nonlinear lengths for these modes: for mode 1, 2.55 m for mode 6, and 5.62 m for mode 15. Although without considering higher-order mode powers, a qualitatively distinct phenomenon is observed, as shown in Figs. 7(c) and (d). Mode 1 experiences significantly less temporal broadening, facilitating the trapping of a substantial portion of the energy from mode 6. This interaction leads to the formation of an MMS that evolves as a single entity. The group delay of the MMS in Fig. 7(c) falls in between that of modes 1 and 6 in Fig. 7(a). This feature shows that an averaging of the linear group velocities of the constituent modes occurs when an MMS forms. In order to sustain this balanced group velocity, the Kerr nonlinearity produces a red-shift for mode 1 (which decreases its speed) and a blue-shift for mode 6 (which increases its speed) [see Fig. 7(d)]. Note also that the peak power of mode 15 is not large enough to be trapped into the MMS. As a result, the pulse in this mode spreads its energy into a dispersive wave, without much change in its spectral characteristics, as seen in Fig. 7(d).
By varying the input pulse energy, we can determine the threshold for MMS formation. In Figs. 8(a)–8(c), we plot the temporal power distribution (normalized to its maximum value) for modes 1, 6, and 15 at the fiber’s output as a function of input pulse energy. In the quasi-linear regime (), three modes exhibit significant walk-off and pulse broadening. However, owing to its large nonlinear coefficient and its high peak power, mode 1 begins to exhibit less broadening as the input energy increases. To illustrate this feature quantitatively, we plot in Figs. 8(d)–8(f) the pulse peak temporal position, the peak power, and the pulse duration. The temporal position of peak power for mode 1 exhibits a large delay when an MMS forms by trapping mode 6 [see Fig. 8(d)], while the power of all modes keeps increasing [see Fig. 8(e)].
Figure 8.MMS within a GIMF: panels (a)–(c) illustrate the normalized temporal power for modes 1, 6, and 15, respectively, at the output of a 20 m long GIMF, versus input pulse energy. The input pulse has a Gaussian shape with a duration of (FWHM: 184 fs), a wavelength of 1.45 μm, and a beam width parameter . Panels (d)–(f) detail corresponding temporal positions of peak power, peak power, and pulse duration at the fiber output. Effects of higher-order dispersion (), Raman (), and self-steepening () are disregarded, in order to focus on MMS dynamics only.
However, only the duration of mode 1 is significantly reduced [see Fig. 8(f)]. At an energy of around 0.5 nJ, the peak position of mode 6 aligns with mode 1 [see Fig. 8(d)], which can be considered as the threshold for MMS generation. For mode 15, even though a bit of energy is trapped, most of its energy flows away as a dispersive wave [see Fig. 8(c)], until 0.8 nJ, which is also observed as the turning point of peak power changes in Fig. 8(e). As the MMS forms, the durations of pulses in all three modes become very close to each other, as shown in Fig. 8(f).
2. Influence of the Input Beam’s Size on MMS
Next, let us consider what is the influence of the input conditions (specifically, the input modal composition) on the formation and propagation characteristics of an MMS, for a given input energy. In Figs. 9(a)–9(c) we plot the output temporal profiles of power (normalized to their maximum values) in modes 1, 6, and 15, respectively, versus input beam waist width . Here the input energy is fixed at . The corresponding pulse peak temporal position, the pulse peak power, and the pulse duration are depicted in Figs. 9(d)–9(f), respectively. At this energy level, the input pulse energy is effectively redistributed among modes 1 and 6 in order to efficiently form an MMS (see Fig. 8), whereas a portion of input energy coupled to mode 15 only contributes to dispersive waves.
Figure 9.Output mode evolution versus input beam size (input modal content): the normalized temporal field powers of (a) mode 1, (b) mode 6, and (c) mode 15 at the output of a GIMF are plotted as a function of the normalized input beam size , when the input pulse energy is 1 nJ, and the pulse duration is (FWHM: 184 fs), corresponding to a dispersion length . Panels (d)–(f) detail the corresponding temporal positions of output peak position, peak power, and pulse duration. Here we neglect the effects of higher-order dispersion (), Raman scattering (), and self-steepening ().
When , only mode 1 is excited, since the input beams have the same size as mode 1. A singlemode soliton within mode 1 is formed, as indicated by the zero peak power for modes 6 and 15. As increases, the group velocity of the MMS decreases, which is reflected in the larger group delay at the output, as shown in Fig. 9(d). Moreover, with an increase in , the high-order modal content of the MMS in modes 6 and 15 also increases, as can be seen in Fig. 9(e). The duration of the MMS expands from 0.1 ps to 0.17 ps, wherein the duration of the pulse in the fundamental mode is slightly shorter than that of pulses in the higher-order modes, as demonstrated by Fig. 9(f).
3. Raman Multimode Solitons and Soliton Fission
In the previous examples, the Raman effect was not considered () in the GMMNLSE model. Now, we aim to illustrate how the Raman effect affects MMS. To achieve this, we introduce the Raman effect by setting the delayed nonlinear response coefficient to in Eq. (29). Once the input energy exceeds 1 nJ, the Raman effect becomes notably influential. Here, we provide two examples to showcase its impact on MMS.
In Fig. 10 we illustrate the evolution of the MMS for an input pulse with and a duration of (FWHM: 184 fs), propagating through a 2 m GIMF. At this energy level, nonlinear effects intensify, enhancing mode coupling and inducing significant temporal and spectral changes. The LOMs undergo a red-shift, whereas the HOMs experience a blue-shift at [see Fig. 10(b)], in order to maintain a GVD-induced balance of their different group velocities. As a result, a Raman MMS forms, with a relatively narrower temporal duration and a noticeably red-shifted spectrum, owing to SSFS. At , the central spectrum of the Raman MMS shifts to 1.52 μm. This is accompanied by a decrease in the soliton group velocity. When comparing with the MMS group delay in Fig. 7(c), where a 20 m propagation leads to approximately 1.5 ns delay, the MMS in Fig. 10(a) only propagates over 2 m.
Figure 10.Temporal and spectral evolution of Raman MMS in a 2 m GIMF, when the input pulse energy is (a), (b) 3 nJ and (c), (d) 7 nJ. Here ; other parameters are the same as Fig. 7.
When increasing the input energy up to 7 nJ, as depicted in Figs. 10(c) and 10(d), there is sufficient nonlinearity to reorganize the field around into a second MMS. This suggests that the initial high-order soliton undergoes fission, resulting in two different MMSs. The first soliton, which carries most of the energy, exhibits a greater delay (extending beyond the plotting window at 1.6 m and 2.0 m) and a larger SSFS. The second MMS carries less energy; however, this is sufficient to form another Raman MMS, featuring a reduced delay and SSFS. Further energy increments can lead the second MMS to gather more energy, resulting in larger group delays (slower group velocities) and heightened SSFS, as detailed in the subsequent subsection.
4. SSFS Control and Soliton Collisions
In previous subsections, we observed different dynamics of multimode solitons in MMFs, including modal walk-off, formation of an MMS, and its fission. We also discussed the dependence of the MMS properties on physical parameters such as the input pulse’s duration, its group velocity, and SSFS upon varying its energy or modal content. Here, we discuss the possibility of controlling the SSFS of an MMS by adjusting its modal content. Under specific conditions, two Raman MMSs with different group velocities may undergo inelastic collisions, resulting in an energy exchange between them.
As an example, we cite the result in Ref. [39]. In this study, the input beam led to the excitation of the first 15 modes, and the input pulse’s duration was 70 fs with a center wavelength of 1.4 μm. Figure 11(a) shows simulated output spectra for a 2 m long GIMF as a function of input pulse energy using and , corresponding to the excitation of only radial modes. The SSFS of the first MMS (S1) at the output is visible for input energies exceeding 1 nJ, while the SSFS of the second MMS (S2) becomes pronounced beyond 5 nJ. The SSFS of both Raman MMSs increases with growing input energy.
Figure 11.Output spectra are plotted as a function of input pulse energy for a 2 m GIMF under different input conditions: (a) , ; (b) , ; (c) , . The temporal and spectral field evolutions as a function of propagation distance are depicted in (d), (e), with inset panels showing the field spectrogram at specific distances. For better visualization, the dashed region in (d) is re-plotted in (f). Panels (g) and (h) illustrate the evolution of fundamental mode content and the sum of other mode contents (modes 2 to 15) with respect to the total in (f). Reprinted with permission from Ref. [39]. Copyright 2022, Optica.
When the input beam is slightly shifted from the center of the fiber axis, additional non-radial HOMs are introduced (, ). In this case, a similar spectral evolution occurs as seen in Fig. 11(b). However, at 15 nJ, S1 undergoes less SSFS-induced red-shift and temporal delay when compared with S2, owing to its reduced fundamental mode content. As the input beam undergoes further energy increase and axial position shift (, ), the special spectral evolution of S1 seen in Fig. 11(c) becomes evident. The corresponding experimental demonstration is shown in Fig. 40 of Section 9.H. As seen in Fig. 11(c), a surprising spectral feature occurs for input energies between 22.5 nJ and 26 nJ: the soliton spectra exhibit large separations before merging again.
To understand these dynamics, we show the temporal and spectral evolutions of the MMS in Figs. 11(d) and 11(e): here the input energy is 22.7 nJ. Initially, the input pulse splits into two solitons (S1 and S2), which collide and then separate due to their inelastic energy exchange. Despite S2 having a smaller peak power when compared with S1 before their collision [see Fig. 11(f)], S2 gradually exhibits a larger delay (and larger SSFS), eventually leading to its collision with S1.
The analysis of the modal contents reveals that S2 has a relatively larger contribution from the fundamental mode (S2M1) when compared with S1 (S1M1), as depicted in Fig. 11(g). This occurs even though the sum of other mode contents in S1 (S1M2-15) is larger than that of S2 (S2M2-15), as shown in Fig. 11(h). Since the fundamental mode dominates the nonlinear strength, S2 experiences a larger SSFS and group delay, resulting in its collision with S1. As a result of their inelastic collision, the two MMSs exchange energy and separate, because S1 gains more energy in the fundamental mode, thus exhibiting a larger group delay. In Section 9, we shall describe observations giving experimental evidence of the spectral evolution and associated dynamics of collisions between MMSs in multimode GRIN fibers [39].
Furthermore, MMS interactions have been theoretically studied in several papers. In Ref. [67], the interaction between two MMSs in a GIMF, each carrying only the degenerate modes and with consideration of their polarization states, was studied. In analogy with the case of single-mode fibers, these MMSs exhibit an attraction or a repulsion behavior, on the basis of their relative phase, despite belonging to different fiber modes. However, unlike the single-mode scenario, each soliton transfers some power to the other mode via intermodal four-wave mixing. Despite this intermodal power exchange, both MMSs maintain their bimodal nature, and interact as if they were traveling in a single-mode fiber.
In Ref. [68], the interaction between fundamental solitons in single-mode fibers under the influence of Raman scattering was extensively investigated. The study reveals a net energy transfer from the leading pulse to the trailing pulse, akin to the behavior that we have described for the multimode scenarios. Furthermore, the impact of different soliton amplitudes or phases on the fraction of transferred energy was analyzed.
C. Discussion
In this section, we first introduced the eigenmodes and eigenvalues of MMFs, which are important for understanding the nonlinear as well as linear properties of an MMF. By comparing two specific MMFs, a SIMF and a GIMF, we found that the GIMF has dominant nonlinear coefficients for LOMs, thus favoring MMSs composed of low-order modes. Moreover, a GIMF has much reduced modal dispersion, which greatly facilitates the MMS formation. In contrast, a SIMF favors the generation of HOM solitons.
Second, by analyzing the modal eigenvalues, we obtain a piece of key information about the propagation properties of MMFs. For instance, equally-spaced propagation constants in GIMFs enhance mode beating, giving rise to the self-imaging effect. Thus, analyzing for different modes and wavelengths helps us to better understand a key property of the MMS in GIMFs: LOMs propagate faster than HOMs in linear regimes. Therefore, the overall group velocity of MMSs is influenced by their modal content. Building on these fundamental notions, we introduced the GMMNLSE, which is a relevant model for describing ultrashort pulse propagation in a broad range of MMFs with identifiable eigenmodes.
We elucidated a series of phenomena typical of GIMF: MMS formation via compensation of modal walk-off, MMS fission, mode-based SSFS control, and MMS collisions. Specifically, we analyzed how the Raman effect interacts with MMS: it increases the soliton’s wavelength, and LOMs exhibit larger SSFS in GIMFs. This has two significant impacts: first, it reduces the velocity of LOMs, thus diminishing their group velocity mismatch with HOMs and favoring their nonlinear trapping. Second, LOMs with increased red shifts are fed energy by the HOMs with a reduced red-shift, resulting in significant energy transfer from HOMs to LOMs, and eventually leading to Raman beam cleanup into the fundamental mode: corresponding experiments are discussed in Section 9.
Lastly, we discussed the possibility of SSFS manipulation by changing the input modal composition of an MMS. Under specific conditions, two MMSs resulting from the fission of an input pulse and propagating with different modal contents may collide at some point along the fiber, owing to their different group velocities. This section lays the groundwork for deeper explorations into MMS, by either analytical methods or experiments, as discussed in subsequent sections.
4. ANALYTICAL APPROACHES FOR MULTIMODE SOLITONS
In this section, we will review the theoretical framework, leading to analytical results, for describing the dynamics of MMSs. Most of these results rely on the variational formulation of Eq. (26) and the derivation of effective theories for describing an MMS by means of a reduced number of degrees of freedom. In this context, most of the literature focuses on two-mode (or bimodal) soliton formation [69–72]. The multimode case, whose treatment is much more complex, has received much less attention [69]. In Section 4.A, we introduce first the variational formulation for the two-mode soliton case. The Kantarovitch, or average variational approach, which is our main method in this section, is presented in Section 4.B. The effective variational theory obtained with this method for the case of a two-mode bright soliton is presented in Section 4.C. Later, in Section 4.D we will present the virial, or moment method, reviewing the results presented in Ref. [72]. Finally, in Section 4.E we present a generalization of this analysis to a multimode context, following Ref. [69].
A. Variational Formulation of Two-Mode Solitons
When just considering two modes, and neglecting high-order effects (e.g., high-order dispersion, Raman effect), Eq. (26) can be written as where we have considered the tensor symmetries [35]
After a straightforward normalization (see, for example, Ref. [73]), Eq. (35) reduces to with for anomalous and for normal GVD. Note that, under the phase transformation Eq. (37) reduces to which is the starting point for most of the analytical research that has been performed in this system (see, e.g., Refs. [70,71]). A key parameter here is the cross-phase modulation (XPM) coefficient , which couples fields and . With , Eq. (39) is known as the Manakov equations [74]. Alternatively, one can get rid of the walk-off terms by performing the transformation which yields the system
In this context, the effect of the walk-off can be reintroduced as an initial splitting velocity between the modal components of a pulse as it is injected into the fiber. Thus, the effect of temporal walk-off owing to modal dispersion can be transferred to an initial relative frequency shift among the two input pulse components [75,76].
1. Lagrangian and Hamiltonian Formalisms
The set of Eq. (37) can be derived from the Lagrangian density with and the interaction contribution
Then Eq. (39) can be retrieved through the Euler-Lagrange equations
Similarly, the application of the transformation Eq. (40) to the Lagrangian Eq. (42) leads to the new Lagrangian density with from where Eq. (41) can be also derived.
By defining the generalized momenta this system can be alternatively described by Hamiltonian density obtained through the Legendre transform which leads to with and
2. Conservation Laws
From the momentum-energy tensor defined through our Lagrangian [77], the following conservation laws can be deduced. Conservation of power. The powers carried by each mode are defined as These quantities satisfy which means that they are conserved individually along propagation in . This is a direct consequence of the fact that all terms in Eq. (42) are real [72].Conservation of the total momentum. The momentum for each field , is defined as The total momentum is conserved along propagation, i.e., Conservation of the total energy. The last quantity to be conserved is the total energy or Hamiltonian function The conservation law in this case reads
B. Average Variational Approach: Rayleigh-Ritz or Kantarovitch Optimization
In this section, we introduce the averaged variational method, which is widely used in order to compute soliton solutions in an optical context [15]. Originally presented in an optical context by Anderson et al. [78], this method is also known as the Rayleigh-Ritz or Kantarovitch optimization approach and generalizes its stationary version: the Ritz approach [78–80]. This method allows us to compute approximate analytical solutions of a given nonlinear partial differential equation by applying the principle of least action to a parameter-dependent solution ansatz, based on either the Lagrangian or the Hamiltonian formalism. In what follows, we describe the main steps in either the Lagrangian or the Hamiltonian formalism.
1. Kantarovitch Method in the Lagrangian Formalism
The method consists of the following four main steps. First of all, we need to define an approximate ansatz solution, or trial function, that captures the main features and shape of the state that we want to compute. This ansatz is a function of the form , which depends on through a number of parameters: which are the generalized coordinates of the system.Given a Lagrangian density associated with a certain set of partial differential equations [see, e.g., Eq. (42)], the next step in this procedure is to compute the effective reduced Lagrangian function of the system, which, in the context of temporal MMSs studied here, is defined as After that, we obtain the dynamical system for the generalized coordinates by computing the Euler-Lagrange equations for each parameter with , where we have defined .Finally, in the last step, we study the dynamics of the reduced system Eq. (60), and rebuild the behavior of the original system by using the initial ansatz.
2. Kantarovitch Method in the Hamiltonian Formalism
Equivalently, one may consider the Hamiltonian formalism for obtaining an effective reduced system. The process is equivalent to that followed in the Lagrangian case, but now we use the Hamiltonian where are the generalized momenta defined as
Then, the effective dynamics of the system are captured by the Hamiltonian equations of motion for each .
C. Effective Variational Theories for Bimodal Bright Solitons: Escaping versus Bound-State Solitons
Once that variational machinery has been introduced, we will try to answer the following question: which kind of initial condition for Eq. (39) leads to the formation of a two-mode soliton?
Most of the works dealing with two-mode soliton formation focus on a regime where each mode in the fiber experiences anomalous GVD [69–71,75]. In comparison with this case, the mixed scenario has been much less studied [81].
In this section, we will mainly focus on the interaction of bright solitons, where both modal components are propagating in the anomalous dispersion regime. In what follows, we take . In this context, two main behaviors are found: either (i) the two solitons split and escape or (ii) they are trapped, forming a bound state, i.e., a two-mode soliton. These solitons, however, are not static upon propagation, but they undergo oscillations in their separation and peak powers [73]. An example of this regime is depicted in Fig. 12.
Figure 12.Bound state, or trapped, oscillatory regime for . Adapted from Ref. [81].
The threshold between these two regimes deeply depends on two main factors: the strength of temporal walk-off, or the initial relative frequency shift of the pulses, and the strength of the XPM. One of the first researchers who investigated these phenomena was Menyuk in 1987. He found, by performing numerical simulations, that the splitting could be arrested through XPM, and that the threshold for mutually trapped solitons increases with increasing walk-off [73]. After that, many theoretical works focused on understanding this phenomenon and on the analytical determination of that threshold [69–72].
Instead of reviewing these results in chronological order, we have decided to organize them in terms of the variational formalism that was applied.
1. Lagrangian Formalism
Let us first review some results based on a Lagrangian approach. One of the first works to tackle this problem was that of Ueda and Kath [70]. In the following, we will use these results as the main guiding example for didactic purposes.
Their approach considered the Lagrangian Eq. (44), which is more adequate for investigating soliton formation, in detriment of the explicit appearance of the linear walk-off in the results.
The general soliton ansatz they considered was with the phases where , , and are the amplitude, width, and central position of the fields and , respectively. These parameters correspond to the three lowest-order moments of the soliton envelopes. , , and , represent the three lowest-order moments of the phase . These are the nonlinear velocity of the soliton central position, the chirp of the soliton (related to the change of the soliton temporal width), and the linear phase, respectively.
By computing the Euler-Lagrange Eq. (60) with this ansatz, the authors obtained an eight-dimensional reduced dynamical system for describing the evolution of the corresponding ansatz parameters. This effective theory allowed the study of both symmetric and asymmetric soliton configurations. However, due to the complexity of the former scenario, the authors focused on the symmetric case, which is the one that we review in the following paragraphs.
Let us consider two identical solitons that are symmetrically located in time on opposite points around their mean position, and that propagate in opposite directions (in a reference frame propagating with the average group velocity). This implies: , , , , and . With these considerations, the effective Lagrangian yields the Euler-Lagrange equations where and is the conserved mode power [see Eq. (53)]. The fixed points of this system correspond to a shape-preserving two-mode soliton, which is a center of the dynamics; hence it is neutrally stable: any small fluctuation of the system will drive it away from this point [82]. Interestingly, around this point a continuous family of oscillatory states may appear, which correspond to dynamical two-mode solitonic states. We will come back to this in a few paragraphs.
A way to obtain more information about this bimodal soliton system is by studying the effective Hamiltonian associated with our ansatz. After some algebra, the Hamiltonian can be reorganized as , where is a measure of the internal energy in each soliton, which is taken up by pulse-width oscillations and represents the kinetic and potential energy due to the coupling between the and solitons [70]. As we know from the conservation law Eq. (58), , where is the initial energy of the system, that is, .
Depending on the sign of , two behaviors are possible. : bound state regime. The two pulses attract each other, and form a bound state, wherein the pulse central position and width undergo periodic oscillations in [70,73]. This bound state corresponds to a dynamic two-mode soliton.: escaping or splitting regime. When , however, the two pulses have enough initial velocities, or frequency-shift (walk-off) to overcome the XPM-induced attraction, yielding two well-separated independent solitons.
The condition marks a separatrix, i.e., the threshold between these two behaviors. This condition provides the escape velocity .
In the chirp-free case (), the escape velocity for two initially superimposed solitons reads
Thus, the formation of two-mode solitons depends on the initial velocity, mode energy, and on the strength of the XPM interaction . In the phase plane defined by the parameters and [see Fig. 13(a)], the escape velocity is the separatrix between the bound state and the escaping soliton regime.
Figure 13.(a) Regions of bound and escaping solitons in the phase space for . (b) Comparison of simulations carried out using Eq. (39) (solid line), Eq. (65) for (dashed line) and for (dotted line). Adapted from Ref. [70].
In the bound-state regime, there is a continuous family of oscillatory states for continuous values of , which correspond to oscillations of the bimodal soliton central position [see closed orbit in Fig. 13(a)]. Such an oscillatory dynamic can be described by combining Eq. (65a) with (65b). This leads to the second-order ODE where is a constant [see Eq. (65c)].
In a strongly bound system, where is small (), and one gets the simplified equation which describes the relatively small amplitude oscillations of the soliton positions () with frequency . Thus, in the chirp-free case experiences harmonic oscillations in of the form
An example of this oscillation is shown in Fig. 13(b) using a dotted line. Note that the escape velocity can also be determined by transforming Eq. (70) into an energy conservation equation, as discussed in Refs. [71,83].
Obviously, the previous analysis only provides a qualitative description of the bimodal soliton system. A more realistic description requires the consideration of the chirp, and thus the modification of . Indeed, numerical results obtained with the original Eq. (39) reveal the presence of oscillations in both the pulse positions and widths (see Fig. 2 in Ref. [70]); moreover, these two oscillations are not commensurate, i.e., non-periodic.
To better understand the reason for these disagreements, it is necessary to consider the presence of pulse chirp (), and study the full system of Eqs. (65a)–(65d), which now allows for the variation of and thus . This extra degree of freedom shows that the periodic orbits in phase space are not periodic but quasiperiodic, and the pulse oscillations and width are not commensurate. The reason for this is that the kinetic and internal oscillations of this model are now governed not just by one frequency, but by the two frequencies
The comparison between the previous analysis and the numerical results is depicted in Fig. 13(b). The presence of this new frequency, due to the extra degree of freedom (), shows that the orbits do not close and are quasiperiodic: the pulse position and width are not commensurate.
These results were later revisited by Kaup et al. by considering a Gaussian ansatz [75]. Although quantitatively less accurate, this ansatz provides a deeper qualitative insight and analytical predictions not only for the symmetric, but for the asymmetric configuration as well, something that was not possible in previous works. In this context, a more general expression for was found {see Eq. (19a) in Ref. [75]}. Near the condition , this velocity can be approximated also by Eq. (69), but with .
In the bound-state regime, the linear stability analysis around the fixed point of the reduced effective dynamical system reveals the presence of not two, but three internal modes of the two-mode soliton that oscillates with the eigenfrequencies: analogous to in Eq. (72), related to , and a frequency that was not previously predicted, which is responsible for the antisymmetric shape oscillations of the symmetric two-mode soliton [75].
In 1990, Kivshar included explicitly the linear walk-off in the system description [see Eq. (37)]. In this case, he used the Lagrangian density Eq. (42) and a modified version of the ansatz Eq. (63), where the chirp is neglected and [71]. Similar to the work of Ueda and Kath [70], the author only considered the symmetric solution case (i.e., , ). In this context, the reduced Euler-Lagrange equations simplify to where is the relative distance between the two modes’ maxima [same as in Eq. (66)], and represents an effective interaction energy between the two modes, where we have used that .
In the strongly bound regime (), and the combination of Eq. (74) leads to the same Eq. (71), where the oscillatory frequency now reads which is twice the value predicted by Ueda and Kath [70].
By using the conservation of the total energy of the system, a separatrix between the bound state and the escaping soliton regimes was obtained in terms of the walk-off and width of the pulses. Thus, two-mode bound states will form if which leads to the threshold walk-off
In terms of the amplitude of the input pulse , this threshold reads which shows a linear dependence with , in contrast to previous results by Caglioti et al., where such a dependence is quadratic [69]. Note that within this approach, the gap amplitude cannot be directly derived: it was obtained by using a result from the inverse scattering method (see discussion in Ref. [72]).
Figure 14 compares, for , these results with those of Menyuk [73] and Hasegawa [33], who using an analogy based on quantum mechanics, derived a trapping condition with an amplitude threshold which is also proportional to (see the dashed line in Fig. 14). However, their theory does not predict the amplitude gap [33].
Figure 14.The solid line corresponds to the analytical results of Kivshar [71]. The dashed line shows the estimation of Hasegawa [33]. The numerical results of Menyuk are plotted using filled circles [73]. Here . Adapted from Ref. [72].
In our context, these results can be read as follows: when for fixed, the two modes interact slightly, and the symmetric soliton configuration splits into two independent solitons traveling in opposite directions; for the initial pulses form a bound state, i.e., a two-mode soliton, owing to the intermodal coupling described by the effective interaction energy .
Similar results were found by Karlsson et al. when investigating two soliton collisions in singlemode fibers, owing to the effect of XPM for [83]. We recommend this work for its completeness and clarity. A generalization of the Lagrangian formalism for two-mode dynamics when considering parameters in the soliton ansatz was also proposed by Caglioti et al. in 1990 [84].
2. Hamiltonian Formalism
Approaching the problem from a different perspective, one may also describe the dynamics of two-mode solitons in the context of the Hamiltonian formalism. The first time that this formulation was used to describe the dynamics of pulses in two-mode fibers was in 1988 with the work of Caglioti et al. [69]. By deriving an effective Hamiltonian for an arbitrary number of fiber modes (see Section 4.E), they were able to describe the interaction of a bimodal bright soliton complex in the presence of linear walk-off. The authors found that the threshold amplitude for two-mode soliton formation depends quadratically on . However, they were not able to predict the gap that appears when , in contrast to Kivshar [71].
One year later, by applying similar approaches, Afanasyev et al. found that the interaction of two bright solitons could be described by the interaction Hamiltonian [81]
The equation of motion associated with this quantity predicts periodic soliton oscillations with a frequency [81] which is proportional to , and not to as it was derived in other works [70,71].
In 1992, Mesentev and Turitsyn found a way to characterize the stability of two-mode solitons with equal amplitudes, by deriving integral conditions describing the trapping [85]. By applying Lyapunov stability theory, they were able to derive an exact condition for mutual soliton trapping. Thus, for any initial condition satisfying a two-mode soliton state will form. Note that this condition is general and it applies to initial pulses with any kind of shape, in contrast to other works that typically used a sech-type solution ansatz.
D. Moment Approach for Two-Mode Solitons
In 1993, Cao and McKinstrie applied a different approach, based on the virial theorem, for understanding Menyuk’s results [72]. The general idea behind this method, known as the moment or virial approach, is to use the conservation laws of the system (see Section 4.A.2) to obtain an expression of a suitable average value of a physical quantity in terms of the conserved quantities , , and [86]. In our problem, we are interested in the separation distance between pulses , whose first and second moments are given by with , respectively. From the virial equation, describing the evolution in of the variance , Cao and McKinstrie found a sufficient condition for a bound, or trapping state, to form: it reads as
They considered a general situation where the widths of the two pulses, and , can be different from each other. Note that all terms in the previous expression are constants of the motion. Therefore, one may analyze the two-mode behavior by simply considering these invariants, no matter how the pulse shape changes.
By considering a symmetric two-soliton input (i.e., , ), the authors were able to derive the amplitude threshold condition which is plotted in Fig. 14 for two different widths, namely, and 8.
Moreover, considering that pulses do not change their shape during propagation, which is valid for small, and supposing that their separation distance is also small (), they obtained a different value for the frequency at which oscillates: which is also proportional to , consistent with other works (see, e.g., Refs. [70,71]).
E. Multimode Generalization
The generalization of the previous theoretical findings to the case of a number of modes greater than two is practically absent. Only in the work by Caglioti et al. was an effective Hamiltonian derived based on a general ansatz for the description of each mode [69]. However, the Hamiltonian system was only utilized for the study of a single and two-mode scenario.
Neglecting high-order effects, the dimensional coupled-mode equations that Caglioti et al. considered read asfor .
By writing the general ansatz for the th mode as and taking with where is any positive-definite even function normalized to one with unit variance, , , and are the zeroth-, first-, and second-order moments of , namely, and , are the conjugate variables to and [69,84], the effective Hamiltonian associated to this system reads , with and
This Hamiltonian possesses degrees of freedom, and the Hamiltonian equations of motion, become
This reduced Hamiltonian system could be used for investigating, at least qualitatively, the behavior of MMSs in MMFs. However, in their original work, Caglioti et al. only focused on the single and two-mode configurations [84]. As far as we know, this effective theory has not been used yet for studying the dynamics of MMS with .
Furthermore, to the best of our knowledge, the virial approach has not been systematically applied for studying MMS formation and dynamics.
5. SPATIOTEMPORAL SOLITON FORMATION IN THE 3D+1 NLSE DESCRIPTION
In this section, we will review the main theoretical results on the formation and stability of spatiotemporal solitons, hereafter STSs, in GIMF. These states, also known as light bullets [17], form due to a double balance between Kerr nonlinearity and the simultaneous effect of spatial diffraction and chromatic dispersion [5]. In the following, we will focus on the variational formulation as the main tool for obtaining some analytical insight into the properties of these states. Unless stated differently, here we will focus on spatiotemporal spinningless solitons, which bifurcate from the fundamental (i.e., Gaussian) spatial mode, as done in the original works [20,27,28,31,87].
A. 3D+1 Nonlinear Schrödinger Equation
An alternative approach for characterizing the formation and dynamics of MMS considers a 3D+1 NLSE: where , , is the core radius, is the electric field component of the wave propagating along the -direction, represents diffraction, represents the group-velocity dispersion with its strength , and is the 2D parabolic potential describing the transverse spatial profile of the linear refractive index of the fiber [5].
By making the scaling transformations with the previous equation becomes where we have dropped the (′) from all the dimensionless variables. Here , for anomalous/normal dispersion, , and , with () being chosen for guiding (antiguiding) materials.
Note that this equation can also be used in the context of BECs, in order to describe nearly 1D condensates, with a cigar-shaped trapping potential () if we exchange the coordinate with . In this context, models a self-attractive nonlinearity [15]. In the context of BECs, Eq. (95) is commonly referred to as the Gross-Pitaevskii equation [5].
Equation (95) possesses the Lagrangian density [20,28] which contains all the relevant information about the system dynamics, including its conservation laws [77]. Indeed, from the Lagrangian density one recovers Eq. (95) from the Euler-Lagrange equations [20]
Here, the Hamiltonian density reads
Note that an alternative way for deriving Eq. (95) can rely on this functional.
B. Shape-Preserving Spatiotemporal Spinningless Solitons
Here we will focus on studying shape-preserving STSs in the absence of vorticity [15]. Vortex solitons in the context of cigar-shaped Bose–Einstein condensates, which is equivalent to our case, have been investigated by Malomed et al. in Ref. [88]. Thus, we expect these vortex states will be present in our optical scenario.
The shape preserving spatiotemporal states of Eq. (95) can be written in the form , where is the propagation constant, and is a real-valued function, describing the steady-state field [5,15]. When applied to Eq. (95), this transformation leads to the -independent (real) partial differential equation
Similarly to the previous case, the -independent Eq. (99) can be derived from the steady-state Lagrangian density which can also be derived from Eq. (96) by the application of the previous scaling transformation. This new Lagrangian [Eq. (100)] depends explicitly on , in contrast to the Lagrangian density Eq. (96).
1. Ritz Optimization through the Variational Approach
At this stage, we will apply the Ritz-optimization method for finding steady-state STS solutions of Eq. (99). This method, extensively used in the soliton literature [15], is equivalent to the Kantarovitch approach that we have described in Section 5.B: the only difference is that now the solution ansatz parameters do not depend on , i.e., they remain constant upon propagation.
An essential task before continuing is to define a proper ansatz for capturing the main features of these states. The selection of an ansatz is not entirely arbitrary, but it is justified by different preliminary observations, mostly related to the symmetries of the system. For example, in the absence of a 2D parabolic potential () or with radially symmetric potentials, Eq. (95) may have radially symmetric 3D solitons, whose shape can be captured by just considering the radius as the only independent variable of the system [89,90]. However, in our current case, the potential is 2D, and the -dependent ansatz is not valid.
Most of the analytical works on the formation of STSs based on the variational approach have focused on spinningless solitons whose spatial shape is mainly based on a single mode: the fundamental LG mode (), which has a Gaussian profile [20,27,28,91].
An appropriate ansatz for the spinningless solitons that we are considering has the form with the Gaussian profile of width , is the inverse of the temporal width (i.e., ), and is the amplitude of the pulse. The justification of the temporal shape of this ansatz is based on the observation that, in the absence of diffraction and spatial potential, Eq. (95) possesses a sech-shape bright soliton solution in the anomalous GVD regime [5]. Solitons with this shape could also be analyzed by just considering a spatiotemporal Gaussian ansatz, which makes the calculations simpler. This approach was followed by Shtyrina et al. in Ref. [91].
By using the definition of the pulse energy we obtain that and we can make our ansatz [i.e., Eq. (101)] energy-dependent. In this way, the pulse energy becomes the most important control parameter for the STS solutions. Thus, the main parameters of the system are .
With this ansatz, the static effective Lagrangian reduces to
Moreover, the effective Euler-Lagrange Eq. (60) becomes
By combining these expressions, we obtain that the shape-preserving soliton parameters satisfy [20] whereas the propagation constant reads as
Note that all of the previous quantities are parameterized by the spatial width coefficient .
Besides, the soliton parameters allow us to compute the peak soliton intensity (i.e., the intensity at the center of the STS) as
At this stage, we can already obtain some general insights about our system. From Eqs. (104) and (105) we find that, in order to obtain real solutions, it is required that . Furthermore, from Eq. (105) we see that and must have the same sign, in order for to be positive. This last condition yields two regimes, which correspond to the cases of anomalous-self-defocusing regime, and normal GVD-self-focusing regime, respectively. Figure 15 shows the modifications of and with for the former (in red) and the latter (in blue) scenarios.
Figure 15.Bifurcation diagrams for STS states as a function of for the anomalous (in red) and normal (in blue) GVD regimes. The green line represents the CW state of the system. (a) shows the spatial width of the STS as a function of ; (b) shows the inverse of the temporal width ; (c) represents the STS peak intensity . Unstable states are represented with dashed lines, while neutrally stable by solid lines. The horizontal black line corresponds to the fundamental mode. The right column shows an example of the STS solution by using three isosurfaces at different intensities (top), and a cross-section of the same state at constant . Adapted from Ref. [20].
Continuing with the examination of Eqs. (104) and (105), we can also see that when (i.e., if ), and become zero. This means that, with , the temporal width of the state , and the STS becomes the continuous-wave (CW) state of the system, which corresponds to the fundamental Gaussian mode with (see black horizontal line in Fig. 15). In other words, STSs in both regimes bifurcate from the fundamental mode when . This means that , STSs are basically singlemode based. When increasing , however, STSs separate from the state and its multimode composition increases. In what follows, we will consider guiding media, and therefore we shall take .
Equation (104) can also be written in the form or in terms of as and they may have either one or two positive real roots, depending on the signs of , , and . Unfortunately, this equation, in general, does not possess exact analytical solutions, and we need to solve it either by using approximate analytical methods, or numerically. However, for , i.e., very close to the CW where the STS is basically singlemode, one can use the approximation [28]
Let us discuss briefly these two regimes. Anomalous GVD-self-focusing regime. This scenario was initially analyzed by Yu et al. in Ref. [27]. In this regime, Eq. (108) has, for a fixed value of , two real solutions, which correspond to the solution branches (solid red line) and (dashed red line), respectively (see the red dot in Fig. 15). While is neutrally stable, is unstable [20]. These two solution branches coexist between and the fold, or turning point, taking place at . The fold position can be calculated analytically, by solving the equation , which leads to , providing that . The solution of this equation yields the fold parameters Normal GVD-self-defocusing regime. This scenario was partially analyzed by Raghavan and Agrawal in Ref. [28]. In this regime, Eq. (108) has a single real solution, and the solution ansatz parameters are single-valued in . Thus, here there exists just a single STS for any fixed value of . In contrast with the anomalous GVD/self-focusing case, we can see that for the same energy interval, the STS is spatially wider, thinner in time and possesses a lower peak intensity (see Fig. 15).
2. Vakhitov-Kolokolov Stability Criterion
The stability of STSs can be estimated by applying different criteria to the reduced equations for the parameters [20]. With the information that we have so far, we may apply the Vakhitov-Kolokolov stability (VKS) criterion [5,92], which relies on the propagation constant dependence with [see Eq. (106)]. This dependence is graphically illustrated in Fig. 16(a) for the anomalous dispersion regime. The VKS criterion establishes that an STS state is linearly stable (i.e., with respect to small perturbations), if the derivative of the energy with respect to is a positive quantity (i.e., ), and unstable otherwise. The main idea behind this criterion is based on the analysis of the properties of the linear operator associated with Eq. (95), evaluated on the soliton solution. We recommend the interested reader to consult Section 2.3 in Ref. [5] for details.
Figure 16.(a) Dependence of the energy on for different values of in the anomalous GVD regime. Stable (unstable) branches are plotted by using solid (dashed) lines. (b) Region of existence of STSs as a function of (see green shadowed area). The line limiting that area is . Vertical dashed lines correspond to the three cases plotted in (a). Adapted from Ref. [20].
The red curve in Fig. 16(a) corresponds to , the same case that we have studied in the previous section [see red curves in Figs. 15(a)–15(c)]. This criterion shows that is stable (see solid line), while is unstable (see dashed line). The instability threshold occurs whenever , which leads to
This point corresponds to the fold occurring at : in the following, we shall write . This point is marked by means of a red bullet in Fig. 16(a).
Let us briefly study how the STS stability is impacted by the parabolic potential. To do so, in Fig. 16(a) we plot the modification for the other two characteristic values of : specifically, (in blue) and (in black). By increasing , the region of existence of STSs broadens, as the critical point moves towards higher values of . The dependence of upon is shown in Fig. 16(b). Here, the green shadowed area corresponds to the region of existence of STSs. For , only the branch survives, and STSs are always unstable. These results show that the presence of a non-vanishing parabolic potential is essential for the stabilization of STSs. This description was already present in the work of Shtyrina et al. [91].
C. Kantarovitch Optimization Method in the Lagrangian and Hamiltonian Formalism
In the previous section, we have focused on the self-preserving -independent states. However, a description of the propagation of STSs can be also obtained by considering the Kantarovitch approach, where the ansatz parameters are allowed to depend on . This was the approach originally used by Yu et al. [27], and Raghavan and Agrawal [28].
Our new ansatz is a product of the static one [see Eq. (101)] and a spacetime-dependent phase contribution, namely, with where represents the spatial chirp, the temporal chirp, and the phase. This ansatz leads to the effective -dependent Lagrangian
In this case, the Euler-Lagrange Eq. (60) becomes the 4D dynamical system
Note that the equilibria, or fixed points, of this system lead to the same set of Eqs. (104)–(106). Besides, the system Eq. (116) provides information about the propagation behavior and spectral stability of the STS, which was not provided by the shape-preserving analysis carried out in Section 5.B.1. The transient and permanent dynamics of STSs by using this set of equations were investigated in Refs. [20,27,28].
Alternatively, the Kantarovitch approach can be also applied in the framework of the Hamiltonian formalism. In this regard, the reduced Hamiltonian can be retrieved from the density Eq. (98) and the chirp-dependent ansatz Eq. (114), as discussed in Section 5.B. The Hamiltonian Eq. (62) then yields the same system of Eq. (116). A detailed study of our system by using this approach can be found in the work by Parra-Rivas et al. [31].
1. Spectral Linear Stability and Types of Equilibria
As previously mentioned, Eq. (116) provides information about the STS spectral stability. For obtaining this information we may perform a linear stability analysis of the system Eq. (116) about the equilibria . This analysis allows us to determine how the equilibria of the system react against perturbations of the form , where and . This, moreover, allows for their classification according to their behavior.
Very close to a fixed point , the dynamics of the system Eq. (116) are captured by the linear dynamical system where is the Jacobian matrix of the vector field [see Eq. (116)], which is defined by its components as follows:
In our case, the Jacobian matrix becomes with
The stability of the steady STS can be evaluated by solving the linear eigenvalue problem where and are the eigenvalue and eigenvector associated with , respectively. The eigenvalues satisfy the bi-quadratic characteristic polynomial with the coefficients , and
Figure 17 shows the modification of the eigenspectrum associated with stable STSs on and unstable STSs on in the versus diagram in the anomalous GVD regime.
Figure 17.Distribution of the eigenvalues associated with the dynamical system Eq. (116) in the anomalous GVD regime. Adapted from Ref. [20].
In , the spectrum consists of four pure imaginary eigenvalues , with . A fixed point with these eigenvalues is known as a center [20]. Center points are neutrally stable, in the sense that nearby trajectories (i.e., soliton parameter perturbations) are neither repelled nor attracted to it, but they undergo permanent oscillations. To the lowest order in , these four eigenvalues can be approximated by [28]
Further examination of the eigenvectors associated with these frequencies shows that parameters and oscillate at the higher frequency , while and oscillate according to . These two oscillations lead to quasiperiodic oscillations [82], analogous to those predicted for the case of two-mode solitons (see Section 4.C). A numerical study of Eq. (116) regarding this oscillatory regime can be found in Refs. [20,28]. The eigenspectra for STSs on , in contrast, correspond to two pure imaginary and two pure real eigenvalues, namely, with . In this case, the associated equilibria are known as saddle-centers, and are unstable [20].
D. Numerical Confirmation through 3D+1 Simulations
Let us see how the effective theory developed in the previous section matches with the results obtained by solving our original Eq. (95). Here we will only review the results regarding the anomalous GVD regime. We recommend the interested reader to take a look at Ref. [20] for further details.
To solve this initial value problem, we consider as the initial condition the approximate analytical STS solution of Eq. (101), together with Eqs. (128a) and (128b). The -evolution of the initial stable chirp-free STS solution is shown in Figs. 18(a)–18(c) for . In both cases, the top and middle panels compare the -evolution of the STS intensity at its center (blue curve), with the analytically predicted intensity value. As we can see, the evolution of the STS intensity is not constant, but it fluctuates around a value that is slightly larger than what is predicted by the analytical theory. We could understand this, if we remember the STS is a center equilibrium: therefore, it is neutrally stable, which means that even numerical noise may perturb such equilibrium, leading to quasiperiodic oscillations. Our simulations reveal the presence of fast, small amplitude intensity fluctuations as shown in Figs. 18(a) and 18(b). This behavior agrees with that obtained when studying the effective reduced system Eq. (116) [20]. These intensity fluctuations are depicted in more detail in Fig. 18(b) for the reduced interval . The shape evolution of the STS in such an interval is illustrated in Fig. 18(c) by considering two isosurfaces at intensities and , respectively.
Figure 18.Evolution with distance of a stable STS for [see (a)–(c)]. Panel (a) shows the variation of the peak STS intensity versus the propagation distance. Panel (b) shows a close-up view of (a) for the interval . Panel (c) shows the evolution of the STS along the interval shown in (b), obtained by plotting two isosurfaces at (red) and (blue). The dashed gray straight line in (a) and (b) represents the theoretical value of the STS intensity. Adapted from Ref. [20].
The discrepancy between the analytically-obtained stable STS and numerical results becomes larger when we increase STS energy. In Fig. 19(a) we compare the center intensity of STS obtained from the effective theory (in orange) and from numerical simulations in the interval (blue circles). The blue circles and the error bars represent the time-averaged intensity values and the corresponding standard deviation for stable states. For low values of , the agreement is quite good, but it worsens with increasing . Eventually, for energy values above the system undergoes full wave collapse. This region is illustrated by using a red-shaded box. This collapse occurs much earlier than the analytical existence limit predicted by the theory at .
Figure 19.Panels (a) and (b) show the evolution of peak intensity of stable STS with energy for the pure quadratic and pure quartic dispersion scenarios, respectively. In (a), the green line shows the analytical values, while the blue circles and the error bars represent the average intensity values and the standard deviation for stable states. Adapted from Ref. [87].
An example of such destructive dynamics is illustrated in Figs. 20(a) and 20(b) for . The theoretically stable STS maintains its stability for a very short propagation length, but eventually, at , it undergoes wave collapse. This is characterized by a very fast growth rate of the peak intensity. This concentration of the field intensity at the center of the state can also be observed in the STS shell evolution which is shown in Fig. 20(b).
Figure 20.Wave collapse started from an STS for . Panel (a) shows the evolution of the , while panel (b) shows the modification of the STS with the propagation.
The disagreement between theory and numerical results, which to our knowledge was not disclosed in earlier works, might be corrected by considering higher-order self-defocusing nonlinearities (e.g., quintic-order nonlinear terms), or high-order dispersive effects.
E. Stabilization through High-Order Dispersion Effects
At this point, the interesting question arises about what potential mechanism could be exploited for arresting, fully or partially, the wave collapse that we discussed in the previous section. It has been demonstrated before that high-order dispersion effects impact positively soliton stabilization in both conservative and dissipative systems [93–100]. Inspired by these works, Parra-Rivas et al. proposed pure quartic chromatic dispersion as a main stabilization mechanism [87]. In this section we review these results, by showing that pure quartic dispersion is able to increase considerably the stability range of STSs, far beyond the limit that is obtained in the case of pure quadratic dispersion, thus delaying the occurrence of wave collapse.
In the presence of pure quartic dispersion, the normalized version of the 3D+1 NLSE reads with , with being the quartic dispersion coefficient [87]. All other parameters are defined as in Eq. (95). In this case, the Lagrangian density reads as
By applying the shape-preserving transformation and the STS ansatz Eq. (101), one may compute the effective steady-state Lagrangian which yields, through the Euler-Lagrange equation [see Eq. (103)], the following parameter relations:
By combining Eqs. (128a) and (128b), we obtain with , which relates and . By inserting Eqs. (129) into Eq. (128b), we find that is also completely parameterized in terms of the spatial width .
In the guiding () self-focusing regime () with , the system also presents two STSs branches and which coexist within the same energy range, extending from up to the fold () located at , where in this case with . This fold marks an upper energy limit, or threshold, for the STS existence.
The modification of the STS peak intensity with energy in this case is illustrated in Fig. 19(b). The comparison between Figs. 19(a) and 19(b) shows that the STS existence region for pure quartic dispersion is larger than in the quadratic case. The stability of these states was confirmed by applying the VKS and Lyapunov stability [87] criteria.
In this case, the authors also compare the effective theory and the full numerical simulations by using Eq. (126) [see Fig. 19(b)]. This comparison shows that pure quartic dispersion significantly delays the appearance of wave collapse, by increasing by more than twice the -range of STS existence. Although engineering an MMF with pure quartic dispersion appears to be challenging, this is relatively straightforward to do in a ring cavity configuration, by using a waveshaper [98].
6. REDUCED (1D + 1) NLSE AND GRIN SOLITONS
We saw in Section 5.A that a multidimensional (3D+1) NLSE must be used to study the evolution of short optical pulses inside a GIMF. As numerical solutions of this equation are time consuming, one may ask whether the propagation problem could be simplified in some specific situations. It turns out that this is indeed possible in view of the self-imaging property of a GRIN medium. We have seen in Section 2.C.1 that optical beams in the form of CW evolve inside a GRIN fiber in a periodic fashion with a relatively small period (). This feature suggests that we may able to find multimode solitons that maintain a constant temporal width, but whose spatial width evolves periodically along the length of a GRIN medium. It is important to stress that such temporal solitons are quite different from the spatiotemporal solitons discussed in Section 3 because they maintain a constant spatial width as well.
A. Effective (1D+1) NLSE
To examine this possibility, we consider a Gaussian beam launched into a GRIN fiber with a planar wavefront with an electric field oscillating at the frequency . In the CW case, the propagation problem can be solved analytically [101], and the electric field associated with the Gaussian beam can be written in the form where is the initial amplitude of the beam and is its propagation constant. The function , governing the spatial self-imaging of the Gaussian beam, is given by where the beam’s width varies in a periodic fashion as
The parameter , defined as , is a measure of the index gradient, where represents the refractive index difference between core and cladding, and is the core radius. The parameter is defined as where represents the spot size of the fundamental mode of a GRIN fiber, with a typical value close to 5 μm, and is the nonlinear index. The phase-front curvature of the beam in Eq. (132) is obtained using . The phase also varies periodically but is not of concern here. The parameter includes the effects of self-focusing. In practice, its value is such that . Notice that only the fundamental mode of the GRIN fiber is excited when the spot size of the incoming Gaussian beam is narrow enough that . In this situation, the beam’s width does not change with because is close to 1, resulting in .
To apply the CW solution given in Eq. (132) to a pulsed Gaussian beam, we assume that the bandwidth of the pulse is narrow enough that the spatial profile of the beam is not affected much by the nonlinear effects. This assumption is reasonable because the periodic spatial pattern of the beam in Eq. (133) maintains the same periodicity when , and its amplitude is only affected through the parameter given in Eq. (134). As long as the peak power of the pulse remains below the self-focusing critical power , i.e., one has , the pulsed beam should follow the same periodic self-imaging pattern that a CW beam follows inside a linear GRIN medium. With this approximation, we look for solutions of the (3D+1) NLSE Eq. (94) in the form of Eq. (131) but allow the amplitude to depend on both and by using [102] where describes temporal evolution of each pulse within the beam, whose spatial evolution depends on the spatial coordinates and through the known function given in Eq. (132). It is important to realize that does not correspond to any specific mode but can be thought of as a superposition of many modes of a GRIN fiber that are excited by the incident Gaussian beam.
To obtain an equation for , we follow a well-known procedure [56]. When we substitute Eq. (135) in Eq. (94), we obtain
The expression in the second line vanishes because is its solution. We multiply Eq. (136) with and integrate over the transverse coordinates and . The final result can be written as [102] where has units of power and the effective nonlinear parameter is defined as
Physically, represents the local effective cross section of the beam, which varies with as the Gaussian beam undergoes periodic focusing during each self-imaging cycle.
Equation (137) is remarkable because it has the form of a 1D NLSE with the only difference that its effective nonlinear parameter varies with . It shows that the evolution of pulses inside a GRIN fiber can be studied under certain conditions by solving a single 1D NLSE in the time domain, and represents a drastic simplification of the original (3D+1) NLSE given in Eq. (94). According to this equation, periodic oscillations of the spatial width of a pulsed Gaussian beam resulting from self-imaging give rise to an effective nonlinear parameter , which is also periodic in with the same period [see their typical values in Fig. 4(a)]. This is not surprising, because the intensity at a given distance depends on the beam’s width, becoming larger when the beam compresses and smaller as it expands. One can interpret this effect as the formation of a Kerr-induced nonlinear index grating within the GRIN medium. The local refractive index of the medium increases in a periodic fashion only near the focal planes where the beam’s intensity is enhanced.
The integrals in Eq. (138) can be carried out using from Eq. (132). The resulting effective mode area has the form , where the periodic function is defined as
The functional form of becomes clear when we note that at any location scales with the spatial width as . We can use this relation to define , where is its initial value at . This step allows us to write Eq. (137) in the form which reduces to the standard NLSE when . As seen from Eq. (137), occurs when . It turns out that this requirement is met when the width of the incident Gaussian beam is matched to that of the fundamental mode of a GRIN fiber, so that only this mode is excited by the incoming beam.
Similar to the case of singlemode fibers, Eq. (140) needs to be modified for pulses shorter than a few picoseconds [56]. The reason is related to the nature of the nonlinear response of the fiber’s material to an electromagnetic field (see Fig. 2 and detailed description in Section 2.B.4). In general, vibrations of silica molecules also produce a nonlinear response, in addition to that of electrons bound to each atom of the molecule. This response, known as the Raman response, is delayed in time, compared to the nearly instantaneous response of electrons, and this delay becomes relevant for short optical pulses. In the case of silica-based GRIN fibers, the Raman response is included by replacing the Kerr-induced index change in Eq. (94) with where represents the fractional contribution of the delayed response and is the Raman response function of silica molecules, normalized such that . Its use in Eq. (94) leads to the following modified form of the effective NLSE: where the sum includes the second and higher-order dispersion terms in the Taylor expansion of . The spectral width of ultrashort pulses becomes large enough that the contribution of the term may not remain negligible. Often, it is sufficient to consider the first two terms by choosing , but the and higher-order terms may also be needed, depending on the situation.
The Raman response in Eq. (142) affects short pulses through the Raman gain, as detailed in Section 2.B.5. It is worth noting that the functional form of the Raman response depends on the Raman gain spectrum that has been measured for silica fibers. It is useful in practice to have an approximate analytic form for . Assuming a single vibrational frequency of silica molecules is involved in the Raman process, can be approximated as Eq. (31) [103]. Although Eq. (31) is used often in practice, its predictions do not always agree with experiments. For this reason, a modified form of was proposed in 2006 [104]: where and . This form requires and explains better the observed Raman-induced frequency shifts of short optical pulses inside silica fibers.
In the following subsection, we use the 1D NLSE Eq. (140) and its generalized form in Eq. (142) to investigate the formation of multimode solitons inside GRIN fibers. These NLSEs involve only two variables ( and ) and can be solved numerically much faster than the full 3D problem for GRIN fibers involving also the spatial variables and . They include the effects of periodic self-imaging, occurring invariably inside a GRIN fiber, through the function and the two parameters and appearing in its definition in Eq. (139). However, they neglect the impact of temporal dynamics on the spatial features of the beam and cannot be applied under all experimental conditions. It is thus important to summarize the conditions under which the effective 1D NLSE can be used in practice. Weakly guiding approximation requiring must hold. This approximation is often valid in practice as for most GRIN fibers.The core size of the GRIN fiber must be large enough for it to support a large number of modes. As the core radius exceeds 20 μm for most GRIN fibers, this is often the case in practice.The spatial width of the input beam must be smaller than the core diameter but still large enough that many low-order modes are excited simultaneously.The spectral bandwidth of pulses must be narrow enough, so that the spatial cross section of the optical beam does not change much over its entire range at any point within the GRIN fiber.The preceding situation is possible only for multicycle input pulses. Even a femtosecond pulse should be wide enough to contain ten or more optical cycles within its temporal envelope.
Under such conditions, spatial width variations of an optical beam caused by the self-imaging phenomenon affect its temporal evolution but the reverse does not occur, i.e., temporal field profile changes have no effect on the spatial evolution of the optical beam.
B. Temporal Solitons with Spatial Self-Imaging
The NLSE Eq. (140) differs from the standard NLSE because the effective nonlinear parameter is not constant but varies in a periodic fashion. The presence of in this equation indicates that it is not integrable by the inverse scattering method. As a result, it is not expected that this equation would have solutions in the form of ideal solitons that propagate without any change in their shape, or evolve in a periodic fashion while preserving their energy over long distances. Nevertheless, it may support, under certain conditions, the soliton-like evolution of an optical pulse such that its energy is nearly preserved over considerable lengths of GRIN fibers. Indeed, an equation similar to Eq. (140) also occurs in the context of singlemode fiber links employing periodically spaced optical amplifiers for compensating fiber’s losses. It was found in 1990 that a new kind of soliton, called the guiding-center soliton, can form in such fiber links [105]. It is also known as the loss-managed soliton [106].
To find the conditions under which soliton formation is possible inside a GRIN fiber, we solve Eq. (140) numerically with the split-step Fourier method [56]. For this purpose, it is useful to normalize this equation using the variables where and are the width and the peak power of input pulses, and is the dispersion length. The resulting equation takes the form where we assumed and introduced the soliton order as . The periodically varying function given in Eq. (139) can be written as where is the self-imaging period of a GRIN fiber (about 1 mm for typical GRIN fibers). This NLSE reduces to its standard form for because for this value of . A sech-shaped pulse launched with a peak power such that forms a fundamental soliton in this situation and propagates without any change in its shape and spectrum. Clearly, this will not be the case when deviates from one.
As an example, Fig. 21 shows the evolution of such a pulse with over 10 dispersion lengths. These results were obtained by solving Eq. (140) numerically using , , and the initial field , where was chosen to ensure . As expected, both the shape and spectrum change as the pulse propagates inside the GRIN fiber. A nearly periodic evolution seen in Fig. 21 provides a clue that the pulse is forming a higher-order soliton. This feature suggests that the peak power of the input pulse is larger than what is needed for a fundamental soliton, and it should be lowered by reducing to below one. The reason for the behavior seen in Fig. 21 can be understood by recalling the periodic spatial focusing of the optical beam owing to its self-imaging. As the beam is compressed during each self-imaging cycle, its peak power increases considerably, and the nonlinear effects are enhanced.
Figure 21.Temporal and spectral evolutions of an optical pulse, launched into a GRIN fiber with a peak power such that using the parameter values and .
The solitons evolve on a length scale provided by the fiber’s dispersion length , whereas self-imaging occurs on a length scale related to the index gradient of the GRIN fiber as . The ratio of these two lengths, , plays an important role as is also evident from Eq. (146). Typically, is close to 1 mm, whereas the dispersion length exceeds 1 m even for if we use , a typical value for silica fibers at wavelengths near 1550 nm. As a result, is a large number () under typical experimental conditions. Physically, it represents the number of times self-imaging of the input beam occurs inside a GRIN fiber over a distance of one dispersion length.
Solitons cannot respond to beam-size changes taking place on a scale of 1 mm or less when exceeds 1 m. This suggests that we should average Eq. (145) over one self-imaging period. If we write its solution as , where represents average over one self-imaging period , perturbations induced by spatial variations remain small enough that they can be neglected (as long as ). In other words, the dynamics of the soliton can be captured by solving the averaged NLSE where is the effective soliton order defined as
As Eq. (147) has the form of the standard NLSE, it follows that it has a solution in the form of a fundamental soliton when , or . Figure 22 shows the numerical results obtained when we solved Eq. (145) with this value of using . As seen there, both the shape and spectrum of the launched pulse do not change much over a distance of . This is also apparent from the bottom panels of Fig. 22, where the shape and spectrum of the pulse at are compared with their input shapes. The temporal intensity profiles overlap almost perfectly. Perturbations caused by the periodic self-imaging can be seen in the spectra but only at a level of below 20 dB. We call such solitons GRIN solitons because a GRIN medium is required for their formation. Note also that GRIN solitons can form for a wide range of pulse widths, as long as the peak power of pulses is adjusted to satisfy the condition .
Figure 22.Temporal and spectral evolutions of a fundamental soliton, launched into a GRIN fiber with a peak power such that using the parameter values and . Corresponding intensity profiles are compared at a distance of 0 and in the bottom panel.
An important question is related to the stability of GRIN solitons. They are not expected to remain stable over long distances simply because Eq. (140) is not integrable by the inverse scattering method. Small perturbations caused by the periodic spatial self-imaging of the solitonic beam build up over long distances and eventually begin to lose energy through the emission of dispersive radiation. This feature is seen clearly in Fig. 23, where the evolution of a GRIN soliton inside a GRIN fiber is shown over 100 dispersion lengths using the same parameter values used for Fig. 22. A comparison of these two figures shows that the GRIN soliton begins to emit dispersive radiation after 80 dispersion lengths. A comparison of temporal and spectral profiles of a fundamental GRIN soliton at the input and output ends shows that noticeable changes indeed occur at a distance of .
Figure 23.Temporal and spectral evolutions of a fundamental soliton over a distance of using the same parameter values used for Fig. 22. Corresponding intensity profiles are compared at a distance of 0 and in the bottom panel.
Higher-order solitons for which both the spatial and temporal widths evolve periodically can also form inside a GRIN fiber [107]. The period of temporal oscillations is set by the dispersion length and is much longer than the period of spatial-width oscillations. This feature is seen clearly in Fig. 24, where the evolution of a second-order GRIN soliton inside a GRIN fiber is shown over 10 dispersion lengths using the same parameter values used for Fig. 22. In contrast with the fundamental soliton, a second-order soliton is perturbed very strongly by the periodic spatial self-imaging of the optical beam. This is apparent from the bottom panel in Fig. 24, where the temporal and spectral profiles of the soliton are compared to those of the input pulse at a distance of . Among other things, the pulse’s spectrum broadens considerably and develops an oscillatory structure in its wings. The temporal intensity profile also develops fluctuations outside of its central region whose magnitude exceeds 10% of the peak level. Higher-order solitons also break up into multiple fundamental solitons if third-order dispersion is included in Eq. (140).
Figure 24.Temporal and spectral evolutions of a second-order soliton over a distance of using the same parameter values used for Fig. 22. Corresponding intensity profiles are compared at a distance of 0 and in the bottom panel.
As we discussed earlier, higher-order effects that become important for short pulses can be included through the generalized NLSE given in Eq. (142). If we use dimensionless variables and include both the third-order dispersion and intrapulse Raman scattering, we can write this equation in the form where . This equation allows us to study the Raman-induced spectral shift of pulses shorter than 1 ps. This shift is enhanced in GRIN fibers, compared to step-index fibers, because of the periodic self-imaging of a pulsed optical beam inside such fibers [108].
The impact of higher-order effects is even more interesting for higher-order solitons because of a fission process that breaks them into multiple fundamental solitons of different widths. We expect the fission of higher-order solitons to occur even in the case of GRIN fibers because the underlying dynamics remain the same, except for the enhancement of the nonlinear effects when the spatial width of the beam shrinks in each self-imaging period. An important question is whether the self-imaging leads to new features in GRIN fibers during the fission of a higher-order soliton.
Figures 25 and 26 compare the evolution of a fourth-order soliton (width 100 fs) over one dispersion length in the time and spectral domains by solving Eq. (149) numerically using , , and . The parameter has a value of one in Fig. 25 and 0.5 in Fig. 26. Recall that only the fundamental mode of a GRIN fiber is excited when , and no periodic self-imaging or spatial focusing of the beam occurs in this situation. In contrast, multiple modes of a GRIN fiber are excited for , and the pulsed beam undergoes spatial focusing during each self-imaging period.
Figure 25.Temporal and spectral evolutions of a fourth-order soliton () over one dispersion length inside a GRIN fiber with and . Higher-order effects are included using and . Periodic spatial focusing of the pulsed beam does not occur for .
Figure 26.Temporal and spectral evolution of a fourth-order soliton () over one dispersion length inside a GRIN fiber. Parameter values are identical to those used in Fig. 25 except that the value of has been reduced to 0.5. Periodic spatial focusing of the pulsed beam occurs when because multiple modes of the GRIN fiber are excited when .
A comparison of Figs. 25 and 26 shows that the fission of the fourth-order soliton occurs near in both cases. The original soliton breaks into multiple fundamental solitons of different widths. Intrapulse Raman scattering shifts their spectra toward lower frequencies by different amounts and reduces each soliton’s speed, resulting in a bent trajectory. The shortest soliton with the largest spectral shift slows down the most, resulting in a bent trajectory that shifts to the right most. The spectrum becomes considerably broader at the output end of the GRIN fiber and exhibits multiple peaks. A new feature is the appearance of a blue-shifted peak just after the fission occurs. This peak belongs to a dispersive wave, generated because solitons shed radiation when perturbed by third-order dispersion, a well-known feature in single-mode fibers [56].
The spectral evolution seen in Fig. 26 develops considerably more structures and has multiple sidebands that result from the periodic spatial focusing of a pulsed beam inside a GRIN medium for . The origin of the sidebands is related to the phenomenon of resonant dispersive radiation [41,59] and geometrical parametric instability [60], as discussed in Section 2.C. In the time domain, one can see low-intensity dispersive waves emitted at the frequencies of these sidebands.
7. QUADRATURE NUMERICAL MODEL
In this section we describe a Gaussian quadrature approach for the numerical simulation of the generalized nonlinear Schrödinger equation (GNLSE). This approach is similar to the split-step Fourier method that is ubiquitously used for simulation of the standard NLSE, but it uses a mode decomposition based on the natural eigenmodes of the fiber and performs the transformation to this modal basis using an optimized Gaussian quadrature scheme.
A mode decomposition is generally accomplished by projecting the field inside the fiber onto an orthogonal set of linear eigenmodes. These modes maintain a fixed transverse profile and propagate with only a phase shift in the absence of any nonlinearity. The projection is achieved by the evaluation of an integral that can be approximated by a weighted sum of field values that are sampled on a discrete grid of points. In Gaussian quadrature [109] one selects both the weighting coefficients and the location of the sample points (abscissas) to enable an exact mode projection for the first modes. The transformation to and from the modal basis, as well as the linear propagation step, can then be carried out as a single multiplication with a predetermined matrix.
A similar numerical quadrature approach to propagation in nonlinear multimode waveguides was previously introduced by Lægsaard; see Refs. [110,111]. This method has its starting point in the coupled mode model and simulates the GMMNLSE by using on-the-fly real-space Gaussian quadrature integration of the nonlinear polarization. This alleviates the scaling problem common to more conventional approaches that compute the nonlinear polarization by the summation of mode overlap integrals. In Ref. [110] Lægsaard considered a multimode fiber with a step-index profile, and showed that the numerical complexity of the Gaussian quadrature approach scales linearly or at most quadratically with the number of modes. In particular, it was found that this scheme enables superior computational performance for the nonlinear propagation in a step-index fiber having more than six guided modes, when compared to the overlap integral approach. It was moreover demonstrated that the Gaussian quadrature approach can be extended to accurately account for mode profile dispersion and the associated frequency dependence of the effective area.
A. Radially Symmetric Model
To illustrate the method, we first consider the non-dispersive case where the beam dynamics is modeled by a radially symmetric GNLSE with a parabolic refractive index profile:
In the linear limit when the power is sufficiently low or when the nonlinear-index coefficient one finds that the last term can be neglected, and that the GNLSE has an orthogonal set of Laguerre-Gaussian eigenmodes: where and are Laguerre polynomials with integer mode number . These ring-shaped modes can be found, e.g., by variable separation and have a constant normalization factor that has been omitted for clarity. Because of the functional dependence of the Gauss-Laguerre modes, it is convenient to rescale the radial distance and introduce the dimensionless variable and the nonlinear coefficient , so that Eq. (150) becomes
The different eigenmodes can then be written more compactly as with the propagation constant .
The Laguerre polynomials and the Gauss-Laguerre functions form a complete basis for the linear combination of guided modes that can propagate inside the fiber and have the important property of being mutually orthogonal with respect to the weight function . This means that the scalar product where is the Kronecker delta that vanishes unless the indices of the two modes are equal. The orthogonality is used in the Gaussian quadrature method to approximate the mode projection integral by a finite term sum with weighting coefficients that are evaluated on a discrete grid of non-equidistantly spaced points . This approximation can be shown to be exact when the field is a linear superposition of modes in the form of weighted orthogonal polynomials of degree or less [109]. In Gauss-Laguerre quadrature the abscissas are chosen as the different zeros of the Laguerre polynomial , i.e., as the roots of the equation with being the th zero. The corresponding weighting coefficients are given by [112]
To apply the method we expand the spatially varying electric field envelope using the eigenmode functions as where the complex spectral coefficients determine the longitudinally evolving mode occupancy. These are obtained from Eq. (155) by identifying the integrand and approximating the projection operation using the orthogonality relation Eq. (154) with
This summation can be efficiently performed as a matrix multiplication of the row vector with the matrix . Given the spectral coefficients we can perform the inverse transformation and obtain the field values by simply evaluating the summation in Eq. (158) as another multiplication with the matrix . In particular, it is seen that the product of these two operations corresponds to the orthogonality relation given in Eq. (154).
The benefit of projecting the field onto the eigenmode basis is that one directly can simulate radially symmetric fields, without using a Cartesian coordinate system where the Laplacian operator must be evaluated either in 2D or approximated by the second derivative in 1D. The method can moreover be extended to enable spatiotemporal simulations of modal dispersion, as we will see in the next subsection. Because the linear and nonlinear terms in the GNLSE Eq. (150) have different natural bases where their solution is particularly easy, we use operator splitting to separately determine the linear and nonlinear evolution of the field in the same way as in the ordinary split-step Fourier method. The equation governing the linear propagation step for , viz., has the solution given by a linear combination of Eq. (153) in the eigenmode basis. Assuming a fixed propagation distance we can therefore evolve the field in the spectral domain by multiplying each component with a simple phase factor that may be written in the form of a diagonal matrix . Each step of transforming to the eigenmode basis, applying the propagation operator, and performing the inverse transformation is a linear operation that can be reduced to a single matrix multiplication: where we have introduced the constant matrix operator that is calculated beforehand. Note that this also applies to methods using the equidistant Fourier basis when the discrete Fourier transform is represented by the matrix operator , but where the steps are usually performed in sequence to take advantage of the faster scaling of the FFT algorithm.
Conversely, the nonlinear propagation step is governed by the equation which describes self-phase modulation and which has the real space solution which can be represented by the nonlinear matrix operator . Here, we assume a simple dielectric fiber material with an instantaneous Kerr nonlinearity, but the method can easily be extended to handle Raman, self-steepening, and other higher-order effects.
With the separate solutions for both the linear propagation step Eq. (161) and the nonlinear step Eq. (163) determined, we sample the field at the grid points and apply them alternately. Specifically, we use the fact that the evolution equation where and are linear differential and nonlinear operators, respectively, has the formal solution , which can be approximated by the factorized solution with second-order accuracy for small [56].
To apply the method it is necessary to have function values for the Laguerre polynomials at the grid points . These values can be generated recursively from and using the recurrence relation
After selecting the number of modes , we proceed with these preprocessing steps: determine the abscissas by solving Eq. (156) numerically;generate a matrix with the Laguerre polynomials from the recurrence relation Eq. (165);calculate the quadrature weight coefficients from Eq. (157);generate the forward and inverse transformation matrices and .
The results of these steps are saved, so that they can be reused in multiple simulations. They need only be repeated if the number of modes is changed in order to, e.g., adjust the trade-off between numerical precision and computation time.
A disadvantage of the above Gauss-Laguerre quadrature is that the endpoints of the radial interval are not included among the sampling points. The lower endpoint at the origin can be particularly important since the optical field will often have its maximum intensity in the center of the fiber and the field at this point will therefore experience a large nonlinear phase shift. To include the origin we can employ an alternative Radau-Laguerre quadrature scheme for the radially symmetric mode. Modes of higher angular order are not considered since they are identically zero on axis and have a singularity in the corresponding quadrature scheme.
The modified Radau-Laguerre quadrature relation is given by [113] where are now the zeros of the generalized Laguerre polynomial and
Expanding the field as before, we have the spectral coefficients which again can be written as a matrix multiplication. In fact by extending the set of abscissas with one can use the same numerical propagation code with either quadrature scheme by simply changing the definition of the matrix operators.
When performing any type of numerical simulations, it is important to consider issues of accuracy, numerical errors, and convergence. The transformed GNLSE Eq. (152) conserves two integral quantities corresponding to the power and the Hamiltonian
These invariants are useful to check on the accuracy of the numerical scheme by comparison of their values for the initial and final fields. The derivatives may be evaluated from knowledge of the expansion coefficients by using and the property that , where .
B. (3D+1) Quadrature Model
The field in a multimode fiber can generally have both angular and temporal dependence. Here, we consider the spatiotemporal dynamics in a parabolic GIMF with an instantaneous Kerr nonlinearity that is governed by the (3D+1) GNLSE where is an operator for transformation from the linear eigenmode basis, which denotes that the argument should be applied individually to each mode , and the longitudinally evolving field is a function of the two transverse coordinates with radius and time . The nonlinear term can be neglected in the linear limit when the input power is sufficiently low, while the modal dispersion term can be disregarded by considering a CW field in the form of a single frequency beam. The remaining terms in the model are then the diffractive transverse Laplacian operator and the parabolic refractive index potential. In this limit, the equation has an orthogonal set of separable Gauss-Hermite eigenmode solutions that in Cartesian coordinates are given by where are Hermite polynomials of integer order and we have introduced the dimensionless scaled variables and . We note that the function values of the Hermite polynomials can be generated from and through the recurrence relation
Alternatively, one can use polar coordinates and the generalized Gauss-Laguerre eigenmode solutions with as in the previous subsection. However, these are less suitable as a basis for implementing a Gaussian quadrature scheme, since the abscissa points are not the same for modes of different angular orders. Each order requires the use of a separate sampling grid. This makes the numerical implementation inefficient: it is preferable to expand the field by using the Gauss-Hermite basis instead. The spectral expansion coefficients for the Gauss-Hermite modes can be converted for problems where it is desirable to work in the Gauss-Laguerre basis; see Ref. [114].
Analogous to the radially symmetric model, the Gauss-Hermite functions are mutually orthogonal for each component and form a complete basis. They satisfy the orthogonality relation with the basis functions
The quadrature relation for the Gauss-Hermite functions is given by [112] where the abscissas are the different zeros of the Hermite polynomial , i.e., , and the weight coefficients
Importantly, the Hermite basis functions are separable, and the abscissas for the - and -directions are therefore independent of each other. This permits the transverse field to be sampled on a single two-dimensional grid in the form of a rectangle. The abscissas are located symmetrically for positive and negative values, and the origin can be included by simply choosing an odd number of points.
Using the basis functions Eq. (176), the field envelope is expanded as a double sum: with a matrix of longitudinally evolving spectral coefficients . These are obtained from the quadrature relation Eq. (177) with as which can be implemented by matrix multiplication with and from the left and right, respectively. Meanwhile, the inverse transformation is given by where and . It is convenient to consider a quadratic grid with the same number of sampling points for each transverse dimension. It is then possible to use the same abscissas and and the two and matrices become the transpose of each other, i.e., and .
The (3D+1) GNLSE includes modal dispersion that is taken into account by storing the field in a three-dimensional array where two dimensions are used for the transverse spatial coordinates and one dimension is used for the temporal field variation. A major advantage of using Gaussian quadratures instead of spatial Fourier transforms is that separate dispersive phase shifts can be applied to each individual mode . This is done through another frequency-dependent matrix multiplication in the linear step after transforming the field to the eigenmode basis. The modal field is sampled on an equidistant grid in time and is subsequently transformed to and from the frequency domain by applying FFTs along the temporal dimension of the array.
The time-independent version of the evolution Eq. (171) has three invariant integral quantities: with , that corresponds to the conservation of power, angular momentum, and Hamiltonian, respectively. These can be evaluated from the spectral coefficients using the relation , which follows from , and are useful for verifying the numerical accuracy.
C. Simulation Results
To illustrate the use of quadrature models for the simulation of spatiotemporal solitons, we numerically solve the GNLSE for a GIMF with a parabolic refractive index profile using a radially symmetric Radau-Laguerre split-step code with constant temporal group-velocity dispersion. As initial conditions we consider the stationary light bullet ansatz where is the peak amplitude on the radial axis at the center of the bullet, is the temporal pulse duration, and is the beam width. These parameters determine the pulse energy (see Section 5.B.1) that can be used as a control parameter for specifying the soliton solution, as previously shown in Section 5.B.
Working in normalized variables, we assume a pulse energy and set the initial soliton parameters using the semi-analytical expressions from the variational approach. The simulations are performed with 50 radial modes and 1024 points in time. The spatiotemporal intensity profile of the soliton obtained after a normalized distance is shown in Fig. 27. The longitudinal evolution of the peak amplitude, spatial beam width, and temporal pulse duration is meanwhile shown in Fig. 28. Here, the beam width and pulse duration are extracted as root-mean-square values from the two moments respectively. The results show oscillations due to a mismatch of the initial ansatz with the actual light bullet solution. These consist both of radial self-imaging oscillations with spatial period and temporal breather oscillations, similar to those that occur in the emission of radiation by conservative solitons with a non-integer soliton number in the standard (1D+1) NLSE.
Figure 27.Space-time plot, showing intensity of a light bullet with normalized energy .
The time required to simulate the light bullet evolution using the quadrature model was about 30 s on a laptop computer.
8. MULTIMODE PROPAGATION IN THE PRESENCE OF RANDOM MODE MIXING
In this section, we focus on the propagation regime that is encountered in the realm of space-division multiplexed (SDM) optical communications in fibers supporting multiple modes [115]. Space-division multiplexed transmission has been in the spotlight of the optical communication research for more than a decade, as it is considered the only viable solution to scale the throughput of fiber-optic communication systems in a sustainable way [116]. An important difference between SDM and transmission in parallel singlemode systems is that SDM entails some form of integration of the transceivers, the optical amplifiers, the network elements, and, ultimately, the lightpaths, as is the case when few-mode fibers (FMFs) and multi-core fibers (MCFs) are utilized. Compared to the propagation regimes discussed in the previous sections of this paper, SDM transmission differs primarily in the targeted reach, which extends from tens to thousands of kilometers, and in the properties of the transmitted signals, whose average power is of the order of mW, and whose bandwidth is of a few tens of GHz, in typical wavelength-division multiplexed transmission systems.
On the one hand, the significantly longer reach magnifies the impact of linear propagation effects, primarily stemming from manufacturing imperfections and environmental perturbations, which are unavoidable in long fibers. These perturbations are random in nature and therefore they are responsible for inducing random linear coupling between the propagating modes. On the other hand, the relatively small transmission power reduces the impact of nonlinear propagation effects, which are typically modeled within a perturbation approach. Most importantly, nonlinear propagation effects are characterized by a much greater length scale than linear propagation effects, with the result that the nonlinear dynamics is influenced by the random mode coupling. Specifically, the effect of linear mode mixing is to average out the coherent terms in the coupled NLSEs. These terms are responsible for nonlinear power transfer between modes, while the remaining incoherent terms produce cross-mode phase modulation, which is therefore the dominant nonlinear propagation effect in SDM fibers.
The modeling of linear coupling requires supplementing Eq. (26) with additional terms that can be conveniently described by using a generalized Jones formalism [117]. In the frequency domain, neglecting mode-dependent loss [118], linear propagation is described by the following equation [note that, in contrast to Eq. (26), here no moving reference frame is assumed]:
Here is the electric field’s generalized Jones vector, whose components are the Fourier transforms of the complex envelopes describing the excitation of the individual modes and polarizations. (In this section we account for the two-fold polarization degeneracy of each spatial mode, consistent with the need of addressing all space and polarization modes in multiple-input-multiple-output SDM transmission.) The vector collecting the complex envelopes is sometimes referred to as the field hyper-polarization vector. The term is a Hermitian matrix describing frequency-dependent mode coupling. In all cases of practical relevance, it is sufficient to expand to the second-order around the central frequency : where . In the absence of perturbations to the ideal fiber structure, the matrices , , and reduce to a diagonal form and their elements are the propagation constants, group delays per unit fiber length, and chromatic-dispersion coefficients of the individual modes, respectively, with the modal dependence of the diagonal terms of being responsible for the phenomenon of modal dispersion. In the presence of perturbations, off-diagonal terms appear, along with additional traceless diagonal terms [117]. The effect of the perturbations is most relevant on , and less on , while it is typically negligible on . This is consistent with the observation that perturbing the modes’ optical phases requires less effort than perturbing their group velocities and even less group-velocity dispersion. Therefore, in the following we neglect the effect of perturbations on .
An important property of multimode propagation in SDM fibers is the existence of groups of modes with very similar propagation constants. Modes belonging to the same group are referred to as degenerate or quasi-degenerate, and they undergo strong random coupling during propagation. On the other hand, modes belonging to different groups couple weakly even over long propagation distances, and they are referred to as non-degenerate. This situation can be conveniently described in the field hyper-polarization vector by grouping together quasi-degenerate modes, so that the vector results from stacking on top of other hyper-polarization vectors with lower dimensions. Consistently, the matrix can be divided into blocks, each describing either intra-group coupling or inter-group coupling, while the matrices and are block-diagonal, as is clarified in the following.
Moving to the modeling of the Kerr nonlinearity, here we restrict the discussion to considering the instantaneous contribution only, in which case the coupled propagation equation can be recast in the following vector form: with where, by , we denote a vector whose -th element is equal to one and all the others to zero, and is the nonlinearity coefficient of the fundamental mode defined in Eq. (5) (in this section we drop the superscript for ease of notation). The definition of the nonlinearity coefficients accounts for polarization, and their expressions can be found in Eq. (17) of Ref. [66].
A. Single Group of Strongly Coupled Modes
A particularly relevant propagation regime in SDM systems is the one where all the transmitted modes undergo strong and random mixing along the fiber. This is the case in coupled-core MCFs [119], where the fiber cores are spaced to the extent that the supermodes have sufficiently similar propagation constants to behave almost as degenerate modes. In this regime, the matrix can be expressed as , where is the mode-averaged propagation constant, while accounts for the modes’ propagation-constant mismatch relative to , as well as for the effect of the perturbations. The matrix can be expressed as , where is the mode-averaged mode delay per unit propagation length, while accounts for the differential mode delays relative to , as well as for the effect of the perturbations. Finally, the matrix is typically approximated as , where is the mode-averaged chromatic-dispersion coefficient. Note that the terms involving and are responsible for a mode-independent phase and group delay, respectively, and therefore they can be dropped from the propagation equation with no consequences on the analysis.
As for the nonlinear term , a dramatic simplification is obtained by noting that the perturbations produce full randomization of the propagating field hyper-polarization state on a length-scale over which nonlinear effects are not appreciable for typical transmission signal power levels. As a result of this separation of length scales, can be replaced with its average, performed with respect to the isotropic distribution of the field hyper-polarization in the generalized Jones space. The only plausible form of resulting from this procedure is , where is the total optical power carried by the multi-modal field, given by the sum of the optical power in each mode. This follows from noting that since the generalized Jones vector is isotropically oriented in its space, there can be no privileged direction other than its own, and hence the nonlinear term must be aligned with . On the other hand, owing to the isotropic argument, the magnitude of can only depend on the magnitude of . Therefore, is the only form of that at the same time is compatible with the above requirements and with Eq. (191). This argument indicates that the nonlinear interaction between strongly coupled spatial modes takes the form of cross-phase modulation, similar to the nonlinear interaction between polarization modes in singlemode fibers [56].
The derivation of the expression for the nonlinearity coefficient is described in Refs. [66,120]. It is based on the observation that should ensure that the average infinitesimal phase shift over a length obtained with the averaged expression is preserved, that is, or equivalently, using Eq. (191),
The problem at hand is to find such that Eq. (193) is valid when averaged over an isotropic distribution, for a fixed amplitude of the field vector . The derivation starts by noting that an isotropically distributed unit vector can be described as , where is a vector whose components are identically distributed, statistically independent zero-mean complex variables with Gaussian distribution and correlations and . The angular degrees of freedom of are independent of the intensity , so that they can be averaged separately. Being the angular distribution of equal to that of , if we average Eq. (193) with replaced by and divide both sides by the average of , we remove the dependence on the statistics of and obtain the same result as if the averages were performed over the isotropic distribution of the unit modulus vector .
The above discussion suggests that can be derived by averaging both sides of Eq. (193) with replaced by . In doing so, we utilize at the left-hand side the independence of the components of and the property of independent Gaussian variables that allow us to average them pairwise, so that we get for the average inside the sum . In averaging the right-hand side, we exploit that the distribution of is a chi-square with degrees of freedom (the real and imaginary parts of modes), each one with variance 1/2, resulting in . Equating the results of these two averages yields
The propagation equation for hence simplifies to [120]
In the case of singlemode fibers, the nonlinearity coefficient reduces to , and to the familiar value of 8/9 when the instantaneous Raman contribution is ignored [121]. It is customary expressing Eq. (195) in a reference frame that accounts for the random mode coupling, that is, by defining the field through , with . This results in the disappearance of the first term on the right-hand side of Eq. (195), which, dropping the primes for ease of notation, becomes where . This equation is the generalization to the multi-dimensional case of the equation that describes nonlinear propagation in single-mode fibers with polarization-mode dispersion (PMD), also known as the PMD-Manakov equation [121]. The first term on the right-hand side describes the phenomenon of spatial mode dispersion (MD), which is the multi-modal equivalent of PMD [122].
In the absence of MD, Eq. (196) reduces to the form of a generalized multi-component Manakov equation [123], which in the singlemode case coincides with the famous Manakov equation [124], which was found to describe nonlinear propagation in singlemode fibers in the presence of random polarization-mode coupling [125]. In this limit, Eq. (196) is known to be integrable by using the inverse-scattering transform method, and to admit vector-soliton solutions. The existence of this class of solutions was used in Ref. [120] to test the accuracy of the multi-component Manakov equation in describing multimode propagation in the regime of strong mode mixing.
Modal dispersion renders Eq. (196) non-integrable by introducing frequency-dependent random-mode coupling. In SDM systems, this effect is one of the key factors that determine the complexity of the multiple-input-multiple-output (MIMO) digital-signal processing (DSP) that is required to equalize the fiber-optic channel [126]. The magnitude of MD is typically quantified in terms of the duration of the fiber intensity impulse response (IIR) [127], which grows proportionally to the square-root of propagation distance as , where the proportionality coefficient depends on the fiber design and perturbation statistics [128]. As a result, MD prevents the existence of soliton solutions. Its effect on the propagation of an input waveform corresponding to the fundamental soliton of Eq. (196) in the absence of MD is illustrated in Fig. 29 in the case of a fiber with strongly coupled modes. The three panels show the evolution of for the input field , with and . The numerical values assumed for the plot are , , and [120], while the values of are displayed in the individual panels. The plot shows that when MD is sufficiently small, the evolving field preserves its input intensity profile, whereas this is disrupted already after a few dispersion lengths as MD increases.
Figure 29.Evolution of an input hyperbolic-secant waveform , which in the absence of mode dispersion would result in a fundamental soliton, in a fiber supporting strongly coupled modes for the displayed values of the mode-dispersion coefficient . Time and propagation distance are normalized to the input-waveform width and the fiber dispersion length , respectively.
A more general propagation regime in fibers for SDM is the one in which multiple groups of non-degenerate modes propagate with little-to-negligible inter-group local coupling. This is the case of both SIMF and GIMF, where the two-fold degenerate fundamental mode (with two degenerate polarization modes) and the four-fold degenerate mode group (with its two degenerate modes and ) form a typical example of non-degenerate mode groups.
Nonlinear propagation in this regime can be conveniently described by organizing the elements of into vectors of reduced dimensions, each describing a single-mode group. Without loss of generality, in the following we consider the case of two mode groups, which we denote by and (the generalization to the more general case of an arbitrary number of mode groups is straightforward). The propagation Eq. (190) can therefore be expressed as
Here and are vectors of dimensions and , respectively, with collecting the first and the last elements of . All the matrices are either or dimensional, except for the matrix describing instantaneous inter-group linear coupling, whose dimension is . (Note that only instantaneous inter-group coupling has been taken into account, whereas high-order terms in the expansion of off-diagonal blocks are neglected, as is appropriate in cases of practical relevance.) The nonlinear terms have the following form:
Owing to the strong intra-group mode mixing, similar arguments used for the derivation of the multi-component MD-Manakov Eq. (196) can be used to simplify the nonlinear terms in Eqs. (199) and (200). To this end it is critical to assume statistically independent random coupling processes in the two mode groups. Under this assumption, the only plausible forms for and are and . This indicates that, similar to the case of a single group of modes, inter-group nonlinear interaction takes the form of cross-phase modulation, with each mode group imposing a phase modulation proportional to its overall optical intensity onto each of the modes of the other mode group.
The derivation of the expressions for , , , and is described in Ref. [129]. In this case too, the derivation is based on the observation that the average infinitesimal phase shift over a length obtained with the averaged expression is preserved, that is, for , or, using Eq. (191),
The only terms whose average is nonzero are those for which either and , or and . Moreover, all terms for which , and range between one and contribute to , with the same result obtained for the case of a single group of modes, that is,
When and , with , the average of splits into the product of the two averages of and . Similarly, when and with , the average of reduces to the product of the two averages of and . These terms contribute to , with the result
Along the same lines, one may also find that and . Assuming a moving reference frame that accounts for intra-group random-mode coupling in each mode group, Eqs. (197) and (198) simplify to the following form: where the matrices and describe intra-group mode dispersion, and and are the intra-group mode-averaged group delays per unit propagation length. Note that when , Eqs. (206) and (207) may be used to describe nonlinear propagation in weakly coupled core MCFs [130] or in fibers guiding optical-angular momentum modes. It is also interesting to note that in the absence of intra-group MD and inter-group linear coupling, Eqs. (206) and (207) predict soliton trapping (see the previous Sections 3 and 4 for its detailed discussion), the phenomenon where two solitary waves, each propagating within distinct groups of modes, dynamically adjust their center frequency to eliminate the group velocity mismatch between the two groups of modes [120,131].
9. MMS EXPERIMENTS
A. Quasi-Single-Mode Regime
In spite of the widespread interest in soliton science, virtually no experimental studies of MMS propagation in MMF have been reported until 11 years ago, when Renninger and Wise carried out the first systematic experimental study of MMSs in GIMF [132]. In their experiments, 300 fs pulses at 1550 nm, with energies of up to a few nJ, were injected into a standard GIMF with 62.5 μm core diameter. Since the input beam came from a single-mode fiber, its diameter (11.5 μm) was smaller than that of the fundamental mode of the MMF (). Although three radially symmetric nondegenerate modes were excited at the fiber input, more than 90% of the input energy remained coupled into the fundamental mode. Under these conditions, numerical simulations demonstrated that the modes are trapped and propagate collectively in the temporal domain. In other words, the group-velocity walk-off between the fundamental and the HOMs was compensated for by the presence of nonlinear cross-phase modulation, as elaborated in the examples provided in Section 3.B.1. Consequently, this leads to a blue shift of the HOMs, hence a compensation of group-velocity mismatch via group-velocity dispersion. The situation is analogous to the case observed for solitons in birefringent optical fibers in the presence of polarization mode dispersion [73,133].
Whenever one replaces the description based on the multicomponent or coupled NLSEs with the collective approach, based on the 3D+1 NLSE or the Gross-Pitaevskii equation, one may exploit the variational approach to find a stable spatiotemporal soliton, or optical bullet (see methodology in Section 5). In the conditions of the experiments in Ref. [132], the MMS can be well approximated by a single-mode soliton solution of the NLSE, with an effective area equal to that of the fundamental mode of the GIMF. The quasi-single-mode nature of the generated MMSs was confirmed by measurements of their output temporal duration versus pulse energy, as well as of the pulse energy dependence of the Raman-induced SSFS (see Fig. 30). A red-shift of the MMS frequency was observed that could be accurately fitted by single-mode soliton perturbation theory, as well as a compression of the pulse width as its energy grows larger, according to Eq. (21) in Section 2.D. The fundamental Raman soliton wavelength shift, which grows larger with the fourth power of pulse energy (theoretical dependence for single-mode solitons), is given by the red solid curve in Fig. 30(c).
Figure 30.Experimental study of MMS versus input pulse energy. (a) Output spectra. (b) Measured output pulse temporal duration. (c) Measured peak wavelength. Reprinted with permission from Ref. [134]. Copyright 2013, Nature Group.
The generation of highly multimode MMS, that is, by exciting a relatively large number of HOMs at the fiber input, was experimentally investigated by Wright et al. in 2015 [62]. In that study, the input pulse width at 1550 nm was 500 fs, and a 62.5 μm core diameter, 25 m long GRIN fiber was used. The relatively larger input beam size leads to the generation of MMSs comprising up to 8–13 modes. Under these conditions, a relatively complex interplay of MMS fission and subsequent SSFS was observed. Specifically, the process of SSFS is accompanied by a transfer of energy from the HOMs towards the fundamental mode of the fiber. This occurs because, as a result of the previously described mechanism of nonlinear modal trapping leading to MMS formation, the fundamental mode is slightly red-shifted with respect to the HOMs. Hence, the fundamental mode is a Stokes wave for HOMs, which act as Raman pumps (see detailed discussion in Section 2.B.5). Moreover, the fundamental mode also exhibits the strongest modal overlap with the dispersive waves that remain around the input pump wavelength, so it preferentially drags energy from them as well [63]. As a result, a Raman MMS beam amplification and spatial cleanup are observed, at the expense of HOMs.
The generation of an MMS rests on the compensation of intermodal dispersion (which is much stronger than intramodal, or chromatic dispersion) by nonlinear phase shifts. As a result, the cross-phase modulation among the different modal components of an MMS should be much stronger than the self-phase modulation for a singlemode soliton (which is only sufficient to counteract chromatic dispersion). This condition was confirmed in the experiments [62]: by measuring the relation between the MMS pulse energy and its temporal width for a variety of different spatial beam widths . Wright et al. have shown that, for a given temporal soliton duration, its energy is always significantly larger than that of a pure single-mode soliton with the same beam width (see Fig. 31), in qualitative agreement with the early findings by Grudinin et al. [135]. From a multimodal perspective, the nonlinear lengths for HOMs in GIMF, as defined in Eq. (6), are longer when compared to that of the fundamental mode, owing to their larger effective areas. Consequently, generating MMS with HOMs requires a relatively higher field peak power. On the other hand, because of their essential single-mode nature, the energy of the first MMS that was studied in Ref. [132] was still very close to that of single-mode fiber solitons. The experimental observation is that highly multimode MMSs are formed at comparatively higher pulse energies than that of single-mode solitons with the same pulse width. This is because stronger nonlinear trapping forces are required to balance modal walk-off [33–35]. The results of Fig. 31 imply that for highly multimode MMSs the pulse duration can be continuously varied, provided the pulse energy scales in inverse proportion, according to Eq. (24).
Figure 31.Experimental demonstration that MMSs require more energy than single-mode solitons, for a given temporal duration. In the vertical axis, the figure reports experimental and simulation results for the slope of the linear relation between pulse energy and inverse temporal duration, as a function of the average size of the beam waist (the waist of the fundamental mode is ). All data points are higher than the curve for single-mode solitons [solid black curve shows as given by Eq. (23)]; the green point indicates the case of quasi-single-mode soliton observed in Ref. [134]. Reprinted with permission from Ref. [62]. Copyright 2015, Optical Society of America.
Multimode solitons result from a balance between (modal and chromatic) dispersion and Kerr nonlinearity. This means that, on the one hand, as discussed in the previous subsection, a stronger nonlinearity, due to the larger mode area of HOMs, is required for an MMS to form, compared with singlemode fiber solitons. Simultaneously, the nonlinearity must counteract not only chromatic dispersion-induced pulse broadening, but also modal walk-off effects.
Typically (i.e., for pulses of picosecond duration or longer), the walk-off length between fiber modes [Eq. (15)] is much shorter than the chromatic dispersion length [Eq. (16)] [see Figs. 4(b) and 4(c)]. On the other hand, as discussed in Sections 3 and 4, the most favorable condition for the formation of an MMS by nonlinear trapping of pulses carried in different fiber modes occurs when modal (or walk-off) and chromatic dispersion lengths are equal [see Eq. (22) in Section 2.D]. In this case, one minimizes the amount of nonlinearity, which is necessary for trapping a pulse via the simultaneous nonlinear compensation of both intramodal and intermodal dispersions. It turns out that the condition can be achieved with femtosecond pulses, by taking advantage of the different scalings with pulsewidth of and [37]. In this case, the temporal duration, say, , of the MMS turns out to be fixed by the condition where is a constant of the order of unity. An experimental verification of Eq. (208) was carried out in Ref. [37]: in Fig. 32 we show that the MMS temporal duration is a constant at each wavelength, irrespective of the initial femtosecond pulse duration. The variation of with wavelength is due to the chromatic dispersion slope. The experimental points refer to: 1 m of GRIN fiber, with input wavelength and pulse width 1300 nm and 61 fs, or 1420 nm and 70 fs, respectively; or 6 m of GRIN fiber, with input wavelength and pulse width 1550 nm and 67 fs, or 1680 nm and 96 fs, respectively.
Figure 32.Soliton pulse duration versus wavelength, compared with measured (green diamonds), simulated (empty red circles and blue squares, respectively), and with Eq. (208) (solid line). Reprinted with permission from Ref. [37]. Copyright 2021, Nature Group.
For relatively large input pulse energies, so that multiple MMSs are generated, the initial pulse gets temporally compressed at first, and subsequently undergoes Raman-induced fission into multiple fundamental solitons and dispersive waves [62]. As shown in Fig. 33, an initial beam mostly composed of HOMs is progressively cleaned into a bell-shaped beam composed of LOMs, as the input pulse energy grows larger. Moreover, long-pass filtering permits to leave out the Raman-shifted soliton: as can be seen, the soliton emerges from the input high-order MMS fission with a spatial bell shape, indicating that it is essentially carried by the fundamental mode of the fiber [135].
Figure 33.Experimental study of high-order soliton fission in GRIN fiber. Left, autocorrelation of the output pulse versus input energy; middle, corresponding output beam profiles; right, output spectrum. Reprinted with permission from Ref. [62]. Copyright 2015, Optical Society of America.
The process of soliton fission has been extensively investigated by using a short span (20 cm) of GRIN fiber and input femtosecond pulses with up to megawatt peak power, approaching the level for catastrophic self-focusing [36]. Figure 34 compares experimental data and numerical simulations for the variation of the output pulse width as a function of its energy for the first and the second Raman MMSs that are generated by the fission. In Fig. 34, the black dashed line (black crosses) is obtained from Eq. (24) with the nominal (experimental) soliton parameters.
Figure 34.Experimental data (empty squares and empty circles for first and second Raman solitons, respectively) versus numerical simulations (solid curves) for the output soliton time width versus its energy; black crosses and black dashed line are obtained from Eq. (24) by using experimental or simulation soliton parameters, respectively. Reprinted with permission from Ref. [36]. Copyright 2020, Optical Society of America.
As can be seen, to the left of the vertical dashed line (here called the “linear-loss” regime), for soliton energies , although the trend is the same, for a given pulse duration the energy of Raman solitons is always much larger than the value obtained from the single-mode relationship of Eq. (24).
On the other hand, Fig. 34 shows that to the right of the vertical dashed line (here called the “nonlinear-loss” regime, because multiphoton absorption effects are present) the experimentally observed soliton pulse durations remain nearly constant as the pulse energy grows larger. This is confirmed by the experimental soliton spectra (not shown here), showing that all MM Raman-shifted solitons converge to nearly the same and constant pulse width, between 50 fs and 60 fs.
E. Few-Mode Fiber Solitons
The intermediate case, i.e., between quasi-single-mode and highly multimode solitons, was experimentally studied in a so-called few-mode fiber (FMF) that supports the propagation of , , and modes only, for a total of three spatial eigenmodes [136]. FMFs are of interest for optical communication links using the SDM technique [137]. For ultrashort pulses, the Raman effect splits the input pulse into dispersive waves, and temporally compressed red-shifting Raman solitons. The dependence of the SSFS on input pulse energy, and the modal composition of the generated MMS were investigated, and found to be in qualitative agreement with numerical simulations based on coupled GNLSEs. Also in this case, the experiments have shown that, for a given pulse width, the MMS energy is typically found to be larger than that associated with a single-mode soliton carried within individual modes. Again, this result can be interpreted by considering that, for trapping modes with different linear group velocities, i.e., for overcoming intermodal dispersion, nonlinearity should be larger than the value that is necessary for compensating for the much weaker intramodal dispersion.
As can be seen in Fig. 35, these experiments have shown that the spatial beam width increases with the soliton energy. At low energies () the soliton beam has a Gaussian shape. On the other hand, at higher pulse energies the soliton is carried by a multimode superposition [136]. This can be explained by the fact that higher pulse energies permit to bind a larger proportion of HOMs.
Figure 35.Experimental dependence of MMS beam area versus soliton energy. Reprinted with permission from Ref. [136]. Copyright 2016, Optical Society of America.
Those experiments [136] have also shown that the MMS temporal duration remains inversely proportional to its energy, as described by the 1D soliton formula Eq. (21). This behavior is different from the case of (3D+1) spatiotemporal solitons in bulk media under the combined action of chromatic dispersion and diffraction: here the beam compresses both in time and in space when its energy grows larger.
F. Solitons in Step-Index Fibers
The generation of MMS in SIMF has been a subject of different experimental studies [42,43,138,139]. In experiments by Zitelli et al. [43], ultrashort pulses of 70 fs duration at 1450 nm wavelength were introduced into a SIMF with a 50 μm core diameter. That experiment involved two distinct input beam coupling conditions: axial and non-axial coupling. Clearly, non-axial coupling facilitates the excitation of a larger amount of HOMs into the fiber. It was observed that non-axial coupling in SIMF leads to a lower energy threshold for MMS formation, when compared with axial coupling. An opposite behavior was observed when using GIMF. The reason for this difference lies in the fact that HOMs in SIMF (GIMF) have relatively higher (lower) nonlinear coefficients, owing to their smaller (larger) effective mode areas, as detailed in Section 2.E. The evolution of the generated MMS temporal width with increasing input energy is depicted in Fig. 36. As can be seen, the initial trend of a decreasing pulse width is followed by an oscillation of the output pulse duration around a fixed value as the pulse energy grows larger. Figure 36 also shows that a GIMF has a much lower energy threshold for MMS generation when compared with a SIMF. In any case, for both input coupling conditions, fundamental mode-based Raman MMSs emerge at the fiber output.
Figure 36.Dependence of MMS temporal width on input pulse energy for GIMF (red curve), SIMF with axial (black curve) or non-axial (blue curve) input beam. Insets show examples of autocorrelation traces. Reprinted with permission from Ref. [43]. Copyright 2019, Optical Society of America.
More recently, Wu et al. [42] have theoretically and experimentally studied the emergence of highly multimode MMSs in SIMFs in a high input pulse energy regime. These MMSs are characterized by superpositions of 5–10 temporally aligned transverse modes, resulting in speckled beam profiles and complex spatio-spectral variations across the MMS, as illustrated in Fig. 37. In a representative experiment, a Gaussian pulse at 1550 nm wavelength with a 500 fs duration was launched into a SIMF, requiring an input pulse energy reaching a maximum of up to 720 nJ, in order to induce a sufficient nonlinearity: this is a notably higher requirement when compared to the previous study [43]. Remarkably, irrespective of the excitation conditions that were employed, the resulting MMSs consistently comprised superpositions of HOMs. Again, this phenomenon can be understood by recognizing that HOMs exhibit a relatively smaller effective mode area, as shown in Fig. 3(e) in Section 2.E. Consequently, HOMs possess heightened nonlinearity compared to LOMs during mode competition, thereby facilitating the formation of highly multimode MMSs.
Figure 37.(a) Total (black curve) and bandpass filtered (red curve) output spectrum from a step-index MMF; output spatial intensity profile before (b) and after (c) filtering; (d) spatially integrated pulse measurement; (e) simulated half width at half maximum of the spatial correlation function for randomly generated beam patterns from the last six modes (red) and the last ten modes (green), with highest-order mode indicated on -axis; purple dashed line: half-width of the experimental spatial correlation function. Reprinted with permission from Ref. [42]. Copyright 2023, American Institute of Physics.
Utilizing SIMFs, it has been demonstrated both numerically and experimentally that Raman gain can facilitate energy transfer between single-mode solitons carried by different HOMs [138,139]. This phenomenon is referred to as soliton self-mode conversion (SSMC). In SIMFs, the group velocities of different spatial modes exhibit a considerable variation. This allows spectrally separated pulses in different modes to co-propagate at the same group velocity. By meticulously selecting the modes and central frequencies of the pulses, it becomes possible to match the frequency separation required for group velocity equalization with the Raman gain peak, typically occurring near 13 THz. Under these optimized conditions, efficient energy transfer from one mode to another can occur through intermodal Raman scattering when solitons are launched into HOMs of a SIMF. The results of cutback experiments depicted in Fig. 38 illustrate the process [139]. A soliton pulse, with a duration of 100 fs and a wavelength of 1045 nm, is initially launched into the mode of a SIMF with a core diameter of 97 μm. This specific mode is excited in the fiber by encoding the spatial phase onto a Gaussian beam using a spatial light modulator. Upon propagation, the soliton formed in the mode undergoes red-shifting via conventional SSFS to 1141 nm after 33 cm. At this point, it becomes group-index matched to the mode with a frequency separation of 16 THz. Consequently, power transfers from the pulse to the mode pulse. After approximately 40 cm of propagation, the mode and wavelength conversion process is completed. Subsequently, the new fundamental soliton in the mode experiences further red-shifting to 1257 nm via SSFS. At this wavelength, it becomes group-index matched to the mode with a 14 THz frequency separation. Again, through SSMC, full power transfer to the mode is achieved with further propagation. These alternating SSFS and SSMC processes repeat along the fiber, ultimately resulting in obtaining a pulse at 1587 nm in a pure mode at the longest fiber length.
Figure 38.Cutback experiments. (a) Spectra as a function of fiber length with input pulse at 1045 nm in the mode. (b)–(f) Output mode images from different bandpass filters. Reprinted with permission from Ref. [139]. Copyright 2019, Optical Society of America.
G. Emergence of Soliton Attractors from Long Multimode Fibers
A pioneering experimental study of soliton generation in a GIMF has shown that femtosecond Raman solitons emerge from a relatively long length of fiber [135]. In their study, Grudinin et al. launched into the fiber 150 ps pulses from a -switched and mode-locked Nd:YAG laser: these pulses excite a multitude of modes at this wavelength. Surprisingly, the experiments have shown that, as a result of the generation of multiple Raman scattering Stokes sidebands, a supercontinuum is formed in the anomalous dispersion region of the fiber. From the supercontinuum a single Raman shifted soliton emerges, with a nearly single-mode beam, whose waist is close to that of the fundamental mode of the fiber. It was speculated that such Raman soliton beam cleaning is a manifestation of a universal property of nonlinear multimode systems [140]. Namely, energy redistribution among all fiber modes, or thermalization, eventually gives way to confinement into the first few low-order modes only.
To date, the properties of the original fascinating discovery remain yet to be fully explained: they appear to be a combined manifestation of Raman beam cleanup [63] and Kerr-induced beam self-cleaning [49]. In those experiments, it was observed that Raman solitons that are formed in GIMF have a peak power (or energy) about six times larger than that of a singlemode fiber soliton with the same pulse duration. Moreover, the spatial field distribution of these multimode Raman solitons is stable and quasi-single-mode, whenever the power of the input pulses grows larger than a certain threshold value. Equivalently, the same properties are observed whenever the GIMF length increases from 10 m to 500 m [135].
The emergence of solitons in MMFs can be seen as a fundamental manifestation of the robustness of solitons as attractors in weakly nonlinear turbulent wave systems, as discussed by Zakharov et al. [141,142]. This hypothesis has received a recent interesting experimental confirmation in studies of femtosecond soliton propagation of relatively long (up to 1 km) spans of GIMF [38]. In those experiments, ultrashort (70 fs) pulses at 1550 nm wavelength were injected in a 50/125 standard GIMF. The input laser beam was focused at the fiber end face with an approximately 15 μm waist. Moreover, the beam was injected at different small angular tilts with respect to the fiber axis for controlling the excitation of higher-order modes besides the fundamental.
Figure 39 illustrates experimentally observed dependence on input pulse peak power of the output beam waist after 1 km of fiber [38]. For 0° input tilt, a nearly monomode soliton (with a waist close to that of the fundamental mode) is formed at 15 kW. For initially tilted and highly multimode beams, the monomode fundamental soliton still emerges, but at a comparatively higher power, as shown by the beam compression effect, which occurs at about 80 kW. In all cases, the beam quality parameter of the emerging soliton was .
Figure 39.Experimental study of MMS attractor in GIMF. Here we show the output beam waist after 1 km of GRIN fiber, versus input peak power, and 0°, 2.3°, and 4.6° input beam tilt angles; the dashed horizontal line indicates the fundamental beam waist. Insets illustrate the output beam shapes at selected powers. Reprinted with permission from Ref. [38]. Copyright 2021, Optical Society of America.
Understanding soliton interactions in optical fibers, particularly how they evolve over propagation distance, poses a significant experimental challenge due to the intricacies of cutback experiments. Even slight adjustments in fiber positioning can yield vastly different outcomes. To circumvent this issue, in Ref. [39], researchers have turned to studying the interaction of two MMSs, generated by Raman MMS fission, by varying the input pulse energy. This approach, motivated by their simulations in Ref. [39], enables the examination of soliton interactions by observing variations in output spectra and temporal traces as the input energy changes. The positioning of soliton interactions is manipulated by adjusting the input pulse energy (see Fig. 11 in Section 3.B.2).
In their experimental investigation, Sun et al. [39] study the dynamics of soliton collisions within GIMF by varying the energy of an input pulse with 70 fs duration at 1400 nm wavelength. To capture high-resolution output spectrum evolution with input energy, the experimental setup integrates motorized-polarizer-based input power control with automated recording of output spectra and input power measurements using a spectrometer and power meter, respectively. These components are seamlessly coordinated and controlled by a computer, ensuring synchronization during data acquisition throughout the experiment. By slightly offsetting the input beam to introduce more HOMs, they obtained Fig. 40(a), illustrating the evolution of measured output spectra concerning input pulse energy () over a 10 m GRIN fiber length, which shows striking similarities to simulation results in Fig. 11. For , the input pulse’s fission produced MMSs S2 and S1. As the input energy increases, two MMSs experience significant SSFS. Further increasing leads to the unconventional SSFS in S1, characterized by a relative “blue-shift” as depicted in Fig. 40(a), due to the reduced LOM content in S1. Notably, the spectral overlap with fringe shape occurred at a region of , indicating that collisions [e.g., Figs. 11(d)–11(h)] may appear in the red dashed box. Finally, for , both solitons (S1 and S2) reemerged as distinct entities.
Figure 40.Soliton collision experiments. (a) Evolution of output spectrum versus input pulse energy for a 10 m GRIN fiber. (b) Output spectrum, (c) autocorrelation, and (b) soliton temporal separation versus input energy for a 2 m GRIN fiber. Reprinted with permission from Ref. [39]. Copyright 2022, Optical Society of America.
In order to further investigate the dynamics of MMS collisions in both temporal and spectral domains, a similar experiment was carried out by using an GRIN fiber. Figures 40(b) and 40(c) report the measured output spectra and their corresponding autocorrelation traces across various input pulse energies. For , two MMSs are visible in both spectral and temporal domains. As increases, the spectra of the two MMSs get progressively closer, until their separation becomes indistinguishable. Fringe patterns within the spectrum emerge for energies , corresponding to two MMSs with a temporal separation of less than 0.5 ps. For , a dominant peak in the middle of the autocorrelation trace and a weak peak at are observed, suggesting the presence of non-overlapping solitons at the fiber output.
The temporal separation of two MMS peaks is plotted in Fig. 40(d). As previously predicted in simulations, this temporal separation is due to a collision at a position within the fiber [e.g., Figs. 11(d)–11(h)]. As a result, at the fiber output, the two MMSs emerge with a large temporal separation, owing to energy transfer from the leading MMS S2 to the trailing MMS S1 at the collision point, which enhances the red-shift and group delay for S1. For input energies exceeding the collision region (), Figs. 40(b) and 40(c) indicate a clear separation between the MMSs at the fiber output.
I. Dispersive Wave Generation
Another fundamental mechanism that is known to play a key role in supercontinuum in single-mode fibers is the process of dispersive wave emission from MMSs, which can be interpreted in analogy with Cherenkov radiation in electrodynamics [143]. Experiments by Wright et al. revealed the formation of broad multi-octave supercontinuum spectra, which are characterized by the presence of an ultra-wideband series of sharp spectral peaks extending from the visible into the mid-infrared regions [59]. As the physical origin of these spectral peaks was not yet clear, they were referred to as mystery peaks.
A subsequent study [41] revealed that the mechanism underlying the formation of a series of spectral peaks (see Fig. 41) in the anomalous dispersion regime of the MMF is that of dispersive wave generation, resonantly phase-matched by the spatiotemporal intensity oscillations, due to self-imaging, of MMSs along the fiber. The mechanism is fully analogous to the multiple dispersive wave resonances of single-mode solitons that occur in fiber lasers or in periodically amplified optical fiber links [144]. Figure 41 shows that the supercontinuum featuring a series of unequally spaced dispersive wave peaks can be qualitatively well reproduced by using the generalized multimode coupled NLSEs [145]. As a matter of fact, the observed sideband peak positions can be well predicted by a simple reduction of the Gross-Pitaevskii equation Eq. (126) into a 1D NLSE with a periodically varying (because of MMS intensity oscillations due to self-imaging) nonlinear coefficient [41], as discussed in Section 6.
Figure 41.Simulated (using the MM GNLSE or the 1D NLSE model, respectively) and experimental generation of dispersive wave peaks in multimode GRIN fiber. Reprinted with permission from Ref. [41]. Copyright 2015, American Physical Society.
This phenomenon is clearly illustrated through comparable numerical examples presented in Figs. 25 and 26 in Section 6.C. These figures demonstrate that distinct peaks emerge as a result of the periodic evolution induced by the self-imaging effect, and conversely, these peaks vanish upon its removal. The resulting dynamics are essentially single-mode, and the sideband frequencies are determined by the phase-matching condition between the wave numbers of the soliton, , and that of the dispersive wave, , which reads as where is the self-imaging period, is the frequency shift from the input pump pulse, and is associated with third-order dispersion [41].
J. Solitons in Hollow-Core and Capillary Multimode Fibers
Another frontier for MMS and spatiotemporal soliton phenomena is provided by intense optical pulse propagation in multimode hollow-core, gas-filled fibers, and capillaries [146]. Figure 42 presents a chronological overview of the dramatic (more than 10 orders of magnitude) peak power increase of fiber solitons when moving from standard single-mode fibers to specialty fibers such as bandgap, antiresonant hollow-core, and capillary fibers [147].
Figure 42.Temporal evolution of soliton peak power in different fiber structures (in the insets, glass is depicted in gray, while white indicates empty spaces). Reprinted with permission from Ref. [147]. Copyright 2024, Elsevier.
In gas-filled hollow-core fibers (HCFs), preferential guiding by the fundamental mode occurs at low powers, because of its low loss. As the input light intensity grows larger, ionization leads to plasma defocusing, which in turn excites HOMs. The resulting nonlinear mode coupling enables pulse compression, and sustains spatiotemporal localization leading to MMS propagation at powers below the self-focusing threshold [148–150].
In an interesting work by Safaei et al. [151], a high-energy (5 mJ) 700 fs laser pulse at 780 nm was coupled into the fundamental mode of a 3 m long HCF with a 500 μm core size, filled with nitrogen. Because of self-focusing, the progressively spatially compressed pulse within the HCF coupled its energy away from the fundamental mode and into radially symmetric HOMs. The resulting MMS (also called a “multi-dimensional solitary state”) experienced a substantial Raman SSFS until a wide supercontinuum was generated. The emitted pulses were accompanied by a high spatial beam quality and exhibited a negative quadratic spectral phase, which permitted their subsequent dispersive compression down to 10 fs, thus providing a compact source for high-harmonic-generation applications.
When considering gas-filled hollow capillary fibers using the lightest noble gas with the highest ionization potential, helium, the MMS intensity is limited to about , before excessive gas ionization takes place [152]. For higher intensities, most of the input pulse energy is absorbed; moreover, a huge amount of self-phase modulation is produced, which, in association with complex spatial mode-coupling, ultimately breaks down the soliton dynamics. Nevertheless, soliton generation in hollow-core optical fibers has enabled a new class of light sources, with unique properties. These include: extreme pulse compression; generation of tunable, few-cycle ultraviolet to visible pulses through resonant dispersive wave emission; and the production of tunable visible and infrared light via soliton frequency shifting. Light beams produced from these soliton-based sources have distinctive characteristics that are unachievable by other methods, with far-reaching potential for both fundamental science and technological applications [147].
10. CONCLUSIONS AND PERSPECTIVES
The exploration of MMSs in MMFs has revealed a rich landscape of nonlinear optical phenomena, advancing the understanding of soliton dynamics in complex media. In this review paper, we started with a tutorial on MMSs within MMFs in Section 2. This includes an analysis of the linear eigenmode properties of the two main types of multimode fibers: SIMFs and GIMFs. We highlight the key fiber properties that influence the formation of multimode solitons, thereby setting the groundwork for understanding their role in soliton generation. Of course, fiber types are not limited to SIMFs and GIMFs; however, the framework to evaluate a fiber’s performance remains consistent, as illustrated in Fig. 2. One can design a fiber based on the transverse refractive index geometry and the nonlinear material response. Generally, dispersion management strategies are crucial for optimizing system performance. These strategies, including minimizing the modal walk-off length and increasing nonlinear coefficients, may reduce the power threshold for MMS formation. By carefully managing these parameters, the overall efficiency and effectiveness of optical fiber systems can be significantly enhanced.
After familiarizing with the fundamental concepts, we introduced the GMMNLSE in Section 3, which is relevant to describing a wide range of MMFs with distinguishable eigenmodes. We then provided succinct examples, ranging from linear multimode pulse walk-off to the formation of an MMS, including phenomena such as the SSFS, soliton fission, and soliton collision processes. These examples effectively elucidate the concepts discussed in Section 2, thereby establishing a robust foundation for further analytical and experimental exploration. These concepts and the model are not only applicable to studying MMSs in MMFs but also prove valuable in the design of spatiotemporal mode-locked lasers [44–46].
Understanding the impact of modal walk-off is particularly crucial in the formation of MMSs. In Section 4, we provide a comprehensive review of the theoretical framework underpinning the dynamics of MMSs with two modes, utilizing a simplified coupled mode model. Much of these findings can be derived from the variational formulation of the GMMNLSE, which aids in the creation of effective theories that describe an MMS with a reduced number of degrees of freedom. Our primary focus is on the formation of two-mode solitons, and we employ the variational formulation, the Kantorovich approach, and the virial (or moment) method to analyze these solitons. This analysis is further extended to a multimode context, thereby generalizing the previously discussed theories. Nevertheless, theoretical investigations of MMS with multiple modes present significant challenges and opportunities.
Similar analytical approaches can be applied to study STS in the 3D+1 NLSE within GIMFs. In Section 5, we review the primary theoretical results concerning the formation and stability of STSs in the 3D+1 dimensional framework for GIMFs. Our focus is on the variational formulation as the main tool for gaining analytical insights into the properties of these states. This approach allows us to obtain analytical solutions for spatiotemporal MMSs in regimes below the energy threshold. High-order dispersion is found to stabilize these STS states. Furthermore, these STSs primarily arise from solitons carrying fundamental modes with a small amount of high-order mode content, as we mainly consider ansatzes expressed in terms of fundamental modes. Potential directions for studying higher-order STS can be explored by considering ansatzes that include high-order modes.
To reduce the complexity of high-dimensional systems and enhance computational efficiency, in Section 6, we simplify the 3D+1 NLSE to a 1D+1 NLSE form by leveraging the periodic evolution of short optical pulses within a GIMF. We discuss the practical conditions under which the effective 1D NLSE can be applied, and provide several examples of MMSs exhibiting self-imaging, including GRIN soliton formation, and the influence of higher-order effects. This method is particularly efficient numerically and simplifies the implementation of analytical analysis, making it a valuable tool for studying complex multimode fiber dynamics.
The complexity of the coupling terms in the GMMNLSE imposes significant computational challenges when considering a large number of modes. One approach to mitigate this issue is to simulate the full field in the temporal domain, rather than dealing directly with the coupling terms of the mode amplitudes. In Section 7, we present an innovative Gaussian quadrature approach for numerical simulations of the generalized NLSE. This method employs a mode decomposition based on the fiber’s natural eigenmodes, utilizing an optimized quadrature scheme for weighted orthogonal polynomials to enhance computational efficiency. We begin with a radially symmetric model and then extend our approach to a 3D+1 quadrature model, providing illustrative examples of the results. A major advantage of using Gaussian quadratures, instead of relying on spatial Fourier transforms, is that separate dispersive phase shifts can be efficiently handled for each mode through frequency-dependent matrix multiplication during the linear step.
When considering long-distance propagation in SDM optical communications within multimode fibers, perturbations caused by fiber imperfections and environmental factors can amplify the influence of linear propagation effects. These linear perturbation effects can significantly impact the dynamics of MMS via random mode coupling. To accurately model this phenomenon, additional terms are required, which can be effectively described using a generalized Jones formalism. In Section 8, we review the model and illustrate its application with examples demonstrating how mode-dispersion coefficients degrade MMS propagation.
In Section 9, we overview recent experiments involving MMS, spanning various regimes such as the quasi-single-mode and fully multimode domains. These experiments encompass a diverse array of phenomena, including femtosecond walk-off solitons, MMS fission, and solitons in SIMF. Additionally, we explore the emergence of soliton attractors from long MMF, investigate soliton-soliton collision dynamics, and examine dispersive wave generation. We envisage that potentially new research directions for MMS may involve additional topics such as single pulse mode decomposition techniques, high-power fiber damage threshold, and high-order mode MMS spatial beam shaping.
Despite evident advances in understanding MMSs, numerous questions persist. For instance, how can we quantitatively assess the formation conditions of MMSs for modal walk-off compensation, considering varying mode orders and modal group velocities, especially when there is a significant disparity in mode order? Does mode beating degrade or enhance the stability of MMSs? Are there qualitatively distinct types of MMSs, such as those featuring high-order modes with coherent beam quality? Can optical frequency conversion be achieved by exploiting the self-imaging effect? What strategies can enhance our ability to control MMSs interactions by manipulating their propagation speed? How can we better manipulate the propagation speed of MMSs to control their interaction? How do MMSs propagate under conditions of weak mode coupling or mode-dependent loss? How do MMSs evolve in environments with strong or weak disorder? What impact do plasma interactions have on MMSs? These questions represent intriguing areas for further exploration. Additionally, for modeling very-high-energy soliton dynamics, particularly in hollow-core fibers [153], the full vector polarization model [154,155] demonstrates strong similarities between simulations and experiments even when the peak power reaches terawatt levels [156].
To efficiently explore diverse soliton solutions within the framework of the GMMNLSE, 3D+1 NLSE, and 1D+1 NLSE models, both analytical and numerical methods are crucial. In this review, we combined analytical techniques (e.g., the variational approach; see Sections 4 and 5) with direct numerical solutions based on the familiar split-step method (Sections 2, 3, and 6) or using newly developed techniques such as the Gaussian quadrature approach (Section 7). Recently, the strong data-fitting capabilities of machine learning (ML) have gained popularity in ultrafast optics, based on the use of neural networks [157,158]. These ML algorithms establish relationships between the input conditions and the output variables within a constrained dynamical space, based on output data from physical models, which may not adhere to the system dynamics beyond this limited space. Since 2019, physics-informed neural networks (PINNs) have garnered significant attention for solving partial differential equations (PDEs) [159–161]. PINNs integrate model equations, like PDEs, directly into the neural network architecture, resulting in physics-supervised neural networks. Substantial research has focused on nonlinear optics within the NLSE framework [162–168], investigating vector solitons in birefringent fibers [162–164], higher-order terms [168], GMMNLSE [165], and three-component coupled NLSE [167]. Future research could further benefit from applying PINNs to 3D+1, 1D+1, and GMMNLSE models for MMS studies.
We would like to dedicate this review to Bruno Crosignani, a pioneer in multimode fiber solitons, who recently passed away. To some of us, he has been a mentor in soliton research, a dear friend, and a precious source of inspiration in science. We are deeply grateful to all of our many collaborators in the field of multimode optical solitons. We also thank the anonymous reviewers for their constructive remarks.
Acknowledgment
Acknowledgment. We acknowledge the financial support from the European Research Council Advanced Grant STEMS; Marie Sklodowska-Curie Actions; Sapienza University; European Union under the Italian National Recovery and Resilience Plan (NRRP) of NextGenerationEU, partnership on “Telecommunications of the Future”; European Union - NextGenerationEU under the Italian Ministry of University and Research (MUR) National Innovation Ecosystem; and Olle Engkvist Foundation.
APPENDIX A: ABBREVIATION
Here we list the abbreviations in Table 1.
List of Abbreviations
Abbreviations
Definitions
CW
Continuous wave
DMD
Differential mode delay
FMF
Few-mode fiber
FWHM
Full width at half maximum
FWM
Four-wave mixing
GIMF
Graded-index multimode fiber
GMMNLSE
Generalized multimode NLSE
GNLSE
Generalized NLSE
GPI
Geometric parametric instability
GRIN
Graded index
GVD
Group-velocity dispersion
HOM
High-order mode
IIR
Fiber intensity impulse response
LG modes
Laguerre-Gaussian modes
LOM
Low-order mode
MCF
Multi-core fiber
MD
Spatial mode dispersion
MEA
Mode’s effective area
MIMO
Multiple-input-multiple-output
MMF
Multimode fiber
MMS
Multimode soliton
MNC
Modal nonlinear coefficient
MNL
Modal nonlinear length
NLSE
Nonlinear Schrödinger equation
PDE
Partial differential equation
PINN
Physics-informed neural network
PMD
Polarization-mode dispersion
SDM
Space-division multiplexing
SIMF
Step-index multimode fiber
SPM
Self-phase modulation
STS
Spatiotemporal soliton
VKS
Vakhitov-Kolokolov stability
XPM
Cross-phase modulation
ZDW
Zero-dispersion wavelength
APPENDIX B: MODE DECOMPOSITION OF AN INPUT GAUSSIAN PULSE
In this appendix, we detail the procedure for computing the input mode distribution, when taking into account the specific input beam coupling conditions, which are encountered experimentally.
In order to determine the input mode distribution, it is crucial to know the waist of the input laser beam (supposed to be Gaussian) and its displacement relative to the central axis of the fiber. Let us first focus on the specific case of a Gaussian beam that is injected into the fiber core, with an offset () from the fiber axis. Hence, the beam at the fiber input facet is described by where is related to the beam full width at half maximum (FWHM) of its intensity , or beam size , under the normalization condition
Projecting this input beam onto the eigenmode basis, we obtain the input modal coefficients
Given that the eigenmodes are normalized, the coefficients quantify the power fraction of mode , ensuring energy conservation:
For a nuanced analysis of the input beam size and displacement, we normalize both parameters against the FWHM or waist of the fundamental mode, by introducing the two parameters and :
By manipulating and , we may explore the variation of the input mode content for different Gaussian beam profiles. In Fig. 43, we illustrate how the input mode content shifts with across four distinct values, when considering up to modes. Notably, when the Gaussian beam width matches that of the fundamental mode ( and ), the mode content simplifies to with for , indicating a pure fundamental mode excitation. Note that radial modes, specifically those indexed as , are predominantly excited when the beam is precisely centered at the fiber axis, providing that there is no lateral shift, i.e., . Conversely, deviations in either or from unity yield a more complex input mode distribution.
Figure 43.Decomposition of a Gaussian beam into the first modes of a GIMF, with panels (a)–(d) showing mode coefficients against the beam lateral offset for beam widths . Radial modes () are predominantly excited at precise center alignment (). The blue lines in (a)–(d) indicate the total power fraction as a function of . Values below one result from the finite number of modes considered.
The blue lines in Fig. 43 depict the variation of the quantity as a function of the beam lateral offset from the fiber axis. A decline in the blue curve values occurs because beams characterized by relatively large values of or cannot be fully represented by merely modes. Naturally, enhancing the number of modes considered in the decomposition analysis improves its precision, thereby enabling the accurate decomposition of beams with larger and dimensions, albeit at the cost of an increased computational time.
For numerical implementations, if a beam cannot be fully represented by the limited number of total modes , indicated by , it becomes necessary to adjust the mode coefficients to for all modes. This adjustment ensures that the ratio of mode energies remains a constant, while the corrected coefficients satisfy , thus preserving the total energy distribution among the modes.
To express the temporal input profile of the field, we assume that at the fiber input (), all modes share identical temporal shapes. Specifically, a Gaussian temporal profile is presumed, centered at and characterized by a duration , outlined as
Thus, we may obtain the input condition for all modal amplitudes for a given pulse with input energy and beam values and . Given that the mode field power is defined by , the total energy of the input pulse satisfies , ensuring the conservation of the total input energy across all modes.
[111] J. Laegsgaard. Efficient simulation of symmetric field propagation in parabolic-index fibers. Photonics & Electromagnetics Research Symposium, 2416-2423(2019).
[112] M. Abramowitz, I. A. Stegun. Handbook of Mathematical Functions with Formulas, Graphs, and Mathematical Tables(1964).
[113] P. J. Davis, P. Rabinowitz. Methods of Numerical Integration(1984).
[124] S. V. Manakov. On the theory of two-dimensional stationary self-focusing of electromagnetic waves. Zhurnal Eksperimentalnoi i Teoreticheskoi Fiziki, 65, 505-516(1973).
[126] S. O. Arik, D. Askarov, J. M. Kahn. MIMO DSP complexity in mode-division multiplexing. Optical Fiber Communications Conference and Exhibition (OFC)(2015).
[135] A. B. Grudinin, E. Dianov, D. Korbkin. Nonlinear mode coupling in multimode optical fibers; excitation of femtosecond-range stimulated-Raman-scattering solitons. JETP Lett., 47, 356-359(1988).
Yifan Sun, Pedro Parra-Rivas, Govind P. Agrawal, Tobias Hansson, Cristian Antonelli, Antonio Mecozzi, Fabio Mangini, Stefan Wabnitz, "Multimode solitons in optical fibers: a review," Photonics Res. 12, 2581 (2024)