Lightweight augmented reality (AR) eyeglasses have been increasingly integrated into human daily life for navigation, education, training, healthcare, digital twins, maintenance, and entertainment, just to name a few. To facilitate an all-day comfortable wearing, AR glasses must have a small form factor and be lightweight while keeping a sufficiently high ambient contrast ratio, especially under outdoor diffusive sunlight conditions and low power consumption to sustain a long battery operation life. These demanding requirements pose significant challenges for present AR light engines due to the relatively low efficiency of the optical combiners. We focus on analyzing the power consumption of five commonly employed microdisplay light engines for AR glasses, including micro-LEDs, OLEDs, liquid-crystal-on-silicon, laser beam scanning, and digital light processing. Their perspectives and challenges are also discussed. Finally, adding a segmented smart dimmer in front of the AR glasses helps improve the ambient contrast ratio and reduce the power consumption significantly.
【AIGC One Sentence Reading】:AR glasses face challenges in power consumption; microdisplay light engines are analyzed, and a smart dimmer is proposed to enhance contrast and save power.
【AIGC Short Abstract】:AR glasses face challenges in achieving low power consumption for long battery life while maintaining high ambient contrast, especially outdoors. Five microdisplay light engines are analyzed for their power usage. Adding a smart dimmer can enhance contrast and reduce power consumption.
Note: This section is automatically generated by AI . The website and platform operators shall not be liable for any commercial or legal consequences arising from your use of AI generated content on this website. Please be aware of this.
Lightweight augmented reality (AR) smart glasses are emerging and have been gradually integrated into human life for navigation, education, training, healthcare, maintenance, and entertainment, just to name a few. As AR devices strive to become more compact for all-day comfortable wearing, minimizing power consumption is essential to extend the battery life and reduce thermal effect while maintaining high image quality. Light engines, responsible for generating and projecting images onto the AR display, often account for a substantial portion of the total power usage.1Figure 1(a) depicts a waveguide-based AR device structure. The light from the microdisplay is collected by a projection lens, coupled into the waveguide through an in-coupler, propagating in the waveguide by total internal reflection (TIR), and then coupled out to human eyes.3,4 Nevertheless, a significant loss happens during this process resulting from the light leakage and angular-dependent diffraction efficiency of the in-coupler and out-coupler, absorption and scattering losses during TIR, and duplication from exit pupil expansion (EPE).5 Presently, to achieve a 30-deg field-of-view (FoV), the diffractive waveguide efficiency is limited to , or 1300 nits/lm. However, under daylight conditions, the outdoor illuminance can achieve . As a result, a -nit luminance to the eye is required to distinguish the projected image from the ambient light,6 which implies the light engine must deliver 3-lm luminous power to the in-coupler.
Figure 1.(a) Working principle of diffractive waveguide-based AR device. (b) Temperature distribution at different power consumption. Reprinted from Ref. 2 under a Creative Commons license.
In addition to battery life, the thermal effect of the AR glasses is another concern. Figure 1(b) shows the temperature rise near the ear when the power consumption increases from 1, 5, to 10 W.2 Given the additional power demands from components such as the CPU, GPU, and Wi-Fi module,7 the power consumption of each microdisplay panel should be capped at 1 W.
The objective of this review paper is to provide a comprehensive analysis of the efficiency and power consumption of five commonly used light engines in current AR devices. We begin with two emissive displays: micro-LED (μLED) and micro-OLED (μOLED), focusing on their distinct approaches to achieving high-resolution full-color displays. Next, we will highlight the working principles and power efficiency improvements of two reflective light modulation microdisplays: liquid-crystal-on-silicon (LCoS) and digital light processing (DLP). Afterward, we will discuss the high-efficiency laser beam scanning (LBS) technology and then compare the power consumption of these waveguide-based AR light engines using a 3-lm light source whose radiation pattern is confined within ±15 deg when displaying different image contents. Finally, adding a segmented smart dimmer helps improve the ambient contrast ratio and reduce power consumption significantly.
Sign up for Advanced Photonics TOC Get the latest issue of Advanced Photonics delivered right to you!Sign up now
2 μLED
2.1 Working Principle
Light-emitting diode (LED) is an inorganic semiconductor device that emits light when an electric current passes through its p-n junction. This junction consists of an n-type region, doped with extra electrons, and a p-type region, doped with extra holes. When a positive voltage is applied to the LED, electrons in the conduction band () of the n-type semiconductor and holes in the valence band () of the p-type semiconductor move towards the active region, where they recombine to form excitons [Fig. 2(a)]. This recombination can proceed via two processes: non-radiative and radiative. The non-radiative recombination channels include Shockley-Read-Hall (SRH) recombination, which results from carrier trapping at crystal defects, and Auger recombination, where the recombination energy is transferred to a third carrier, leading to heat dissipation.9 The radiative recombination leads to photon emission, generating light at a frequency , where is the energy band gap and is Planck’s constant.
are LEDs with a chip size smaller than and have found wide range applications, including modular large-screen televisions (panel size ), transparent displays, and microdisplays.10 For ultra-high resolution density AR applications, the pixel size is usually smaller than , and the complementary metal-oxide-semiconductor (CMOS) backplane technology is required for the driving circuits in an active-matrix (AM) display.11
To activate an AM panel, two driving methods are commonly used: pulse amplitude modulation (PAM) and pulse width modulation (PWM). For PAM, the driving circuits can be quite compact, consisting of two transistors and one capacitor (2T1C).12,13 However, emission wavelength and external quantum efficiency (EQE) vary under different driving current densities, leading to wavelength shift and increased power consumption at low gray levels when adopting the PAM driving method. By contrast, the PWM driving method keeps the driving current fixed and modulates the gray level by adjusting the emission time. PWM can be categorized into analog and digital types, depending on different data input methods and pixel circuits. Typically, analog PWM requires a sawtooth-like sweep signal generated from the external driver integrated circuit (IC) or internal circuit.14,15 The driving transistor operates in the saturation region as a current source, determining the driving current. The drain-to-source voltage () is relatively high (), which increases the voltage across to () and leads to a higher power consumption.14,16Figure 2(b) describes a basic 3T1C subpixel circuit for PWM.8 For digital PWM, the driving transistor operates in the linear region, functioning as an on/off switch, depending on the input data level. Therefore, the power consumption of driving transistors in digital PWM is much less than that in analog PWM. The data input time for each pixel is dependent on the display’s bit depth, as described in Fig. 2(c), and higher bit depths require increased data driver IC frequencies, leading to higher power consumption.17
The total power consumption of a panel with digital PWM driving consists of four parts: where , , , , and represent the power consumption of the individual chip, transistors (, ), scan line, data line, and peripheral integrated circuit (IC), respectively. During simulation, we assume that the scan voltage () and the data voltage () are 5V and 4V, respectively, in reference to . In Eq. (1), refers to the duty ratio, which is defined as the pixel-on time in each frame, and its value is dependent on the required luminous power. The typical under resolution of is about 40 to 60 mW when all pixels are off.18 In Eqs. (2) and (3), and bits denote the frame rate and the bit depth of the display, and , , and correspond to the capacitances of row wire, column wire, and storage capacitor, respectively. In our simulations, we assume , , and .14,19
The total optical power emitted from a chip is related to its wall-plug efficiency (WPE) and EQE as10where , and represent the Planck’s constant, speed of light in vacuum, electron charge, and emission wavelength, respectively.
In an AR glass, the acceptance angle of the projection lens is typically within , whereas the light emission from the μLED is Lambertian. To increase the coupling efficiency of into the imaging system, a microlens array is often laminated on top of the array for a better collimation.20 The dependence on the luminous power (unit: lumen) provided by a chip can be calculated as where () is the aperture ratio of the microlens array,21 () is the luminous intensity ratio confined within ,20 is the luminous intensity dependence on the polar angle, represents the spectrum distribution, and is the spectral human eye sensitivity function.
In the following sections, we will introduce and analyze various commercial technologies, including monochromatic [Fig. 2(d)], X-cube [Fig. 2(e)], parallel RGB [Fig. 2(f)], and quantum dot color conversion (QDCC) [Fig. 2(g)]. In addition, we will explore the potential of several emerging high-resolution technologies, such as nanowire LED [Fig. 2(h)] and full-color subpixel [Fig. 2(i)].
2.2 Monochromatic μLED
The fabrication of monochromatic involves monolithic integration technology such as wafer bonding and flip-chip bonding without mass transfer as shown in Figs. 3(a) and 3(b), respectively.22 For wafer bonding, epi wafer and CMOS wafer are first bonded through metal layers, then the pixels are formed by inductively coupled plasma reactive ion etching (ICP-RIE) and surface passivation. Nevertheless, the light extraction efficiency (LEE) is typically lower for the inverted trapezoidal structure due to the unavoidable shadow effect during the etching process.1 By contrast, with flip-chip bonding, the pixels are formed prior to bonding. However, this method requires precise alignment, which limits its applicability to high-resolution density AR displays.
For both wafer bonding and flip-chip bonding, ICP-RIE dry etching has been widely applied to precisely control the mesa size. However, the high-energy ions in the plasma cause material damage, especially at the sidewalls, resulting in an increased non-radiative recombination rate and leakage current.23 As the size continues to shrink, the increased surface-to-volume ratio leads to a higher percentage of the defect region, as Fig. 3(c) illustrates. Therefore as Fig. 3(d) depicts, the EQE of is highly dependent on the chip size due to sidewall defects.24 Various methods have been proposed to mitigate these material defects. In 2009, Yang et al. proposed a potassium hydroxide (KOH) wet etching method after dry etching to remove the defect area in GaN-based LEDs.25 In addition, surface passivation by dielectric atomic layer deposition (ALD) has been investigated to reduce the dangling bonds of gallium and nitrogen atoms.26 In 2018, Wong et al. improved the surface passivation by replacing conventional PECVD with ALD of the passivation layer in Fig. 3(e), increasing the EQE of a from 24% to 33%.27 Later, in 2019, the same group reported a size-independent EQE of 23% for blue down to by combining KOH chemical treatment with ALD surface passivation.28 Furthermore, a proper electrode design can also avoid the defect regions.29 For example, Hsu et al. improved the current confinement effect on a blue array and achieved an optimized EQE of 9.95% by modifying the electrode contact size to .30 Several novel designs can avoid defects induced by ICP etching. For instance, bottom-up fabrication processes through selective epitaxy growth can avoid sidewall damage.31,32 Another example is that, in 2024 SID Display Week, Jade Bird Display reported a novel continuous MQW AlGaInP red design [Fig. 3(f)] to achieve a higher power efficiency than conventional one.33 However, optical crosstalk [Fig. 3(g)] degrades image quality and causes blurring.
A EQE may diminish after bonding to the panel due to misalignment and increased black matrix aspect ratio, which becomes more pronounced as resolution density rises. The wavelength and size-dependent EQEs are plotted in Fig. 4(a). In 2022, Wu et al. developed a resolution blue panel with a mesa diameter of , achieving EQE of 6.5% at a current density of and a peak wavelength of .34 Typically, green would exhibit a lower EQE than the blue ones because they require a higher indium concentration to reduce the bandgap. However, the increased indium content leads to a greater lattice mismatch between InN and GaN, reducing crystal quality. Despite this, green with ultra-small mesa sizes can outperform blue in efficiency, as the size-dependent non-radiative surface recombination rate for indium is less significant than for gallium.24 In 2024, Wu et al. fabricated a 0.39-in. green panel and achieved 7.17% peak EQE for green with a mesa size of .35 For InGaN red , the even richer indium concentration further reduces the EQE. As Feng et al. reported in 2022, the peak EQE of InGaN red LEDs fabricated by selective epitaxy growth is only 1.75% on a wafer at .39 In addition, the broadened FWHM (full width at half maximum) due to inhomogeneous indium distribution and carrier localization reduces the color purity.40 Alternatively, AlInGaP red LED can achieve a wider color gamut and higher EQE for large LEDs (e.g., ), but the efficiency drops significantly as the size decreases.41 In 2022, Wu et al. fabricated a red AlGaInP panel with a mesa size, achieving an EQE of 5.3% at a current density of .42
Figure 4.(a) Size-dependent EQE on panel.24,33–38" target="_self" style="display: inline;">–38 (b) Calculated power consumption of panels with a subpixel size of to provide 3-lm luminous power to the in-coupler. (c) Power consumption comparison between the calculated X-cube 3-panel and measured TCL RayNeo X2 when producing 2-lm luminous power.
Due to its high sensitivity to human vision and superior efficiency at ultra-small pixel sizes, green InGaN/GaN LED is the favored choice for lightweight monochromatic AR glasses, such as the Even Realities G1, which weighs only 40 g. The power consumption of the green with pixel pitch at three-lumen brightness is depicted as the green line in Fig. 4(b), which increases with the average pixel lit (APL) linearly. The power consumption at APL = 0% is about 50 mW, which is attributed to the high power consumption of the driving IC.43 This method helps simplify the circuit design; however, the tradeoffs are reduced image quality and increased power consumption in the low APL region.
2.3 3-panel RGB μLEDs
At the present stage, integrating full-color with CMOS onto a single display panel without mass transfer remains a challenge.44 Although monolithic integration of full-color can be achieved by multiple bonding processes,45 its circuit design, pixel alignment, and angular color shift are the main problems. Thus far, the most mature commercial technology is to implement three monolithic RGB panels as Figs. 2(e) and 2(f) show.
At the 2023 Display Week, JBD introduced an X-cube-based “Hummingbird” projector, which integrates three RGB monochromatic panels using spatial integration through dichroic mirrors.46 This compact design, with a volume of merely 0.4 cc for a display panel, achieves a full-color pixel pitch of and weighs only 1 g, making it a successful commercial product currently available. For instance, TCL RayNeo X2 AR glasses utilize the X-cube projector and a diffractive waveguide to achieve 1500-nit brightness and 25 deg FoV. The next generation RayNeo X3 uses a similar design, and the weight is reduced from 120 to 70 g. However, the RGB pixel alignment is challenging for these high-resolution display panels. In 2024, Meta Orion AR glasses integrate three parallel panels with a SiC waveguide ( at ), and the device only weighs 98 g. Such a design alleviates the misalignment issue and further reduces the volume, although it increases the complexity and cost of the waveguide design.
To fairly compare the power consumption of full-color microdisplay light engines, we consider a 3-lm white light source consisting of RGB colors at 4:10:1 mixing ratios.47 The optical power of the RGB and the luminous power of the white light () can be expressed as ∶∶
The calculated APL-dependent power consumption is plotted in Fig. 4(b), and it is significantly higher than that of green displays due to the lower EQE of red . The duty ratio for the RGB is 25%, 7%, and 15%, respectively. To validate our calculation, we compare the calculated power consumption of a 2-lm X-cube (blue line) to the experimental data from the TCL RayNeo X2 light engine (blue dots) in Fig. 4(c). The agreement is reasonably good (within 20%), considering such a sophisticated system, including RGB μLED materials, chip size effects, driving currents, and optics. Moreover, the power consumption data of μLED are rarely reported in the literature, especially at a specific APL.
2.4 Quantum Dot Color Conversion (QDCC) μLED
Quantum dot color conversion (QDCC) technology holds significant promise in modern display applications, especially when paired with technology. The core mechanism involves using blue to excite QDs, which convert a portion of the blue light into red and green. This method not only enhances the color purity for a wide color gamut but also eases the mass transfer of , leading to reduced manufacturing cost and improved display resolution.48 At 2024 Display Week, PlayNitride demonstrated a 0.49-in. μLED display with 4536 pixels-per-inch (PPI), over 100,000-nit luminance, and 106% NTSC color space with color filters.49
However, despite these attractive features, QDCC technology faces several technical barriers, particularly in addressing degradation caused by blue light excitation. First, prolonged exposure to blue light can degrade the optical properties of QDs, lowering the conversion efficiency and shortening the device’s lifespan. This degradation is primarily due to the formation of photo-generated electrons and holes, which in turn accelerates the photo-oxidation and deterioration of the QD emissive layer. Several strategies have been developed to enhance QD stability, including surface ligand passivation,50 core-shell engineering,51 surface encapsulation,52 and material development of heavy metal-free QDs. Second, there is a tradeoff between color purity and energy efficiency. Although QDs offer excellent color purity, maintaining a proper balance between energy conversion efficiency and display color purity during the light conversion process can be challenging. Third, the optical properties of QDs are highly sensitive to their size and size distribution, which must be precisely controlled to maintain a consistent performance. Large-scale manufacturing of high-resolution QDCCs requires precision in the coating or photo-lithography process to ensure even distribution of the QDs. To minimize the QDCC film thickness, additional optical coatings, such as scattering particles or distributed Bragg reflectors (DBRs), are required.53
Considering a QDCC display excited by blue , the optical power consumption for green and red pixels can be calculated as As reported by Nanosys, the power conversion efficiency can achieve 38% for both red and green heavy-metal-free QDs.54 The power consumption for a 3-lm output is plotted as the red line in Fig. 4(b). Here, the duty ratio of the RGB QDCC is 33%, 17%, and 15%, respectively. The power consumption of QDCC is slightly lower than that of X-cube–based because of the higher efficiency of red subpixels. However, the short lifetime of green-emitting QDs and reduced quality of QD film for microdisplays are major concerns.55
2.5 Emerging μLED Technology
The III-nitride bottom-up nanowire LED is epitaxially grown layer-by-layer on a substrate using atomic self-assembly methods.56,57 The selective area growth utilizes an etched template, and the nanowires are grown on a constrained area by metal-organic vapor-phase epitaxy (MOVPE), which can achieve precise control of nanowires.58,59 The dopant incorporation is accomplished during the growing process without the etching process (which causes extra damage) to improve internal quantum efficiency.60 Furthermore, the vertical structure behaves as a waveguide, which helps to boost the LEE and then EQE of nanowire μLEDs.61 For example, Aledia announced a blue core-shell nanowire μLED with 32% EQE grown by bottom-up method.62 To date, red and green nanowire μLEDs can achieve an EQE of 8.3%32 and 25.2%,63 respectively. However, the demanding fabrication procedure, challenging integration to CMOS backplane, and broad emission spectrum from red InGaN nanowire LEDs remain major barriers.
Integrating RGB colors into one subpixel can potentially triple the resolution density. The first approach is color tunable ,64 and its emission wavelength from InGaN MQW is controlled by varying the indium concentration in different regions. In 2022, Protech introduced a dynamic pixel tuning technology that enables each pixel to emit different colors by adjusting the current density directly. A similar concept is applied in the V-groove color-tunable , where the subpixels emit different colors by increasing the current density, ranging from to ,64,65 respectively. However, the WPE of red, green, and blue is limited to 0.6%, 3%, and 1%, respectively. Ideally, a specific color corresponds to a specific current density. However, any variation in current would lead to luminance fluctuation, which requires color detection and demanding pulse width control. Furthermore, designing the active-matrix electrical backplane is highly complex due to the production yield challenges of each LED. A simpler approach is to employ time-sequential RGB colors, whereas low brightness for red, color breakup, and reduced color gamut remains to be overcome.
Alternatively, three independent RGB p-n junctions can be vertically stacked and driven independently to triple the pixel density.66,67 In 2023, Shin et al. demonstrated a full-color vertical stacked panel with a pixel pitch.68 To avoid unwanted photoluminescence, wavelength-selective polyimide (PI) absorbers are laminated between each layer as color filters and adhesive interlayers, but the LEE is compromised. Jin et al. proposed to use DBR as a bonding layer to recycle the downward light, as a result, the luminous efficiency of the white color is enhanced by 1.6% to 7.4%,69 but the light leakage at large angle incidence and photoluminescence is unavoidable. In addition, the degraded color performance, demanding alignment accuracy, and complex active-matrix circuit design are existing challenges.
3 μOLED
3.1 Tandem Structures
Tandem on a silicon CMOS backplane is emerging as a promising self-emissive light engine for AR devices due to its high-resolution density and mature fabrication process. In conventional structures [Fig. 5(a)], electrons and holes are injected from the electron transport layer (ETL) and hole transport layer (HTL), respectively, leading to electron-hole pair recombination within the emissive layer (EML) and subsequent light emission. By contrast, tandem introduces an additional charge generation layer (CGL) between two or more EMLs.72,73 The n-type CGL generates and transfers electrons to the lowest unoccupied molecular orbital (LUMO) of EML-1, whereas the p-type CGL generates and transfers holes to the highest occupied molecular orbital (HOMO) of EML-2. This CGL effectively acts as an internal electrode, enabling electron-hole recombination in both EMLs of the tandem , thereby enhancing light emission, as depicted in Fig. 5(b).
Figure 5.(a) Working principle of OLED. (b) Working principle of tandem OLED. (c) Schematic of white . (d) Schematic of RGB . (e) Schematic of birdbath structure. (f) Power consumption of RGB and white . Dots are measured data from Refs. 70 and 71.
High-resolution tandem white displays can achieve a pixel density exceeding 3000 PPI, as the pixel structure is formed using color filters, as shown in Fig. 5(c). Such a high-resolution density is made through photolithography, whereas the use of color filters reduces the optical efficiency by more than 70%.74 In 2018, Sony developed a 0.5″ white display with a pixel pitch. They addressed the angular color shift by integrating an on-chip color filter and reduced driving voltage using a 4T2C circuit design.75 However, the peak luminance of this display was limited to 2000 nits, making it unsuitable for AR eyewear. In 2024, LG Display introduced a 1.3″ full-color 3-stacked white panel with a 4175-PPI resolution density and a luminance efficiency of 38 cd/A.70 Despite the high pixel density, the display operates at 11 V, and its peak brightness is constrained to 10,000 nits at a current density of . The device is intended for mixed reality applications, but the 10,000 nits brightness is still inadequate for AR glasses in bright ambient.
As illustrated in Fig. 5(d), RGB can obtain a higher light efficiency if the color filters can be eliminated. The ETL, HTL, and CGL can be uniformly applied across the RGB subpixels, simplifying the fabrication processes,71 though additional color filters are still required on top of the structure to reduce optical crosstalk. Alternatively, the ETL, HTL, and CGL can be designed individually for each subpixel, with a black matrix inserted between subpixels to minimize crosstalk.76 The primary challenge with RGB technology is to achieve high resolution. To pattern and define high-density subpixels, fine metal masks (FMMs) are necessary. However, FMMs must maintain strong mechanical stability to avoid sagging and pixel edge blurring. A thicker FMM, although more stable, introduces a shadow effect during fabrication, degrading pixel quality. To overcome this, APS Corp. has demonstrated a laser-patterned, thin-film FMM that achieves resolutions greater than 3000 PPI.77 Other techniques, such as maskless lithography78 and selective area growth of ITO,79 have been developed for high-resolution monochromatic displays. In 2016, eMagin introduced a direct-patterned full-color 2645 PPI RGB with a peak brightness of 11,300 nits.80 In 2024, Fukuzaki et al. reported a 3207 PPI tandem RGB fabricated through photolithography.81 This display demonstrated a luminance decay time (LT90) of 130 h at an initial brightness of 88,000 nits. However, the high cost and scalability of this technology for full-color RGB mass production still require further validation.
3.2 μOLED-based AR Glasses
The primary challenge for technology is its low peak luminance, largely due to the instability of organic materials. In a waveguide-based AR system, microdisplays must deliver luminance levels of up to 1,000,000 nits. However, organic bonds would degrade under an excessive current flow,82 high exciton density,83 and localized heating,84 leading to performance degradation. To overcome this problem, displays are often integrated with freeform optics, such as birdbath optics, which offer a higher optical efficiency (), as shown in Fig. 5(e).85 The optical loss mainly originates from the absorptive polarizer on top of the OLED panel and the half mirror. Despite this improvement, only about 25% of environmental light reaches the human eye due to the beam splitter and half-mirror configuration. In addition, concerns around weight distribution and bulkiness persist. For instance, the Lenovo ThinkReality A3, released in 2021, weighs 130 g. Although the more recent TCL RayNeo Air 2S reduced the weight to 78 g, the front-heavy design of birdbath optics combined with displays remains less comfortable than the weight-balanced designs of diffractive waveguide-based AR glasses. Moreover, in birdbath optics, the size of the panel typically ranges from 0.5″ to 1.0″ (e.g., 0.71″ in LG Nreal) to increase the étendue, which is significantly larger compared with used in diffractive waveguide systems.
3.3 Power Consumption
Because and other light engines employ different projection methods, the comparison of power consumption is typically based on the brightness perceived by the human eye. On the other hand, only 25% of environmental light can be received by the eye, instead of 70% for the optical combiner in waveguide-based systems. Considering birdbath optical efficiency, then nits would be required from the panel to achieve a similar image readability.6
The power consumption calculation for RGB is similar to that of , with the primary difference being the higher driving voltage due to the tandem structure. According to eMagin’s report,71 a 0.87-in. display can reach a peak luminance of over 25,000 nits. The power consumption of the RGB tandem is represented by the red curve in Fig. 5(f) for a full-screen white display. To achieve 9500 nits at a 100% duty cycle, the array consumes . In contrast, the power consumption of white dramatically doubles to 1813 mW due to the substantial losses from color filters and the reduced EQE at high current densities [blue line in Fig. 5(f)].70 To suppress the image blurs of active-matrix display devices, such as LCD, , and , a low-duty cycle () is required.86 Under such a condition, the peak luminance of the microdisplay will be higher and neither white nor RGB tandem can achieve such a demanding requirement.
4 Liquid-Crystal-on-Silicon (LCoS)
4.1 LCoS Panel
A typical LCoS device consists of a silicon backplane, pixelated aluminum electrodes, an LC layer, an ITO common electrode, and a cover glass, as Fig. 6(a) shows. When the incoming light traverses the LC layer twice, the polarization state can be modulated by voltage-induced LC reorientations. For amplitude-modulating LCoS, an analyzer or a polarizing beam splitter (PBS) is required to convert the phase retardation to amplitude modulation. To achieve a small pixel size, field sequential color (FSC) is widely adopted to eliminate color filters. More specifically, the panel is illuminated by external RGB LEDs and several subframe images are sequentially displayed, which will be integrated into a frame image by human eyes. Figure 6(b) shows two major LC modes: mixed-mode twisted nematic (MTN) and vertical alignment (VA), which are employed in amplitude-modulating LCoS. The former is normally white, and the latter is normally dark. For MTN in the voltage-off state, the LCs are twisted from the bottom substrate to the top substrate, thus changing the polarization state of the incoming light and generating a bright state. The LCs can be reorientated toward a vertical direction by applied voltages, thus generating gray levels. The contrast ratio of the MTN mode is limited to because the LCs near the substrate are hardly reoriented by the voltage owing to strong surface anchoring in the substrate. As a comparison, the dark state for VA mode is achieved without any applied voltage, and thus a high on-axis contrast ratio can be obtained. Despite these advantages, the viewing cone of VA mode is narrower than that of MTN mode, posing a constraint on the collection cone of the projection lens.
When the pixel pitch becomes closer to the cell gap, the fringing field effect (FFE) becomes more severe. Figure 6(c) shows that the fringe field is generated between the voltage-on pixel and the adjacent voltage-off pixel, which decreases the efficiency and contrast ratio for amplitude-modulating LCoS. Although VA mode can exhibit a higher reflectance than MTN mode, its severe FFE reduces the reflectance averaged over the entire pixel. In addition to LC mode-dependent reflectance ( for MTN after considering FFE), the total LCoS reflectance (optical throughput) is also determined by the mirror reflectance (), fill factor (), and zeroth-order diffraction efficiency (), as formulated in Eq. 11. Therefore, the total MTN LCoS reflectance is . To achieve a higher optical throughput, we can employ a distributed Bragg reflector to increase and reduce inter-pixel gaps to increase the FF and mitigate diffraction losses.
4.2 LCoS Illumination System
In numerous AR scenarios, only one portion of the panel has image contents. In other words, the projected images and videos in AR glasses are usually sparse. However, RGB LEDs in conventional LCoS systems illuminate the entire panel and do not depend on the image contents. To save power consumption, both Avegant and Meta87,88 proposed to turn on an LED array that only illuminates a certain portion of the LCoS panel with image contents, as shown in Fig. 6(d). The independently controlled (local dimming) zones are required. Compared with direct-view liquid crystal displays (LCDs) with local dimming zones,89 LCoS requires much smaller LED sizes to achieve the same function. Power saving depends on the image contents, zone design, and zone number. Besides power saving, it can also mitigate the issue of gray boxes. Compared with emissive displays such as μLEDs and tandem μOLEDs, the contrast ratio of LCoS is usually limited, leading to an imperfect dark state. When ambient light is not bright, and the APL of image contents is low, the gray box becomes much more evident as shown in Fig. 6(e). The zonal illuminated architecture can greatly eliminate the gray box by completely turning off the LEDs that originally illuminated the gray box. Finally, the local dimming also enables local primary desaturation (LPD)90 to mitigate the color breakup (CBU) caused by eye saccade or quick head rotation. This algorithm was originally proposed to suppress the CBU in FSC LCDs91 and is applicable to FSC LCoS as well. The key point is that the color gamut can be locally shrunk by adopting three desaturated primary colors. It is noted that LPD can increase the luminance by turning on RGB LEDs in each of the three subframes. Although zonal illumination has numerous benefits, how to design illumination systems and increase uniformity while keeping a compact form factor remains challenging.
Owing to its bulky PBS cube, the form factor of LCoS is much larger compared with self-emissive displays. To reduce the volume, Himax developed a front-lit LCoS92 with a volume of by introducing the micro-PBS array, as illustrated in Fig. 6(f). The light beam (S) emitted from the RGB mini-LED array is collimated and split into A and A′ by the coupling lens. Both A and A′ will be s-polarized by the employed wire-grid polarizer. The A-s will directly be reflected by the micro-PBS array toward the LCoS. The A’-s will first experience a total internal reflection, then hit the micromirror array, and is finally reflected toward the LCoS. Owing to the polarization modulation of the LCoS, A-s and A’-s are converted to B-p and B’-p and then passed through the micro-PBS array and the clean-up polarizer, which is used for further improving the contrast ratio. In addition to Himax, Avegant also demonstrated an ultracompact LCoS. However, each approach has its own pros and cons. For example, the optical efficiency of the Himax approach is only for a linearly polarized light, whereas the contrast ratio of the Avegant method is only about 100:1.
To reduce the volume of LCoS while keeping a high optical efficiency and high contrast ratio, Luo et al.93 proposed a novel illumination system. The key elements to enable the compact illumination system include an in-coupling prism and a light guide plate (LGP) with multiple parallelepiped extraction prisms, as shown in Fig. 6(g). The in-coupling prism couples the light emitted from the LEDs into the LGP, which will continue to travel inside the LGP owing to total internal reflection. The enlarged figure shows that some of the light beams will hit the extraction prisms and be reflected toward the bottom LCoS and other light beams continue to travel inside the LGP. For an FSC LCoS panel with pixel pitch, the simulated optical efficiency using a linearly polarized incident light can reach around 40%, and the volume (without the projection optics) is only . To ease the fabrication process, Luo et al. also proposed four thin PBS cuboids to reduce the volume to 25% of the conventional PBS-based system,94 as shown in Fig. 6(h). The optical efficiency is about 36.7% for an unpolarized input light.
4.3 LCoS Power Consumption
The total power consumption of an LCoS system consists of two parts: electrical power from the CMOS backplane to drive the LC molecules and optical power from the backlight. The electrical power consumption is mainly determined by the APL and LC mode. Different LC modes not only affect the contrast ratio but also lead to content-dependent electrical power consumption. For “normally white” MTN LCoS, a lower APL content (i.e., more pixels are in a dark state) consumes more electrical power because the applied voltage is required to reorientate LCs. To reduce electrical power consumption, Cho et al. presented an 8T static-random-access memory structure to reduce the power of a CMOS by 78%.95 Except for the improvements in circuit design, the power consumption for LCoS at the resolution of can be reduced from 200 to 50 mW by improving the CMOS process from to 55 nm.92
The LCoS panel is illuminated by RGB LEDs or a passive matrix (PM) mini-LED array. As stated in Secs. 4.1 and 4.2, the major optical losses come from the limited polarization conversion efficiency of the unpolarized LED array, imperfect reflectance from the LCoS panel, and illumination system. As reported in SID Display Week 2024, the optical efficiency of the Himax’s front-lit LCoS is 8.5%, including the LCoS panel and polarization conversion.92 This optical efficiency is defined by the ratio of total lumen received after the front-lit illumination system versus the collected light (in all angles) emitted from the LED. The LED optical power consumption can be calculated as where and correspond to the color sequential duty ratio and acceptance angle of the projection lens, respectively. is the human eye’s sensitivity to the light source and corresponds to the system efficiency of the illumination system.
The WPEs of commercial RGB-packed LEDs produced by LumiLED (model L1MC-RGB0028000MP0) are found to be 30.8%, 30.7%, and 57%, respectively. The overall optical efficiency to provide 3-lm white light is calculated to be 781 mW, which is close to the Himax estimated power consumption of 4 lm/W. The total power consumption for the Himax front-lit MTN LCoS to provide 3-lm light is about 831 mW to 781 mW as the APL increases from 0% to 100%. Assuming the polarization recycling efficiency is 70%, Luo’s design93 can further reduce the LED power consumption to only 312 mW to 262 mW.
5 Digital Light Processing (DLP)
5.1 DLP Working Principle
Similar to LCoS, DLP is based on a digital micromirror device (DMD) paired with micro-electro-mechanical system (MEMS) control, developed by Texas Instrument (TI). The DMD is composed of a micromirror array and each micromirror corresponds to a full-color pixel. Figure 7(a) depicts the working principle of DMD.96 The unpolarized FSC LEDs illuminate the DMD panel, and the on/off state is controlled by the rotation of mirrors. In the voltage-on state, the DMD directs the incident light to the projection lens, whereas in the voltage-off state, the reflected light is absorbed by the coated black paint. In comparison with LCoS, DMD typically has a higher optical efficiency because it can use unpolarized light, except that its illumination system composed of freeform optics is more complicated and bulkier.98
Similar to LCoS, the power consumption of a DLP projector is divided into two parts: optical power from LEDs and electrical power from MEMS. For TI’s DLP3010 with a resolution (pixel pitch ), the electrical power consumption is ranged from 162 to 219 mW. By reducing the pixel pitch to , the power consumption can be further reduced by 30%.99
The optical efficiency is mainly dependent on the DMD efficiency, illumination system efficiency, and multi-layer transmittance. For TI’s DLP3010 with pixel pitch, the DMD efficiency is calculated to be 68% with a consideration of 92% double path window transmission, 93% mirror filling factor, 86% diffraction, and 89% reflectance from mirrors. The illumination system is the most important part of a DLP panel. In 2017, TI proposed an R/B 2-in-1 illumination system for microdisplays and achieved over 70% geometric efficiency by Zemax ray-tracing.97 However, the illumination optics are still too bulky because two LED panels are included in the system as shown in Fig. 7(b). In 2023, TI proposed a new compact optical architecture by shrinking the illumination light source into a single full-color LED panel and removing the dichroic mirror [Fig. 7(c)]. However, the geometric efficiency of RGB channels is reduced to 42.7%, 40.4%, and 35.7%,90 respectively. Considering the material absorption and multi-layer transmittance, which is approximated to be 65%, as well as the 68% DMD efficiency, the optical engine efficiency () for the RGB LEDs is about 18.9%, 17.9%, and 15.8%, respectively. The LED optical power consumption can be calculated as where corresponds to the color sequential duty ratio. By considering the WPE of RGB-packed LEDs to be 30.8%, 30.7%, and 57%, respectively, our calculated overall optical efficiency to provide a 3-lm white light is 222.6 mW (i.e., 13.5 lm/W), which is in a very good agreement with the TI’s estimated power consumption of 14 lm/W.
6 Laser Beam Scanning (LBS)
6.1 LBS Working Principle
LBS is a mature technology in the realm of augmented reality (AR) displays, offering unique advantages in terms of high brightness, small volume (), and high power efficiency. An LBS is composed of an RGB laser module and a MEMS scanner. The RGB lasers are combined by dichroic mirrors and packed as described in Fig. 8(a).101 Different from DLP, the MEMS in LBS is composed of two 1D mirrors or one 2D mirror. From the viewpoint of compactness, one 2D MEMS mirror is preferred as shown in Fig. 8(b), but its driving power consumption is about 10% higher.102 The MEMS driving power consumption depends on the scanner specifications, such as scan frequencies and product of maximum scan half angle () and mirror diameter () ( product). These values are determined by the required display resolution and frame rate.103,104 The (unit: deg·mm) of a 1D scanner can be approximated as105where is the mirror shape factor ( for rectangular mirror and for circular mirror), is the number of pixels, and is the light wavelength.
The power consumption of the LBS system consists of two parts: the laser’s optical power consumption and the MEMS scanner’s electrical power consumption. Compared with LEDs, laser light sources are more directional with a narrower FWHM, but the downsides are lower WPE and speckles. The WPE of RGB lasers with emission wavelengths of [640 nm, 532 nm, 450 nm] is [25%, 14%, 10%], respectively.106 Although both DLP and LBS utilize MEMS, the illumination systems have several differences. Due to a higher directionality, the laser beams are collimated in the illumination system. However, the resolution is proportional to the scan mirror size and scan angle. Therefore, for a high-resolution LBS display, the efficiency may be reduced by a broader modulation transfer function (MTF).107 For a typical scanning display, the pixel size is chosen at the FWHM of the Gaussian spot103 and the MTF efficiency () is calculated approximately to be 76%. To reduce laser speckles, a despeckle film is typically applied at the input coupler to reduce the coherence and its efficiency can approach 80%.108 To provide a white light to the input coupler, the power consumption of the laser diodes in MEMS can be calculated as where , , , and represent the efficiency of despeckle film, MEMS mirror, multilayer transmittance, and MTF efficiency, respectively. Considering and , the required laser power is 373 mW for a 2D MEMS mirror.
The power consumption of the MEMS mirror is heavily dependent on the actuation methods, product, and frame rate. To activate a MEMS mirror, the power consumption is below 10 mW.109,110 However, due to the high driving voltage requirements, the driving electronics consume significantly more power. In 2007, Kim et al. demonstrated electromagnetic actuation MEMS, achieving a pixel size of , a resolution of , and a power consumption of 150 mW, though the frame rate was limited to 18.5 Hz.111 In 2012, Lemoptix used electromagnetic 2D MEMS mirrors to achieve a 1.5 cc micro projector with 100 mW power consumption and 20 kHz scanning rate to achieve a resolution of SVGA () at 60 Hz frame rate.102 To reduce power consumption, electrostatic actuation based on comb drivers or piezoelectric actuation with AlN actuators is preferred. In 2019, Microsoft introduced the HoloLens 2 with a field of view (FoV) of and a resolution of 47 pixels per degree (PPD). This design employed two vertical-stacked lasers per color, with fast mirrors using piezoelectric actuation and slow mirrors using electromagnetic actuation. The power consumption for the entire display and image processing unit was per panel.7 In 2022, STMicroelectronics reported that 1D MEMS could reduce the driving voltage from 200 to 17 V by transitioning from electrostatic to piezoelectric actuation. At a resonance frequency of 27.5 kHz, power consumption dropped to 20 mW, though the product was limited to 15 deg·mm.112 OQmented reported in 2021 and 2023 SPIE that Lissajous scan 2D MEMS mirrors with a resolution of at a 60 Hz frame rate only consume 200 mW at sparse content.100,113 Based on Eq. (14), the product of the 2D circular piezoelectric mirror is 22.73 and 11.36 deg·mm for the fast and slow axis, respectively. The oscillation frequency of the fast and slow axis is 35 kHz and 600 Hz, respectively, to ensure a good image quality. Because the electrical power consumption of MEMS is proportional to the oscillation frequency, and increases with the product, here, we estimate the electrical power consumption of MEMS based on the measured data.112Figures 8(c) and 8(d) indicate that our estimated power consumption of MEMS is 105 mW and 30 mW for the fast and slow axis, respectively. That means, the total MEMS electrical power consumption is 135 mW. Considering the abovementioned 373 mW optical laser power, at the sparse image content (), our estimated total power consumption is , which is close to the measured data (less than 200 mW) reported by OQmented.
7 Smart Dimmer
The ambient contrast ratio (ACR) of an AR eyewear is defined as where , and represent the on-state display luminance, off-state display luminance, ambient light luminance, and transmittance of the optical combiner. Achieving a high ambient contrast ratio (ACR) is crucial for users, as it enhances the visual sensitivity to image contents.114 According to Eq. (16), to enhance the ACR of AR eyewear, two approaches can be considered: (1) boosting by an adaptive brightness control and (2) adding a smart dimmer to reduce . The former will undoubtedly lead to increased power consumption, whereas the latter helps to reduce the ambient light.
Two types of smart dimmers have been developed: a single pixel for global dimming and segmented pixels for local dimming. Figures 9(a)–9(c) illustrate the AR eyewear without any dimmer, with a single pixel global dimmer, and with a segmented dimmer,115 respectively. Using a global dimmer [Fig. 9(b)], the transmittance of the ambient light changes uniformly. For a segmented dimmer, the dimmer consists of several zones so that the transparency of each zone can be individually manipulated by the applied voltage [Fig. 9(c)]. In addition, a smart dimmer is preferred to be “normally transparent” to ensure fail-safe operation. This means in the case of circuit malfunction; the dimmer remains in the see-through state.
In terms of device operation mechanisms, the electrochromic (EC) effect, film-compensated liquid crystal modulator, and dichroic dye-doped liquid crystal have been demonstrated. Figure 9(d) shows the device structure of a global smart dimmer using the EC effect.116 When a voltage is applied, ions from the storage layer are driven through the ion conductor layer toward the EC layer. Afterward, these ions interact with the EC material and cause a redox reaction with a change in the oxidation state of the EC material, resulting in a transmittance change. In addition, the organic EC material can maintain this final state without applying voltage, which helps to save power. The original state can be reversed by applying an opposite voltage. The transmittance of the bright state and the dark state can achieve 90% and 0.01%, respectively. However, the slow response time () of EC materials is a major challenge.117 To shorten the response time to , Kortz et al. developed complementary organic EC materials with transparent nano-particle layer electrodes, but the peak transmittance is reduced to .118
The second type of smart dimmer is a film-compensated homogeneous LC cell119 used in Fig. 9(b). The contrast ratio is greater than 300:1 and its fast response time (8 ms) enables 120 Hz frame rate. In 2022, such a segmented smart dimmer ( zones; aperture size = 1.57'') with a tunable transmittance from 22% to 0.3% has been implemented in Magic Leap 2 to improve the ACR and to enable occlusion.120 In 2024 SID Display Week, Liqxtal Technology reported a high contrast LC dimmer with 33% bright state transmittance and 0.1% dark state transmittance.115 To mitigate the diffraction patterns caused by the periodic pixels, they applied an S-curved scan and data lines with an impressive 96% aperture ratio.
The third type is dichroic dye-doped LC cell [Fig. 9(c)] whose major advantage is polarization independence. However, the contrast ratio is typically ∶ for a single-layer device because of the limited dichroic ratio of the dyes. In 2016, Zhu et al. first proposed a fast-response guest-host LC dimmer for AR displays with a transmittance tunable from 73% to 26%.121 In 2024, Liqxtal Technology also reported a dye-doped LC dimmer with 56% bright-state transmittance and 16% dark-state transmittance.115 By optimizing the dye material and concentration, LC cell gap, and device structure, the transmittance range can be extended from to or even wider.
The performance of all three types of smart dimmers is summarized in Table 1. The electrochromic effect can offer an excellent contrast ratio, but its response time is relatively slow. By contrast, LC-based smart dimmers have a much faster response time to enable real-time dynamic dimming; especially, the film-compensated LC dimmer provides both high contrast and fast response time and is useful for enhancing the immersion of AR eyewear. On the other hand, the major advantage of a dye-doped LC smart dimmer is its polarization independence.
Table 1. Performance of three types of smart dimmers. The LC dimmer data are taken from Ref. 115.
Table 1. Performance of three types of smart dimmers. The LC dimmer data are taken from Ref. 115.
Electrochromic
Film-compensated LC
Dye-doped LC
Bright state transmittance
60%
33%
70%
Dark state transmittance
0.01%
0.1%
10%
Contrast ratio
6000:1
330:1
7:1
Response time (ms)
8
12
8 Conclusions and Outlook
We have reviewed the recent advancements and challenges of six microdisplay light engines, particularly focusing on their power consumption. Table 2 summarizes their advantages, challenges, and possible solutions.
Table 2. Advantages, challenges, and possible solutions of six presented light engines.
Table 2. Advantages, challenges, and possible solutions of six presented light engines.
Advantages
Challenges
Solutions
3-panel
High efficiency of green monochromatic display.
Alignment of RGB colors.
Monolithic integration.
Image quality.
Low efficiency for red color.
CMQW structure.
Long lifetime.
High power consumption of driving IC.
Advanced CMOS process.
QDCC
Easier fabrication for single-color panel.
Stability and lifetime.
Surface and material engineering.
Wide color gamut.
Low conversion efficiency.
High-resolution density (6000 PPI).
Require birdbath optics due to insufficient brightness.
RGB OLED with improved mask fabrication.
Low efficiency at high current.
Bulkiness.
LCoS
High stability.
Compactness and efficiency trade-off.
Novel illumination architecture.
Mature technology.
Require a polarized light source.
Low cost.
DLP
High optical efficiency.
Difficult to further reduce pixel size.
Novel illumination architecture.
Bulky illumination system.
LBS
High optical efficiency.
Poor image quality.
Additional compensation film.
Low power consumption of MEMS mirrors.
Limited frame rate.
Faster resonance of piezoelectric actuation.
Figure 10(a) shows a comparison of our calculated power consumption of existing light engines based on 3-lm luminous power received at the in-coupler produced by a resolution microdisplay light engine. Considering both power consumption and image quality, a monochromatic display is an attractive option for entry-level lightweight AR glasses. For the X-cube–based full-color , constraining the power consumption below 1 W to prevent overheating would undoubtedly limit the image content to . Under the same scenarios, LCoS, LBS, and DLP can meet such criterion at , which represents the case of web browsers.
Figure 10.(a), (b) Calculated power consumption of commonly employed microdisplay light engines for AR glasses at the current stage when producing (a) 3-lm and (b) 1-lm luminous power. (c), (d) Calculated power consumption of microdisplay light engines with improved architectures when producing (c) 3-lm and (d) 1-lm luminous power. CMQW stands for JBD’s continuous MQW.
Implementing a smart dimmer helps to reduce power consumption and improve the ambient contrast ratio. If the ambient light transmittance can be reduced to 10%, then the display brightness of 1300 nits is adequate to achieve a 4:1 ambient contrast ratio, which is equivalent to coupling 1-lm luminous power into the in-coupler of the waveguide. In this scenario, the electrical power consumption remains the same, but the optical power consumption can be reduced to 1/3 of the original value. As described in Fig. 10(b), all the light engines are capable of avoiding the overheating threshold of 1 W/panel.
Both and LCoS hold great potential to be the ultimate solutions for next-generation AR glasses if the power consumption can be further reduced as shown in Figs. 10(c) and 10(d) without and with smart dimmer, respectively. For driving IC, the power consumption can be further reduced by using an advanced CMOS process to decrease the operating voltage and adopting a power management IC to shut down unused blocks and adjust the frequency and voltage based on the display content.122 By replacing CMOS with 55-nm CMOS, the power consumption can be reduced by . For X-cube–based , the major challenge comes from the low efficiency of small-size AlGaInP red . By implementing JBD’s novel continuous MQW design, which shows 8% WPE, the power consumption can be significantly reduced. With the help of a smart dimmer, it is possible to avoid overheating in outdoor conditions, even for “normally dark” circuit design. For the QDCC , the photostability issue must be overcome before it can be considered further for practical applications. In the case of LCoS, the novel front-lit design from Luo et al.85 can greatly reduce the power consumption to a level lower than DLP. By further implementing local dimming, the power consumption can be even lower than that of LBS. For DLP, the remaining challenges are in the fabrication of small micro-mirrors, the reduced optical efficiency for small pixels, and the increased overall system volume because of the employed freeform optics. LBS remains the most power-efficient light engine for microdisplays due to its highly collimated laser source and low-power piezoelectric MEMS actuation, whereas its limited frame rate and raster-scanned image quality require further improvement.