In computer vision and robotics, accurate 3D positioning and trajectory determination are crucial for a variety of applications, including industrial and clinical [
Journal of the European Optical Society-Rapid Publications, Volume. 20, Issue 1, 2024032(2024)
Digital holographic microscopy applied to 3D computer micro-vision by using deep neural networks
Deep neural networks (DNNs) are increasingly employed across diverse fields of applied science, particularly in areas like computer vision and image processing, where they enhance the performance of instruments. Various advanced coherent imaging techniques, including digital holography, leverage different deep architectures like convolutional neural networks (CNN) or Vision Transformers (ViT). These architectures enable the extraction of diverse metrics such as autofocusing reconstruction distance or 3D position determination, facilitating applications in automated microscopy and phase image restitution. In this work, we propose a hybrid approach utilizing an adapted version of the GedankenNet model, coupled with a UNet-like model, for the purpose of accessing micro-objects 3D pose measurements. These networks are trained on simulated holographic datasets. Our approach achieves an accuracy of 98% in inferring the 3D poses. We show that a GedankenNet can be used as a regression tool and is faster than a Tiny-ViT (TViT) model. Overall, integrating deep neural networks into digital holographic microscopy and 3D computer micro-vision holds the promise of significantly enhancing the robustness and processing speed of holograms for precise 3D position inference and control, particularly in micro-robotics applications.
1 Introduction
In computer vision and robotics, accurate 3D positioning and trajectory determination are crucial for a variety of applications, including industrial and clinical [
2 Theoretical background and context
2.1 Deep neural networks
DNNs inspired by biological neural networks, process, classify, and predict complex data through multi-layer structures. These networks employ non-linear transformations from input to output layers, enabling tasks like linearization in higher-dimensional spaces [
2.2 Digital holographic microscopy and computer micro-vision for micro-robotics
DH is an advanced imaging technique capturing both amplitude and phase of an object’s entire wavefield using a CMOS imaging sensor. In
Figure 1.(a) Lyncee-tec DHM observing a micro-structured pattern moved by a hexapod stage. (b) A typical experimental hologram of a pseudo-periodic pattern that allow 3D pose measurement [2]. Image reconstruction (c) in amplitude and (d) in phase at a numerical in-focus distance of 185 μm.
Digital hologram reconstruction relies on the Angular Spectrum Method [
3 Positioning models (X, Y and Z)
In this work, we combine previous autofocusing with DHM accelerated with DNN [
Figure 2.(a–c) Thumbnail reconstruction. (d–f) Assess the distance Z. (a) A ROI of 768 × 768 is cropped from the hologram at a fixed position. (b) XY Model (based on a UNet like model). (c) The reconstructed thumbnail of 64 × 64 pixels. (d) A ROI of 128 × 128 is randomly cropped from the hologram space. (e) Z model based on an adapted version of a GedankenNet model [3]. (f) The distance Z.
4 Methodology
We address this issue by applying DNNs to micro-vision measurement of 3D trajectories with DH. Recently, we demonstrated the ability of new generation of deep neural networks such as ViT to predict the in-focus distance with a high accuracy [
DNNs require training to realize expected tasks and to reach the best performances. In our work, the training step is conducted from a dataset constituted by simulated holograms. Various experimental parameters have been considered in simulations such as spherical aberration introduced by objective microscope lens, and has been implemented in simulated hologram datasets, with the aim of being able to mimic real experimental conditions. To rigorously evaluate the effectiveness of the proposed methodology, which integrates DH with DNNs and video-rate micro-vision, we conducted a comprehensive validation through simulation. Our primary objective was to assess the DNNs capability to predict a simulated 3D trajectory under precisely controlled conditions. For this purpose, we selected a Lissajous’ figure (result of superposing two harmonic motions on the X-Y plane). This complex trajectory served as a challenging yet well-defined path for rigorously testing the capabilities of the DH-DNN system. We simulated a complete 3D trajectory of 2D pseudo-periodic pattern with period of 9 μm, displaced by the hexapod stage (
5 Results
We present the results obtained from the DH-DNN system methodology for predicting 3D trajectories. The models (XY Model and Z Model) have been trained using a total of 65000 simulated holograms. The XY Model is using binary cross entropy loss. The Z Model has been trained using a cross-validation method using the TanhExp loss function [
Figure 3.(a) Outliers (in red), simulated (in blue) and estimated (in green) trajectory in the 3D space. (b) Z and X-Y errors in μm (absolute difference and L2-norm). The Z error is mostly below an error of 1 μm (red dashed line).
Figure 4.Matching rate associated to each 3D pose (red: outliers, green: right 3D poses).
6 Conclusions
We propose a method that enables the direct determination of 3D positions from hologram space with a mean error of 1 μm on Z and 12 μm on X-Y, effectively bypassing the need for full holographic image reconstruction. These errors must be compared to the complete encoded area of 11 × 11 cm2. Moreover, our study offers a thorough analysis of the matching rate levels attributed to each 3D pose. We believe it is the first time a GedankenNet model is used as a regression tool. The modified GedankenNet (Z Model) achieved an inference speed of 2.5 ms, contrasting with the over 20 ms required by a TViT [
[1] S. Yao, H. Li, S. Pang, B. Zhu, X. Zhang, S. Fatikow.
[2] S. Cuenat, L. Andréoli, A.N. André, P. Sandoz, G.J. Laurent, R. Couturier, M. Jacquot.
[3] L. Huang, H. Chen, T. Liu et al. Self-supervised learning of hologram reconstruction using physics consistency.
[5] T. Zeng, Y. Zhu, E.Y. Lam.
[6] A.N. André, P. Sandoz, B. Mauzé, M. Jacquot, G.J. Laurent.
[7] A.N. André, P. Sandoz, B. Mauzé, M. Jacquot, G.J. Laurent.
[8] J.W. Goodman.
Get Citation
Copy Citation Text
Stéphane Cuenat, Jesús E. Brito Carcaño, Belal Ahmad, Patrick Sandoz, Raphaël Couturier, Guillaume J. Laurent, Maxime Jacquot. Digital holographic microscopy applied to 3D computer micro-vision by using deep neural networks[J]. Journal of the European Optical Society-Rapid Publications, 2024, 20(1): 2024032
Category: Research Articles
Received: Jan. 31, 2024
Accepted: Jun. 13, 2024
Published Online: Dec. 16, 2024
The Author Email: Maxime Jacquot (maxime.jacquot@univ-fcomte.fr)