Authentication through residual attention-based processing of tampered optical responses

Blake Wilson; Yuheng Chen; Daksh Kumar Singh; Rohan Ojha; Jaxon Pottle; Michael Bezick; Alexandra Boltasseva; Vladimir M. Shalaev; Alexander V. Kildishev

doi:10.1117/1.AP.6.5.056002

1 Introduction

The semiconductor industry has grown into a $500 billion global market over the last 60 years. However, the semiconductor fabrication pipeline has become fragmented, inadvertently giving rise to a $75 billion counterfeit chip market that jeopardizes safety and security across multiple sectors dependent on semiconductor technologies, such as aviation, communications, quantum, artificial intelligence, and personal finance.1^–5 Several techniques aimed at affirming semiconductor authenticity have been introduced to detect counterfeit chips, largely leveraging physical security tags baked into the chip functionality or packaging.6^–13 Central to many of these methods are physical unclonable functions (PUFs),14^,15 which are unique physical systems that are difficult to replicate, either because of economic constraints or inherent physical properties. Rather than being grounded in cryptographic hardness, PUFs emphasize the economic and technological challenges of duplicating a given system’s physical characteristics.16 Optical PUFs, which capitalize on the distinct optical responses of random media, are especially promising. However, achieving scalability and maintaining accurate discrimination between adversarial tampering and natural degradation, such as physical aging at higher temperatures, packaging abrasions, and humidity, poses significant challenges.17^–19

To combat these difficulties, this study focuses on an optical PUF model utilizing the distance matrix constructed of the positions and radii of random gold nanoparticles.20 The overview process of the PUF tamper detection method is demonstrated in Fig. 1. Due to the extreme difficulty of replicating large sets of nanoparticles with precise positions and radii, the distance matrix acts as the PUF signature. However, we demonstrate that current verification methods for distance matrix PUFs are neither sufficiently scalable nor robust enough for discriminating between natural disturbances and adversarial tampering. First, we take dark-field images of nanoparticles that are randomly distributed. The random positions and radii are extracted using semantic segmentation and labeled clustering. Then, the nanoparticles undergo treatment due to either natural degradation, e.g., minor thermal treatment and packaging abrasions, or adversarial tampering, e.g., substrate tearing, thermal tampering, and refilling. After the nanoparticles are exposed to either kind of treatment, the nanoparticle positions and radii are remeasured, and a new, posttampered distance matrix is compared against the pretampered distance matrices. Previous works use variations in the Hausdorff distance metric to classify pre- and posttampering detection. In addition to the Hausdorff metric, we also apply the Procrustes matrix distance and average-Hausdorff-distance metrics21^–30 as analytical, classical methods for discrimination.

Figure 1.PUF sampling process. An overview of the PUF tamper detection method using distance matrices of randomly positioned gold nanoparticles. The process consists of four primary stages. (i) Gold nanoparticles are randomly introduced, serving as a distinct physical system. (ii) The nanoparticles’ distance matrix is recorded and archived in a reference database. (iii) The system may experience external tampering or natural degradation that can modify its initial state. (iv) The distance matrix is reassessed and cross-referenced with the initial database to identify any potential tampering or other changes.

Download full size

View all figures

However, under more difficult assumptions of adversarial tampering, both the Hausdorff and Procrustes metrics can be provably tampered with, as we show in Sec. 4. Addressing this gap, we present a novel deep-learning approach using residual, attention-based processing of tampered optical responses (RAPTOR),31^–33 showing marked improvements in both speed and accuracy under diverse adversarial tampering conditions.

Overall, the novelty of our approach is demonstrated as

(1)being the first method to apply an attention mechanism for PUFs authentication, using the nanoparticle radii as soft weights and the posttamper distance matrix as a value matrix;
(2)developing data set generation methods for gold nanoparticle PUFs for which there is no existing public data set;
(3)achieving high verification accuracy under difficult, real-world tampering schema using machine learning to verify the gold nanoparticle PUFs.

We begin by discussing the importance of optical PUFs for semiconductor authentication and then spotlight the challenges in current verification methods. We then introduce a statistical approach to overcoming these challenges by formalizing the problem of adversarial tampering detection. We conclude by providing accuracy and speed results for both the average distance analysis and RAPTOR.

2 Background

2.1 Physical Unclonable Functions

PUFs are distinctive physical systems characterized by a unique, irreplicable, physical fingerprint. PUFs yield a probability distribution over random measurements of a system that is practically unclonable due to current technology, economic factors, or time constraints. That is, given two random physical systems, the probability of obtaining the same distribution of measurements is extremely low. An adversary will attempt to replicate the physical system that yields the measurement distribution in order to spoof any detection schemes. The detection of adversarial tampering features introduced during the spoofing process is based on the following steps: (1) PUF system preparation, (2) pretamper measurements, (3) random tampering, and (4) posttampering adversarial detection. Previous works primarily implement this detection method using optical PUFs, which construct unique scattering and/or spectral responses of random media.9^,14 Optical PUFs are easy to fabricate and quick to measure, making them ideal for proof-of-concept experiments. Likewise, several other physical systems exhibit similar levels of randomness and measurability, including resonators,17 laser-induced speckle patterns,6 memristors,10 memtransistors,10 and intentional damaging in glass.34 However, nanoscale metallic optical systems, otherwise known as plasmonic PUFs, have been rising in popularity due to their strong scattering response at optical wavelengths, increasing robustness during posttampering measurements. Among the early instances of plasmonic PUFs are responses from dichroic gold barcodes,35 anisotropic gold nanoparticles grown within thin silicon dioxide films,36 distinct surface plasmon resonance modes,37 unique molecular configurations embedded in multilayer structures,18^,38 and 100 nm gold nanorods.39 Nevertheless, while serving as viable PUF prototypes, these methods grapple with scalability challenges, either in fabrication or measurement robustness. To address these limitations, we reintroduce a streamlined, plasmonic PUF suitable for large-scale applications: the distance matrix verification of gold nanoparticles.20 As we argue in Appendix A (Sec. 6.4), gold nanoparticles are sufficiently random during fabrication and can easily be measured using dark-field microscopy, a readily available technique that can integrate seamlessly into any stage of the semiconductor fabrication pipeline.

2.2 Distance Matrix PUFs

Figure 2 shows the distance matrix extraction process based on gold nanoparticle PUFs from dark-field images. The detailed segmentation process is found in Sec. 4.1 and Appendix A (Sec. 6.3). Distance matrix PUFs are given by the distance matrix $D$ constructed by all pairwise distances between nanoparticle positions. Let $d (r_{i}, r_{j})$ be the Euclidean distance between nanoparticles $i$ and $j$ , with positions $r_{i}$ and $r_{j}$ , respectively; then the distance matrix elements $D_{i j}$ are defined as $D_{i j} ≜ d (r_{i}, r_{j})$ . The merit of the distance matrix as a PUF lies in its symmetry properties: it is rotationally and translationally invariant, renormalizable, and simple enough for computer-vision measurements across varying fields of view and orientations. It is important to note that the use of distance matrix PUFs makes an implicit assumption that the probability of introducing random translations and rotations during measurement is much higher than that of fabricating two systems that are identical under a rotation and translation symmetry. This ensures that in-plane distance matrices are uniquely associated with their system state, barring unlikely rotational and translational symmetries introduced during fabrication. This motivates our use of distance matrices as reliable PUFs as we now introduce their analysis. Smith et al.20 showed that the Hausdorff distance is robust in accounting for $5 μ m$ translations as well as illumination discrepancies in the imaging process. In this study, we expanded the tests with a wider range of adversarial tampering through simulation by increasing the translation and rotation of the imaging lens, increasing the noise perturbations of the nanoparticle positions, and introducing adversarial tearing and refilling, as described in detail in Sec. 3.2.

Figure 2.Distance matrix extraction from dark-field images. Nanoparticle dark-field images of size $448 \times 448 pixels$ are prepared using dark-field microscopy. Then, the segmentation process classifies pixels as belonging to either a nanoparticle pattern or the dark-field background. Next, nanoparticle pattern pixel regions are clustered into local particle patterns, and their centers of mass (purple points) are extracted. Finally, the distance matrix is generated by evaluating all pairwise distances between these nanoparticle patterns. We visualize the distance matrix using its minimum spanning tree, despite the full tree being all-to-all. All scale bars represent $20 μ m$ .

Download full size

View all figures

3 Methods

Figure 3 presents our machine-learning-assisted authentication flowchart from fabrication to tampering detection. Consider a physical system state $x \sim p (x)$ generated by a fabrication process $p (x)$ . A PUF gives a distribution over measurements $m \sim p (m | x)$ of the system conditioned on the system state. After recording a set of measurements $M = {m_{0}, \dots, m_{| M | - 1}}$ , the system state $x$ evolves to a new state $x^{'}$ via either an adversarial tampering process $x^{'} \sim q_{a} (x^{'} | x)$ or natural degradation process $x^{'} \sim q_{n} (x^{'} | x)$ , e.g., natural thermal changes, packaging abrasions. An independent Bernoulli variable $β \sim B$ chooses which of the two distributions produces the state evolution. The general tampering distribution $q (x^{'} | x, β)$ is conditioned on the initial system state $x$ and the tampering indicator $β$ , i.e., $q (x^{'} | x, β = 0) = q_{n} (x^{'} | x)$ and $q (x^{'} | x, β = 1) = q_{a} (x^{'} | x)$ . Once the system has undergone the chosen tampering, we record the posttampering measurements $m^{'} \sim p (m^{'} | x^{'})$ in a new database $M^{'} = {m_{0}^{'}, \dots, m_{| M^{'} | - 1}^{'}}$ . Using a discriminator function $Y_{θ} (m, m^{'})$ , with variational parameters $θ$ , we infer the tampering indicator $β$ to determine whether the system underwent a natural degradation process or the adversarial tampering process. Our objective function for detecting adversarial tampering is optimized by finding the optimal variational parameters $θ$ for our discriminator function $Y$ , as $\underset{θ}{\arg \min} E_{x \sim p (x)} [E_{\underset{m^{'} \sim p (m^{'} | β, x)}{\binom{β \sim B}{m \sim p (m | x)}}} [| Y_{θ} (m, m^{'}) - β |]],$ (1)where $p (m^{'} | β, x) = \int p (m^{'} | x^{'}) q (x^{'} | x, β) d x^{'}$ is the marginal distribution of the posttampering measurements $m^{'}$ , given the initial system state $x$ and tampering indicator $β$ , which are baked into the expectation implicitly. We now apply this definition to distance matrix PUFs.

Figure 3.Machine-learning-assisted authentication is trained by classifying synthetic posttamper measurements as being either adversarially tampered or naturally degraded, indicated by $\hat{β}$ . We use a pretrained segmentation model, along with a labeled clustering algorithm, to compute the distance matrix and radii of the nanoparticles for both samples. Then, the discriminator network is trained by randomly choosing a synthetic tampering type according to the tampering Bernoulli distribution $β \sim B$ .

Download full size

View all figures

3.1 Nanoparticles for the PUF-D Problem

The gold nanoparticles are uniformly distributed on the substrate $r_{i} \sim U [0,1]^{2}$ , but their radii are normally distributed $ρ_{i} \sim N (μ_{r}, σ_{r})$ , which yield a system state $x = {r, ρ}$ . Then, a database $M$ of randomly positioned dark-field images is created through dark-field microscopy. Due to the extremely large number of samples taken during dark-field microscopy, the measurement density is highly correlated to the fabrication prior through a narrow Gaussian peak (Assuming dark-field microscopy is i.i.d. sampling, then the law of large numbers dictates the measurement will converge as ${σ_{i}}^{2} / n$ where $n$ is the number of measurements taken by the dark-field microscope on a single nanoparticle with variance ${σ_{i}}^{2}$ .) and is approximately localized $p (m | x) \approx δ (x - \hat{x} (m))$ , where $\hat{x} (m) = {\hat{r}, \hat{ρ}}$ is our approximation to the true system state $x = {r, ρ}$ . Therefore, the problem objective in Eq. (1) can be approximated as $E_{x \sim p (x)} [E_{\underset{m^{'} \sim p (m^{'} | β)}{β \sim B}} [| Y_{θ} (m, m^{'}) - β |]],$ (2)by marginalizing out $x$ and $x^{'}$ from the inner expectations using the delta function. Taking the distance matrices of the inferred system state $\hat{x}$ and the evolved system state $x^{'}$ yields a distance matrix objective function, $E_{D (x) \sim p (x)} [E_{\underset{D (\hat{x} (m^{'})) \sim p (m^{'} | β)}{β \sim B}} [| Y_{θ} (D (\hat{x}), D (x^{'})) - β |]],$ (3)where $Y$ is now defined on the distance matrix space. (As mentioned previously, we assume here that the probability of introducing random translations and rotations during imaging is far less likely than that of producing the same distance matrix for two sets of nanoparticles.) This becomes our objective function for constructing RAPTOR. Now, we explicitly consider features of the tampering distribution $q$ .

3.2 Adversarial Tampering

During the random tampering step, the system may undergo either natural changes given by $q_{n}$ or adversarial tampering given by $q_{a}$ . Thermal fluctuations may occur for both treatments, and they introduce varying degrees of random Gaussian translations of the nanoparticles, i.e., $r^{'} = r + r_{Δ} : r_{Δ} \sim N (0, σ_{Δ})$ . However, adversarial tampering introduces Gaussian translations as well as substrate tearing and refilling, as shown in Fig. 4. Adversarial tearing introduces a random cut through the plane, displacing each nanoparticle location $r_{i}$ by a magnitude of $\frac{w}{\sqrt{| r_{i} - α_{i} |}}$ , orthogonal to a cut vector $α$ weighted by a tearing coefficient $w$ . As demonstrated in Fig. 4(c), introducing tears alters the average distance, thereby making adversarial tearing detectable by statistical discrimination. In the less ideal case, an adversary will attempt to refill the tear by introducing nanoparticles of a similar density as the fabrication density to recover similar features to the natural degradation. As shown in Fig. 4, filling the tear makes the average nanoparticle distance indistinguishable from natural degradation noise, with some constant distance. Therefore, a purely expected distance discrimination method between the tampering distributions $q_{n}$ and $q_{a}$ is completely unfeasible for small sample sizes under adversarial filling. Therefore, discrimination tasks necessitate conditioning on the measurements $M$ and $M^{'}$ .

Figure 4.Adversarial tampering is introduced through tearing of the substrate, thereby separating the gold nanoparticles according to their distance from the tear line, and filling the tear with new nanoparticles uniformly distributed in the tear to match the original distribution. The tearing of the substrate is modeled as a random cut that shifts the nanoparticles based on the inverse square root of the perpendicular distance to the cut. (a), (b) The tearing coefficients $w = 0.01$ and $w = 0.05$ demonstrate the increased separation dependent on the tearing coefficient. (c) The normalized expected distance between nanoparticles is plotted for natural degradation, adversarial tearing without filling, and adversarial tearing with filling.

Download full size

View all figures

3.3 Distance Matrix Authentication

Three analytical distance metrics are explored for distance matrix authentication: Hausdorff distance, Procrustes distance, and the average Hausdorff distance (AHD). For each of these metrics, the binary classification threshold is determined via logistic regression. If the distance between two matrices is above the logistic threshold, the posttamper matrix is considered too dissimilar to arise from the environment or natural degradation. Otherwise, the matrix is considered to have an acceptable level of natural changes and is therefore authentic.

3.3.1 Hausdorff metric

The Hausdorff distance metric $H$ is the maximum Euclidean distance $d (r_{i}, r_{j}^{'})$ between each point $r_{i}$ and its nearest neighbor $r_{j}^{'}$ as shown in Eq. (4). (Using the distance matrix elements $D$ and $D^{'}$ instead of $r$ and $r^{'}$ does not yield significant differences in results for our purposes.) $H (r, r^{'}) = \max_{\forall r_{i} \in r} [\min_{\forall r_{j}^{'} \in r^{'}} d (r_{i}, r_{j}^{'})] .$ (4)

3.3.2 Procrustes metric

An alignment matrix is a matrix that aligns two sets of multivariate data by transforming one into the other. Procrustes analysis is a statistical method that finds the optimal alignment matrix $A$ that minimizes the sum of squared distances between corresponding points in $A r$ and $r^{'}$ , thus accounting for rotational, translational, and scaling discrepancies.40 Procrustes distance $P$ is then given by the sum, $P (r, r^{'}) = \sum_{r_{i} \in r} d (A r_{i}, r_{i}^{'})^{2} .$ (5)

Ordering and data set size constraints make Procrustes a less reliable method for distance matrix matching. Likewise, finding the optimal alignment matrix is an iterative and time-consuming process compared to Hausdorff.

3.3.3 Average Hausdorff distance metric

An average-nearest-neighbor approach offers a more robust solution in practice than the Hausdorff and Procrustes metrics. Rather than simply considering the maximum nearest neighbor, it considers all nearest neighbors and is thus less sensitive to slight changes in any single nanoparticle position.21 The AHD is defined as $AHD (r, r^{'}) = \frac{1}{| r |} \sum_{\forall r_{i} \in r} [\min_{\forall r_{j}^{'} \in r^{'}} d (r_{i}, r_{j}^{'})] .$ (6)

Despite the previously reported 100% accuracy of distance matrix verification schemes involving a Hausdorff-inspired metric similar to AHD,20 we demonstrate in Sec. 4.2 that under more difficult adversarial tampering conditions, AHD eclipses both Hausdorff and Procrustes metrics, but is still beaten by RAPTOR.

3.4 RAPTOR

RAPTOR (Fig. 5) takes a more supervised approach to compute the authenticity of a distance matrix. For each nanoparticle $i$ , we reweight the posttamper matrix $D^{'}$ by a soft-weight matrix $A^{i}$ to indicate the probability that nanoparticle $i$ in the pretamper matrix $D$ is nanoparticle $j$ in the posttamper matrix $D^{'}$ [Fig. 5(a)]. Let $Γ_{i} = [\dots, | ρ_{i} - ρ_{j}^{'} |, \dots]$ be the query row tensor; then for each nanoparticle $i$ , we compute the soft-weight $S_{j}^{i} = softmax (Γ_{i} / τ_{i})$ where $τ_{i}$ is a variational parameter. Then, we multiply each row $μ$ of the value matrix $D^{'}$ by the soft-weight $S_{μ}^{i}$ , thereby creating a unique attention distance matrix $A^{i}$ for each nanoparticle $i$ , i.e., $A_{μ ν}^{i} = S_{μ}^{i} D_{μ ν}^{'} .$ (7)

Figure 5.RAPTOR uses an attention mechanism for prioritizing nanoparticle correlations across pretamper and posttamper samples before passing them into a residual, attention-based deep convolutional classifier. (a) RAPTOR takes the top 56 nanoparticles in descending order of radii to construct the distance matrices $D$ and $D^{'}$ and radii $ρ$ and $ρ^{'}$ from the pretamper and posttamper samples. (b) The radii and distance matrices form the query and value embeddings of an attention mechanism. The attention mechanism is then used alongside the raw distance matrices $D^{'}$ and $D$ , the soft weight matrix, and $L_{2}$ matrix generated from the radii vectors for the classifier. (c) The classifier uses GELU activation and attention layers before applying a kernel layer and max pool layer. Then, the output is flattened into a multilayer perceptron to compute the final classification $\hat{β}$ .

Download full size

View all figures

This mechanism zeroes out rows in the posttamper matrix $D^{'}$ , whose nanoparticles are unlikely to be the same before and after tampering based on the difference in radii. Then, using the pretamper distance matrix $D$ , we compute the probability that nanoparticle $i$ is the same as nanoparticle $j$ , defining the matrix elements $B_{i j}$ , by first encoding all pairwise rows between both matrices using a 3D ResNet encoder model $f_{θ} (A^{i}, D)$ to compute the element $B_{i j}$ in Fig. 5(b). The feature matrix $B$ along with $Γ$ , $D$ , $D^{'}$ , and $S^{i}$ are concatenated along the channel dimension and fed into the residual attention-based classifier shown in Fig. 5(c). An algorithmic description of RAPTOR is included in Appendix B (Sec. 7.1).

4 Results and Discussion

4.1 Semantic Segmentation

To reliably extract the nanoparticle centers and radii, we employ semantic segmentation networks to separate the image into two classes: nanoparticle and dark-field background. First, we trained the unsupervised semantic segmentation network STEGO as ground-truth labels for a data set of 10,000 dark-field images.41 We chose STEGO due to its prominence in the literature in assigning meaningful and high-quality segmentation to unlabeled data. The training data set for STEGO is created by randomly selecting and positioning gold nanoparticles obtained from a data set of 2400 gold nanoparticles extracted from 40 dark-field images. Particle extraction is performed via brightness thresholding at 4% intensity, followed by regional clustering and is manually verified for each input image. A minimum pattern radius of $0.5 μ m$ is enforced to discern the particles from noise. From this data set, samples of transformed particles are generated to match the source distributions of on average 79 particles per image ( $σ = 20$ ) and source dimensions ( $1280 \times 960 pixels$ at $0.069 μ m / pixel$ ), thus creating an augmented data set that is visually indistinguishable from source images. We injected 4% intensity Gaussian noise to match realistic noise levels from the dark-field images data set. The particle density is uniform across samples as discussed in Appendix A (Sec. 6.4). We list detailed explanations for choosing the parameter values mentioned in Appendix A (Sec. 6.5).

STEGO is very powerful but slow for simple semantic segmentation. Hence, we train both a lightweight ResNet-based attention convolutional neural network and a Gaussian blurring filter for mimicking STEGO. Overall, as demonstrated in Table 1, our CNN model and Gaussian filters achieve binary cross-entropy losses of $10^{- 3}$ and 0.56 and compute 1000 images in 27 and 33 ms on a T4 GPU, as opposed to 24 min for 1000 images using STEGO. After computing the semantic segmentation labels, all images are fed into a labeled clustering algorithm that extracts the center of mass and radii of 1000 images in 250 ms.

Table 1. Overall performance comparison of each method for distance matrix extraction and discrimination tasks. For all results in the table, a 1000-sample tensor was loaded onto an NVIDIA T4 GPU (except Procrustes, which used all CPU RAM) and batched at maximum capacity for the particular model. Accuracy is measured by the number of correct pixels or authentication classifications over the total. For semantic segmentation, we include the BCE loss to show a marginal advantage in using ResNet over Gaussian blur. The computation time is measured by preloading all data onto an NVIDIA T4 GPU or CPU RAM before recording the start time.

View table

View all Tables

Table 1. Overall performance comparison of each method for distance matrix extraction and discrimination tasks. For all results in the table, a 1000-sample tensor was loaded onto an NVIDIA T4 GPU (except Procrustes, which used all CPU RAM) and batched at maximum capacity for the particular model. Accuracy is measured by the number of correct pixels or authentication classifications over the total. For semantic segmentation, we include the BCE loss to show a marginal advantage in using ResNet over Gaussian blur. The computation time is measured by preloading all data onto an NVIDIA T4 GPU or CPU RAM before recording the start time.


Task	Method	Average Accuracy (%)	Computation Time
Distance matrix extraction	STEGO	100% (ground truth)	24 min for 1000 images
ResNet attention CNN	$10^{- 3}$ (BCE) or 99%	27 ms for 1000 images
Gaussian blur	0.56 (BCE) or 99%	33 ms for 1000 images
Discrimination	RAPTOR	97.6%	80 ms for 1000 matrices
AHD	91.2%	13.5 ms for 1000 matrices
Hausdorff	54.9%	22.9 ms for 1000 matrices
Procrustes	58.2%	3.30 s for 1000 matrices

4.2 Tampering Discrimination

The tampering data set is generated synthetically at run-time offline from semantic segmentation. A substrate of size $2 \times 2$ is filled uniformly with a nanoparticle density of 100 per unit square, and the radii are normally sampled i.i.d. $ρ_{i} \sim N (μ_{ρ} = 0.006, σ_{ρ} = 0.004)$ . Natural degradation is introduced through a simple displacement of nanoparticles by a factor $0.05 \cdot r_{Δ}$ using the r.v. $r_{Δ}^{x, i}, r_{Δ}^{y, i} \sim N (μ_{n} = 0, σ_{n} = 1)$ . For adversarial tampering, a tampering configuration is chosen at random using the following scheme. For adversarial displacement noise, we multiply the noise r.v. by a random coefficient, i.e., $c_{a} \cdot r_{Δ}$ , where $c_{a} \in {0.035, 0.04, 0.05, \dots, 0.1}$ is chosen uniformly. The tear coefficient $w \in {0.01, 0.03, 0.05}$ is also chosen uniformly. Tampering data are generated under harsher conditions than the expected imaging conditions to show robustness. Note that the tampered data are produced in the same manner as training data, with an additional tampering step. Finally, to test the imaging robustness, we randomly decide to rotate all nanoparticles about the center by a uniformly chosen angle. We also apply a constant translation in a randomly uniform direction with translation coefficients in ${0, 0.01, \dots, 0.12}$ . After applying the randomly chosen tampering configuration, all nanoparticles within the center unit square are sorted in descending order of radii, and their associated distance matrix and radii are extracted for authentication. RAPTOR is trained to discriminate tampering under eight different noise levels, causing random particle movements of up to 10% image width from a pessimistic 5% natural degradation level. The adversarial filling is performed under worst-scenario conditions in which filling precisely matches perforation boundaries while matching initial particle density. RAPTOR is trained in batches of 100 images, on information from the 56 largest radii particle patterns in each image, with a learning rate of 0.01. During training, RAPTOR is compared to analytical methods: Hausdorff, Procrustes, and AHD. For all analytical methods, the output distance metric is fit to a logistic regression model for determining authenticity.

Table 1 shows the average accuracy and computation times of RAPTOR alongside the analytical methods. RAPTOR has the highest average accuracy, correctly detecting tampering in 97.6% of distance matrices under worst-case-scenario tampering assumptions and exceeding the performance of the Hausdorff, Procrustes, and AHD methods by 40.6%, 37.3%, and 6.4%, respectively. The AHD has the fastest computation time in discrimination tasks and the highest accuracy among the three analytical methods.

5 Conclusion

In this work, we demonstrate the robustness of a new RAPTOR for the authentication of semiconductor devices, using random pattern arrays of gold nanoparticles as distance-matrix-based optical PUFs. The arrays are imaged using dark-field microscopy, and the positions and radii of individual particle patterns are extracted using semantic segmentation and labeled clustering. We introduce difficult, yet realistic, adversarial tampering features through tearing and substrate refilling, or natural deviations through thermal noise with varying levels of substrate heating. We demonstrate that RAPTOR achieves a tampering accuracy of 97.6%, greatly outperforming the Hausdorff, Procrustes, and AHD distance metrics by 40.6%, 37.3%, and 6.4%, respectively. These results indicate that RAPTOR significantly outperforms known classical distance matrix metric methods for authenticating PUFs built on the random arrays of gold nanoparticles in accuracy and speed.

The ease of fabrication of gold nanoparticles, along with rapid and robust tampering detection with RAPTOR, opens up a large opportunity for the adoption of machine-learning-based tampering detection schemes in the semiconductor industry. However, more work is required in material development to ensure that these methods are robust to unforeseen types of tampering and natural degradation. Furthermore, hyperparameter optimization and alternative deep networks may improve the speed or accuracy of RAPTOR. While our scheme greatly improves on the core bottlenecks found in these verification schemes, future work could consider the computation of the distance matrices directly without labeled clustering, or a full end-to-end network that does not use semantic segmentation as an intermediate step in the verification process.

6 Appendix A: PUFs and Data Set

6.1 Nanoparticle PUFs Fabrication

A diluted nanoparticle suspension ( $1 μ L$ ) of 75 nm Au ( $1 μ L$ ) (nanoComposix, Inc.) in deionized (DI) nanopure water (2 mL) is drop cast onto the precleaned silicon substrate, which is prepared by standard solvent cleaning [placed substrate within toluene, acetone, and iso-propyl alcohol (IPA) in three separate steps, with 5 min sonication at each step] and piranha cleaning [placed substrate in 3:1 volume ratio concentrated sulfuric acid ( $H_{2} {SO}_{4}$ ) and hydrogen peroxide ( $H_{2} O_{2}$ ) for 15 min] in a controlled cleanroom environment. Then, the sample is placed horizontally to let the liquid evaporate naturally to leave the gold nanoparticle pattern on the substrate.

6.2 Optical Imaging

The dark-field optical imaging system consists of a Keyence VHX-6000 digital microscope with a high-brightness LED light source, a 1/1.8-in. CMOS image sensor with virtual pixels 1600 ( $H$ ) × 1200 ( $V$ ) maximum, a ZS-200 RZ×200-×2000 objective lens with a fine adjustment for working distance, and a color LCD monitor with 16,770,000 colors and a 1000:1 contrast ratio. The dark-field images are taken at 1500× magnification to form the training data set for semantic segmentation and verify the uniformity of the formed PUFs prior.

6.3 Synthetic Dark-Field Image Dataset Generation and Segmentation

We built a data set of 10,000 images by augmenting 40 dark-field images. Over 2400 nanoparticle bounding boxes are extracted from 40 source images via connectivity-based clustering of thresholded image segments. Augmented images are generated by randomly placing nanoparticles from the set of bounding boxes in uniformly distributed positions. To ensure maximal variability in the augmented data set, we apply random rotation, shear, and additive noise transformations to each particle before placement. Due to the resolution of the dark-field microscope, we only consider nanoparticle scattering patterns with radii greater than $0.5 μ m$ , as any smaller patterns cannot be verified to be gold nanoparticles. Gaussian noise is injected into the background to further mimic the original images, effectively reintroducing nanoparticles with average radii less than $0.5 μ m$ to the augmented data set.

A ResNet-based convolutional neural network and a Gaussian filter are demonstrated to accurately segment 1000 dark-field images in only 27 and 33 ms, respectively. Each of these methods achieves 99% segmentation accuracy, greatly outperforming the classical methods and the ground truth unsupervised segmentation network STEGO in speed with negligible error in accuracy. (It takes 24 min for STEGO to segment 1000 images.) These segmented images are postprocessed for reliable position and radii extraction using labeled clustering.

6.4 Uniformity of PUFs

For a normalized uniform distribution, the expected distance between any two points is given exactly by42 $\frac{1}{15} [2 + \sqrt{2} + 5 \log (1 + \sqrt{2}] \approx 0.521405$ . To test the uniformity of the nanoparticle placements, we took 40 dark-field images of randomly embedded nanoparticles on the substrate and measured the expected distance between any two nanoparticles to be 0.521318, which has an error of 0.017%.

6.5 Parameters Choices

Our study provides a research-oriented example to demonstrate a comprehensive feasibility study. Forming an optimal or adaptive threshold for the following parameters may require additional study with auxiliary training and analysis, especially for industry-level systems.

•2400 gold particles: The dark-field image data set must be augmented to contain maximally varied nanoparticles resembling a wide variety of real-life conditions. Also, for noninteracting scatterers, when we have a sufficiently large number of scatterers, we could apply statistical or average properties reliably in statistical mechanics and condensed matter physics.43 To this end, we sample from 2400 nanoparticles that were extracted from an original data set of 20 dark-field images. Extracted nanoparticles were additionally transformed (rotations and shear transformations) to maximize the diversity of segmentation shapes. We found this level of variety to be sufficient to demonstrate the dexterity of tested segmentation techniques after experiments.
•4% intensity brightness threshold: The original data set nanoparticle extraction was manually verified. A 4% brightness magnitude threshold was chosen for our specific imaging procedure. As stated above, an optimal or adaptive threshold may require additional study. For STEGO and attention CNN segmentation methods, brightness thresholding is not used. For Gaussian blur-based segmentation, a brightness threshold can be manually chosen to match imaging conditions or optimized to match the semantics of the former methods.
•Minimum pattern radius of $0.5 μ m$ : The $0.5 μ m$ minimal radius was enforced for the original data set creation to discern the particles from noise, since it was a typical gold nanoparticle scattering pattern radii distribution observed during the fabrication of samples and optical characterization of dark-field images. Here, we assume that particles are noninteracting. Otherwise, the scattering pattern may reach substantially larger radii. During verification, this minimal radius would be implicitly learned and optimized by the chosen segmentation method.
•79 particles per image ( $σ = 20$ ) and source dimensions ( $1280 \times 960 pixels$ at $0.069 μ m / pixel$ ): Particle density is a function of molecular interaction of gold nanoparticles as well as other fabrication parameters and is chosen to reflect densities seen in the original dark-field images (this density is uniform and consistent across samples, as described in Section 6.4). Image dimensions are arbitrary with respect to segmentation and are chosen simply to reflect typical imaging parameters.
• $2 \times 2$ size substrate filled with a nanoparticle density of 100 per unit square: A $2 \times 2$ frame was filled with nanoparticles so that a randomly placed $1 \times 1$ canvas of nanoparticles could be “imaged” out of a larger set. This approach simulated framing imprecision in real-world substrate imaging and allowed us to determine which methods were robust against that translational framing error. Nanoparticle density is relevant to tamper detection, since the number of nanoparticles within a unit frame determines the amount of information available to discrimination algorithms. We chose 100 to match dark-field image nanoparticle density upon sampling of a $960 \times 960 pixels$ square subset from a $1280 \times 960 pixels$ image.
•Natural degradation is introduced through a simple displacement of nanoparticles by a factor of 0.05: To mimic the extreme physical tampering behavior, we chose to translate particles up to 5% image width to reflect a worse-than-expected case scenario of PUF degradations. However, this number could be changed depending on the real-life packaging degradation measured for a particular packaging type.

7 Appendix B: Authentication Methods

7.1 RAPTOR Algorithmic Overview

Inputs:

•Pre-/posttamper nanoparticle distance matrices: $D$ , $D^{'}$ ( $k \times k$ tensors)
•Pre-/posttamper nanoparticle radii: $ρ$ , $ρ^{'}$ ( $k \times 1$ vectors)

RAPTOR:

• $L_{2}$ ← $L_{2}$ normalization of Euclidean distances between elements of particle radii vectors $ρ$ , $ρ^{'}$
•Soft weights ← Softmax of $L_{2}$ matrix divided by a trained parameter.
•Attention matrix: $A$ ← $k \times k$ attention matrices for all nanoparticles encoding predicted particle correspondence between pre-/posttamper systems
•ResNet encoded particle correspondence: $B$ ← trained ResNet( $A$ , $D$ )
•ResNet classifier: residual/attention blocks and a fully connected layer

Outputs:

•Likelihood of adversarial tampering during transit: $\hat{B}$

7.2 Analytical Methods

We introduce statistical authentication methods using Hausdorff, Procrustes, and AHD metrics and benchmark their performance in authenticating distance matrices extracted from dark-field images. All learning is performed in the same Jupyter environment on an NVIDIA T4 GPU with 16 GB of GPU RAM and an Intel(R) Xeon(R) CPU running at 2.30 GHz with 12.7 GB of system RAM. Each discrimination model is trained for 5000 epochs with a mini-batch of 100 random graph instances with random tampering, as discussed in Sec. 4.2. Training graphs are randomly generated at training time to prevent overfitting. Our validation step measures the average accuracy across the most recent 500 epochs. Reported accuracy is the maximum accuracy achieved by each discrimination method during the validation step.

7.3 Alternative Deep-Learning Networks

In an attempt to compare against other deep-learning methods, we used the same data fed into RAPTOR with different networks. We tried deep feed-forward multilayer perceptron networks, Siamese graph encoder networks, and deep residual convolutional layers. However, these were not able to consistently outperform the AHD, achieving accuracies below 70%. We also attempted to use the AHD metric as a resource for these networks, but these networks relied too heavily on the metric and converged to the same performance with minimal improvements below RAPTOR.

Blake A. Wilson earned his PhD at Purdue University in Electrical and Computer Engineering. He now works as a Research Scientist at Quantinuum, UK, working on generative AI, categorical machine learning and quantum algorithms.

Yuheng Chen is a third-year PhD student at the Elmore Family School of Electrical and Computer Engineering, Purdue University. His research focuses on the meeting point of AI, physics, and nanodevices, including AI-driven inverse design in photonic/quantum devices, generative machine learning model application exploration, and photonic/quantum devices electromagnetic simulation.

Daksh Kumar Singh is an undergraduate research assistant pursuing an integrated bachelors and masters in electrical and computer engineering at Purdue University. Currently focused on enhancing nanofabrication, characterization, and data analysis techniques through quantum algorithms and machine learning.

Rohan Ojha is an undergraduate electrical engineering student at Purdue University, specializing in microelectronics/semiconductors and quantum technology. At Purdue’s Quantum Science and Engineering Institute, he researches machine learning applications in photonics. He interned at Sandia National Laboratories working in quantum error correction. He plans to pursue a PhD in quantum technology.

Jaxon Pottle: Biography is not available.

Michael Bezick is a rising junior undergraduate research assistant in computer science at Purdue University, with a passion for machine learning. He focuses on applications of generative models, such as variational autoencoders and diffusion models, to nanophotonic optimization problems. He plans to pursue a PhD in machine learning to contribute to the advancement of the field and further apply himself in industry post-graduation.

Alexandra Boltasseva received her PhD from the Technical University of Denmark and is currently the Ron and Dotty Garvin Tonjes Distinguished Professor of Electrical and Computer Engineering at Purdue University where she specializes in nanophotonics, optical metamaterials, and quantum photonics. As Purdue’s Discovery Park fellow, she leads the university-wide multidisciplinary Big Idea Challenge program in quantum information science and technology/security/health. She was editor-in-chief of the Optical Society of America’s Optical Materials Express journal.

Vladimir M. Shalaev, scientific director for nanophotonics at Birck Nanotechnology Center and distinguished professor of electrical and computer engineering at Purdue University, specializes in nanophotonics, plasmonics, optical metamaterials, and quantum photonics. He has received numerous awards, including APS Frank Isakson Prize, Max Born Award, etc. He is recognized as a highly cited researcher in physics by the Web of Science 2017–2023. He is a fellow of the IEEE, APS, SPIE, MRS, and Optica.

Alexander V. Kildishev is renowned for his groundbreaking work in optical metamaterials and transformation optics that spans theoretical concepts, advanced numerical modeling, and experimental guidance. His research has enabled superlenses, hyperlenses, and optical black holes. His recent work focuses on advanced multiphysics modeling in nonlinear optics and AI-driven inverse design in photonics. Beyond other awards, was listed as a highly cited researcher by the Web of Science in 2018, 2022, and 2023.

Category: Research Articles

Received: Apr. 1, 2024

Accepted: Jun. 13, 2024

Posted: Jun. 14, 2024

Published Online: Jul. 19, 2024

The Author Email: Alexander V. Kildishev (kildisha@purdue.edu)

DOI:10.1117/1.AP.6.5.056002

CSTR:32187.14.1.AP.6.5.056002