Content-Based Image Retrieval has a lot of applications in the industry, where large collections of data from manufacturing need to be automatically queried e.g. for quality inspection purposes. In this work we design an image retrieval solution over IMAGO, a dataset of Transmission Electron Microscopy (TEM) images of nano-sized silicon structures collected in the production site of STMicroelectronics, in Agrate Brianza, Italy. Image retrieval in imago is challenging because: i) only a limited portion of images are provided with labels, namely type of semiconductor structure, ii) most images refer to unseen classes that are not represented in the training set, and iii) images of the same class can be acquired at different magnification levels of the electronic microscope. Our main contribution is the design of a deep-learning based image retrieval system that leverages a training procedure that alternates between siamese loss, assessed on annotated samples, and reconstruction loss, assessed on unlabelled samples. Our solution exploits the whole information in the IMAGO dataset, and our experiments confirm we can successfully retrieve images of unseen classes that exhibit the same structure of the query ones. Our solution is currently deployed in STMicroelectronics production sites.

Image Retrieval in Semiconductor Manufacturing

Carrera D.;Boracchi G.
2023-01-01

Abstract

Content-Based Image Retrieval has a lot of applications in the industry, where large collections of data from manufacturing need to be automatically queried e.g. for quality inspection purposes. In this work we design an image retrieval solution over IMAGO, a dataset of Transmission Electron Microscopy (TEM) images of nano-sized silicon structures collected in the production site of STMicroelectronics, in Agrate Brianza, Italy. Image retrieval in imago is challenging because: i) only a limited portion of images are provided with labels, namely type of semiconductor structure, ii) most images refer to unseen classes that are not represented in the training set, and iii) images of the same class can be acquired at different magnification levels of the electronic microscope. Our main contribution is the design of a deep-learning based image retrieval system that leverages a training procedure that alternates between siamese loss, assessed on annotated samples, and reconstruction loss, assessed on unlabelled samples. Our solution exploits the whole information in the IMAGO dataset, and our experiments confirm we can successfully retrieve images of unseen classes that exhibit the same structure of the query ones. Our solution is currently deployed in STMicroelectronics production sites.
2023
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
9783031431470
9783031431487
Autoencoders
Content-Based Image Retrieval
Siamese Networks
File in questo prodotto:
File Dimensione Formato  
2023_01_ICIAP_IMAGO (2).pdf

Accesso riservato

: Post-Print (DRAFT o Author’s Accepted Manuscript-AAM)
Dimensione 7.03 MB
Formato Adobe PDF
7.03 MB Adobe PDF   Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11311/1261017
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact