CIC10: Master Project on Data Augmentation Techniques for Spectroscopic Data Using Physics-Aware and Generative Learning Approaches

 

The Nanoscience Cooperative Research Center, CIC nanoGUNE, located in Donostia / San Sebastian, Basque Country (Spain), is currently looking for a

 

Master Student

 

to work on

 

Data Augmentation Techniques for Spectroscopic Data Using Physics-Aware and Generative Learning Approaches 

 

NanoGUNE is a research center devoted to conducting world-class nanoscience research for a competitive growth of the Basque Country. NanoGUNE is a member of the Basque Research and Technology Alliance (BRTA) and is recognized by the Spanish Research Agency as a María de Maeztu Unit of Excellence.

 

The position is offered in the Nanoengineering group, led by Andreas Seifert. More information can be found at https://www.nanogune.eu/en/research/groups/nanoengineering

 

Project Description

Raman spectroscopy is widely used for chemical and material characterization across fields such as biomedical diagnostics, environmental monitoring, and material science. Machine learning methods are increasingly applied to automate and better understand the Raman spectra; however, the development of prediction models is often limited by the scarcity of available samples, especially in medical diagnostics, and variabilities introduced by noise, baseline drifts, and instrument-dependent distortions.
The focus of the proposed master's thesis is on the development and evaluation of advanced data augmentation strategies for Raman spectral data, but also data from other spectroscopic techniques, to improve the robustness and generalization capability of machine learning models. The
project will explore both physics-inspired augmentations (e.g., peak shifting, intensity scaling, baseline variations, and noise modeling) and modern generative learning approaches for synthesizing realistic spectral data.
Emphasis will be placed on novel or emerging techniques such as diffusion-based generative models, self-supervised augmentation learning, and physics-constrained generative frameworks that preserve chemically meaningful spectral features. The student will design and implement augmentation pipelines and evaluate their effectiveness in improving downstream tasks such as spectral classification, regression, or anomaly detection.
The project combines spectroscopy, machine learning, and signal processing, and aims to contribute to the development of robust data augmentation techniques for spectral datasets, which remain relatively underexplored compared to image and text domains.

 

 

Candidates should apply by following the instructions of the general call and by completing the form below and attaching the following documents:

  1. A complete CV 
  2. Academic record and cover letter grouped in a single PDF file

 

The deadline for applications is 06/04/2026.

 


 

 

NOTES:

(i) All applicants will receive an answer after the end of the selection process; but please note that due to the large number of submissions that are expected, we cannot provide individual feedback.

(ii) Additional information about nanoGUNE's commitment towards HR excellence in Research and Gender Equality are available on our website.

(iii) We encourage you to subscribe to our HR mailing list to receive information related to nanoGUNE's open positions and open calls for different training and talent attraction programs.

Apply for the position