Documenting ML Experiments in HELIPORT

Authors

  • David Pape Helmholtz-Zentrum Dresden - Rossendorf (HZDR) https://orcid.org/0000-0002-3145-9880
  • Oliver Knodel Helmholtz-Zentrum Dresden - Rossendorf (HZDR)
  • Sebastian Starke Helmholtz-Zentrum Dresden - Rossendorf (HZDR)

DOI:

https://doi.org/10.14279/eceasst.v83.2597

Keywords:

data management, research software engineering, machine learning, metadata, ontologies

Abstract

Machine learning practitioners use a variety of tools to track their experiments. These tools have in common that they are only concerned with the machine learning aspect of the experiment: They may track model parameters or performance metrics of the model, but provenance of the training data or scientific outcomes produced with the trained model are largely overlooked. This is a drawback especially when it comes to experiments where machine learning meets scientific experiments and traditional simulations. In this contribution we present an initial evaluation on improving documentation of such machine learning experiments using our data management guidance system HELIPORT. We also explore existing experiment tracking tools and metadata schemas for ML experiments in the process and discuss their suitability for integration with HELIPORT.

Downloads

Published

2025-02-21

How to Cite

[1]
D. Pape, O. Knodel, and S. Starke, “Documenting ML Experiments in HELIPORT”, ECEASST, vol. 83, Feb. 2025.