Matthew Feickert’s 2025 URSSI Early-Career Fellowship Report - Matthew Feickert 2025 URSSI Early-Career Fellowship Report

1Project Overview¶

As part of inaugural US Research Software Sustainability Institute (URSSI) Early-Career Fellow Matthew Feickert’s fellowship research, Feickert developed open source educational material for a workshop series on creating reproducible software environments for scientific and artificial intelligence and machine learning (AI/ML) applications. The project, “Reproducible Machine Learning Workflows for Scientists”, focused on using the open source tool Pixi to create fully reproducible research software environments with specialized hardware accelerator support for the NVIDIA CUDA parallel computing platform. Use of Pixi’s technologies allowed for digest-level reproduction of “locked” software environments based on conda and Python packages that were solved for multiple computing platforms and portable to different host machines. In addition to performing research on creating robust reproducible hardware accelerated workflows with Pixi, Feickert organized and taught a pilot workshop at the University of Wisconsin–Madison, a tutorial at the 2025 SciPy conference, and a national-level workshop at the University of Wisconsin–Madison.

2Project Outcomes¶

The main deliverable of the project was to create permissively licensed open source educational material and executable examples for real scientific software applications with a focus on AI/ML. The educational material was also scoped to be contributed as a lesson module to The Carpentries Incubator. The workshop materials were proposed to The Carpentries Incubator as a contribution at the project onset, and all materials were developed under the carpentries-incubator GitHub organization at https://github.com/carpentries-incubator/reproducible-ml-workflows. The produced material was piloted at an initial workshop for University of Wisconsin–Madison students and staff in June, 2025. Feedback received on the pilot workshop’s most critical points was incorporated into a condensed four hour tutorial taught at the SciPy 2025 conference with the lead developer of Pixi, Ruben Arts, and NVIDIA principal engineer, John Kirkham, who lead the technical work for the distribution of the CUDA software stack as conda packages. Both of these events informed the content for the national-level workshop in August, 2025, with 44 participants from 11 universities, national laboratories, organizations, and companies across the United States.

This would have historically been considered too technically difficult to achieve for beginners.

In all workshops, most participants and little to no experience with Pixi, and limited or no experience constructing software environments containing CUDA accelerated software packages. As a result, the workshop participants had their effective introduction to Pixi on the first day of the workshop. Workshop participants were able to successfully deploy CUDA accelerated PyTorch machine learning workflows to remote GPUs on high-throughput computing (HTC) facilities by the last day of the workshop, using software environments that they had all individually constructed. This represents a rapid technological adoption and deployment process that would have historically been considered too technically difficult to achieve for beginners.

2.1Workshops and Events¶

Reproducible Machine Learning Workflows for Scientists Workshop Pilot 2025, Matthew Feickert. June 16-17, 2025.
SciPy 2025 tutorial on Reproducible Machine Learning Workflows for Scientists with Pixi, Matthew Feickert, Ruben Arts, John Kirkham. July 7, 2025.
Reproducible Machine Learning Workflows for Scientists Workshop 2025, Matthew Feickert. August 12-14, 2025.

2.2Research Products and Publications¶

Matthew Feickert, Reproducible Machine Learning Workflows for Scientists, 2025. DOI: 10.25080/zenodo.17537698

Matthew Feickert, Ruben Arts, John Kirkham, Reproducible Machine Learning Workflows for Scientists with Pixi, Proceedings of 24th International SciPy Conference — SciPy 2025, July, 2025. DOI: 10.25080/nwuf8465

2.3Contributions to Open Source Projects¶

Addition of Pixi workflow templates to the University of Wisconsin–Madison’s Center for High Throughput Computing^[1] GPU Job Templates GitHub repository.

2.4Future Opportunities and Collaborations¶

The Brookhaven National Laboratory (BNL) National Synchrotron Light Source II (NSLS-II) Data Science and Systems Integration (DSSI) division sent staff to the August, 2025 national-level workshop. Following the workshop, the DSSI has invited Feickert to BNL in 2026 to give a guest workshop for NSLS-II scientists and staff, as they are looking to modernize their beamline software application deployments.

3Project Review¶

3.1Expected Impact¶

The project proposal’s expected impact on the Scientific Software Community was focused on the long-term impact of the training and education imparted at the workshop to the research communities of the workshop participants. From the post-workshop survey and from the 3 month long-term follow-up survey there is evidence that Pixi as a technology is easy to learn and is beneficial enough for researchers that it has changed their normal scientific software workflows habits, becoming a common tool in their regular work. The workshop and material had a particular focus on CUDA accelerated workflows for applications on GPUs. While this was a popular topic in the workshop and was noted by participants as an area of interest for participating, few participants had existing software projects that were actively making use of CUDA or GPUs for scientific tasks where hardware acceleration would be beneficial, e.g. machine learning. While the reasons for this are not well understood, a hypothesis is that while the hardware and hardware acceleration technologies are important and useful, to gain access to them and use them effectively at traditional academic institutions requires multiple steps and levels of permissions, which can act as a deterrent to rapid experimentation. While the workshop materials provide instruction and examples at each step of this procedure, and use of Pixi and CUDA conda packages significantly lowers the complexity of the software management, the computing platforms used may have a large impact on adoption of demonstrated workflows in normal research. As shown in Table 1, deploying fully reproducible software environments to a computing facility for use generally requires researchers to use at least one additional computing technology beyond Pixi, with each additional technology potentially requiring multiple supporting files or actions to use. Improvements in the levels of interfacing between computing systems and researchers may have a positive effect on the widespread adoption and impact of fully reproducible hardware accelerated software environments.

Table 1:Comparison of common computing facility management solutions for scientific research and the number of distinct technologies required to use fully reproducible software environments with them. The use of * indicates a potential or optional dependency. Note that HTCondor systems with shared file systems are uncommon.

Computing resource management system	Shared file system	Technologies required
High-throughput computing	No	Pixi, Linux containers, HTCondor
High-throughput computing	Yes	Pixi, HTCondor
High-performance computing	Yes	Pixi, Slurm
Commercial cloud services	Yes	Pixi, Slurm, cloud specific software

3.2Evaluation Metrics and Deliverables¶

The project evaluation metrics for success are reported in Table 2. The total number of participants across all workshops is reported from the in-person attendance of the June, 2025 pilot workshop (29 participants), the SciPy 2025 tutorial (56 participants), and the August, 2025 national-level workshop (44 participants). Additionally, as not all national-level workshop participants brought their own research projects, the reported percentage (42/44 participants) includes those that executed examples workflows. Under these conditions, all participants in the national-level workshop succeeded in executing CUDA accelerated ML workflows except for two, who had been delayed by technical difficulties earlier in the workshop. During the duration of the Fellowship project, The Carpentries Lab was reviewing lesson submissions by invitation only. This, along with additional criteria discussed in Section 3.3, made it not possible to submit to The Carpentries Lab review process. The workshop materials will instead be submitted to the Journal of Open Source Education (JOSE).

Table 2:Summary of project success evaluation metrics as defined in the Fellowship proposal.

Evaluation metric	Target	Delivered	Metric achieved
Total number of participants across all workshops	50 participants	129 participants	Yes
Percentage of participants who were able to successfully reproduce their own scientific and AI/ML research workflows using the information they learned by the end of the national-level workshop	90%	~95%	Yes
Acceptance of the workshop educational materials into The Carpentries Incubator and The Carpentries Lab curriculum	The Carpentries Lab	The Carpentries Incubator	No

Of the twelve milestone and deliverables established in the Fellowship proposal, all were met or delivered, with the exception of:

The deliverable of a second pilot workshop at University of Wisconsin–Madison based on development time constraints.
The deliverable of submission of the workshop material for peer review to The Carpentries Lab given the previously mentioned restrictions.

3.3Potential Impact of Additional Funding¶

In the event in which additional research funding would have been available to extend the Fellowship project, the project would have supported additional more targeted workshops. The registration for the August, 2025 national-level workshop reached its room capacity in under one week of registration being opened, and there were requests for additional registration slots to be opened indicating interest and demand for the material. There was also a limited number of lodging stipend awards that were available for participants to apply for to offset the costs associated with travel to the workshop location in Madison, Wisconsin. Additional funding would have allowed for follow up workshops and additional lodging stipend awards to make the in-person workshop training accessible to more scientific communities.

In addition, feedback in the post-workshop survey and discussion with The Carpentries Incubator community, indicates that the scope of technologies covered in the existing workshop material may be too much for some researchers to be able to fully understand and apply in a workshop that lasts less than one week. It has been recommended by The Carpentries Incubator that the current material be split into three successive “lessons”^[2] focusing on: Pixi, deployment technologies with Linux containers, and machine learning workflows at HTC and HPC facilities. Additional funding support would allow for the redesign of the current material into individual lessons that could be more easily taught by additional instructors, which is a requirement for peer review of lesson submissions to The Carpentries Lab. This would also allow for multiple, more targeted workshops to be taught that might attract different researchers, or for more technical workshops that use the succession of lesson development to require higher levels of prerequisite skills.

Acknowledgments¶

This work was supported by the US Research Software Sustainability Institute (URSSI) via grant G-2022-19347 from the Sloan Foundation. This work benefitted from resources and services provided by the University of Wisconsin–Madison Center for High Throughput Computing 3, as well as services provided by the OSG Consortium 4567, which is supported by the National Science Foundation awards #2030508 and #2323298.

Footnotes¶

The development home of HTCondor.
↩
The Carpentries Incubator lessons map roughly to individual workshops.
↩

References¶

Feickert, M. (2025). Reproducible Machine Learning Workflows for Scientists. Zenodo. 10.5281/ZENODO.17537698
Feickert, M., Arts, R., & Kirkham, J. (2025). Reproducible Machine Learning Workflows for Scientists with Pixi. Proceedings of the 24th Python in Science Conference, 232–244. 10.25080/nwuf8465
Center for High Throughput Computing. (2006). Center for High Throughput Computing. Center for High Throughput Computing. 10.21231/GNT1-HW21
Pordes, R., Petravick, D., Kramer, B., Olson, D., Livny, M., Roy, A., Avery, P., Blackburn, K., Wenaus, T., Würthwein, F., Foster, I., Gardner, R., Wilde, M., Blatecky, A., McGee, J., & Quick, R. (2007). The open science grid. J. Phys. Conf. Ser., 78, 012057. 10.1088/1742-6596/78/1/012057
Sfiligoi, I., Bradley, D. C., Holzman, B., Mhashilkar, P., Padhi, S., & Wurthwein, F. (2009). The pilot way to grid resources using glideinWMS. 2009 WRI World Congress on Computer Science and Information Engineering, 2, 428–432. 10.1109/CSIE.2009.950
OSG. (2006). OSPool. OSG. 10.21231/906P-4D78
OSG. (2015). Open Science Data Federation. OSG. 10.21231/0KVZ-VE57