📣 BIG NEWS: Neptune is joining OpenAI! → Read the message from our CEO 📣

Case Study

How BGU Research Group Tracks Thousands of Models With Neptune

We all have limited resources, even large companies. Tools like Neptune help us train fewer models by finding better models faster, optimizing our resources.
Omri Azencot
Assistant Professor at BGU
Before
    Searching for a way to manage complex, multi-project workflows across a large team
After
    Enabled collaboration within a distributed team
    Implemented organized and efficient process of debugging and analyzing training results
    Optimized resource usage

Omri Azencot is an Assistant Professor in the Computer Science department at Ben-Gurion University of the Negev. He leads a research group of around 20 members, including PhD, master’s, and undergraduate students. His team focuses on developing machine learning models for sequential data, with an emphasis on representation learning and generative modeling.

The challenge

Omri’s group is currently managing about 15 projects, some of them more active, some less. Within each project, they train different models for various problems. Often, it’s thousands of experiments per project. 

Each project is a collaborative work. Whenever a model achieves good results, the entire group reviews the run to understand what worked and what didn’t, comparing it to other experiments.

Managing these complex, multi-project workflows across a large, distributed team presented several challenges:

  • Onboarding new team members efficiently
  • Tracking and comparing thousands of experiments
  • Collaborating effectively among team members, both locally and remotely
  • Optimizing limited computational resources
avatar lazyload
quote
At some point, one of my students tried doing the tracking process manually, and he was very frustrated after one project. Any manual change can mess up information organization and how you track it. And if you do not build it well, then you suffer, you need to recode, etc. I think it’s just a waste of time.
Omri Azencot Assistant Professor at BGU

Implementing experiment tracking solution in an academic research setting

To address these challenges, Omri’s group implemented Neptune.

How is their workflow reflected in Neptune? The team has one workspace divided into multiple projects. While each team member has access to all projects, they could manage access restrictions (e.g., for data privacy reasons), though they currently find it unnecessary.

Using Neptune, the team logs every model along with key metadata, such as parameters, loss functions, and system resource consumption. They can compare and analyze this data, and the training runs easily.

Neptune’s custom views of the run table have been instrumental for their work. Filtering runs based on different datasets or models, the group can track and compare results more effectively.

avatar lazyload
quote
In modern ML, we train many models for a problem. Comparing models in an easy and intuitive way is crucial—Neptune provides multiple views to make informed comparisons, which would be very difficult otherwise.
Omri Azencot Assistant Professor at BGU

Team members engaged in a project look at the table to identify interesting runs and then dive into those specific runs for detailed analysis. They appreciate that Neptune supports logging and displaying different metadata types as they need to look at many things to debug training effectively—images, metrics, plots, sometimes even text or audio.

Working as a distributed team, they frequently rely on Neptune during remote meetings to discuss project progress and even include links to specific Neptune runs in presentations.

avatar lazyload
quote
Neptune helps us collaborate easily because all our experiments are logged in one place, making it easy to share results and analyze outcomes.
Omri Azencot Assistant Professor at BGU

The results

With Neptune, Omri’s research group:

  • Enabled collaboration within a distributed team.
  • Implemented a more organized and efficient process of debugging and analyzing training results.
  • Reduced the number of models that need to be trained and, in consequence, optimized resource usage.
avatar lazyload
quote
We all have limited resources, even large companies. Tools like Neptune help us train fewer models by finding better models faster, optimizing our resources.
Omri Azencot Assistant Professor at BGU
avatar
quote
Neptune helps us collaborate easily because all our experiments are logged in one place, making it easy to share results and analyze outcomes. I also appreciate that as an academic group, we can use it for free.
Omri Azencot Assistant Professor at BGU

Looking for an experiment tracking solution for your research group?