Sharing machine learning models

11. Sharing machine learning models#

Up to now, we have been focus on sharing code and collaborating with others on GitHub. But what about sharing machine learning models?

For ML models, we can use the same principles of version control and collaboration that we have seen before. However, there are some additional considerations when sharing models, such as the size of the model, the data, and the dependencies. GitHub has some limitations when it comes to sharing large files, so we may want to use other tools and platforms to share large machine learning models.

For this kind of content, we can use the same tools that we have seen before, like GitHub, but we can also use other platforms like Hugging Face, Weights & Biases, and TensorFlow Hub.

In this section, we will see how to share machine learning models using Hugging Face.

11.1. Hugging Face#

Hugging Face’s platform and libraries, particularly the Transformers library and Hugging Face Hub, enable efficient and collaborative code sharing. By providing access to a vast array of models, datasets, and tools, Hugging Face empowers developers, researchers, and organizations to accelerate their projects.

We can use Hugging Face to share models, datasets, and training scripts.

11.2. GitHub and HuggingFace#

GitHub and Hugging Face can be used together to share machine learning models. You can use GitHub to store the code and the training scripts, and Hugging Face to store the models. This way, you can have a complete pipeline for sharing machine learning models. Also, in this way, you can use the version control features of GitHub to track changes in the code and the models.

In a similar way, you could share public dataset in Hugging Face Datasets and use it in your GitHub repository.

Sharing machine learning models

Contents

11. Sharing machine learning models#

11.1. Hugging Face#

11.1.1. Sharing models using the web interface#

11.1.2. Sharing models using the ModelHubMixin class#

11.2. GitHub and HuggingFace#