General information about ML-platform product
Selectel's ML platform is a prepared infrastructure for realizing ML development processes: training, deployment of ML models, etc. The infrastructure consists of software and hardware components that are configured and prepared for operation.
The selection of ML platform components utilizes all available cloud server configurations. Once the platform is connected, it can be extended with its own software components. Usage tested:
- ClearML;
- Kubeflow — more about installing Kubeflow.
There are no additional restrictions on Selectel's management of the ML platform cluster.
Platform components
By default, the ML platform consists of:
- hardware components:
- cloud platform is the base for Managed Kubernetes c GPUs (Tesla T4, A2, A30, A100, RTX A2000 and A5000);
- program components:
- Managed Kubernetes clusters with pre-configuration;
- domain to access the Managed Kubernetes cluster;
- SSO Keycloak — authorization in internal services of the platform;
- Prom Stack — monitoring of platform components;
- Forecastle is the home page of the platform;
- object storage — storage of datasets and experiment data;
- Container Registry — storage of container images.
In Managed Kubernetes clusters:
- drivers installed;
- annotation of nodes has been performed;
- added the necessary GPU resources for computation;
- network is configured, including Traefik Kubernetes Ingress.
When the ClearML platform is installed in a cluster, it is directly managed through the SDK, which is installed in the user's own IDE. ClearML uses cluster nodes to run ML experiments. The ClearML architecture allows for a variety of component configurations:
- A single Managed Kubernetes cluster for all ML tasks;
- several clusters of Managed Kubernetes — each for its own task (Inference and Training);
- connecting a dedicated server as a computational node for ML experiments.
Connect platform
To connect the platform, leave a request on the website. We can help you customize its configuration for your project and calculate the cost.
Cost
The cost of ML-platform is calculated after processing the application and selecting the configuration. Depends only on the platform components: Managed Kubernetes cluster, object storage, Container Registry.