GPU H100 Instances

Why choose NVIDIA H100 GPUs?

Powerful

Up to 4 times faster than A100 for training complex AI models and content generation.

High performance

By optimising computations, the FP8 Transformer Engine greatly enhances the performance and energy efficiency of LLMs and GenAI.

Compatible

Fully compatible with CUDA, PyTorch, TensorFlow, and JAX. Leverage your existing optimisations without any adjustments required.

Sovereign

Available in our Public Cloud, ensuring flexibility, transparency, and compliance with European standards.

Optimised for your AI and data workloads

LLM training and inference

Accelerate your models — up to 70B configurations (Llama 2, Mistral, Falcon, etc.) — with the power of Transformer Engine and the memory bandwidth of H100.

Multimodal generative AI

Seamlessly create, train, and deploy your large-scale image, audio, and video generation models.

Data science and high-performance computing

Get the best results from your heavy computations (simulation, scientific modelling, and massive parallel processing), all with consistent performance and low latency.

Specifications

Technical specifications

GPU

1–4 GPUs per instance

GPU memory

80 GB of ultra-fast HBM3 per GPU

High-performance storage

Local NVMe passthrough on most instances

Public and private network

Up to 25 Gbps included

Automation

Management via your Control Panel, API, OVHcloud CLI, etc.

Secure and private

ISO27001, SOC certifications, health data hosting.

Our Cloud GPU range

H200

Up to 1.4 times faster than the H100. Ideal for LLM 65B+.

A100

Best balance of performance, cost, and AI flexibility.

V100

Reliable GPU for machine learning and scientific computing.

V100S

An upgraded V100 with higher bandwidth and frequency.

L40S

Suitable for various uses in multimodal GenAI and advanced 3D rendering.

L4

Efficient and cost-effective for AI inference and video processing.

A10

AI + graphics versatility makes it ideal for inference and computer vision.

RTX 5000 Quadro

Designed for 3D rendering, visualisation, and professional design.

Ready to accelerate your AI projects?

Create an account and launch your services in minutes

Get ₹ 18 000 in free credit to launch your first Public Cloud project

Maximise your ROI with flexible GPU infrastructure

Pricing transparency

Pay only for the resources you use, with no hidden fees. Maintain control over your costs while enjoying optimal performance.

Instant scalability

Scale your GPU resources on demand, in just a few clicks. Easily adapt your capacity to AI and data workloads.

Sovereignty and compliance

Your data is hosted in a secure, transparent, and compliant European cloud, certified to meet regulations (GDPR, ISO, HDS).

Barrier-free accessibility

Open and accessible H100 GPUs — from proof of concept to production deployment, with no limitations on how many you need or the type of hardware used.

How do I choose my GPU for inference?

Compact models

The A100, with up to 7B of parameters, offers an excellent price/performance ratio.

Intermediate LLMs

With up to 30B, the H100 offers the best balance of speed, energy efficiency, and framework compatibility.

Large models

From 65B+ or extended context windows, the H200 provides the memory bandwidth needed for stable response times.

Configure your GPU instances

Choosing your GPU for LLM inference

Discover how to set up your GPU architecture to meet the requirements of AI models.

Explore the key differences between our AI Notebooks, AI Training, and AI Deploy solutions

Choose the one that best suits your needs using the comparison table.

Explore the key differences between our AI Notebooks, AI Training, and AI Deploy solutions

We will assist you in setting up your GPU on Managed Kubernetes via the OVHcloud Control Panel and Helm.

Your questions answered

What service level agreement is guaranteed by OVHcloud on a GPU instance?

The SLA guarantees 99.99% monthly availability on GPU instances. For further information, please refer to the General Terms of Service.

Which hypervisor is used for instance virtualisation?

Just like other instances, GPU instances are virtualised by the KVM hypervisor in the Linux kernel.

What is PCI Passthrough?

Cards with GPUs are served via the physical server’s PCI bus. PCI Passthrough is a hypervisor feature that allows you to dedicate hardware to a virtual machine by giving direct access to the PCI bus, without going through virtualisation.

Can I resize a Cloud GPU instance?

Yes, Cloud GPU instances can be upgraded to a higher model after a reboot. However, they cannot be downgraded to a lower model.

Do GPU instances have anti-DDoS protection?

Yes, our anti-DDoS protection is included with all OVHcloud solutions at no extra cost.

Can I switch to hourly billing from an instance that is billed monthly?

If you have monthly billing set up, you cannot switch to hourly billing. Before you launch an instance, please take care to select the billing method that is best suited to your project.

What is a Cloud GPU?

A Cloud GPU is a cloud computing service that provides graphic processing units (GPUs) for tasks that require high computing power. Examples of these are graphic rendering, machine learning, data analysis, and scientific simulations. Unlike on-premises GPUs, which require a significant investment in hardware, cloud GPUs are more flexible and easier to scale. Users can access high-performance computing resources on demand, and only pay for what they use.

What is an H100 and A100 server?

Servers that are equipped with NVIDIA H100 and A100 GPUs are purpose-built to offer exceptional performance in HPC, AI, and data analytics.

What is NGC?

NVIDIA GPU Cloud (NGC) is a cloud computing platform offered by NVIDIA. It provides a comprehensive selection of software that is optimised for GPU acceleration in artificial intelligence (AI), machine learning (ML), and high-performance computing (HPC). NGC simplifies and speeds up the deployment of AI and scientific computing applications. It does this by providing containers, pre-trained models, SDKs, and other tools that are optimised to leverage NVIDIA GPUs.

Why use a Cloud GPU?

There are several advantages to using a Cloud GPU, especially for companies, R&D, and development teams in demanding fields, such as artificial intelligence (AI), graphics rendering, machine learning (ML), and high-performance computing (HPC).

GPU H100 Instances

Boost your AI projects with H100 GPU instances

Why choose NVIDIA H100 GPUs?

Powerful

High performance

Compatible

Sovereign

Optimised for your AI and data workloads

LLM training and inference

Multimodal generative AI

Data science and high-performance computing

Specifications

Technical specifications

GPU

GPU memory

High-performance storage

Public and private network

Automation

Secure and private

Our Cloud GPU range

H200

A100

V100

V100S

L40S

L4

A10

RTX 5000 Quadro

Ready to accelerate your AI projects?

Create an account and launch your services in minutes

Maximise your ROI with flexible GPU infrastructure

Pricing transparency

Instant scalability

Sovereignty and compliance

Barrier-free accessibility

How do I choose my GPU for inference?

Compact models

Intermediate LLMs

Large models

Configure your GPU instances

Choosing your GPU for LLM inference

Explore the key differences between our AI Notebooks, AI Training, and AI Deploy solutions

Explore the key differences between our AI Notebooks, AI Training, and AI Deploy solutions

Your questions answered

What service level agreement is guaranteed by OVHcloud on a GPU instance?

Which hypervisor is used for instance virtualisation?

What is PCI Passthrough?

Can I resize a Cloud GPU instance?

Do GPU instances have anti-DDoS protection?

Can I switch to hourly billing from an instance that is billed monthly?

What is a Cloud GPU?

What is an H100 and A100 server?

What is NGC?

Why use a Cloud GPU?