AI Endpoints
AI Endpoints
Powerful, secure, and easy-to-integrate generative AI APIs to enhance your applications.
Why choose OVHcloud AI Endpoints?
AI Endpoints is the serverless inference API for a selection of models, such as Llama, Qwen, Deepseek, and over 40 models, designed to prioritise data privacy.
Data security and confidentiality
With our platform, you can be sure that your and your users’ data are safe and private.
Your data will never be used to train or improve our AI models; this is one of our many security guarantees.
Developers
AI Endpoints makes it easier to integrate the latest AI models into your applications.
You get access to comprehensive documentation and code examples to embed AI, with no expertise needed.
The best AI models
A collection of the most advanced and renowned AI models on the market. AI Endpoints provides powerful tools for LLMs, voice processing, document analysis, and image analysis, whatever your specific needs.
Reversibility
With AI Endpoints, models can be deployed on your infrastructure or integrated with other cloud services. This ensures complete freedom and eliminates any risk of vendor lock-in, allowing you to maintain control.
Usage examples
Conversational AI
Enhance your apps by adding conversational AI that interacts naturally with users in real time. These AI-powered chatbots boost customer engagement, automate customer service, and customise user experiences.
Voice transcription and interactions
Use AI-powered voice-to-text for transcription in customer service, meetings, and closed captioning. Providing accessible and customisable audio, text-to-speech improves the user experience by meeting individual needs and preferences.
Private code assistant
Integrate coding help plugins, such as Continue, into common development environments (IDEs). These tools suggest code, detect errors and automate tasks in real time, while protecting data confidentiality.
SPECIFICATIONS
Key features
Standard APIs
Popular APIs (like OpenAI) for easy integration
Token authentication
Easily manage and revoke your API tokens
Performance
Achieve high inference performance using OVHcloud’s GPU infrastructure
Security
Underlying platform with ISO 27000, SOC, and healthcare data certifications
Privacy
Your data is neither reused nor kept
More than 40 models
A constantly updated range of popular, open-weight models
Lifecycle management
Improve reproducibility with transparent model version management
Sandbox
Interactively test and explore models in a simplified environment
Our integrations
The AI Endpoints ecosystem is expanding as it adds new native integrations with popular platforms. Developers can easily use your high-performing AI models directly from their preferred environments and tools, eliminating infrastructure issues and ensuring both data sovereignty and security.
Related products

Explore the potential of the Public Cloud
Discover our comprehensive portfolio of Public Cloud solutions — Compute, Storage, Network, AI and much more — through an interactive and certification-focused learning experience.
Why choose OVHcloud?
Simplicity
Developers can effortlessly integrate AI into their projects on our platform, even without extensive AI knowledge. Easy APIs and clear templates make integration quick and seamless.
Made for businesses
OVHcloud’s infrastructure is designed for optimal security, performance, and scalability. Whether you are a startup or a large company, our platform meets your AI needs reliably and securely.
Scalability and flexibility
AI Endpoints seamlessly handles business growth, scaling to support hundreds to millions of requests, with zero trade-offs on performance and privacy.












