-
Products & documentation Red Hat AI
A platform of products and services for the development and deployment of AI across the hybrid cloud.
Red Hat AI Inference Server
Optimize model performance with vLLM for fast and cost-effective inference at scale.
Red Hat Enterprise Linux AI
Develop, test, and run generative AI models to power enterprise applications.
Red Hat OpenShift AI
Build and deploy AI-enabled applications and models at scale across hybrid environments.
-
Learn -
AI partners
Generative AI with Red Hat AI
Generative AI (gen AI) is a type of artificial intelligence that can produce new content, such as text and software code. In an enterprise setting, gen AI provides an opportunity to boost productivity, improve customer experience, and optimize workflow processes.
For gen AI to work effectively for your business, it must be customized to fit your needs and operate on your terms.
Red Hat® AI can help with that.
Boost productivity without overburdening your workforce
With the ability to produce new content shaped by your unique company data, gen AI can help you connect people with information more efficiently than ever before.
Why Red Hat AI?
Red Hat AI gives you access to a supported, enterprise version of open source tools and technologies for the AI lifecycle. This means you stay at the forefront of AI innovation with consistent access to the most transparent and optimized solutions.
Increased efficiency, reduced cost
Red Hat AI increases efficiency in AI deployment and operations by providing access to enterprise-grade, open source Granite models. These small language models can be tailored to fit your needs. They are less costly, more efficient, and optimized for inference. They’re also covered by our Open Source Assurance program.
Apply LLM compressor algorithms to further reduce hardware costs, and run them on hardware accelerators of your choosing.
Technologies like vLLM are available to optimize the power of GPUs. This lowers hardware costs and improves token generation speed, leading to quicker responses in real-time applications.
Simplified model customization
Designed for accessibility, Red Hat AI provides a consistent user experience for data scientists, data engineers, application developers, and DevOps teams. This means better collaboration, fewer errors, and faster time to market.
Red Hat AI includes access to InstructLab alignment tools, making it accessible for all your employees to contribute to a large language model (LLM).
You can also customize your LLMs with retrieval-augmented generation (RAG). Red Hat AI provides capabilities like data ingestion, embeddings, and model evaluations, so you can produce responses that align with your specific business needs.
Flexible platform
Red Hat AI provides users with the flexibility to choose where to train, tune, deploy, and run models and gen AI applications–on premise, in the public cloud, or at the edge. By managing your gen AI models within your environment of choice, you can control access, automate compliance monitoring, and enhance data security.
vLLM supports flexible deployment of your gen AI applications by breaking up the work of processing across multiple GPUs. This distributes services across nodes that receive, process, and transmit data and makes for more efficient use of computing resources.
Red Hat AI also supports disconnected and air-gapped environments, so you can safeguard your most sensitive data.
Red Hat Support
Our engineering team is dedicated to helping you navigate our AI platform. From the operating system to the individual tools, we can provide the help you need to move your AI strategy forward.
Red Hat AI
Tune small models with enterprise-relevant data, and develop and deploy AI solutions across hybrid cloud environments.
Customer stories
Turkcell
Red Hat helped Turkcell create an infrastructure to deliver gen AI services. As a result, provisioning times were shortened from months to seconds and AI development and operations costs were reduced by 70%.
Galicia
Working with Red Hat, Banco Galicia built an AI-based solution that reduced the data-processing time for verifying corporate clients from 20 days to minutes. It also achieved 90% accuracy.
Ejie
Red Hat helped the Basque Government Informatic Society (EJIE) create a high-speed translation tool to promote linguistic diversity. The tool helps citizens quickly translate Spanish, French, and English into Basque, and vice versa.
Your vendors are your choice
We work with software and hardware vendors and open source communities to offer a holistic AI solution.
Access partner products and services that are tested, supported, and certified to perform with our technologies.