OpenVINO™ toolkit: An open source AI toolkit that makes it easier to write once, deploy anywhere.

What's New in Version 2024.1

The OpenVINO™ toolkit version 2024.1 release enhances generative AI accessibility with improved large language model (LLM) performance and expanded model coverage. It also boosts portability and performance for deployment anywhere: at the edge, in the cloud, or locally.

Release Notes

View System Requirements

Latest Features

Easier Model Access and Conversion

Product	Details
New Model Support	Support for Falcon-7b-Instruct, a GenAI LLM ready-to-use chat/instruct model with superior performance metrics.

Generative AI and LLM Enhancements

Expanded model support and accelerated inference.

Feature	Details
Model Coverage	New Jupyter* Notebooks added: YOLOv9, YOLOv8 Oriented Bounding Boxes Detection (OBB), Stable Diffusion* in Keras, MobileCLIP, RMBG-v1.4 Background Removal, Magika, TripoSR, AnimateAnyone, LLaVA-Next, and retrieval augmented generation (RAG) system with OpenVINO toolkit and LangChain.
Performance Improvements for LLMs	LLM compilation time reduced through additional optimizations with compressed embedding. Improved first token performance of LLMs on 4th and 5th generations of Intel® Xeon® platforms with Intel® Advanced Matrix Extensions (Intel® AMX). Better LLM compression and improved performance with Intel® oneAPI Deep Neural Network Library (oneDNN), int4 and int8 support for Intel® Arc™ GPUs.

Feature

Details

Model Coverage

New Jupyter* Notebooks added: YOLOv9*, YOLOv8* Oriented Bounding Boxes Detection (OBB), Stable Diffusion* in Keras, MobileCLIP, RMBG-v1.4 Background Removal, Magika, TripoSR, AnimateAnyone, LLaVA-Next, and retrieval augmented generation (RAG) system with OpenVINO toolkit and LangChain.

Performance Improvements for LLMs

LLM compilation time reduced through additional optimizations with compressed embedding.

Improved first token performance of LLMs on 4th and 5th generations of Intel® Xeon® platforms with Intel® Advanced Matrix Extensions (Intel® AMX).

Better LLM compression and improved performance with Intel® oneAPI Deep Neural Network Library (oneDNN), int4 and int8 support for Intel® Arc™ GPUs.

More Portability and Performance

Develop once, deploy anywhere. OpenVINO toolkit enables developers to run AI at the edge, in the cloud, or locally.

Product	Details
Arm* Processor Support Updates	FP16 inference on Arm processors now enabled by default for the convolutional neural network (CNN).
Intel Hardware Support	Mixtral and URLNet models optimized for performance improvements on Intel® Xeon® processors. Stable Diffusion* 1.5, ChatGLM3-6b, and Qwen-7B models optimized for improved inference speed on Intel® Core™ Ultra processors with an integrated GPU. The preview neural processing unit (NPU) plug-in for Intel Core Ultra processors is now available in the OpenVINO toolkit open source GitHub* repository, in addition to the main OpenVINO toolkit package on PyPI. Significant memory reduction for select smaller generative AI (GenAI) models on Intel Core Ultra processors with an integrated GPU.
JavaScript* API	The JavaScript API is now more easily accessible through the npm repository, enabling JavaScript* developers to have seamless access to the OpenVINO toolkit API.

Product

Details

Arm* Processor Support Updates

FP16 inference on Arm processors now enabled by default for the convolutional neural network (CNN).

Intel Hardware Support

Mixtral and URLNet models optimized for performance improvements on Intel® Xeon® processors.

Stable Diffusion* 1.5, ChatGLM3-6b, and Qwen-7B models optimized for improved inference speed on Intel® Core™ Ultra processors with an integrated GPU.

The preview neural processing unit (NPU) plug-in for Intel Core Ultra processors is now available in the OpenVINO toolkit open source GitHub* repository, in addition to the main OpenVINO toolkit package on PyPI.

Significant memory reduction for select smaller generative AI (GenAI) models on Intel Core Ultra processors with an integrated GPU.

JavaScript* API

The JavaScript API is now more easily accessible through the npm repository, enabling JavaScript* developers to have seamless access to the OpenVINO toolkit API.

Sign Up for Exclusive News, Tips & Releases

Be among the first to learn about everything new with the Intel® Distribution of OpenVINO™ toolkit. By signing up, you get early access product updates and releases, exclusive invitations to webinars and events, training and tutorial resources, contest announcements, and other breaking news.

All fields are required unless marked optional.

Business Email Address

Please select a country/region

Describe your use case:

Intel strives to provide you with a great, personalized experience, and your data helps us to accomplish this.

I consent to Intel collecting and using my personal information as described below.

I agree to Intel's transfer of my personal information outside of China for processing in accordance with Intel’s Privacy and Cookie Statement and Intel's Privacy Policy Supplement for Chinese Users.

By submitting this form, you are confirming you are an adult 18 years or older and you agree to share your personal information with Intel to use for this business request. You also agree to subscribe to stay connected to the latest Intel® technologies and industry trends by email and telephone. You may unsubscribe at any time. Intel's web sites and communications are subject to our Privacy Notice and Terms of Use.

Thank you for registering to stay up-to-date with the latest on AI inference with the OpenVINO™ toolkit.

Resources

Community and Support

Explore ways to get involved and stay up-to-date with the latest announcements.

Get Help

Ask on the Community Forum

Contact Intel Support

File an Issue on GitHub*

Get Answers on StackOverflow

Stay Informed

Read the Documentation

Read the Knowledge Base

Learning

Training and Certifications

Downloadable Resources

Webinars

Get Started

Optimize, fine-tune, and run comprehensive AI inference using the included model optimizer and runtime and development tools.

Free Download

Powered by oneAPI

The productive smart path to freedom from the economic and technical burdens of proprietary alternatives for accelerated computing.

Sign Up for Exclusive News, Tips & Releases

All fields are required unless marked optional.

Business Email Address

Please select a country/region

Describe your use case:

Intel strives to provide you with a great, personalized experience, and your data helps us to accomplish this.

I consent to Intel collecting and using my personal information as described below.

Select Your Language

Using Intel.com Search

Quick Links

Recent Searches

Advanced Search

Only search in

OpenVINO™ toolkit: An open source AI toolkit that makes it easier to write once, deploy anywhere.

What's New in Version 2024.1

Latest Features

Sign Up for Exclusive News, Tips & Releases

Thank you for registering to stay up-to-date with the latest on AI inference with the OpenVINO™ toolkit.

Resources

Community and Support

Get Started

Powered by oneAPI

Sign Up for Exclusive News, Tips & Releases

Thank you for registering to stay up-to-date with the latest on AI inference with the OpenVINO™ toolkit.