Intel OneAPI 2025.0.0 | 17.4 Gb
The oneAPI Toolkits development team is pleased to announce the availability of Intel oneAPI Base & HPC Toolkit 2025.0.0.0 is a comprehensive suite of development tools that make it fast and easy to build modern code that gets every last ounce of performance out of the newest Intel processors in high-performance computing (HPC) platforms.
Intel oneAPI Base & HPC Toolkit 2025.0.0.0 Release notes
Toolkit Level Updates
- Get the most from the latest hardware with new Intel developer tools support for Intel® Xeon® 6 Processors with Performance-Cores (P-Cores), formerly codenamed Granite Rapids and Intel® Core™ Ultra processors (Series 2), formerly codenamed Lunar Lake.
- The Intel® oneAPI Base Toolkit (Base Kit) now offers two convenient subset bundles offering smaller downloads for specific developer use cases. Intel® C++ Essentials is for C++ developers is focused on compiling, debugging, and utilizing the most widely used Base Kit performance libraries for Intel CPUs and GPUs. Intel® Deep Learning Essentials provides advanced developers with the tools and libraries to develop, compile, test, and optimize deep learning frameworks and libraries, such as PyTorch and TensorFlow, for Intel CPUs and GPUs.
- ISO C++ parallel STL code runs on CPU and offloads to GPU using Intel® oneAPI DPC++/C++ compiler.
- Experience dynamic and flexible GPU programming with Intel® oneAPI DPC++/C++ Compiler's SYCL Bindless Textures support, utilizing textures at runtime without compile-time knowledge for improved performance and scalability in C++ with SYCL applications, alongside powerful new LLVM sanitizers to streamline development and ensure enhanced device code reliability.
- Maximize your application's efficiency with Intel oneAPI DPC++/C++ Compiler's performance optimization features, tailored for the latest Intel platforms including Intel® Xeon® 6 Processors and Intel® Core™ Ultra processors (Series 2), to deliver peak performance and cutting-edge computing experiences.
- Leverage enhanced OpenMP standards support and performance enhancements with the Intel® oneAPI DPC++/C++ Compiler, including OpenMP 5.x and 6.0 features for increased efficiency and flexibility, complemented by upgraded compiler opt-report capabilities for in-depth performance insights and optimization feedback.
- GPU kernels run faster with Intel® oneAPI DPC++ Library (oneDPL) improved performance by up-to 4X for algorithms including reduce, scan and many other functions.
- Use oneDPL Range-based algorithms with over 20 new C++20 standard ranges and views to accelerate on multiarchitecture devices.
- Intel® Math Kernel Library (oneMKL)SYCL Discrete Fourier Transform API is easier to use and to debug with key compilation messages added for type safety, reducing time to develop your application, especially when targeting Intel GPUs.
- HPC workloads using oneMKL single precision 3D real in-place FFTs run faster on Intel® Data Center GPU Max Series.
- Multi-threads apps run faster with Intel® oneAPI Threading Building Blocks (oneTBB) task_group, flow_graph and parallel_for_each improved scalibility
- Get result faster using oneTBB flow graph to process overlapping messages on a shared graph, waiting for a specific message using the new try_put_and_wait experimental API
- Intel® Integrated Performance Primitives (Intel IPP) now boasts CET-enabled protection, safeguarding your software against control-flow attacks and mitigates exploitation risks. Safeguard your software with cutting-edge, hardware-enforced security measures.
- Use Intel® Cryptography Primitives Library to turbocharge RSA encryption (2K, 3K, 4K) with multi-buffer capabilities—achieving up to 4x the speed of OpenSSL.
- Use Intel® Cryptography Primitives Library to dive into the future of hashing with our enhanced SM3 algorithm, now 5x faster thanks to the SM3_NI instructions
- Intel® VTune™ Profiler now identifies GPU-bound bottlenecks, optimize rendering pipelines, and improve overall application responsiveness for media and content creation applications on Intel® Core™ Ultra 200V, codenamed Lunar Lake.
- Intel® VTune™ Profiler now identifies and optimizes device-side inefficiencies for Direct X APIs.
- Intel® Advisor introduces a more adaptable kernel matching mechanism, enabling developers to identify and analyze code regions relevant to their specific optimization goals. The integration with the XCG app streamlines the process of offloading computation to GPUs, enhancing performance on Intel's latest hardware.
- Intel® oneAPI Deep Neural Network Library (oneDNN) dramatically boosts performance for Large Language Models and scaled dot-product subgraphs.
- Intel® oneAPI Communications Library (oneCCL) now includes optimizations that enable workloads to scale and perform even better than before. Important enhancements have been made to key collectives, and even more optimizations are now available on single-node CPU configurations.
- Save time with Intel® DPC++ Compatibility Tool to easily migrate your CUDA code and CMake build script to SYCL as demonstrated by auto migration of more APIs used by popular AI, HPC and rendering apps. The migrated code is easy to comprehend with SYCLcompat, easy to debug using CodePin, and runs performantly on Nvidia GPUs
- Free your imaging apps from vendor lock-in using Intel® DPC++ Compatibility Tool to migrate bindless textures APIs to SYCL image extension
- Intel® Distribution for GDB* adds support for Intel® Core™ Ultra processors (Series 2) on Windows* allowing developers to efficiently debug application code on these new CPUs and GPUs.
- Intel® Distribution for GDB* rebases to GDB* 15 staying current and aligned with the latest enhancements supporting effective application debug.
- Intel® Distribution for GDB* enhances the developer experience, both on the command line and when using Microsoft* Visual Studio and Visual Studio Code* by boosting the debugger performance and refining the user interface.
Intel® oneAPI DPC++ Compiler 2025.0.0
- Unlock dynamic execution and flexible programming for Intel GPUs with the Intel® oneAPI DPC++/C++ Compiler's new SYCL Bindless Textures support, allowing developers to utilize textures at runtime without compile-time knowledge, and leverage texture objects for enhanced performance and scalability in C++ with SYCL applications.
- Streamline your development process with the Intel® oneAPI DPC++/C++ Compiler's new LLVM sanitizers providing powerful tools to detect and troubleshoot device code issues with ease, ensuring cleaner and more reliable applications.
- Maximize your application's efficiency with Intel oneAPI DPC++/C++ Compiler's performance optimization features, tailored for the latest Intel platforms including Intel® Xeon® 6 Processors and Intel® Core™ Ultra processors (Series 2), to deliver peak performance and cutting-edge computing experiences.
- The Intel® oneAPI DPC++/C++ Compiler enhances OpenMP standards conformance and performance optimizations for OpenMP 5.x and the new OpenMP 6.0 standards, featuring new interop properties, sync target no wait, free-agent threads/tasks, extended OMP_PLACES syntax, and advanced thread limit controls, all designed to boost application performance with unparalleled efficiency and flexibility.
- Enhanced compiler insights with Intel® oneAPI DPC++/C++ Compiler's significant upgrades to the opt-report capabilities, now offering a more user-friendly optimization report that includes OpenMP offloading details, integrates with the open-source optimization remark framework, and adds optimization report functionality to the SYCL and OpenMP runtimes as well as AOT compilation paths for a more comprehensive understanding of application performance improvements.
- Streamlined FPGA development with Intel® oneAPI DPC++/C++ Compiler's latest enhancements, featuring usability and performance improvements, flexible local buffer size configurations, seamless SPIR-V translation, and streamlined command options where -fintelfpga implies -qactypes and -fp-model=fast triggers aoc -vpfp-relaxed, simplifying the workflow for faster, more efficient FPGA application development
Intel® oneAPI DPC++ Library 2022.7.0
- GPU kernels run faster with Intel® oneAPI DPC++ Library (oneDPL) improved performance by up-to 4X for algorithms including reduce, scan and many other functions.
- Use production released oneDPL Range-based algorithms with over 20 new C++20 standard ranges and views to accelerate on multiarchitecture devices
- ISO C++ parallel STL code runs on CPU and offloads to GPU using Intel® oneAPI DPC++/C++ compiler
Intel® DPC++ Compatibility Tool 2025.0.0
- Save time with Intel® DPC++ Compatibility Tool to easily migrate your CUDA code and CMake build script to SYCL as demonstrated by auto migration of more APIs used by popular AI, HPC and rendering apps. The migrated code is easy to comprehend with SYCLcompat, easy to debug using CodePin, and runs performantly on Nvidia GPUs
- Free your imaging apps from vendor lock-in using Intel® DPC++ Compatibility Tool to migrate bindless textures APIs to SYCL image extension
Intel® oneAPI Math Kernel Library 2025.0.0
- Developers targeting Intel® Xeon® 6 Processors will benefit from the performance optimizations available on Intel(R) oneMKL 2025.0 across multiple domains as BLAS, LAPACK and FFT.
- HPC workloads using single precision 3d real in-place FFT will get significant improvements to execute on Intel® Data Center GPU Max Series.
- New distribution models and data types available for Random Number Generation (RNG) using SYCL device API.
- SYCL Discrete Fourier Transform API got easier to use and to debug with key compilation messages added for type safety, reducing time to develop your application, in special when targeting Intel GPUs.
- Sparse domain on SYCL API now supports sparse matrices using Coordinate Format (COO). This format is widely used for fast sparse matrices construction and it can be easily converted to other popular formats such as Compressed Sparse Row (CSR) and Compressed Sparse Column) CSC.
Intel® Distribution for GDB* 2025.0.0
- Intel® Distribution for GDB* adds support for Intel® Core™ Ultra processors (Series 2) on Windows* allowing developers to efficiently debug application code on these new CPUs and GPUs.
- Intel® Distribution for GDB* rebases to GDB* 15 staying current and aligned with the latest enhancements supporting effective application debug.
- Intel® Distribution for GDB* enhances the developer experience, both on the command line and when using Microsoft* Visual Studio and Visual Studio Code* by boosting the debugger performance and refining the user interface.
Intel® VTune™ Profiler 2025.0.0
- Adds support for Intel® Core™ Ultra 200V, codenamed Lunar Lake, Intel® Core™ Ultra 200 "Arrow Lake-S" series and 6th gen. Intel® Xeon® Scalable Processors, codenamed Granite Rapids.
- Identify GPU-bound bottlenecks, optimize rendering pipelines, and improve overall application responsiveness for media and content creation applications on Intel® Core™ Ultra 200V, codenamed Lunar Lake.
- Identify and optimize device-side inefficiencies for Direct X APIs.
- Adds profiling support for Python 3.11. Improved productivity with the ability to focus Python profiling to only areas of interest and control performance data collection with ITT APIs.
- The Platform Profiler capabilities in Intel® VTune™ Profiler has released its final version, available as stand-alone download. No further feature improvements or security fixes will be available after this final release. The capabilities are now transitioned to the EMON command line interface. For more information, see the Intel® VTune™ Profiler – Platform Profiler transition notice.
Intel® Advisor 2025.0.0
- Intel Advisor 2025.0 expands its hardware support to include GNR, Intel's next-generation platform. Developers can now leverage Advisor's powerful analysis and optimization capabilities on the latest hardware.
- Intel Advisor 2025.0 introduces a more adaptable kernel matching mechanism, enabling developers to identify and analyze code regions relevant to their specific optimization goals. The integration with the XCG app streamlines the process of offloading computation to GPUs, enhancing performance on Intel's latest hardware.
Intel® oneAPI Threading Building Blocks 2022.0.0
- Multi-threads apps run faster with oneTBB task_group, flow_graph and parallel_for_each improved scalibility
- Get result faster using oneTBB flow graph to process overlapping messages on a shared graph, waiting for a specific message using the new try_put_and_wait experimental API
Intel® Integrated Performance Primitives 2022.0.0
- Security First: Intel IPP now boasts CET-enabled protection, safeguarding your software against control-flow attacks and mitigates exploitation risks. Safeguard your software with cutting-edge, hardware-enforced security measures.
- Speed Up with Optimized Functions: ippiNormRel_L2_8u_C*MR now optimized for Intel's latest Granite & Sapphire Rapids platforms.
- Thread-Safe Excellence: Bug fixes on ippsWinKaiser_32f_I() increased its reliability in multi-threaded environments.
- Intel® Integrated Performance Primitives Cryptography (Intel® IPP Cryptography) is now Intel® Cryptography Primitives Library!
- Experience the power of dispatching on SRF, turbocharging RSA encryption (2K, 3K, 4K) with multi-buffer capabilities—achieving up to 4x the speed of OpenSSL.
- Dive into the future of hashing with our enhanced SM3 algorithm, now 5x faster thanks to the SM3_NI instruction.
- Now built with the secure and powerful ICX compiler, our library is optimized for peak performance.
Intel® Cryptography Primitives Library 2025.0.0
- Intel® Integrated Performance Primitives Cryptography (Intel® IPP Cryptography) is now Intel® Cryptography Primitives Library!
- Experience the power of dispatching on SRF, turbocharging RSA encryption (2K, 3K, 4K) with multi-buffer capabilities—achieving up to 4x the speed of OpenSSL.
- Dive into the future of hashing with our enhanced SM3 algorithm, now 5x faster thanks to the SM3_NI instruction.
- Now built with the secure and powerful ICX compiler, our library is optimized for peak performance.
Intel® oneAPI Collective Communications Library 2021.14.0
Intel® oneAPI Communications Library 2021.14.0 brings further optimizations that allow workloads to scale and perform better. Some of the key features include:
- Enhancements to oneCCL's Key-Value store that improves communication among ranks, allowing workloads to scale up to even larger number of nodes
- Performance improvements to key collectives such as Allgather, Allreduce and Reduce-scatter
- Even further optimizations for workloads on single-node CPU configurations
- Other lower level optimizations that improve speed and resource utilization
Intel® oneAPI Data Analytics Library 2025.0.0
- Intel® oneAPI Data Analytics Library 2025.0.0 enabled calculation of SHAP values for binary classification models required for explainability RF algorithms
- Intel® oneAPI Data Analytics Library 2025.0.0 enabled performance improvements for Decision Forest
Intel® oneAPI Deep Neural Networks Library 2025.0.0
Intel® oneAPI Deep Neural Network Library (oneDNN) 2025.0 introduces:
- Accelerated AI workloads: Experience significantly faster performance for Large Language Models and scaled dot product subgraphs.
- Optimized for Intel hardware: Benefit from oneDNN's tailored optimizations for Intel CPUs and GPUs, ensuring maximum efficiency and performance.
- Enhanced productivity: Save time and resources by leveraging oneDNN's cutting-edge acceleration technology for hardware to accelerate your AI development and deployment.
Deprecation Notices
- None
- Get the most from the latest hardware with new Intel developer tools support for Intel® Xeon® 6 Processors with Performance-Cores (P-Cores), formerly codenamed Granite Rapids and Intel® Core™ Ultra processors (Series 2), formerly codenamed Lunar Lake.
- The Intel® oneAPI Base Toolkit (Base Kit) now offers two convenient subset bundles offering smaller downloads for specific developer use cases. Intel® C++ Essentials is for C++ developers is focused on compiling, debugging, and utilizing the most widely used Base Kit performance libraries for Intel CPUs and GPUs. Intel® Deep Learning Essentials provides advanced developers with the tools and libraries to develop, compile, test, and optimize deep learning frameworks and libraries, such as PyTorch and TensorFlow, for Intel CPUs and GPUs.
- ISO C++ parallel STL code runs on CPU and offloads to GPU using Intel® oneAPI DPC++/C++ compiler.
- Experience dynamic and flexible GPU programming with Intel® oneAPI DPC++/C++ Compiler's SYCL Bindless Textures support, utilizing textures at runtime without compile-time knowledge for improved performance and scalability in C++ with SYCL applications, alongside powerful new LLVM sanitizers to streamline development and ensure enhanced device code reliability.
- Maximize your application's efficiency with Intel oneAPI DPC++/C++ Compiler's performance optimization features, tailored for the latest Intel platforms including Intel® Xeon® 6 Processors and Intel® Core™ Ultra processors (Series 2), to deliver peak performance and cutting-edge computing experiences.
- Leverage enhanced OpenMP standards support and performance enhancements with the Intel® oneAPI DPC++/C++ Compiler, including OpenMP 5.x and 6.0 features for increased efficiency and flexibility, complemented by upgraded compiler opt-report capabilities for in-depth performance insights and optimization feedback.
- GPU kernels run faster with Intel® oneAPI DPC++ Library (oneDPL) improved performance by up-to 4X for algorithms including reduce, scan and many other functions.
- Use oneDPL Range-based algorithms with over 20 new C++20 standard ranges and views to accelerate on multiarchitecture devices.
- Intel® Math Kernel Library (oneMKL)SYCL Discrete Fourier Transform API is easier to use and to debug with key compilation messages added for type safety, reducing time to develop your application, especially when targeting Intel GPUs.
- HPC workloads using oneMKL single precision 3D real in-place FFTs run faster on Intel® Data Center GPU Max Series.
- Multi-threads apps run faster with Intel® oneAPI Threading Building Blocks (oneTBB) task_group, flow_graph and parallel_for_each improved scalibility
- Get result faster using oneTBB flow graph to process overlapping messages on a shared graph, waiting for a specific message using the new try_put_and_wait experimental API
- Intel® Integrated Performance Primitives (Intel IPP) now boasts CET-enabled protection, safeguarding your software against control-flow attacks and mitigates exploitation risks. Safeguard your software with cutting-edge, hardware-enforced security measures.
- Use Intel® Cryptography Primitives Library to turbocharge RSA encryption (2K, 3K, 4K) with multi-buffer capabilities—achieving up to 4x the speed of OpenSSL.
- Use Intel® Cryptography Primitives Library to dive into the future of hashing with our enhanced SM3 algorithm, now 5x faster thanks to the SM3_NI instructions
- Intel® VTune™ Profiler now identifies GPU-bound bottlenecks, optimize rendering pipelines, and improve overall application responsiveness for media and content creation applications on Intel® Core™ Ultra 200V, codenamed Lunar Lake.
- Intel® VTune™ Profiler now identifies and optimizes device-side inefficiencies for Direct X APIs.
- Intel® Advisor introduces a more adaptable kernel matching mechanism, enabling developers to identify and analyze code regions relevant to their specific optimization goals. The integration with the XCG app streamlines the process of offloading computation to GPUs, enhancing performance on Intel's latest hardware.
- Intel® oneAPI Deep Neural Network Library (oneDNN) dramatically boosts performance for Large Language Models and scaled dot-product subgraphs.
- Intel® oneAPI Communications Library (oneCCL) now includes optimizations that enable workloads to scale and perform even better than before. Important enhancements have been made to key collectives, and even more optimizations are now available on single-node CPU configurations.
- Save time with Intel® DPC++ Compatibility Tool to easily migrate your CUDA code and CMake build script to SYCL as demonstrated by auto migration of more APIs used by popular AI, HPC and rendering apps. The migrated code is easy to comprehend with SYCLcompat, easy to debug using CodePin, and runs performantly on Nvidia GPUs
- Free your imaging apps from vendor lock-in using Intel® DPC++ Compatibility Tool to migrate bindless textures APIs to SYCL image extension
- Intel® Distribution for GDB* adds support for Intel® Core™ Ultra processors (Series 2) on Windows* allowing developers to efficiently debug application code on these new CPUs and GPUs.
- Intel® Distribution for GDB* rebases to GDB* 15 staying current and aligned with the latest enhancements supporting effective application debug.
- Intel® Distribution for GDB* enhances the developer experience, both on the command line and when using Microsoft* Visual Studio and Visual Studio Code* by boosting the debugger performance and refining the user interface.
Intel® oneAPI DPC++ Compiler 2025.0.0
- Unlock dynamic execution and flexible programming for Intel GPUs with the Intel® oneAPI DPC++/C++ Compiler's new SYCL Bindless Textures support, allowing developers to utilize textures at runtime without compile-time knowledge, and leverage texture objects for enhanced performance and scalability in C++ with SYCL applications.
- Streamline your development process with the Intel® oneAPI DPC++/C++ Compiler's new LLVM sanitizers providing powerful tools to detect and troubleshoot device code issues with ease, ensuring cleaner and more reliable applications.
- Maximize your application's efficiency with Intel oneAPI DPC++/C++ Compiler's performance optimization features, tailored for the latest Intel platforms including Intel® Xeon® 6 Processors and Intel® Core™ Ultra processors (Series 2), to deliver peak performance and cutting-edge computing experiences.
- The Intel® oneAPI DPC++/C++ Compiler enhances OpenMP standards conformance and performance optimizations for OpenMP 5.x and the new OpenMP 6.0 standards, featuring new interop properties, sync target no wait, free-agent threads/tasks, extended OMP_PLACES syntax, and advanced thread limit controls, all designed to boost application performance with unparalleled efficiency and flexibility.
- Enhanced compiler insights with Intel® oneAPI DPC++/C++ Compiler's significant upgrades to the opt-report capabilities, now offering a more user-friendly optimization report that includes OpenMP offloading details, integrates with the open-source optimization remark framework, and adds optimization report functionality to the SYCL and OpenMP runtimes as well as AOT compilation paths for a more comprehensive understanding of application performance improvements.
- Streamlined FPGA development with Intel® oneAPI DPC++/C++ Compiler's latest enhancements, featuring usability and performance improvements, flexible local buffer size configurations, seamless SPIR-V translation, and streamlined command options where -fintelfpga implies -qactypes and -fp-model=fast triggers aoc -vpfp-relaxed, simplifying the workflow for faster, more efficient FPGA application development
Intel® oneAPI DPC++ Library 2022.7.0
- GPU kernels run faster with Intel® oneAPI DPC++ Library (oneDPL) improved performance by up-to 4X for algorithms including reduce, scan and many other functions.
- Use production released oneDPL Range-based algorithms with over 20 new C++20 standard ranges and views to accelerate on multiarchitecture devices
- ISO C++ parallel STL code runs on CPU and offloads to GPU using Intel® oneAPI DPC++/C++ compiler
Intel® DPC++ Compatibility Tool 2025.0.0
- Save time with Intel® DPC++ Compatibility Tool to easily migrate your CUDA code and CMake build script to SYCL as demonstrated by auto migration of more APIs used by popular AI, HPC and rendering apps. The migrated code is easy to comprehend with SYCLcompat, easy to debug using CodePin, and runs performantly on Nvidia GPUs
- Free your imaging apps from vendor lock-in using Intel® DPC++ Compatibility Tool to migrate bindless textures APIs to SYCL image extension
Intel® oneAPI Math Kernel Library 2025.0.0
- Developers targeting Intel® Xeon® 6 Processors will benefit from the performance optimizations available on Intel(R) oneMKL 2025.0 across multiple domains as BLAS, LAPACK and FFT.
- HPC workloads using single precision 3d real in-place FFT will get significant improvements to execute on Intel® Data Center GPU Max Series.
- New distribution models and data types available for Random Number Generation (RNG) using SYCL device API.
- SYCL Discrete Fourier Transform API got easier to use and to debug with key compilation messages added for type safety, reducing time to develop your application, in special when targeting Intel GPUs.
- Sparse domain on SYCL API now supports sparse matrices using Coordinate Format (COO). This format is widely used for fast sparse matrices construction and it can be easily converted to other popular formats such as Compressed Sparse Row (CSR) and Compressed Sparse Column) CSC.
Intel® Distribution for GDB* 2025.0.0
- Intel® Distribution for GDB* adds support for Intel® Core™ Ultra processors (Series 2) on Windows* allowing developers to efficiently debug application code on these new CPUs and GPUs.
- Intel® Distribution for GDB* rebases to GDB* 15 staying current and aligned with the latest enhancements supporting effective application debug.
- Intel® Distribution for GDB* enhances the developer experience, both on the command line and when using Microsoft* Visual Studio and Visual Studio Code* by boosting the debugger performance and refining the user interface.
Intel® VTune™ Profiler 2025.0.0
- Adds support for Intel® Core™ Ultra 200V, codenamed Lunar Lake, Intel® Core™ Ultra 200 "Arrow Lake-S" series and 6th gen. Intel® Xeon® Scalable Processors, codenamed Granite Rapids.
- Identify GPU-bound bottlenecks, optimize rendering pipelines, and improve overall application responsiveness for media and content creation applications on Intel® Core™ Ultra 200V, codenamed Lunar Lake.
- Identify and optimize device-side inefficiencies for Direct X APIs.
- Adds profiling support for Python 3.11. Improved productivity with the ability to focus Python profiling to only areas of interest and control performance data collection with ITT APIs.
- The Platform Profiler capabilities in Intel® VTune™ Profiler has released its final version, available as stand-alone download. No further feature improvements or security fixes will be available after this final release. The capabilities are now transitioned to the EMON command line interface. For more information, see the Intel® VTune™ Profiler – Platform Profiler transition notice.
Intel® Advisor 2025.0.0
- Intel Advisor 2025.0 expands its hardware support to include GNR, Intel's next-generation platform. Developers can now leverage Advisor's powerful analysis and optimization capabilities on the latest hardware.
- Intel Advisor 2025.0 introduces a more adaptable kernel matching mechanism, enabling developers to identify and analyze code regions relevant to their specific optimization goals. The integration with the XCG app streamlines the process of offloading computation to GPUs, enhancing performance on Intel's latest hardware.
Intel® oneAPI Threading Building Blocks 2022.0.0
- Multi-threads apps run faster with oneTBB task_group, flow_graph and parallel_for_each improved scalibility
- Get result faster using oneTBB flow graph to process overlapping messages on a shared graph, waiting for a specific message using the new try_put_and_wait experimental API
Intel® Integrated Performance Primitives 2022.0.0
- Security First: Intel IPP now boasts CET-enabled protection, safeguarding your software against control-flow attacks and mitigates exploitation risks. Safeguard your software with cutting-edge, hardware-enforced security measures.
- Speed Up with Optimized Functions: ippiNormRel_L2_8u_C*MR now optimized for Intel's latest Granite & Sapphire Rapids platforms.
- Thread-Safe Excellence: Bug fixes on ippsWinKaiser_32f_I() increased its reliability in multi-threaded environments.
- Intel® Integrated Performance Primitives Cryptography (Intel® IPP Cryptography) is now Intel® Cryptography Primitives Library!
- Experience the power of dispatching on SRF, turbocharging RSA encryption (2K, 3K, 4K) with multi-buffer capabilities—achieving up to 4x the speed of OpenSSL.
- Dive into the future of hashing with our enhanced SM3 algorithm, now 5x faster thanks to the SM3_NI instruction.
- Now built with the secure and powerful ICX compiler, our library is optimized for peak performance.
Intel® Cryptography Primitives Library 2025.0.0
- Intel® Integrated Performance Primitives Cryptography (Intel® IPP Cryptography) is now Intel® Cryptography Primitives Library!
- Experience the power of dispatching on SRF, turbocharging RSA encryption (2K, 3K, 4K) with multi-buffer capabilities—achieving up to 4x the speed of OpenSSL.
- Dive into the future of hashing with our enhanced SM3 algorithm, now 5x faster thanks to the SM3_NI instruction.
- Now built with the secure and powerful ICX compiler, our library is optimized for peak performance.
Intel® oneAPI Collective Communications Library 2021.14.0
Intel® oneAPI Communications Library 2021.14.0 brings further optimizations that allow workloads to scale and perform better. Some of the key features include:
- Enhancements to oneCCL's Key-Value store that improves communication among ranks, allowing workloads to scale up to even larger number of nodes
- Performance improvements to key collectives such as Allgather, Allreduce and Reduce-scatter
- Even further optimizations for workloads on single-node CPU configurations
- Other lower level optimizations that improve speed and resource utilization
Intel® oneAPI Data Analytics Library 2025.0.0
- Intel® oneAPI Data Analytics Library 2025.0.0 enabled calculation of SHAP values for binary classification models required for explainability RF algorithms
- Intel® oneAPI Data Analytics Library 2025.0.0 enabled performance improvements for Decision Forest
Intel® oneAPI Deep Neural Networks Library 2025.0.0
Intel® oneAPI Deep Neural Network Library (oneDNN) 2025.0 introduces:
- Accelerated AI workloads: Experience significantly faster performance for Large Language Models and scaled dot product subgraphs.
- Optimized for Intel hardware: Benefit from oneDNN's tailored optimizations for Intel CPUs and GPUs, ensuring maximum efficiency and performance.
- Enhanced productivity: Save time and resources by leveraging oneDNN's cutting-edge acceleration technology for hardware to accelerate your AI development and deployment.
Deprecation Notices
- None
Toolkit Level Updates
- Get the most from the latest hardware with new Intel developer tools support for Intel® Xeon® 6 Processors with Performance-Cores (P-Cores), formerly codenamed Granite Rapids and Intel® Core™ Ultra processors (Series 2), formerly codenamed Lunar Lake.
- The Intel® oneAPI HPC Toolkit (HPC Kit) now offers two convenient subset bundles offering smaller downloads for specific developer use cases. Intel® C++ Essentials is for C++ developers is focused on compiling, debugging, and utilizing the most widely used Base Kit performance libraries for Intel CPUs and GPUs. Intel® Fortran Essentials is for Fortran developers focused on compiling, debugging, and utilizing the most widely used HPC Kit performance libraries for Intel CPUs and GPUs.
- The Intel® Fortran Compiler adds new F2023 Standard features, including the AT Edit Descriptor for cleaner output, enhanced string parsing with SPLIT and TOKENIZE functions, and improved numerical precision with upgraded IEEE_ARITHMETIC capabilities, all designed to optimize your coding efficiency and application performance.
- Maximize your Fortran application's parallel processing capabilities with our latest Intel® Fortran Compiler release, featuring OpenMP 6.0 enhancements such as conditional TEAMS construct execution with the new IF clause, flexible TARGET constructs with DEVICE_TYPE clauses, and enhanced device targeting and task affinity control with OpenMP 5.1 updates, all designed to give developers greater control and efficiency in high-performance computing environments.
- Unlock advanced parallelism in your Fortran applications with Intel® Fortran compiler's latest enhancements, now supporting arrays of coarrays and the ability to create allocatable arrays with coarray components, offering developers dynamic, high-performance data structures for sophisticated coarray programming.
- Achieve high scale out and scale up performance on Intel® Xeon® 6 Processors with Intel® MPI Library including P-core pinning for optimized balancing of asymmetric CPU topologies.
- Intel MPI Library now offers a full MPI 4.0 implementation including Partitioned Communication, Improved Error handling, and Fortran 2008 support
- New Intel MPI Library optimizations for MPI_Allreduce improve scale up and scale out performance for Intel GPUs
- ISO C++ parallel STL code runs on CPU and offloads to GPU using Intel® oneAPI DPC++/C++ compiler.
- Experience dynamic and flexible GPU programming with Intel® oneAPI DPC++/C++ Compiler's SYCL Bindless Textures support, utilizing textures at runtime without compile-time knowledge for improved performance and scalability in C++ with SYCL applications, alongside powerful new LLVM sanitizers to streamline development and ensure enhanced device code reliability.
- Maximize your application's efficiency with Intel oneAPI DPC++/C++ Compiler's performance optimization features, tailored for the latest Intel platforms including Intel® Xeon® 6 Processors and Intel® Core™ Ultra processors (Series 2), to deliver peak performance and cutting-edge computing experiences.
- Leverage enhanced OpenMP standards support and performance enhancements with the Intel® oneAPI DPC++/C++ Compiler, including OpenMP 5.x and 6.0 features for increased efficiency and flexibility, complemented by upgraded compiler opt-report capabilities for in-depth performance insights and optimization feedback.
- GPU kernels run faster with Intel® oneAPI DPC++ Library (oneDPL) improved performance by up-to 4X for algorithms including reduce, scan and many other functions.
- Use oneDPL Range-based algorithms with over 20 new C++20 standard ranges and views to accelerate on multiarchitecture devices.
- Intel® Math Kernel Library (oneMKL)SYCL Discrete Fourier Transform API is easier to use and to debug with key compilation messages added for type safety, reducing time to develop your application, especially when targeting Intel GPUs.
- HPC workloads using oneMKL single precision 3D real in-place FFTs run faster on Intel® Data Center GPU Max Series.
- Multi-threads apps run faster with Intel® oneAPI Threading Building Blocks (oneTBB) task_group, flow_graph and parallel_for_each improved scalibility
- Get result faster using oneTBB flow graph to process overlapping messages on a shared graph, waiting for a specific message using the new try_put_and_wait experimental API
- Intel® Integrated Performance Primitives (Intel IPP) now boasts CET-enabled protection, safeguarding your software against control-flow attacks and mitigates exploitation risks. Safeguard your software with cutting-edge, hardware-enforced security measures.
- Use Intel® Cryptography Primitives Library to turbocharge RSA encryption (2K, 3K, 4K) with multi-buffer capabilities—achieving up to 4x the speed of OpenSSL.
- Use Intel® Cryptography Primitives Library to dive into the future of hashing with our enhanced SM3 algorithm, now 5x faster thanks to the SM3_NI instructions
- Intel® VTune™ Profiler now identifies GPU-bound bottlenecks, optimize rendering pipelines, and improve overall application responsiveness for media and content creation applications on Intel® Core™ Ultra 200V, codenamed Lunar Lake.
- Intel® VTune™ Profiler now identifies and optimizes device-side inefficiencies for Direct X APIs.
- Intel® Advisor introduces a more adaptable kernel matching mechanism, enabling developers to identify and analyze code regions relevant to their specific optimization goals. The integration with the XCG app streamlines the process of offloading computation to GPUs, enhancing performance on Intel's latest hardware.
- Intel® oneAPI Deep Neural Network Library (oneDNN) dramatically boosts performance for Large Language Models and scaled dot-product subgraphs.
- Intel® oneAPI Communications Library (oneCCL) now includes optimizations that enable workloads to scale and perform even better than before. Important enhancements have been made to key collectives, and even more optimizations are now available on single-node CPU configurations.
- Save time with Intel® DPC++ Compatibility Tool to easily migrate your CUDA code and CMake build script to SYCL as demonstrated by auto migration of more APIs used by popular AI, HPC and rendering apps. The migrated code is easy to comprehend with SYCLcompat, easy to debug using CodePin, and runs performantly on Nvidia GPUs
- Free your imaging apps from vendor lock-in using Intel® DPC++ Compatibility Tool to migrate bindless textures APIs to SYCL image extension
- Intel® Distribution for GDB* adds support for Intel® Core™ Ultra processors (Series 2) on Windows* allowing developers to efficiently debug application code on these new CPUs and GPUs.
- Intel® Distribution for GDB* rebases to GDB* 15 staying current and aligned with the latest enhancements supporting effective application debug.
- Intel® Distribution for GDB* enhances the developer experience, both on the command line and when using Microsoft* Visual Studio and Visual Studio Code* by boosting the debugger performance and refining the user interface.
Intel® oneAPI DPC++/C++ Compiler 2025.0
- New Features for standard support such as OpenMP 6.0, Fortran 2023 and SYCL 2020.
- Hardware enabling such as SPR, EMR, GNR, BMG, LNL and FS1 (e.g. cache hints and new data types for AI)
- Performance tuning for AI framework and HPC applications.
- Improvements such opt-report enhancement for better user experiences
- Bindless Textures Support implemented for Intel GPUs (DG2, Arc).
- Device code now supports sanitizers to help identify issues.
- New SYCL Offload model introduced with –offload-new-driver.
- Performance enhancements and early support for OpenMP 6.0 features.
- OpenMP loop rotation issue fixed
- New OpenMP 6.0 DEVICE_TYPE clause for TARGET construct
- Mandatory OpenMP offload support enabled
- Improved user experience with additional optimization report information for OpenMP offloading.
- SYCL now offers functionality comparable to CUDA textures for Intel Client GPUs.
- Advanced support for optional kernel features in Ahead-Of-Time (AOT) compilation mode.
- SYCL ABI has undergone breaking changes.
Intel® Fortran Compiler 2025.0
- New F2023 Standard features for the runtime library including the AT Edit Descriptor to trim trailing blanks before output, and SPLIT and TOKENIZE intrinsic functions for parsing character strings. IEEE_ARITHMETIC has been enhanced with F2023 IEEE_MAX, IEEE_MAX_MAG, IEEE_MIN, IEEE_MIN_MAG, along with F2023 behavior changes for IEEE_MAX_NUM, IEEE_MAX_NUM_MAG, IEEE_MIN_NUM and IEEE_MIN_NUM_MAG
- New OpenMP features in this release:
. OpenMP 6.0 An IF clause allowed on TEAMS construct
. OpenMP 6.0 A DEVICE_TYPE clause may now appear on a TARGET construct.
. OpenMP 5.1 The DEVICE_TYPE clause may appear on a DECLARE TARGET directive.
. OpenMp 5.1 AFFINITY clause is now permitted on a TASK directive.
- New work on Coarrays that include arrays of coarrays, a data object with a coarray component may now be an array and may be allocatable.
- The standard-semantics option is now sensitive to a standards conformance level set by the stand option. If no stand option is specified, standard semantics sets options to conform to Fortran 2018 standard behavior.
- The Intel® Fortran Developer Guide and Reference received extensive updates to remove older material, refresh existing examples, and added supported Fortran 2018 and 2023 Fortran Language features
Intel® MPI Library 2021.14.0
- GNR /SRF Tuning and optimizations for both scale out and scale up.
. GNR - Improved CPU pinning library for optimized balancing for asymmetric CPU topologies. (IPL2 becomes PoR)
- MPI 4.0 compliance- Support for Partition communication, Improved Error handling, Fortran 2008 support
- MPI_Allreduce scale-up and scale-out optimizations for Intel GPUs
- Intel GPU aware reduce optimizations
- Windows IFX support for MPI
- OFI and provider update to latest open-source versions
Deprecation Notice
- Intel® Fortran Compiler Classic (ifort) is now discontinued in oneAPI 2025 release.
- Get the most from the latest hardware with new Intel developer tools support for Intel® Xeon® 6 Processors with Performance-Cores (P-Cores), formerly codenamed Granite Rapids and Intel® Core™ Ultra processors (Series 2), formerly codenamed Lunar Lake.
- The Intel® oneAPI HPC Toolkit (HPC Kit) now offers two convenient subset bundles offering smaller downloads for specific developer use cases. Intel® C++ Essentials is for C++ developers is focused on compiling, debugging, and utilizing the most widely used Base Kit performance libraries for Intel CPUs and GPUs. Intel® Fortran Essentials is for Fortran developers focused on compiling, debugging, and utilizing the most widely used HPC Kit performance libraries for Intel CPUs and GPUs.
- The Intel® Fortran Compiler adds new F2023 Standard features, including the AT Edit Descriptor for cleaner output, enhanced string parsing with SPLIT and TOKENIZE functions, and improved numerical precision with upgraded IEEE_ARITHMETIC capabilities, all designed to optimize your coding efficiency and application performance.
- Maximize your Fortran application's parallel processing capabilities with our latest Intel® Fortran Compiler release, featuring OpenMP 6.0 enhancements such as conditional TEAMS construct execution with the new IF clause, flexible TARGET constructs with DEVICE_TYPE clauses, and enhanced device targeting and task affinity control with OpenMP 5.1 updates, all designed to give developers greater control and efficiency in high-performance computing environments.
- Unlock advanced parallelism in your Fortran applications with Intel® Fortran compiler's latest enhancements, now supporting arrays of coarrays and the ability to create allocatable arrays with coarray components, offering developers dynamic, high-performance data structures for sophisticated coarray programming.
- Achieve high scale out and scale up performance on Intel® Xeon® 6 Processors with Intel® MPI Library including P-core pinning for optimized balancing of asymmetric CPU topologies.
- Intel MPI Library now offers a full MPI 4.0 implementation including Partitioned Communication, Improved Error handling, and Fortran 2008 support
- New Intel MPI Library optimizations for MPI_Allreduce improve scale up and scale out performance for Intel GPUs
- ISO C++ parallel STL code runs on CPU and offloads to GPU using Intel® oneAPI DPC++/C++ compiler.
- Experience dynamic and flexible GPU programming with Intel® oneAPI DPC++/C++ Compiler's SYCL Bindless Textures support, utilizing textures at runtime without compile-time knowledge for improved performance and scalability in C++ with SYCL applications, alongside powerful new LLVM sanitizers to streamline development and ensure enhanced device code reliability.
- Maximize your application's efficiency with Intel oneAPI DPC++/C++ Compiler's performance optimization features, tailored for the latest Intel platforms including Intel® Xeon® 6 Processors and Intel® Core™ Ultra processors (Series 2), to deliver peak performance and cutting-edge computing experiences.
- Leverage enhanced OpenMP standards support and performance enhancements with the Intel® oneAPI DPC++/C++ Compiler, including OpenMP 5.x and 6.0 features for increased efficiency and flexibility, complemented by upgraded compiler opt-report capabilities for in-depth performance insights and optimization feedback.
- GPU kernels run faster with Intel® oneAPI DPC++ Library (oneDPL) improved performance by up-to 4X for algorithms including reduce, scan and many other functions.
- Use oneDPL Range-based algorithms with over 20 new C++20 standard ranges and views to accelerate on multiarchitecture devices.
- Intel® Math Kernel Library (oneMKL)SYCL Discrete Fourier Transform API is easier to use and to debug with key compilation messages added for type safety, reducing time to develop your application, especially when targeting Intel GPUs.
- HPC workloads using oneMKL single precision 3D real in-place FFTs run faster on Intel® Data Center GPU Max Series.
- Multi-threads apps run faster with Intel® oneAPI Threading Building Blocks (oneTBB) task_group, flow_graph and parallel_for_each improved scalibility
- Get result faster using oneTBB flow graph to process overlapping messages on a shared graph, waiting for a specific message using the new try_put_and_wait experimental API
- Intel® Integrated Performance Primitives (Intel IPP) now boasts CET-enabled protection, safeguarding your software against control-flow attacks and mitigates exploitation risks. Safeguard your software with cutting-edge, hardware-enforced security measures.
- Use Intel® Cryptography Primitives Library to turbocharge RSA encryption (2K, 3K, 4K) with multi-buffer capabilities—achieving up to 4x the speed of OpenSSL.
- Use Intel® Cryptography Primitives Library to dive into the future of hashing with our enhanced SM3 algorithm, now 5x faster thanks to the SM3_NI instructions
- Intel® VTune™ Profiler now identifies GPU-bound bottlenecks, optimize rendering pipelines, and improve overall application responsiveness for media and content creation applications on Intel® Core™ Ultra 200V, codenamed Lunar Lake.
- Intel® VTune™ Profiler now identifies and optimizes device-side inefficiencies for Direct X APIs.
- Intel® Advisor introduces a more adaptable kernel matching mechanism, enabling developers to identify and analyze code regions relevant to their specific optimization goals. The integration with the XCG app streamlines the process of offloading computation to GPUs, enhancing performance on Intel's latest hardware.
- Intel® oneAPI Deep Neural Network Library (oneDNN) dramatically boosts performance for Large Language Models and scaled dot-product subgraphs.
- Intel® oneAPI Communications Library (oneCCL) now includes optimizations that enable workloads to scale and perform even better than before. Important enhancements have been made to key collectives, and even more optimizations are now available on single-node CPU configurations.
- Save time with Intel® DPC++ Compatibility Tool to easily migrate your CUDA code and CMake build script to SYCL as demonstrated by auto migration of more APIs used by popular AI, HPC and rendering apps. The migrated code is easy to comprehend with SYCLcompat, easy to debug using CodePin, and runs performantly on Nvidia GPUs
- Free your imaging apps from vendor lock-in using Intel® DPC++ Compatibility Tool to migrate bindless textures APIs to SYCL image extension
- Intel® Distribution for GDB* adds support for Intel® Core™ Ultra processors (Series 2) on Windows* allowing developers to efficiently debug application code on these new CPUs and GPUs.
- Intel® Distribution for GDB* rebases to GDB* 15 staying current and aligned with the latest enhancements supporting effective application debug.
- Intel® Distribution for GDB* enhances the developer experience, both on the command line and when using Microsoft* Visual Studio and Visual Studio Code* by boosting the debugger performance and refining the user interface.
Intel® oneAPI DPC++/C++ Compiler 2025.0
- New Features for standard support such as OpenMP 6.0, Fortran 2023 and SYCL 2020.
- Hardware enabling such as SPR, EMR, GNR, BMG, LNL and FS1 (e.g. cache hints and new data types for AI)
- Performance tuning for AI framework and HPC applications.
- Improvements such opt-report enhancement for better user experiences
- Bindless Textures Support implemented for Intel GPUs (DG2, Arc).
- Device code now supports sanitizers to help identify issues.
- New SYCL Offload model introduced with –offload-new-driver.
- Performance enhancements and early support for OpenMP 6.0 features.
- OpenMP loop rotation issue fixed
- New OpenMP 6.0 DEVICE_TYPE clause for TARGET construct
- Mandatory OpenMP offload support enabled
- Improved user experience with additional optimization report information for OpenMP offloading.
- SYCL now offers functionality comparable to CUDA textures for Intel Client GPUs.
- Advanced support for optional kernel features in Ahead-Of-Time (AOT) compilation mode.
- SYCL ABI has undergone breaking changes.
Intel® Fortran Compiler 2025.0
- New F2023 Standard features for the runtime library including the AT Edit Descriptor to trim trailing blanks before output, and SPLIT and TOKENIZE intrinsic functions for parsing character strings. IEEE_ARITHMETIC has been enhanced with F2023 IEEE_MAX, IEEE_MAX_MAG, IEEE_MIN, IEEE_MIN_MAG, along with F2023 behavior changes for IEEE_MAX_NUM, IEEE_MAX_NUM_MAG, IEEE_MIN_NUM and IEEE_MIN_NUM_MAG
- New OpenMP features in this release:
. OpenMP 6.0 An IF clause allowed on TEAMS construct
. OpenMP 6.0 A DEVICE_TYPE clause may now appear on a TARGET construct.
. OpenMP 5.1 The DEVICE_TYPE clause may appear on a DECLARE TARGET directive.
. OpenMp 5.1 AFFINITY clause is now permitted on a TASK directive.
- New work on Coarrays that include arrays of coarrays, a data object with a coarray component may now be an array and may be allocatable.
- The standard-semantics option is now sensitive to a standards conformance level set by the stand option. If no stand option is specified, standard semantics sets options to conform to Fortran 2018 standard behavior.
- The Intel® Fortran Developer Guide and Reference received extensive updates to remove older material, refresh existing examples, and added supported Fortran 2018 and 2023 Fortran Language features
Intel® MPI Library 2021.14.0
- GNR /SRF Tuning and optimizations for both scale out and scale up.
. GNR - Improved CPU pinning library for optimized balancing for asymmetric CPU topologies. (IPL2 becomes PoR)
- MPI 4.0 compliance- Support for Partition communication, Improved Error handling, Fortran 2008 support
- MPI_Allreduce scale-up and scale-out optimizations for Intel GPUs
- Intel GPU aware reduce optimizations
- Windows IFX support for MPI
- OFI and provider update to latest open-source versions
Deprecation Notice
- Intel® Fortran Compiler Classic (ifort) is now discontinued in oneAPI 2025 release.
The Intel oneAPI Base Toolkit is a core set of tools and libraries for developing high-performance, data-centric applications across diverse architectures. It features an industry-leading C++ compiler and the Data Parallel C++ (DPC++) language, an evolution of C++ for heterogeneous computing. Domain-specific libraries and the Intel Distribution for Python provide drop-in acceleration across relevant architectures. Enhanced profiling, design assistance, and debug tools complete the kit. High-performance computing (HPC) is at the core of artificial intelligence, machine learning, and deep learning applications. The Intel oneAPI HPC Toolkit delivers what developers need to build, analyze, optimize, and scale HPC applications with the latest techniques in vectorization, multithreading, multi-node parallelization, and memory optimization. Intel oneAPI HPC Toolkit is an add-on to the Intel oneAPI Base Toolkit, which is required for full functionality. It also includes access to the Intel Distribution for Python, the Intel oneAPI DPC++/C++ Compiler, powerful data-centric libraries, and advanced analysis tools.
Intel oneAPI Base Toolkit
What is the Intel oneAPI HPC Toolkit?
Intel is a world leader in computing innovation. The company designs and builds the essential technologies that serve as the foundation for the world's computing devices. As a leader in corporate responsibility and sustainability, Intel also manufactures the world's first commercially available "conflict-free" microprocessors.
Owner: Intel
Product Name: oneAPI Base & HPC Toolkit
Version: 2025.0.0.0
Supported Architectures: x64
Website Home Page : https://software.intel.com/
Languages Supported: english
System Requirements: Windows & Linux *
Size: 17.4 Gb
Please visit my blog
Added by 3% of the overall size of the archive of information for the restoration
No mirrors please
Added by 3% of the overall size of the archive of information for the restoration
No mirrors please