NVIDIA Announces TensorRT 8.2 and Integrations with PyTorch and TensorFlow

“Today NVIDIA released TensorRT 8.2, with optimizations for billion parameter NLU models. These include T5 and GPT-2, used for translation and text generation, making it possible to run NLU apps in real time.

TensorRT is a high-performance deep learning inference optimizer and runtime that delivers low latency, high-throughput inference for AI applications. TensorRT is used across several industries including healthcare, automotive, manufacturing, internet/telecom services, financial services, and energy.

PyTorch and TensorFlow are the most popular deep learning frameworks having millions of users. The new TensorRT framework integrations now provide a simple API in PyTorch and TensorFlow with powerful FP16 and INT8 optimizations to accelerate inference by up to 6x…”

Source: developer.nvidia.com/blog/nvidia-announces-tensorrt-8-2-and-integrations-with-pytorch-and-tensorflow/

December 6, 2021

0 Comments

Inline Feedbacks

View all comments

Request a Quote

Log In

NVIDIA Announces TensorRT 8.2 and Integrations with PyTorch and TensorFlow

NVIDIA Announces TensorRT 8.2 and Integrations with PyTorch and TensorFlow

NVIDIA Announces TensorRT 8.2 and Integrations with PyTorch and TensorFlow | NVIDIA Developer Blog