Introducing PyTorch Profiler – the new and improved performance tool
PyTorch
“Analyzing and improving large-scale deep learning model performance is an ongoing challenge that grows in importance as the model sizes increase. For a long time, PyTorch users had a hard time solving this challenge due to the lack of available tools. There were standard performance debugging tools that provide GPU hardware level information but missed PyTorch-specific context of operations. In order to recover missed information, users needed to combine multiple tools together or manually add minimum correlation information to make sense of the data. There was also the autograd profiler (torch.autograd.profiler
) which can capture information about PyTorch operations but does not capture detailed GPU hardware-level information and cannot provide support for visualization…”
Source: pytorch.org/blog/introducing-pytorch-profiler-the-new-and-improved-performance-tool/