Nsight Compute can';t剖面波辉光(PyTorch应用)

Nsight Compute can';t剖面波辉光(PyTorch应用),pytorch,nsight-compute,Pytorch,Nsight Compute,我试图通过以下命令配置文件: nv-nsight-cu-cli --export ./nsight_output ~/.virtualenvs/waveglow/bin/python3 inference.py -f <(ls mel_spectrograms/*.pt) -w waveglow_256channels.pt -o . --is_fp16 -s 0.6 操作系统:CentOS Linux 7,Nsight Compute(2019.3.1,构建26317742), GPU

我试图通过以下命令配置文件:

nv-nsight-cu-cli --export ./nsight_output ~/.virtualenvs/waveglow/bin/python3 inference.py -f <(ls mel_spectrograms/*.pt) -w waveglow_256channels.pt -o . --is_fp16 -s 0.6
操作系统:CentOS Linux 7,Nsight Compute(2019.3.1,构建26317742), GPU:Tesla V100-PCIE-32GB


如何修复此问题?

我认为这里没有任何错误,工具的行为与预期一致。它不仅分析了一个内核,还分析了日志输出中已经启动的296个内核(看起来都来自一个内核函数)

您可以控制使用--launch count或--kernel regex选项分析的内核的数量或类型。您还可以使用--metrics和--section控制为每个内核收集的度量,因为收集较少的度量可以减少工具的开销


有关更多可用的命令行选项,请参阅。

我认为这里没有任何错误,该工具的行为与预期一致。它不仅分析了一个内核,还分析了日志输出中已经启动的296个内核(看起来都来自一个内核函数)

您可以控制使用--launch count或--kernel regex选项分析的内核的数量或类型。您还可以使用--metrics和--section控制为每个内核收集的度量,因为收集较少的度量可以减少工具的开销

有关更多可用的命令行选项,请参见

...
==PROF== Profiling "weight_norm_fwd_first_dim_ker..." - 286: 0%....50%....100% - 48 passes
==PROF== Profiling "weight_norm_fwd_first_dim_ker..." - 287: 0%....50%....100% - 48 passes
==PROF== Profiling "weight_norm_fwd_first_dim_ker..." - 288: 0%....50%....100% - 48 passes
==PROF== Profiling "weight_norm_fwd_first_dim_ker..." - 289: 0%....50%....100% - 48 passes
==PROF== Profiling "weight_norm_fwd_first_dim_ker..." - 290: 0%....50%....100% - 48 passes
==PROF== Profiling "weight_norm_fwd_first_dim_ker..." - 291: 0%....50%....100% - 48 passes
==PROF== Profiling "weight_norm_fwd_first_dim_ker..." - 292: 0%....50%....100% - 48 passes
==PROF== Profiling "weight_norm_fwd_first_dim_ker..." - 293: 0%....50%....100% - 48 passes
==PROF== Profiling "weight_norm_fwd_first_dim_ker..." - 294: 0%....50%....100% - 48 passes
==PROF== Profiling "weight_norm_fwd_first_dim_ker..." - 295: 0%....50%....100% - 48 passes
==PROF== Profiling "weight_norm_fwd_first_dim_ker..." - 296: 0%....50%...^C
==PROF== Received signal, trying to shutdown target application
 - 43 passes
==ERROR== Failed to profile kernel "weight_norm_fwd_first_dim_ker..." in process
==ERROR== An error occurred while trying to profile.
==ERROR== An error occurred while trying to profile
==PROF== Report: nsight_compute_result.nsight-cuprof-report