Exporting the operator stft to ONNX opset version 9 is not supported. #982

lawlict · 2020-10-23T13:36:54Z

Hi, I try exporting the process of feature extraction to onnx:

import torch
import torchaudio

model = torchaudio.transforms.MelSpectrogram()
x = torch.randn(1, 16000)
torch.onnx.export(model, x, 'tmp.onnx', input_names=['input'], output_names=['output'])

and get:

RuntimeError: Exporting the operator stft to ONNX opset version 9 is not supported. Please open a bug to request ONNX export support for the missing operator.

So will torchaudio add supports operators used in torchaudio.transforms module in the future? You see that exporting the process of feature extraction and the neural network together will be very convenient.

Thanks!

The text was updated successfully, but these errors were encountered:

vincentqb · 2020-10-24T00:18:01Z

Thanks for opening an issue for this :) This is an issue upstream with pytorch directly, see pytorch/pytorch#31317. @houseroad, do you know the status of STFT support with ONNX?

We can keep this issue open since this does affect torchaudio.

mthrok · 2020-10-26T17:56:35Z

Hi @lawlict

We are hearing the increased number of support requests for stft/istft in ONNX. The core of the problem is stft/istft not defined in ONNX. So we passed down the user voices to the proper channel to see what we can do. At this moment, we do not have a concrete action plan. Meanwhile one workaround you can do is to mock the stft function in a similar way as pytorch/pytorch#31317 (comment) and patch the call to torch.stft.

Let us know if you have a proposal for a way to resolve this.

lawlict · 2020-10-27T02:10:00Z

Hi @mthrok
Thank you for your nice help, and I will have a try on pytorch/pytorch#31317 (comment) later. So should I close the issue now?

mthrok · 2020-10-28T20:28:37Z

@lawlict

Let's keep the issue. This is a very unique issue and torchaudio does not have an official stance on this one, so until we hear the update and decide the stance we can keep this issue open and use it as a point of contact.

faroit · 2020-10-28T21:03:29Z

@lawlict just as a quick fix you can try nnaudio, torch-stft or asteroids stftfb

lawlict · 2020-11-07T15:26:37Z

Frustrating news is that the speed becomes much slower when replacing FFT with Fourier matrix...

mthrok · 2020-11-07T15:39:15Z

Sorry to hear that. I'll follow up with the team.

mthrok · 2020-11-12T01:06:43Z

So I was told that there is no plan to add stft/istft functions to ONNX at the moment.
The best way would be to write a proposal following their guide
This seems to require significant amount of efforts.
Due to my workload, I cannot take an initiative on this one though I think this is very valuable work.
We can do some research on fulfilling some of the requirements for the spec. (but it might be more appropriate to do so in their issue thread for better visibility.)

chenjiasheng · 2022-04-13T03:09:02Z

Any update fow now?

NiziL · 2022-05-18T12:54:45Z

@chenjiasheng There is actually some good news around : pytorch/pytorch#65666 and onnx/onnx#3741

averkij · 2022-05-24T07:57:50Z

PR is finally merged. onnx/onnx#3741

stonelazy · 2022-06-24T06:47:49Z

These signal processing operators (STFT/FFT/IFFT) are supported in opset 17.
I was trying to export the torch module to ONNX, but I was thrown an error. Does torch need to support this even after MR is being merged in ONNX ?

import torch
import torch.nn as nn

class FeaturizeTorchFFT(nn.Module):

    def __init__(self):
        super().__init__()

    def forward(self, x):
        return torch.stft(
            input=x,
            n_fft=320,
            hop_length=160,
            win_length=320,
            window=torch.ones(320),
            center=False,
            pad_mode="reflect",
            normalized=False,
            onesided=True,
        )

self.featurizer_model = FeaturizeTorchFFT()
torch.onnx.export(
            self.featurizer_model, 
            self.sample_input, 
            # 're_onnx_model.onnx',
            str(self.output_path),
            export_params=True,
            opset_version=17,
            do_constant_folding=True,
            input_names=["x"],
            output_names=["output"],
            dynamic_axes={
                "x": {0: "batch_size", 1: "audio_length"},
            },
        )

  File "/home/kp/miniconda3/envs/gamd6-kp2/lib/python3.8/site-packages/torch/onnx/symbolic_helper.py", line 853, in _set_opset_version
    raise ValueError("Unsupported ONNX opset version: " + str(opset_version))
ValueError: Unsupported ONNX opset version: 17

averkij · 2022-06-24T07:07:45Z

Operators are supported in opset 17. But opset 17 itself is not supported by PyTorch yet.

stonelazy · 2022-06-24T07:23:24Z

Operators are supported in opset 17. But opset 17 itself is not supported by PyTorch yet.

Thanks for the quick help, if possible could you actually show the direction on how I can actually test the working of these operators standalone (without torch) ? Am jst trying to ensure the output of torch.fft is the same as the one from onnx

mravanelli · 2022-09-10T01:52:24Z

Hi, is there any news on this issue? This feature is going to be important for the SpeechBrain project. Some of our contributors are working on exporting our speech processing pipelines into a microcontroller. This requires the model to be ONNX exportable/importable (@fpaissan). The main issue is that all the feature extraction relies on fft and stft.

mthrok · 2022-09-10T13:32:00Z

This is PyTorch (MSFT) issue rather than TorchAudio.
Please check and cast voice at pytorch/pytorch#81075
It seems the support for STFT is in motion.

njb · 2022-11-22T19:08:12Z

While we wait for native PyTorch integration, folks can checkout https://github.com/adobe-research/convmelspec which will implement (Mel)spectrograms via 1D conv layer to both ONNX and CoreML (and as a second option CoreML MIL ops).

Xavier-i · 2022-11-28T03:34:03Z

While we wait for native PyTorch integration, folks can checkout https://github.com/adobe-research/convmelspec which will implement (Mel)spectrograms via 1D conv layer to both ONNX and CoreML (and as a second option CoreML MIL ops).

very cool solution, you guys should publish it to pypi!

Closes pytorch#771. Fix vague description on batch size calculation in imagenet.

wrz1999 · 2024-01-06T09:08:18Z

May I ask if PSD and SoudenMVDR in torchaudio support conversion to onnx or libtorch?

lawlict closed this as completed Oct 27, 2020

lawlict reopened this Oct 27, 2020

mravanelli mentioned this issue Aug 1, 2022

Exporting the operator stft to ONNX opset version 9 is not supported speechbrain/speechbrain#1455

Closed

alecgunny mentioned this issue Sep 27, 2022

Highpass filtering preprocessing step during training ML4GW/aframe#158

Closed

mpc001 pushed a commit to mpc001/audio that referenced this issue Aug 4, 2023

Merge pull request pytorch#982 from yzs981130/fix_batch_size_comment

8016876

Closes pytorch#771. Fix vague description on batch size calculation in imagenet.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Exporting the operator stft to ONNX opset version 9 is not supported. #982

Exporting the operator stft to ONNX opset version 9 is not supported. #982

lawlict commented Oct 23, 2020

vincentqb commented Oct 24, 2020 •

edited

Loading

mthrok commented Oct 26, 2020

lawlict commented Oct 27, 2020

mthrok commented Oct 28, 2020

faroit commented Oct 28, 2020 •

edited

Loading

lawlict commented Nov 7, 2020

mthrok commented Nov 7, 2020

mthrok commented Nov 12, 2020

chenjiasheng commented Apr 13, 2022

NiziL commented May 18, 2022

averkij commented May 24, 2022

stonelazy commented Jun 24, 2022

averkij commented Jun 24, 2022

stonelazy commented Jun 24, 2022

mravanelli commented Sep 10, 2022

mthrok commented Sep 10, 2022

njb commented Nov 22, 2022

Xavier-i commented Nov 28, 2022

wrz1999 commented Jan 6, 2024

Exporting the operator stft to ONNX opset version 9 is not supported. #982

Exporting the operator stft to ONNX opset version 9 is not supported. #982

Comments

lawlict commented Oct 23, 2020

vincentqb commented Oct 24, 2020 • edited Loading

mthrok commented Oct 26, 2020

lawlict commented Oct 27, 2020

mthrok commented Oct 28, 2020

faroit commented Oct 28, 2020 • edited Loading

lawlict commented Nov 7, 2020

mthrok commented Nov 7, 2020

mthrok commented Nov 12, 2020

chenjiasheng commented Apr 13, 2022

NiziL commented May 18, 2022

averkij commented May 24, 2022

stonelazy commented Jun 24, 2022

averkij commented Jun 24, 2022

stonelazy commented Jun 24, 2022

mravanelli commented Sep 10, 2022

mthrok commented Sep 10, 2022

njb commented Nov 22, 2022

Xavier-i commented Nov 28, 2022

wrz1999 commented Jan 6, 2024

vincentqb commented Oct 24, 2020 •

edited

Loading

faroit commented Oct 28, 2020 •

edited

Loading