Stft transformation and windowing issue

dlawler · September 19, 2023, 9:18pm

Hello!

I’m currently trying to get an stft class with shy_fft as it’s backend for FFT. Class can be seen here

Currently with an fft_size of 512, and a buffer_size of 680 (fft size + hop size), everything works pretty well.

The issue comes when I either try to window the signal (line 109 and 133 respectively). Any type of window or size combo I’ve applied to the signal seems to create a feedback/choppiness/static on the output signal.

Does anyone have any tips on correctly processing a window here?

Main code can also be found here

Any tips are greatly appreciated! Please note I am by no means an expert or student, just a hobbyist trying to learn stft!

tele_player · September 19, 2023, 9:32pm

The link for the class doesn’t work for me.

tunagenes · September 19, 2023, 9:43pm

This is just a shot in the dark, but it could be that the windowing code adds enough cpu cycles that you run out. You could try reducing the fft or buffer size and see if that makes the problem go away.

I also could not see the class, at first, the link:
https://gist.github.com/dlawle/20791fac1d58208a25ed4615dee53b10]
gets a ‘404’. Removing the trailing square bracket works:

gist.github.com

https://gist.github.com/dlawle/20791fac1d58208a25ed4615dee53b10

Updated-NaiveSTFT.h

#pragma once 
#ifndef NAIVE_STFT_H_
#define NAIVE_STFT_H_

#include "shy_fft.h"

template <size_t FFT_SIZE, size_t BUFFER_SIZE>
class naive_stft
{
public:

This file has been truncated. show original

dlawler · September 19, 2023, 10:09pm

Link should be fixed!

I haven’t tried much of anything lower than 256-512 but it’s absolutely a start!

tele_player · September 19, 2023, 10:35pm

I think the underlying problem is that the Process() function can take enough time to cause buffer underrun on the output. It’s risky to put anything in the callback whose timing isn’t very well understood.

dlawler · September 20, 2023, 8:38am

Do you have any suggestions on the best way to handle this? I know we spoke about it previously on discord, but when I moved it out of the callback, it actually stopped functioning. Most likely the way that I implemented it was incorrect though

EDIT AGAIN:


    // Function to notify that STFT processing is ready
    void NotifySTFTReady()
    {
        stft_ready_ = true;
    }

    // Function to check if STFT processing is ready
    bool IsSTFTReady()
    {
        return stft_ready_;
    }

    // Function to reset the STFT ready flag
    void ResetSTFTReady()
    {
        stft_ready_ = false;
    }

Through some weird trickery with flipping bools, I now have the processing done in the while loop, outside of the audio callback. I haven’t tested anything too latent/heavy yet, and have yet to apply/test windows, but I’ll check into that and report back!