Hi All - I'm working on an app for detecting bird songs. The detection part itself is straightforward as there's another project that actually provides the analysis engine. I'm providing recording, storage, and UI.
Anyhow, I'm displaying spectrograms for the birdsongs with the create_spectrogram function found in here:
This function was the result of Googling and ChatGPTing pretty much entirely.
For making small images this works fine, but larger ones are pretty low resolution. This may due at least in part to the input audio having 16KHz sampling (blame my cheap security camera that's providing the audio).
I've tried torturing ChatGPT into telling me how to make it higher res, and it has some some suggestions but none seemed to make a huge difference.
Suggestions for improvement would be greatly appreciated. I've attached a sample input file and resulting PNG.