Interested in Idea "16 Support Audio IO module"

119 views
Skip to first unread message

노성훈

unread,
Mar 3, 2020, 6:33:02 PM3/3/20
to opencv-gsoc-2020
Hello everyone

My name is Seonghun Noh. I'm a 3rd-year student at Sogang University, Electronic Engineering.
 
currently, I'm working on a project that uses audio and video signal processing simultaneously in my research internship.

when doing my project, I use OpenCV for video IO and use RTAudio for audio IO.

I was wondering why using audio IO is not possible in OpenCV.

that's why I'm interested in the Idea "Support Audio IO module" 

Also, I have experienced deep learning Inference with audio and video both. 

so I can make some tutorials about audio and video multi-model approach with Audio IO module in OpenCV if needed.

by the way, I've been always using OpenCV For my project. now I'm so glad to have a chance that not only using but making a contribution.

Looking forward to get answer from a mentor.


jzi0...@naver.com

unread,
Mar 4, 2020, 4:08:35 AM3/4/20
to opencv-gsoc-2020

nes...@gmail.com

unread,
Mar 5, 2020, 10:21:58 AM3/5/20
to opencv-gsoc-2020
Hello, Seonghun. My name is Alex Nesterov, I'm will be mentoring the task of audio support. If you have any questions you can ask me they.

Seonghun Noh

unread,
Mar 9, 2020, 6:26:58 AM3/9/20
to opencv-gsoc-2020
Hello, Alex Nesterov !.
Thanks for your answer.
I have some question about the task.
  • Is it Okay to use RTAudio approach in this task?
  • should I have to add signal processing algorithm such as STFT FFT Windows ... that used in dnn for audio .
Best regards,
Seonghun Noh

Alexander Nesterov

unread,
Mar 10, 2020, 6:50:12 AM3/10/20
to opencv-gsoc-2020
We have plan audio support through change pipelines of videoio (for example gstreamer). Next step try check DNN module. Do you work with gstreamer, ffmpeg, msmf or other media pipelines?

도레미파솔라시노

unread,
Mar 12, 2020, 1:09:39 AM3/12/20
to opencv-gsoc-2020
I Worked with RTAudio(A set of C++ classes that provide a common API for realtime audio input/output) https://github.com/thestk/rtaudio.
I manipulated RTAudio in c++ code and applied it to my project in cross-platform. 
sometimes, when RTAudio didn't work well, I found a low level API reference such as WASAPI(windows) and  ALSA(linux).
I've worked with RTAudio in c++ project for 
  • realtime resampling, 
  • reading header
  • playing music in device 
  • recoding from device

Also I used compiled version of ffmpeg for
  • sending webcam data to streaming server
  • encoding and decoding video
  • save video from realtime webcam
Message has been deleted

Alexander Nesterov

unread,
Mar 16, 2020, 7:47:39 AM3/16/20
to opencv-gsoc-2020
Hello, you can explore Gstreamer pipeline in videoio module, because we would wanted  change it for audio support. And I advise write proposal base on this changes or propose your idea for change exist videoio functiniality.

Seonghun Noh

unread,
Mar 23, 2020, 3:45:17 AM3/23/20
to opencv-gsoc-2020
 Hello, I submitted proposal draft just before. could you give me some feedback?

Alexander Nesterov

unread,
Mar 23, 2020, 3:31:51 PM3/23/20
to opencv-gsoc-2020
Hello! I going to try review tommorow, thank you.

Alexander Nesterov

unread,
Mar 24, 2020, 3:19:41 PM3/24/20
to opencv-gsoc-2020
Hello, I reviewed yours proposal and I have few questions:
1) What did you want say in the time interval? (June 19 - Jul 13) Simply I didn't quite understood it.
2) I think better create new data structure (e. g. MediaCapture) on base VideoCapture and not changed current realization VideoCapture. (if see on scheme)
3) Maybe, time intervals need to be explained in more detail.

Thank you)

Seonghun Noh

unread,
Mar 25, 2020, 11:10:09 AM3/25/20
to opencv-gsoc-2020

Hi, thanks for your advice!

1)it's just a sample data. I've changed that data in a new draft
2)I see. I agree with your approach and changed in my proposal.
3)I've added furthermore detail on timeline.

Also I changed overview image
I will wait for your new advice!

Best regards,

Alexander Nesterov

unread,
Mar 28, 2020, 6:48:55 PM3/28/20
to opencv-gsoc-2020
Hello, thank you, it looks good. You can add proposal in tool and wait for the results.
Reply all
Reply to author
Forward
0 new messages