Is it possible to use the convolution layers in a way that preserves the input dimensions? The component doesn't seem to be able to do zero-padding.It could work by doing that separately on the input before feeding it to the component, however I'm not quite sure how to do it.
--
You received this message because you are subscribed to the Google Groups "kaldi-help" group.
To unsubscribe from this group and stop receiving emails from it, send an email to kaldi-help+...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
Frequency padding is easy to implement, but it would be tough to implement temporal padding given that we support computation of chunks of outputs and not single outputs. If you were planning to support zero-padding for each frame of output like IBM's CNN recipe you would need to do major changes in the component. At each time step there would be several different convolution outputs for each filter which correspond to each output in the output chunk. This would be very tough to implement.