Did some back-of-the-envelope math, assuming that the diaphragm was
1"x1" and that the plates were 1mm apart, and came up with a
capacitance of something like 5.4E-12 farads per mic. I think that
would require a rather high bias voltage to yield a workable signal.
Gain can be improved somewhat with thinner spacing. A business card
is approximately 0.25 mm thick, so that would give you 4 X the
capacitance per unit area. Doubling the width of the strips would
quadruple the capacitance of a single element (and quadruple the
overall area of the array).
Also, after doing some drawings, it became apparent that the size of
the sensor influences its directional response at any given frequency
-- so each element does have an effective, frequency-dependent comb
filter-ish "aperture". The net effect of all apertures at all
frequencies is a little obscure ...