I have a feature that allows each of the following in both LaTeX and html:
Some sample screenshots (LaTeX first, then html):
As I say, the feature allows Figures and Tables to be next to each other, together with subfigures, and subtables. 
The XML for the above tables is
<multobjects type="table" TeXFloatOptions="!hbt">
              <div width="50%">
                <table xml:id="exp-tab-salary">
                  <caption/>
                  <coltypes>
                    <col align="decimal" format="2.0"/>
                    <col align="decimal" format="10.0"/>
                  </coltypes>
                  <thead>
                    <row><entry><m>d</m> (days worked)</entry><entry><m>p</m> (cents)</entry></row>
                  </thead>
                  <tbody>
                    <row><entry>1</entry><entry>2</entry></row> 
                    <row><entry>2</entry><entry>4</entry></row>
                    <row><entry>3</entry><entry>8</entry></row>
                    <row><entry>4</entry><entry>16</entry></row>
                    <row><entry>5</entry><entry>32</entry></row>
                    <row><entry>10</entry><entry>1024</entry></row>
                    <row><entry>30</entry><entry>1073741824</entry></row>
                  </tbody>
                </table>
              </div>
              <div width="50%">
                <table xml:id="exp-tab-salaryalt">
                  <caption />
                  <coltypes>
                    <col align="decimal" format="2.0"/>
                    <col align="left"/>
                  </coltypes>
                  <thead>
                    <row><entry><m>d</m> (days worked)</entry> <entry><m>p</m> (cents)</entry></row> 
                  </thead>
                  <tbody>
                    <row><entry>1</entry> <entry><m>2</m>                  </entry></row>
                    <row><entry>2</entry> <entry><m>2^2</m>                </entry></row>
                    <row><entry>3</entry> <entry><m>2^3</m>                </entry></row>
                    <row><entry>4</entry> <entry><m>2^4</m>                </entry></row>
                    <row><entry>5</entry> <entry><m>2^5</m>                </entry></row>
                    <row><entry>10</entry> <entry><m>2^{10}</m>             </entry></row>
                    <row><entry>30</entry> <entry><m>2^{30}</m>             </entry></row>
                  </tbody>
                </table>
              </div>
            </multobjects>
For the moment, please ignore the coltypes as they are part of another discussion; instead, please concentrate on the multobjects tag, and the use of a div tag together with an associate width to specify how wide each 'sub' object should be.
The LaTeX code uses the minipage environment, and the subcaption package with associated subfigure and subtable environments (for Figures 1(a), (b), etc). The hmtl code uses div boxes, together with a check on the parent object, multobjects type to style the caption.
Rob said that it's generally a good idea to post these kinds of things here in the support forum so that the syntax can be hashed out and stewed over for a bit before submitting a pull request. So.... please do let me know if you have any questions about this; I'll be happy to post more sample code, or chat about any potential issues.
 
<sidebyside leftmargin="0">
   <paragraph width="31" valign="middle" rightmargin="29">
      <title>First Step</title>
      <p>Glue the top edge to the bottom to make a cylinder.</p>
      <p>Maybe my text has a second paragaph, so we use "paragraph" enclosure.</p>
      <p>Title is optional.</p>
   </paragraph>
   <image width="40" rightmargin="5">
      [image source, description, etc, here]
   </image>
</sidebyside>
It is reasonably common to have 2 or three things side-by-side in a
book. Both having and not having a caption is common (not having a
caption would be more common if people put more thought into the
presentation. See Tufte's "Beautiful Evidence".)
Here are three pages of a book of mine, in which there appears
2 images, or text and image, or three images side-by-side:
http://aimath.org/~farmer/print/side-by-side.pdf
Those examples show that you need a way to position the objects
vertically within their block, and also that you need to control
the width of the object, the spacing between objects, and
the margins.
The cases of two things or three things side-by-side are the most common.
Writing p for text, t for table, and i for image, I definitely have
used all of the following:
pp (same content in two languages, for example)
pt
pi
ip
ii
ipi
iii
<figure>
  <caption>A nice figure</caption>
  <displaygroup children="subfigure">
  ....
  </displaygroup>
</figure>
How would you do a 3 by 2 grid of figures, where each figure is
the graph of a hyperbolic trig function, each little graph has
a caption, and the whole group has a caption?
<figure>
    <caption>The Hyperbolic Functions</caption>
    <displaygroup rows="3" cols="2" children="subfigure">
        <figure>
            <caption><m>y=\cosh(x)</m></caption>
            <image source="cosh.svg" />
        </figure>        
        <figure>
            <caption><m>y=\sinh(x)</m></caption>
            <image source="sinh.svg" />
        </figure>        
        <figure>
            <caption><m>y=\tanh(x)</m></caption>
            <image source="tanh.svg" />
        </figure>        
        <figure>
            <caption><m>y=\sech(x)</m></caption>
            <image source="sech.svg" />
        </figure>        
        <figure>
            <caption><m>y=\csch(x)</m></caption>
            <image source="csch.svg" />
        </figure>        
        <figure>
            <caption><m>y=\coth(x)</m></caption>
            <image source="coth.svg" />
        </figure>
    </displaygroup>
</figure>