For example, imagine you enter a room. The room name provides a tileset description code, like forest tileset. The room description contains grid information that defines accessible areas. The who is here description provides the coordinates of other players in the "room". Most of this is not easily human readable but if your web client or telnet program translates that into visual data, boom you've got a graphical game.
This all seems a little too easy though and not what Evennia was intended for. Am I missing something or could this work?