[Discussion][GSoc] Advancing SymPy Gamma and SymPy live using Natural Language Processing ?

62 views
Skip to first unread message

Moses Paul

unread,
Mar 26, 2020, 10:24:13 AM3/26/20
to sympy

To introduce Natural Language Processing to these projects to help them better translate expressions and queries to mathematical symbols.
To extend the parsing support for SymPy Gamma by including support for syntaxes from languages such as Julia, Matlab, Lua etc, which would improve the way SymPy Gamma queries are interpreted.
To allow Tex to be interpreted by SymPy Gamma. (https://github.com/augustt198/latex2sympy).
And much more ...

Nikhil Maan

unread,
Mar 28, 2020, 5:39:10 PM3/28/20
to sympy
Hey,

I noticed you've only provided a fraction of your project plan and timeline in your proposal. Do you have a plan you'd like to discuss for what you're proposing?

And how many and which of the languages you're planning to implement the support for?

Also, looking at this, you're proposing to implement NLP support, and support for multiple languages including latex in the GSoC period. That's a huge amount of work you're proposing for essentially a 3 month period.

Regards,
Nikhil Maan

Moses Paul

unread,
Mar 29, 2020, 7:15:37 AM3/29/20
to sympy
Hey Sc0rpi0n101,

Yes, I've only provided a little bit of information regarding the implementation of my proposal.

The thing is most of the work I'd be doing would be integration and consolidation.
To elaborate, say for interpreting LaTex , there is already a package called latex2sympy that is already included in sympy's parsers.
and there are nlp libraries that parse to mathematical expressions, my work would be to integrate them into sympy.


Essentially

textual expressions --> NLP --> mathematical expressions --> sympy compatible expressions.

for eg projects like mathparse which parse sentences like `one hundred times fifty four` and so on.

the idea is not to implement an NLP package from scratch as that would be a huge undertaking, but make use of existing technologies such as BERT , ANTLR4 and so on to help expedite the process.

I plan to elaborate my proposal as soon I complete going through sympy's codebase and figure out the best approach. and for the timeline, I'm not sure of the exact situations I'd have to face but I do plan to propose an approximate one
Reply all
Reply to author
Forward
0 new messages