I think that it should be very doable to extract a reasonably accurate parts list from the HTML text of a US patent document, which is contained in the element html > body > coma (everything but the abstract).
Example regular expression to use : \b(\w+)\b\s+\b(\d+)\b. This is obviously very rudimentary - some refinements could be added to it to pick up multiple comma-separated reference numerals instead of just the first one, like "fasteners 20, 22", to exclude group 1 from being the word "and" as in "FIGS. 1 and 2", and to not capture claim references like "the device of claim 6", but none of that should be too hard. A slightly trickier coding challenge may be capturing multi-word parts, as in "upper platen 20" and "lower platen 22," but a couple of promising methodologies immediately come to mind - for instance, you could search backwards from a match for successive preceding words, and continue collecting the preceding words as long as they appear before each instance of the particular match. Or, you could search backwards from the first instance of each match until you hit the indefinite article "a"/"an," and everything between a/an and the number is your part. E.g., "a flexible, heat-resistant, non-stick release sheet 24 composed of PTFE or similar material."
Anyway, an image of the output from
regex101.com for matches of the simple regular expression \b(\w+)\b\s+\b(\d+)\b found in the HTML text of USP 6,293,874 is attached. You can see that it does a halfway decent job of finding parts, and it would be helpful even just to provide a list of these matches on Petapator, so that you don't have to manually transcribe part names onto the figures, or keep flipping back and forth to see what they are. The split screen view helps for that if you are viewing the patent on the web with Petapator enabled, so that at least you can keep the figure in view while you search for the part name, but a parts list would be better and useful for reviewing the PDF as well.