Google Groups no longer supports new Usenet posts or subscriptions. Historical content remains viewable.
Dismiss

Text manipulation

109 views
Skip to first unread message

Sunil Agrawal

unread,
Aug 23, 2012, 2:32:17 PM8/23/12
to dev-p...@lists.mozilla.org
Hi,
I am thinking of experimenting with text manipulation in Pdf.Js, e.g. do
spell correction before display. Currently, the text extraction is done in
Canvas code right before rendering. Is there a better place where that
could be done?

I looked at Partial Evaluator's extractTextContent, which gets me all the
text on a page, however, there's no scope of updating text there which will
get picked up by Canvas later. Any pointers will be highly appreciated.

Or should I wait for the 'Text Search/Text Extraction' work that
Julian/Brendan/Artur were referring to in one of the threads before I
embark on my experiment. If yes, is there a timeframe when that rework will
be done?

Thanks, Sunil

su...@armor5.com

unread,
Aug 26, 2012, 1:57:46 PM8/26/12
to mozilla.d...@googlegroups.com, dev-p...@lists.mozilla.org
Sorry for pestering, but does someone familiar with Pdf.js code have recommendations for me on the best way to do text manipulation?

If some work is imminent that will make my job easier, I would like to take advantage of it.

Thanks, Sunil

su...@armor5.com

unread,
Aug 26, 2012, 1:57:46 PM8/26/12
to mozilla-d...@lists.mozilla.org, dev-p...@lists.mozilla.org
Sorry for pestering, but does someone familiar with Pdf.js code have recommendations for me on the best way to do text manipulation?

If some work is imminent that will make my job easier, I would like to take advantage of it.

Thanks, Sunil

On Thursday, August 23, 2012 11:32:17 AM UTC-7, Sunil Agrawal wrote:

leonl...@gmail.com

unread,
Nov 25, 2012, 10:53:54 PM11/25/12
to mozilla-d...@lists.mozilla.org, dev-p...@lists.mozilla.org
I am also interesting in maipulating text. I would like to add hyper links to other pdf or sections in the document based on the referencing text. Thinking of dynamically linking legislation.

Sunil Agrawal

unread,
Nov 26, 2012, 5:06:56 PM11/26/12
to leonl...@gmail.com, mozilla-d...@lists.mozilla.org, dev-p...@lists.mozilla.org
Unfortunately I was never able to get it to work. Maybe someone else
on this mailing list might have better ideas.

I believe hyperlinking can be done without 'real' text manipulation,
similar to way PDF's Link Annotations are supported?

Sunil

On Sun, Nov 25, 2012 at 7:53 PM, <leonl...@gmail.com> wrote:
> On Thursday, August 23, 2012 2:32:17 PM UTC-4, Sunil Agrawal wrote:
> I am also interesting in maipulating text. I would like to add hyper links to other pdf or sections in the document based on the referencing text. Thinking of dynamically linking legislation.
> _______________________________________________
> dev-pdf-js mailing list
> dev-p...@lists.mozilla.org
> https://lists.mozilla.org/listinfo/dev-pdf-js

Sunil Agrawal

unread,
Nov 26, 2012, 5:06:56 PM11/26/12
to leonl...@gmail.com, mozilla-d...@lists.mozilla.org, dev-p...@lists.mozilla.org
Unfortunately I was never able to get it to work. Maybe someone else
on this mailing list might have better ideas.

I believe hyperlinking can be done without 'real' text manipulation,
similar to way PDF's Link Annotations are supported?

Sunil

On Sun, Nov 25, 2012 at 7:53 PM, <leonl...@gmail.com> wrote:
> On Thursday, August 23, 2012 2:32:17 PM UTC-4, Sunil Agrawal wrote:

leonl...@gmail.com

unread,
Nov 25, 2012, 10:53:54 PM11/25/12
to mozilla.d...@googlegroups.com, dev-p...@lists.mozilla.org
On Thursday, August 23, 2012 2:32:17 PM UTC-4, Sunil Agrawal wrote:
0 new messages