Google Groups no longer supports new Usenet posts or subscriptions. Historical content remains viewable.
Dismiss

best trick for getting PDF URL from google search

2,343 views
Skip to first unread message

Josh L

unread,
Jun 6, 2011, 2:29:58 PM6/6/11
to
Has anyone worked out a really good trick for this? When the
URL is long, google doesn't provide it. The only things I've
been able to come up with over the years are to take the google
URL and somehow try to break it down manually, or to open the
PDF in my FF Browser instead of downloading/saving, which I
think gives me the URL in the FF address bar. However, opening
in the browser has been historically difficult for me to get to
work, and when I get it to work, there are freezing/slowness
issues.

So, are there are other tricks to this?

Josh

Christian Riechers

unread,
Jun 6, 2011, 2:47:01 PM6/6/11
to

Right click on the link - Copy Link Location.

--
Christian

Message has been deleted

Dave Pyles

unread,
Jun 6, 2011, 4:05:27 PM6/6/11
to
Sailfish wrote:
> My bloviated meandering follows what Josh L graced us with on 6/6/2011
> 11:29 AM:
> Some PDF URLs will not save the PDF file if you select the "Save page
> as..." options. I've take to simply loading the PDF in the browser and
> using the Save feature of the PDF file itself to save them.
>
I've installed the PDF Download extension which provides an option to
download the PDF or view it in the browser.
https://addons.mozilla.org/en-US/firefox/addon/pdf-download/
Dave Pyles

Josh L

unread,
Jun 6, 2011, 6:36:34 PM6/6/11
to

Hi Christian:

This doesn't address my question. If I do a google search and I
want to go to a google search result that is to a PDF, often the
URL is long enough such that the base URL does not display and
all I am left with is a jumbled google mess that I can't store
readily myself for efficient research or pass along to
colleagues. For example, here is the result of copying a link
location as to a document I am trying to go to and link for some
research.

http://www.google.com/url?sa=t&source=web&cd=8&ved=0CEgQFjAH&url=http%3A%2F%2Fwww.ngsa.org%2FAssets%2Fdocs%2Fanalyses%2520studies%2Ftop%252040%25202010%25202nd%2520quarter.pdf&rct=j&q=natural%20gas%20top%20producers&ei=_FPtTYylGqXb0QHZz8GKAQ&usg=AFQjCNFTBMZe7K0lxDH7N-KxGQqTj9SaLQ&sig2=N1Qf4woSeppkLpQqVVfA8A&cad=rja

Is this really that helpful? How do I get the actual link?

Setting my FF browser (Version 4.0.1) to open the document in
the browser (and going through the usual fickleness of whether
this will work and cost me minutes of work while it decides
whether to load or not or freeze my browser or not) I get to see
this link, which is all I was after:

http://www.ngsa.org/Assets/docs/analyses%20studies/top%2040%202010%202nd%20quarter.pdf

So, the goal is to overcome google's ruining of the original URL
so that I can have it more readily. Unfortunately,
right-clicking doesn't get me there. Yes, if we look at the
google URL we can possibly pare it down to the real one, but
what a pain.

jl


Josh L

unread,
Jun 6, 2011, 6:39:11 PM6/6/11
to
On Mon, 06 Jun 2011 12:40:02 -0700, Sailfish
<REMOVECAP...@REMOVECAPSunforgettable.com> wrote:

>My bloviated meandering follows what Josh L graced us with on 6/6/2011
>11:29 AM:

>Some PDF URLs will not save the PDF file if you select the "Save page
>as..." options. I've take to simply loading the PDF in the browser and
>using the Save feature of the PDF file itself to save them.

Thanks Sailfish. I am after a way to get at the URL from google
searches, not at the file itself. That I can already do.

Josh

Christian Riechers

unread,
Jun 6, 2011, 7:03:06 PM6/6/11
to

Not sure if that really helps you. If you're on a Unix type system, you
could use this code to extract the long url. It works out of the box on
my Linux system.

> php -r "print
urldecode('http://www.google.com/url?sa=t&source=web&cd=8&ved=0CEgQFjAH&url=http%3A%2F%2Fwww.ngsa.org%2FAssets%2Fdocs%2Fanalyses%2520studies%2Ftop%252040%25202010%25202nd%2520quarter.pdf&rct=j&q=natural%20gas%20top%20producers&ei=_FPtTYylGqXb0QHZz8GKAQ&usg=AFQjCNFTBMZe7K0lxDH7N-KxGQqTj9SaLQ&sig2=N1Qf4woSeppkLpQqVVfA8A&cad=rja');"
| awk -F'&' '/url=/{ print $5 }'

url=http://www.ngsa.org/Assets/docs/analyses%20studies/top%2040%202010%202nd%20quarter.pdf
>

The information was taken from here:
http://stackoverflow.com/questions/5780809/real-link-to-file-in-google-search-results

PHP can probably be installed for Windows as well, but I don't know what
else may be needed to get this working.

--
Christian

Message has been deleted

Dave Symes

unread,
Jun 7, 2011, 2:14:38 AM6/7/11
to
In article <1nlqu61e0che3nsv7...@4ax.com>,

> Josh

Isn't that a bit like looking at a menu, and wanting to eat the meal
before it has been served?

Anyway... If I understand you correctly.

Have an advanced text editor open (EditPad Lite would do).

Run your Google search page.
When the page is displayed.

Press and *hold down* the Shift key.
Mouse pointer over the item you want, *Click and hold*, then drag to the
text editor window.

Dave

*Click and hold*, is just press the mouse button down and hold it down
while drag n drop being done.

When you first start the drag in the Fx window, it will appear as though
you can't do it (Barred circle) but once you drag outside the Fx window it
works.

You are probably aware, but in case not.
In the EditPad Lite window, you can hilight a Url then double click it to
run it to Fx.

If it's a single entry, then just a double click will do.
D.

--

Dave Triffid

Ralph Fox

unread,
Jun 7, 2011, 4:14:01 AM6/7/11
to

Josh L

unread,
Jun 7, 2011, 8:26:04 PM6/7/11
to

Thanks, I'll take a look.

Josh

Josh L

unread,
Jun 7, 2011, 8:28:29 PM6/7/11
to
On Mon, 06 Jun 2011 17:09:05 -0700, Sailfish
<REMOVECAP...@REMOVECAPSunforgettable.com> wrote:
>Can you provide the search terms you used to produce such an URL. It
>appears like the actual PDF link in within the Google URL (along with
>other terms) and has been encoded with something like PHP
>urlencode(Urlstring) or Javascript encodeURIComponent(Urlstring).
>
>Very odd.

Hi Sailfish:

This has been going on for years with any PDF URL served up by
google search. No exceptions that I'm aware.

If you're after a link you can pass along readily to others
without a mess, the only time you luck out is if the original
URL is short enough such that when google gives it (beneath
their obstructive impossible-to-read URL), they don't shorten
it.

Josh

Josh L

unread,
Jun 7, 2011, 8:36:28 PM6/7/11
to
On Tue, 07 Jun 2011 20:14:01 +1200, Ralph Fox
<-rf...@xn--kba.invalid> wrote:

>On Mon, 06 Jun 2011 11:29:58 -0700, in message <v07qu69h4gj9nn9hg...@4ax.com>
>Josh L wrote:
>
>One of these looks like it should do the job.
>
>Redirect Cleaner
>https://addons.mozilla.org/en-US/firefox/addon/redirect-cleaner/

Thanks Ralph, this seems to address the matter. I imagine the
other one might also. Been battling this for many years, thanks
for helping me get this done.

jl

0 new messages