Strip URL

16 views
Skip to first unread message

Robert King

unread,
Sep 6, 2015, 4:30:16 PM9/6/15
to DataparkSearch Engine
Hi Maxime,

I have come across a site which has http://sitename.com/page.html?7GhyHWgkS and other pages like that.

How would I be able to index those pages but strip the ?7GhyHWgkS or other ?strings from .html?string so I am just left with http://sitename.com/page.html

thanks

Robert

Maxim Zakharov

unread,
Sep 7, 2015, 6:03:49 PM9/7/15
to datapar...@googlegroups.com
Hi Robert,

It is ReverseAlias command which does that job, see http://www.dataparksearch.org/devel-doc/dpsearch-aliases.en.html#ALIAS-REVERSE
You can see several sample ReverseAlias commands in indexer.conf-dist file

Maxim

--
You received this message because you are subscribed to the Google Groups "DataparkSearch Engine" group.
To unsubscribe from this group and stop receiving emails from it, send an email to dataparksearc...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.



--
Reply all
Reply to author
Forward
0 new messages