Google Groups Home
Help | Sign in
Search options
There are currently too many topics in this group that display first. To make this topic appear first, remove this option from another topic.
There was an error processing your request. Please try again.
flag
  4 messages - Expand all  -  Translate all to Translated (View all originals)
The group you are posting to is a Usenet group. Messages posted to this group will make your email address visible to anyone on the Internet.
Your reply message has not been sent.
Your post was successful
 
From:
To:
Cc:
Followup To:
Add Cc | Add Followup-to | Edit Subject
Subject:
Validation:
For verification purposes please type the characters you see in the picture below or the numbers you hear by clicking the accessibility icon. Listen and type the numbers you hear
 
Ivan Sagalaev  
View profile  
 More options May 11 2007, 11:38 am
From: Ivan Sagalaev <Man...@SoftwareManiacs.Org>
Date: Fri, 11 May 2007 19:38:20 +0400
Local: Fri, May 11 2007 11:38 am
Subject: Search options
Hello!

I'd like to ask everyone's opinion on implementing a search
functionality in an app. The app is a forum that tends to be simple and
pluggable. Now I'm on a quest of picking a right solution for searching
and have stuck.

My current thoughts and decision:

- Searching using "like" db queries is too simplistic and tends to be
slower over time.
- Database-specific solutions (MySQL search, Postgres TSearch2) kill
portability.
- PyLucene is too large to work in-process (20 MB in memory). Also it
doesn't work with Python's threading (segfaulting the whole process on
import). A solution would be a dedicated PyLucene process.
- Xapian looks good but I didn't actually try it yet. I've heard though
that it doesn't implement locking of index database and this should be
done manually. Not a rocket science but complicates the solution a bit.
I've also seen recommendation to run it in a dedicated process.

So my questions are:

- Am I doomed to have a separate server? This complicates things a lot
and I very much inclined to use some in-process thing
- Are there any solutions on a scale between simplistic "likes" and
sophisticated indexers like Lucene?


    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Jeremy Dunck  
View profile  
 More options May 11 2007, 11:52 am
From: "Jeremy Dunck" <jdu...@gmail.com>
Date: Fri, 11 May 2007 10:52:54 -0500
Local: Fri, May 11 2007 11:52 am
Subject: Re: Search options
On 5/11/07, Ivan Sagalaev <Man...@softwaremaniacs.org> wrote:

> - Am I doomed to have a separate server? This complicates things a lot
> and I very much inclined to use some in-process thing

Probably. :)

> - Are there any solutions on a scale between simplistic "likes" and
> sophisticated indexers like Lucene?

http://www.osreviews.net/reviews/misc/hyperestraier
http://cheeseshop.python.org/pypi/estraiernative/0.2

http://swish-e.org/
http://cheeseshop.python.org/pypi/Swish-E/0.5


    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Joseph Heck  
View profile  
 More options May 11 2007, 1:57 pm
From: "Joseph Heck" <joseph.h...@gmail.com>
Date: Fri, 11 May 2007 10:57:08 -0700
Local: Fri, May 11 2007 1:57 pm
Subject: Re: Search options
We determined that Postgres was portable enough to any platform we'd
host on, and went with TSearch2 and have been pretty happy. Having
done hard-core search work in a previous life (www.singingfish.com), I
know it isn't everything you can get in the search world, but it was
sufficient for our needs. I personally feel that TSearch2 falls very
nicely between better than the simplistic "like"+wildcard SQL
statements and a sophisticated indexing engine like Lucene.

If you're willing to go "search server", you might even consider SOLR
(lucene based search server with a web api). Especially if you scale
out your front end's (the django app servers) horizontally in a large
environment, it becomes appealing. How many front-end's you have
actually becomes something to seriously consider, because the likes of
PyLucene, Xapian, and others all have search related indices that then
need to be kept up to date and available to the searcher processes.

On 5/11/07, Jeremy Dunck <jdu...@gmail.com> wrote:


    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
olly@survex.com  
View profile  
 More options May 11 2007, 2:46 pm
From: "o...@survex.com" <o...@survex.com>
Date: Fri, 11 May 2007 18:46:50 -0000
Local: Fri, May 11 2007 2:46 pm
Subject: Re: Search options
On May 11, 4:38 pm, Ivan Sagalaev wrote:

> - Xapian looks good but I didn't actually try it yet. I've heard though
> that it doesn't implement locking of index database and this should be
> done manually.

You've heard incorrect information then, since Xapian most definitely
does implement database locking.

Cheers,
    Olly


    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
End of messages
« Back to Discussions « Newer topic     Older topic »

Create a group - Google Groups - Google Home - Terms of Service - Privacy Policy
©2009 Google