From general-return-3382-apmail-lucene-general-archive=lucene.apache.org@lucene.apache.org Sat Jun 18 20:48:03 2011
Return-Path:
X-Original-To: apmail-lucene-general-archive@www.apache.org
Delivered-To: apmail-lucene-general-archive@www.apache.org
Received: from mail.apache.org (hermes.apache.org [140.211.11.3])
by minotaur.apache.org (Postfix) with SMTP id A8371669D
for ; Sat, 18 Jun 2011 20:48:03 +0000 (UTC)
Received: (qmail 42307 invoked by uid 500); 18 Jun 2011 20:48:03 -0000
Delivered-To: apmail-lucene-general-archive@lucene.apache.org
Received: (qmail 42264 invoked by uid 500); 18 Jun 2011 20:48:03 -0000
Mailing-List: contact general-help@lucene.apache.org; run by ezmlm
Precedence: bulk
List-Help:
List-Unsubscribe:
List-Post:
List-Id:
Reply-To: general@lucene.apache.org
Delivered-To: mailing list general@lucene.apache.org
Received: (qmail 42255 invoked by uid 99); 18 Jun 2011 20:48:03 -0000
Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136)
by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 18 Jun 2011 20:48:03 +0000
X-ASF-Spam-Status: No, hits=2.0 required=5.0
tests=FREEMAIL_FROM,RFC_ABUSE_POST,SPF_NEUTRAL,T_TO_NO_BRKTS_FREEMAIL,URI_HEX
X-Spam-Check-By: apache.org
Received-SPF: neutral (athena.apache.org: 216.139.236.26 is neither permitted nor denied by domain of dholl@comcast.net)
Received: from [216.139.236.26] (HELO sam.nabble.com) (216.139.236.26)
by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 18 Jun 2011 20:47:57 +0000
Received: from ben.nabble.com ([192.168.236.152])
by sam.nabble.com with esmtp (Exim 4.72)
(envelope-from )
id 1QY2Qb-0005Lw-AO
for general@lucene.apache.org; Sat, 18 Jun 2011 13:47:37 -0700
Date: Sat, 18 Jun 2011 13:47:37 -0700 (PDT)
From: Dave Jones
To: general@lucene.apache.org
Message-ID: <1308430057311-3080719.post@n3.nabble.com>
In-Reply-To:
References: <1308329049489-3076997.post@n3.nabble.com>
Subject: Re: setup and use scenario
MIME-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Transfer-Encoding: 7bit
Ryan, thanks for the reply.
I am using Lucene and most of the defaults. Where I am now is I am finding
that I need to boost certain terms, otherwise, I am getting the wrong
results. For example, I would probably boost Little and Book to overcome
the scoring of the phrases with longer terms in them.
The guidance that I am looking for is what is normally used in these
situations vs. me continuing to perform trial and error experiments. For
example, is it better to boost in the index or boost in the query? What is
a good boost value? I started out at 3.0 and then tried 5.0 and got better
results but picked up some small errors. How does one find the minimum
threshold for the cutoff when the book is not there? Is another type of
analyzer better to use in this case. Any other settings that I should pay
attention to?
Again, thanks for the help.
--
View this message in context: http://lucene.472066.n3.nabble.com/setup-and-use-scenario-tp3076997p3080719.html
Sent from the Lucene - General mailing list archive at Nabble.com.