Comments on: Xapian Search Architecture http://www.flax.co.uk/blog/2009/04/02/xapian-search-architecture/ The Open Source Search Specialists Tue, 12 Feb 2019 14:44:32 +0000 hourly 1 https://wordpress.org/?v=4.9.8 By: Searching date ranges in Xapian and PHP | Geert Van Damme http://www.flax.co.uk/blog/2009/04/02/xapian-search-architecture/#comment-18513 Tue, 19 Apr 2011 17:13:06 +0000 http://www.flax.co.uk/blog/?p=66#comment-18513 […] of dates (e.g. date created from – to). After reading the very interesting article on the Xapian Search Architecture, I realised I needed to add a value to the document which contained the date field, opposed to the […]

]]>
By: yoa http://www.flax.co.uk/blog/2009/04/02/xapian-search-architecture/#comment-18512 Mon, 20 Apr 2009 19:53:53 +0000 http://www.flax.co.uk/blog/?p=66#comment-18512 Thanks for answering, I appreciate it, I’m working on my thesis for engineer and I work with xapian but is new for me.

thank you again,
Yoa

]]>
By: Richard Boulton http://www.flax.co.uk/blog/2009/04/02/xapian-search-architecture/#comment-18511 Mon, 20 Apr 2009 19:35:29 +0000 http://www.flax.co.uk/blog/?p=66#comment-18511 yoa – if you want to discuss the weighting model used by Xapian, the Xapian mailing list, or the Xapian IRC channel, would probably be better places: see http://xapian.org/lists and #xapian on irc.freenode.net But briefly, BM25 uses the frequency of terms in both documents and across the database as a whole, as well as information on the document length and the frequency of terms in the query. ie – it uses a superset of the statistics used by tf-idf, and combines them differently. The formula has performed a lot better in many retrieval experiments, and has some sound theoretical basis justifying its derivation.

]]>
By: yoa http://www.flax.co.uk/blog/2009/04/02/xapian-search-architecture/#comment-18510 Mon, 20 Apr 2009 18:27:15 +0000 http://www.flax.co.uk/blog/?p=66#comment-18510 Thanks for answering, but the inverse frequency is used to calculate the weight BM25?. I studied the BM25 and I think only works with the importance of frequency of occurrence.
I hope you answer

Yoa

]]>
By: Tom http://www.flax.co.uk/blog/2009/04/02/xapian-search-architecture/#comment-18509 Mon, 20 Apr 2009 18:06:38 +0000 http://www.flax.co.uk/blog/?p=66#comment-18509 Yoa, Xapian uses BM25, as documented on the Xapian website.

http://xapian.org/docs/bm25.html

Tom

]]>
By: yoa http://www.flax.co.uk/blog/2009/04/02/xapian-search-architecture/#comment-18508 Mon, 20 Apr 2009 17:58:44 +0000 http://www.flax.co.uk/blog/?p=66#comment-18508 I need know if the calculated weight for each term (word) is the multiplication of
tf * idf. So if the multiplication of the importance of frequency of occurrence of each word in the text per the inverse frequency of each word in the collection.
Please I need help with this, expect quick responses

Yoa

]]>
By: Xapian terms, values and data explained « Searching with Xapian http://www.flax.co.uk/blog/2009/04/02/xapian-search-architecture/#comment-18507 Thu, 02 Apr 2009 16:45:28 +0000 http://www.flax.co.uk/blog/?p=66#comment-18507 […] a comment » My colleague Tom Mortimer has made a useful post on the Flax blog to clear up a common set of confusions regarding Xapian; covering the difference between, and the […]

]]>