<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Flax Blog</title>
	<atom:link href="http://www.flax.co.uk/blog/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.flax.co.uk/blog</link>
	<description>Open source &#38; enterprise search</description>
	<lastBuildDate>Fri, 22 Jan 2010 12:32:21 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.4</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Search networking groups in London</title>
		<link>http://www.flax.co.uk/blog/2010/01/22/search-networking-groups-in-london/</link>
		<comments>http://www.flax.co.uk/blog/2010/01/22/search-networking-groups-in-london/#comments</comments>
		<pubDate>Fri, 22 Jan 2010 12:30:45 +0000</pubDate>
		<dc:creator>charlie</dc:creator>
				<category><![CDATA[events]]></category>
		<category><![CDATA[networking]]></category>
		<category><![CDATA[open source]]></category>

		<guid isPermaLink="false">http://www.flax.co.uk/blog/?p=262</guid>
		<description><![CDATA[Here are two relatively new networking groups &#8211; these are informal gatherings of those who work with enterprise search. I&#8217;ve been to the first one and it was very interesting.
London Open Source Social &#8211; for those working with open-source enterprise search
Enterprise Search London &#8211; more generally for those working in enterprise search
]]></description>
			<content:encoded><![CDATA[<p>Here are two relatively new networking groups &#8211; these are informal gatherings of those who work with enterprise search. I&#8217;ve been to the first one and it was very interesting.</p>
<p><a href=http://www.meetup.com/london-search-social/>London Open Source Social</a> &#8211; for those working with open-source enterprise search<br />
<a href=http://www.meetup.com/es-london/>Enterprise Search London</a> &#8211; more generally for those working in enterprise search</p>
]]></content:encoded>
			<wfw:commentRss>http://www.flax.co.uk/blog/2010/01/22/search-networking-groups-in-london/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Predictions</title>
		<link>http://www.flax.co.uk/blog/2010/01/20/predictions/</link>
		<comments>http://www.flax.co.uk/blog/2010/01/20/predictions/#comments</comments>
		<pubDate>Wed, 20 Jan 2010 11:17:23 +0000</pubDate>
		<dc:creator>charlie</dc:creator>
				<category><![CDATA[Business]]></category>
		<category><![CDATA[autonomy]]></category>
		<category><![CDATA[microsoft]]></category>
		<category><![CDATA[real time]]></category>

		<guid isPermaLink="false">http://www.flax.co.uk/blog/?p=257</guid>
		<description><![CDATA[A new year, and a chance to think about what might happen in the world of enterprise search over the next twelve months. I&#8217;ll make a stab at some predictions:

Price cuts &#8211; possibly driven by even harsher competition between Google and Microsoft FAST, I can see prices coming down for packaged enterprise search. Autonomy will [...]]]></description>
			<content:encoded><![CDATA[<p>A new year, and a chance to think about what might happen in the world of enterprise search over the next twelve months. I&#8217;ll make a stab at some predictions:</p>
<ol>
<li><strong>Price cuts</strong> &#8211; possibly driven by even harsher competition between Google and Microsoft FAST, I can see prices coming down for packaged enterprise search. Autonomy will probably raise theirs <img src='http://www.flax.co.uk/blog/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' /> </li>
<li><strong>Real time search matures</strong> &#8211; not just Twitter or Facebook, but real time data from many sources being part of enterprise search results</li>
<li><strong>More geolocation-aware search</strong> &#8211; in the U.K. at least, we&#8217;re seeing <a href="http://news.bbc.co.uk/1/hi/technology/8402327.stm">signs</a> that the source data is finally being freed up, which should make it a lot simpler and cheaper to build location-aware solutions</li>
<li><strong>A few less second-tier players in the market</strong> &#8211; it&#8217;s still difficult out there, I&#8217;m afraid not every company will survive the next year.</li>
</ol>
<p>You&#8217;re welcome to take any of these with a generous pinch of salt! </p>
]]></content:encoded>
			<wfw:commentRss>http://www.flax.co.uk/blog/2010/01/20/predictions/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Online Information 2009, day 3</title>
		<link>http://www.flax.co.uk/blog/2009/12/04/online-information-2009-day-3/</link>
		<comments>http://www.flax.co.uk/blog/2009/12/04/online-information-2009-day-3/#comments</comments>
		<pubDate>Fri, 04 Dec 2009 11:15:39 +0000</pubDate>
		<dc:creator>charlie</dc:creator>
				<category><![CDATA[Business]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[events]]></category>
		<category><![CDATA[open source]]></category>
		<category><![CDATA[performance]]></category>

		<guid isPermaLink="false">http://www.flax.co.uk/blog/?p=254</guid>
		<description><![CDATA[Back at Online 2009 on Thursday, to take part in the closing panel: &#8220;Cloud Computing, Open Source and Semantics: Content and Search Predictions&#8221;, moderated by Stephen Arnold. We only touched on four of the ten controversial themes Stephen had prepared: we talked a lot about how &#8216;Google pressure&#8217; will affect the market, how XML isn&#8217;t [...]]]></description>
			<content:encoded><![CDATA[<p>Back at <a href="http://www.online-information.co.uk/">Online 2009</a> on Thursday, to take part in the closing panel: &#8220;Cloud Computing, Open Source and Semantics: Content and Search Predictions&#8221;, moderated by <a href="http://arnoldit.com">Stephen Arnold</a>. We only touched on four of the ten controversial themes Stephen had prepared: we talked a lot about how &#8216;Google pressure&#8217; will affect the market, how XML isn&#8217;t necessarily the universal panacea for representing data, on the growth of rich media and the challenges it presents and finally on security. Some great questions from the floor as well, thanks to all who came and the organisers and Stephen for inviting us. I wish we&#8217;d had more time!</p>
<p>I didn&#8217;t agree with Stephen&#8217;s main point that Google will crush us all &#8211; I think the battles between Google and Microsoft (and Google and everyone else) are a distraction. While they&#8217;re fighting it out the rest of us can get on with developing cutting-edge search technologies. Open source search technology gives us tremendous flexibility, allows us to develop solutions very fast, allows the customer to take ownership of the system that&#8217;s being developed and now has comparable performance, scalability and commercial support to the traditional closed source world.  </p>
<p>The real question is how this will affect the profitability of existing companies in the search space. I wonder who <strong>won&#8217;t</strong> be around at next year&#8217;s Online Information show&#8230;</p>
]]></content:encoded>
			<wfw:commentRss>http://www.flax.co.uk/blog/2009/12/04/online-information-2009-day-3/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Flax Newsletters</title>
		<link>http://www.flax.co.uk/blog/2009/12/02/flax-newsletters/</link>
		<comments>http://www.flax.co.uk/blog/2009/12/02/flax-newsletters/#comments</comments>
		<pubDate>Wed, 02 Dec 2009 12:10:20 +0000</pubDate>
		<dc:creator>charlie</dc:creator>
				<category><![CDATA[News]]></category>
		<category><![CDATA[flax]]></category>

		<guid isPermaLink="false">http://www.flax.co.uk/blog/?p=250</guid>
		<description><![CDATA[I&#8217;ve created a page with links to our Flax Newsletters &#8211; let us know if you would like to be added to the mailing list (or indeed, if you&#8217;d like to be removed from it).
]]></description>
			<content:encoded><![CDATA[<p>I&#8217;ve created a page with links to our <a href="http://www.flax.co.uk/blog/newsletters/">Flax Newsletters</a> &#8211; let us know if you would like to be added to the mailing list (or indeed, if you&#8217;d like to be removed from it).</p>
]]></content:encoded>
			<wfw:commentRss>http://www.flax.co.uk/blog/2009/12/02/flax-newsletters/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Online Information 2009, day 1</title>
		<link>http://www.flax.co.uk/blog/2009/12/02/online-information-2009-day-1/</link>
		<comments>http://www.flax.co.uk/blog/2009/12/02/online-information-2009-day-1/#comments</comments>
		<pubDate>Wed, 02 Dec 2009 11:23:51 +0000</pubDate>
		<dc:creator>charlie</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[events]]></category>
		<category><![CDATA[real time]]></category>

		<guid isPermaLink="false">http://www.flax.co.uk/blog/?p=232</guid>
		<description><![CDATA[I visited the Online Information exhibition yesterday at Olympia. My first impression was that the exhibition area was very quiet &#8211; and a few of the exhibitors agreed with me. The current financial situation would seem the obvious cause. At previous shows exhibitors have given away all kinds of freebies, from bags, to mini mice, [...]]]></description>
			<content:encoded><![CDATA[<p>I visited the <a href="http://www.online-information.co.uk/">Online Information exhibition</a> yesterday at Olympia. My first impression was that the exhibition area was very quiet &#8211; and a few of the exhibitors agreed with me. The current financial situation would seem the obvious cause. At previous shows exhibitors have given away all kinds of freebies, from bags, to mini mice, to branded juggling balls&#8230;.but this year you&#8217;d be lucky if you came away with a couple of free pens and a boiled sweet.</p>
<p>I dropped in on the associated <a href="http://www.online-information.co.uk/online09/conference.html">conference</a> later, and caught a presentation titled &#8220;The Real Time Web: Discovery vs. Search&#8221;. <strong>Antonio Gulli</strong> of Microsoft told us about their new European offices, including one in Soho, that were concentrating on bringing new features to <a href="http://www.bing.com/">Bing</a> &#8211; but the <a href="http://www.bing.com/search?q=flax&#038;go=&#038;form=QBLH&#038;filt=all&#038;qs=n">results look very familiar</a>, is Bing doomed to play catch-up? The only &#8216;real time&#8217; feature he discussed was indexing Twitter, although apparently they&#8217;ll soon be indexing Facebook as well. Surely real time encompasses more than these two platforms?</p>
<p><strong>Stephen Arnold</strong> <a href="http://arnoldit.com/wordpress/2009/12/02/some-thoughts-about-real-time-content-processing/">gave us his thoughts</a> on what we should mean by &#8216;real time&#8217;, sensibly talking about how the financial services world has been using real time systems for many years. He also injected some notes of caution about how difficult it is to trust information spread amongst peers on social networking sites &#8211; here&#8217;s a <a href="http://news.techworld.com/security/11333/uk-minister-counters-holocaust-hoax-spam/">recent case</a>, read further down the page for a great quote from Graham Cluley.</p>
<p>Someone from <a href="http://www.endeca.com/"><strong>Endeca</strong></a> (I didn&#8217;t catch the name, he was replacing the published speaker) showed us lots of slides of various applications of search, but his theme seemed more about how search can replace traditional databases than about &#8216;real time&#8217;, something I&#8217;ve <a href=http://www.flax.co.uk/blog/2009/08/27/replacing-relational-databases-with-search-engines-for-simple-lookups/>blogged about recently</a>.</p>
<p>We finished with <strong>Conrad Wolfram</strong>, demonstrating <a href="http://www.flax.co.uk/blog/2009/08/27/replacing-relational-databases-with-search-engines-for-simple-lookups/">Wolfram Alpha</a>, which isn&#8217;t really a search engine but rather a computation engine &#8211; it <a href="http://www.wolframalpha.com/input/?i=chalk+vs+cheese">tries</a> to give you a set of answers, rather than a list of possible resources where the answer might be found. Not a lot of &#8216;real time&#8217; here either.</p>
<p>I&#8217;m back on Thursday as part of the <a href="http://www.online-information.co.uk/online09/conference_2009.html">closing keynote panel</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.flax.co.uk/blog/2009/12/02/online-information-2009-day-1/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Finding French TV with Flax</title>
		<link>http://www.flax.co.uk/blog/2009/11/26/finding-french-tv-with-flax/</link>
		<comments>http://www.flax.co.uk/blog/2009/11/26/finding-french-tv-with-flax/#comments</comments>
		<pubDate>Thu, 26 Nov 2009 11:22:52 +0000</pubDate>
		<dc:creator>charlie</dc:creator>
				<category><![CDATA[News]]></category>
		<category><![CDATA[flax]]></category>
		<category><![CDATA[indexing]]></category>
		<category><![CDATA[media]]></category>

		<guid isPermaLink="false">http://www.flax.co.uk/blog/?p=225</guid>
		<description><![CDATA[We&#8217;ve recently been working with mySkreen, who like Hulu in the U.S. provide a service for finding and viewing television programs via your web browser. mySkreen is the brainchild of Frédéric Sitterlé, previously Head of New Media at the Le Figaro media group.
mySkreen works with French-language content, and is currently indexing over 1.6 million programmes [...]]]></description>
			<content:encoded><![CDATA[<p>We&#8217;ve recently been working with <a href="http://www.myskreen.com">mySkreen</a>, who like <a href="http://www.hulu.com/">Hulu</a> in the U.S. provide a service for finding and viewing television programs via your web browser. mySkreen is the brainchild of Frédéric Sitterlé, previously Head of New Media at the <a href="http://www.lefigaro.fr/">Le Figaro</a> media group.</p>
<p>mySkreen works with French-language content, and is currently indexing over 1.6 million programmes (and counting). Using Flax, you can search using programme title, actors, genres or time periods. We also added some innovative query parsing to translate fuzzy queries such as &#8216;tomorrow evening&#8217; into more exact time periods, and some clever ranking so that &#8216;more easily available&#8217; programmes appear higher in the search results. We also added faceted search and automatic spelling correction.</p>
<p>This was a fast-moving project with a very quick turnaround: we first visited mySkreen in Paris in August and delivered customised code to them less than four weeks later; the flexibility of Flax and the open source model helped to make this possible.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.flax.co.uk/blog/2009/11/26/finding-french-tv-with-flax/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>When real-time search isn&#8217;t</title>
		<link>http://www.flax.co.uk/blog/2009/11/05/when-real-time-search-isnt/</link>
		<comments>http://www.flax.co.uk/blog/2009/11/05/when-real-time-search-isnt/#comments</comments>
		<pubDate>Thu, 05 Nov 2009 16:55:48 +0000</pubDate>
		<dc:creator>charlie</dc:creator>
				<category><![CDATA[News]]></category>
		<category><![CDATA[Technical]]></category>
		<category><![CDATA[indexing]]></category>
		<category><![CDATA[real time]]></category>

		<guid isPermaLink="false">http://www.flax.co.uk/blog/?p=217</guid>
		<description><![CDATA[Avi Rappoport writes about &#8216;real-time&#8217; search, a popular subject at the moment. Twitter search is one  example of this kind of application, where a stream of new content is arriving very quickly.
From a search engine developer&#8217;s point of view there are various things to consider: how quickly new content must become searchable, how to [...]]]></description>
			<content:encoded><![CDATA[<p>Avi Rappoport <a href="http://searchtools.livejournal.com/88949.html">writes</a> about &#8216;real-time&#8217; search, a popular subject at the moment. <a href="http://search.twitter.com/search">Twitter search</a> is one  example of this kind of application, where a stream of new content is arriving very quickly.</p>
<p>From a search engine developer&#8217;s point of view there are various things to consider: how quickly new content must become searchable, how to balance this against performance demands and how to rank the results. </p>
<p>A lot of search engine architectures are built on the assumption that indexes won&#8217;t need to be updated very often, sacrificing index freshness for search speed, so constantly adding new content is expensive in terms of performance. One approach is to maintain several indexes: a small, fresh one and some older, static ones, with the fresh index periodically being merged into the older static set. Searches must be made across all these indexes of course, with care taken to maintain accurate statistics and thus relevancy ranking.</p>
<p>The question of ranking is also an interesting one: in a &#8216;real-time&#8217; situation, how should we present the results &#8211; does &#8216;more recent&#8217; always trump &#8216;more relevant&#8217;? As always, a combination of both is probably the best default approach, with an option available to the user to choose one or the other. </p>
<p>In any case there will always be <em>some</em> delay between content being published and being searchable &#8211; the trick is to keep this to the minimum, so it appears as &#8216;real-time&#8217; as possible.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.flax.co.uk/blog/2009/11/05/when-real-time-search-isnt/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Open Source Search Event</title>
		<link>http://www.flax.co.uk/blog/2009/10/06/open-source-search-event/</link>
		<comments>http://www.flax.co.uk/blog/2009/10/06/open-source-search-event/#comments</comments>
		<pubDate>Tue, 06 Oct 2009 16:01:38 +0000</pubDate>
		<dc:creator>charlie</dc:creator>
				<category><![CDATA[events]]></category>
		<category><![CDATA[lucene]]></category>
		<category><![CDATA[open source]]></category>
		<category><![CDATA[xapian]]></category>

		<guid isPermaLink="false">http://www.flax.co.uk/blog/?p=210</guid>
		<description><![CDATA[We sponsored Open Source Search Cambridge last week, which went very well, with attendees from as far away as Tokyo and New Zealand, a great variety of talks, presentation and networking and some excellent food!
Shane Evans from mydeco gave a detailed talk on Creating a product search engine, with some interesting details on how query-independent [...]]]></description>
			<content:encoded><![CDATA[<p>We sponsored <a href="http://searchevent.org/">Open Source Search Cambridge</a> last week, which went very well, with attendees from as far away as Tokyo and New Zealand, a great variety of talks, presentation and networking and some excellent food!</p>
<p>Shane Evans from <a href="http://www.mydeco.com">mydeco</a> gave a detailed talk on <em>Creating a product search engine</em>, with some interesting details on how query-independent weights are calculate. He was followed by <a href="http://oligarchy.co.uk">Olly Betts</a> on <em>How Gmane is implemented using Xapian</em> &#8211; 72 million messages indexed on a single server! We also had talks from those involved with the <a href="http://www.cheshire3.org/">Cheshire3 </a>XML search engine, <a href="http://www.puppyir.eu/">PuppyIR</a>, project to develop search frameworks for children, and found out more about how <a href="http://www.glassesdirect.co.uk/">Glasses Direct</a> have implemented their search using <a href="http://lucene.apache.org/solr/">SOLR</a>.</p>
<p>The afternoon consisted of a number of well-attended seminars on search topics, such as comparisons of the various open source search engines available. The day ended with informal networking in a nearby pub.</p>
<p>Based on the feedback we got, there&#8217;s definitely interest in a similar event next year &#8211; watch this space.</p>
<p><strong>Update</strong>: sounds like <a href="http://www.iaplay.com/2009/10/05/search-solutions-2009/">Search Solutions 2009</a> was also a good day.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.flax.co.uk/blog/2009/10/06/open-source-search-event/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>New season</title>
		<link>http://www.flax.co.uk/blog/2009/09/09/new-season/</link>
		<comments>http://www.flax.co.uk/blog/2009/09/09/new-season/#comments</comments>
		<pubDate>Wed, 09 Sep 2009 16:25:57 +0000</pubDate>
		<dc:creator>charlie</dc:creator>
				<category><![CDATA[Business]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[events]]></category>

		<guid isPermaLink="false">http://www.flax.co.uk/blog/?p=204</guid>
		<description><![CDATA[As September begins, there are various events coming up that may be of interest to some of our readers. We have a list of conferences we&#8217;re attending and/or presenting at. Gartner are running their Portals, Content and Collaboration Summit in mid September in London. Also in London is E Commerce Expo 2009 in late October, [...]]]></description>
			<content:encoded><![CDATA[<p>As September begins, there are various events coming up that may be of interest to some of our readers. We have a list of <a href="http://www.flax.co.uk/blog/events-conferences/">conferences we&#8217;re attending and/or presenting at</a>. Gartner are running their <a href="http://www.gartner.com/it/page.jsp?id=787313">Portals, Content and Collaboration Summit</a> in mid September in London. Also in London is <a href="http://www.ecommerceexpo.co.uk/">E Commerce Expo 2009 in late October</a>, which may be of interest as most e-commerce solutions will need some kind of search facility (although in our opinion many fall woefully short, failing to implement such features as spelling correction and synonyms). </p>
<p>For more Enterprise Search events, there&#8217;s a <a href="http://www.infotoday.com/calendar.shtml">calendar provided by Information Today</a> which is pretty exhaustive.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.flax.co.uk/blog/2009/09/09/new-season/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Replacing relational databases with search engines for simple lookups</title>
		<link>http://www.flax.co.uk/blog/2009/08/27/replacing-relational-databases-with-search-engines-for-simple-lookups/</link>
		<comments>http://www.flax.co.uk/blog/2009/08/27/replacing-relational-databases-with-search-engines-for-simple-lookups/#comments</comments>
		<pubDate>Thu, 27 Aug 2009 16:34:41 +0000</pubDate>
		<dc:creator>charlie</dc:creator>
				<category><![CDATA[Technical]]></category>
		<category><![CDATA[database]]></category>
		<category><![CDATA[flax]]></category>

		<guid isPermaLink="false">http://www.flax.co.uk/blog/?p=199</guid>
		<description><![CDATA[One of the things we often notice about existing systems based on relational databases (RDB) is that as they scale to millions of items, simple lookup tasks become slow and inefficient. These tasks don&#8217;t usually require complicated database operations, so in most cases it is possible to relocate the data from the RDB into a [...]]]></description>
			<content:encoded><![CDATA[<p>One of the things we often notice about existing systems based on relational databases (RDB) is that as they scale to millions of items, simple lookup tasks become slow and inefficient. These tasks don&#8217;t usually require complicated database operations, so in most cases it is possible to relocate the data from the RDB into a search engine like Flax.</p>
<p>Consider a system where a search engine has already been implemented to search textual product information, but numerical data on each product, such as price, is still being stored in a RDB. Users will often need filters on search results such as <em>&#8217;show me items under £10&#8242;</em> and so a RDB operation similar to &#8216;<code>SELECT productID FROM products WHERE price&lt;£10</code>&#8216; will be needed, in addition to the search engine query. Modern search engines like Flax implement range search functions, so that numerical information can be added to documents, and it is thus possible to carry out this operation in the search engine as part of the full-text search for the product information.</p>
<p>We&#8217;ve noticed with several clients that it is now possible to move <strong>all</strong> their data from the original RDB into the search engine. This can obviously lead to cost savings, as only one system must be hosted, maintained and backed up, and scaling out can be far simpler.</p>
<p>Another way to look at this is to consider a search engine as an example of a <a href="http://en.wikipedia.org/wiki/Document-oriented_database">document-oriented database</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.flax.co.uk/blog/2009/08/27/replacing-relational-databases-with-search-engines-for-simple-lookups/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
