Just the facts with Solr & Luwak

It won't have escaped your notice that factchecking is very much in the news recently due to last year's political upheavals in both the US and UK and the suspected influence of fake news on voters. Both traditional and social media organisations are making efforts in this area; examples include Channel 4 and Faceboo...Continue reading

Meetup at Big Data London – One-click Solr & Factchecking with Solr

Last week I spoke at the Big Data London conference, a very busy event with several thousand people attending. My session was on using open source search to make sense of Big Data - you can get slides here. In the evening we ran another Continue reading

Boosts Considered Harmful – adventures with badly configured search

During a recent client visit we encountered a common problem in search - over-application of 'boosts', which can be used to weight the influence of matches in one particular field. For example, you might sensibly use this to make results that match a query on their title field come higher in search results. However in this case we saw huge boost values used (numbers in the hundreds) which were probably swamping everything else - and it wasn't at all clear where the values had come from, be it ex...Continue reading

Setting up your first Quepid test case

Quepid is an innovative tool from our partners Open Source Connections, which allows you to bridge the gap between content owners (who really know what's in your search index and how people might search for it) and search developers (who can tweak the search engine to improve relevance, given some examples of 'good' and 'bad' results for a query). We're increasingly using it in...Continue reading

Can we fix your Solr or Elasticsearch system in a single day?

Here at Flax, we're often called in to take a look at existing Apache Solr or Elasticsearch search applications, to suggest improvements, tune-ups or enhancements. It's impossible for us to know ahead of time what we might find - out-of-date versions of the software, slow performance on either (or both) the indexing or search side of the application ...Continue reading