Case study – Flax http://www.flax.co.uk The Open Source Search Specialists Thu, 10 Oct 2019 09:03:26 +0000 en-GB hourly 1 https://wordpress.org/?v=4.9.8 Asking the right questions in a Search Audit http://www.flax.co.uk/blog/2018/02/09/asking-right-questions-search-audit/ http://www.flax.co.uk/blog/2018/02/09/asking-right-questions-search-audit/#respond Fri, 09 Feb 2018 14:07:48 +0000 http://www.flax.co.uk/?p=3698 We're often asked to review how our clients use search technology, both open and closed source (although we specialise in the former, we've also encountered most of the commercial search engines over the last 16 years). One common mistake is to assume that search is purely a technical problem, and that all issues can be resolved by writing software and changing configurations, or worse by throwing away the old engine and replacing it with a new one at huge expense. More

The post Asking the right questions in a Search Audit appeared first on Flax.

]]>
We’re often asked to review how our clients use search technology, both open and closed source (although we specialise in the former, we’ve also encountered most of the commercial search engines over the last 16 years). One common mistake is to assume that search is purely a technical problem, and that all issues can be resolved by writing software and changing configurations, or worse by throwing away the old engine and replacing it with a new one at huge expense.

As Martin White writes, search should be regarded as a ‘wicked problem’ – multi-dimensional and hard to resolve using traditional methods. So when we carry out a ‘search audit’, as we recently did for global charity Oxfam, we will consider both technical and non-technical factors – not just the software your organisation is using with its potential flaws and misconfigurations, but how it is being used and by whom.

We’ll consider how content is created for the search engine to index – who controls this process, how shortcuts might be taken or mistakes made, who manages the content quality. We’ll check how you are managing the search and associated software – is version control used correctly, are your team sufficiently trained and supported. We’ll look at your future plans and strategy and consider how these might be helped or hampered by what has gone before – the drag effect of legacy systems and content. We’ll consider the needs of your users, both internal and external, and how you test and maintain the quality of your search results. The human factors are as important as the technical details.

This process involves using our years of experience to ask the right questions – to discover how you represent your organisation in data – and how that data flows, under whose control. The output of the process is a detailed report listing what we discovered, what immediate improvements we can suggest and what we regard as more long-term recommendations. Even if we don’t find a lot wrong it will be reassuring to know you’re on the right path to a great search experience for your users. If we do discover major flaws, we hope to save you a great deal of trouble and expense.

If you’re interested in a search audit do ask us for more details.

The post Asking the right questions in a Search Audit appeared first on Flax.

]]>
http://www.flax.co.uk/blog/2018/02/09/asking-right-questions-search-audit/feed/ 0
Elasticsearch for Westcoast – why search is never simple! http://www.flax.co.uk/blog/2015/11/27/elasticsearch-westcoast-search-never-simple/ http://www.flax.co.uk/blog/2015/11/27/elasticsearch-westcoast-search-never-simple/#respond Fri, 27 Nov 2015 15:48:29 +0000 http://www.flax.co.uk/?p=2817 Elasticsearch for Westcoast from Charlie Hull

The post Elasticsearch for Westcoast – why search is never simple! appeared first on Flax.

]]>

The post Elasticsearch for Westcoast – why search is never simple! appeared first on Flax.

]]>
http://www.flax.co.uk/blog/2015/11/27/elasticsearch-westcoast-search-never-simple/feed/ 0
FIBEP WMIC 2015 – Open source search for media monitoring with Solr http://www.flax.co.uk/blog/2015/11/19/fibep-wmic-2015-infomedia-upgraded-closed-source-search-engine-fast-scalable-flexible-open-source-platform/ http://www.flax.co.uk/blog/2015/11/19/fibep-wmic-2015-infomedia-upgraded-closed-source-search-engine-fast-scalable-flexible-open-source-platform/#respond Thu, 19 Nov 2015 16:23:46 +0000 http://www.flax.co.uk/?p=2812 FIBEP WMIC 2015 – How Infomedia upgraded their closed-source search engine to a fast, scalable and flexible open-source platform from Charlie Hull

The post FIBEP WMIC 2015 – Open source search for media monitoring with Solr appeared first on Flax.

]]>

The post FIBEP WMIC 2015 – Open source search for media monitoring with Solr appeared first on Flax.

]]>
http://www.flax.co.uk/blog/2015/11/19/fibep-wmic-2015-infomedia-upgraded-closed-source-search-engine-fast-scalable-flexible-open-source-platform/feed/ 0
Building high-end search features at low cost with Apache Solr http://www.flax.co.uk/blog/2013/03/01/building-high-end-search-features-at-low-cost-with-apache-solr/ http://www.flax.co.uk/blog/2013/03/01/building-high-end-search-features-at-low-cost-with-apache-solr/#respond Fri, 01 Mar 2013 10:59:13 +0000 http://www.flax.co.uk/blog/?p=950 One of the best things about the increased use of open source search technology is that features that were previously unattainable for clients with small budgets are now within reach. Our client Bride and Groom Direct, a UK-based business selling … More

The post Building high-end search features at low cost with Apache Solr appeared first on Flax.

]]>
One of the best things about the increased use of open source search technology is that features that were previously unattainable for clients with small budgets are now within reach. Our client Bride and Groom Direct, a UK-based business selling wedding gifts and stationery, asked us if we could help improve the search features on their website and in particular the auto-suggest – and they asked us to take a look at the website of US mega-retailer Sears.com for inspiration. They particularly liked the way that while you type, Sears’ website doesn’t just show you suggested words but also clickable picture previews of products you might be looking for.

Using Apache Solr and in under two days we built them a similar feature for their website: since we didn’t have direct access to their development servers we provided both Solr configuration files and a simple JQuery/Javascript demo of the features they needed (it’s about 170 lines of code). Their own developers then integrated these changes based on our notes. I think it’s safe to say that Bride and Groom Direct are a rather smaller business than Sears, but with open source they can have access to equally good search facilities. They’ve been kind enough to let us feature them on our Clients page and as you can see, they’re happy with the results.

The post Building high-end search features at low cost with Apache Solr appeared first on Flax.

]]>
http://www.flax.co.uk/blog/2013/03/01/building-high-end-search-features-at-low-cost-with-apache-solr/feed/ 0
Tuning and improving elasticsearch for the Government Digital Service http://www.flax.co.uk/blog/2012/10/01/tuning-and-improving-elasticsearch-for-the-government-digital-service/ http://www.flax.co.uk/blog/2012/10/01/tuning-and-improving-elasticsearch-for-the-government-digital-service/#comments Mon, 01 Oct 2012 15:45:03 +0000 http://www.flax.co.uk/blog/?p=855 The exciting GOV.UK project is getting close to its first release date of October 17th and we were asked by them to help with some search tuning as they migrate from Apache Solr to elasticsearch. Although elasticsearch has some great … More

The post Tuning and improving elasticsearch for the Government Digital Service appeared first on Flax.

]]>
The exciting GOV.UK project is getting close to its first release date of October 17th and we were asked by them to help with some search tuning as they migrate from Apache Solr to elasticsearch. Although elasticsearch has some great features there are still some areas where it lags Solr, such as the lack of spelling suggestion and proximity boost features. Alan from Flax spent a couple of days working with the GDS team and has blogged about how proximity boosting in particular can be implemented – at least for terms that are relatively close to each other rather than being separated by a page or so.

If you’re interested in more details of how we fixed this and a few other elasticsearch issues, you may want to take a look at the code we worked on – one of the best things about working with the GOV.UK team is that it was already up as open source software within a day (yes, you read that right – code paid for by the taxpayer is open source, as it should be!). We’re looking forward to launch day!

Update: changed ‘proximity search’ to ‘proximity boost’ – thanks Alan!

The post Tuning and improving elasticsearch for the Government Digital Service appeared first on Flax.

]]>
http://www.flax.co.uk/blog/2012/10/01/tuning-and-improving-elasticsearch-for-the-government-digital-service/feed/ 3
Media monitoring with open source search – 20 times faster than before! http://www.flax.co.uk/blog/2012/07/25/media-monitoring-with-open-source-search-20-times-faster-than-before/ http://www.flax.co.uk/blog/2012/07/25/media-monitoring-with-open-source-search-20-times-faster-than-before/#comments Wed, 25 Jul 2012 09:39:38 +0000 http://www.flax.co.uk/blog/?p=834 We’re happy to announce we’ve just finished a successful project for a division of the Australian Associated Press to replace a closed source search engine with a considerably more powerful open source solution. You can read the press release here. … More

The post Media monitoring with open source search – 20 times faster than before! appeared first on Flax.

]]>
We’re happy to announce we’ve just finished a successful project for a division of the Australian Associated Press to replace a closed source search engine with a considerably more powerful open source solution. You can read the press release here.

As our client had a large investment in stored searches (which represent a client’s interests) which were defined in the query language of their previous search engine, we first had to build a modified version of Apache Lucene that replicated exactly this syntax. I’ve previously blogged about how we did this. However this wasn’t the only challenge: search engines are designed to be good at applying a few queries to a very large document collection, not necessarily at applying tens of thousands of stored queries to every single new document. For media monitoring applications this kind of performance is essential as there may be hundreds of thousands of news articles to monitor every day. The system we’ve built is capable of applying tens of thousands of stored queries every second.

With the rapid increase in the volume of content that media monitoring companies have to check for their clients – today’s news isn’t just in print, but online, in social media and indeed multimedia – it may be that open source software is the only way to build monitoring systems that are economically scalable, while remaining accurate and flexible enough to deliver the right results to clients.

The post Media monitoring with open source search – 20 times faster than before! appeared first on Flax.

]]>
http://www.flax.co.uk/blog/2012/07/25/media-monitoring-with-open-source-search-20-times-faster-than-before/feed/ 2
Lucene Eurocon Barcelona 2011: Just the Job – Employing Solr for Recruitment Search http://www.flax.co.uk/blog/2011/11/26/lucene-eurocon-barcelona-2011-just-the-job-employing-solr-for-recruitment-search/ Sat, 26 Nov 2011 16:08:57 +0000 http://www.flax.co.uk/?p=2316 The post Lucene Eurocon Barcelona 2011: Just the Job – Employing Solr for Recruitment Search appeared first on Flax.

]]>

The post Lucene Eurocon Barcelona 2011: Just the Job – Employing Solr for Recruitment Search appeared first on Flax.

]]>
Building bridges in the Cloud with open source search http://www.flax.co.uk/blog/2011/11/23/building-bridges-in-the-cloud-with-open-source-search/ http://www.flax.co.uk/blog/2011/11/23/building-bridges-in-the-cloud-with-open-source-search/#respond Wed, 23 Nov 2011 14:46:00 +0000 http://www.flax.co.uk/blog/?p=667 We’ve just published a case study on our work for C Spencer Ltd., a UK-based civil engineering company who take a pro-active approach to document management – instead of taking the default Sharepoint route or buying another product off the … More

The post Building bridges in the Cloud with open source search appeared first on Flax.

]]>
We’ve just published a case study on our work for C Spencer Ltd., a UK-based civil engineering company who take a pro-active approach to document management – instead of taking the default Sharepoint route or buying another product off the shelf, they decided to create their own in-house system based on open source components, hosted on the Amazon AWS Cloud. We’ve helped them integrate Apache Solr to provide full text search across the millions of items held in the document management system, with a sub-second response. Their staff can now find letters, contracts, emails and designs quickly via a web interface.

C Spencer are known for their innovative and modern approach – they’re even building their own green power station on a brownfield site in Hull. It’s thus not surprising that they chose cutting-edge open source technology for search: tracking and managing documents correctly is extremely important to their business.

The post Building bridges in the Cloud with open source search appeared first on Flax.

]]>
http://www.flax.co.uk/blog/2011/11/23/building-bridges-in-the-cloud-with-open-source-search/feed/ 0
Just the job for a recruitment client http://www.flax.co.uk/blog/2011/10/18/just-the-job-for-a-recruitment-client/ http://www.flax.co.uk/blog/2011/10/18/just-the-job-for-a-recruitment-client/#respond Tue, 18 Oct 2011 11:05:16 +0000 http://www.flax.co.uk/blog/?p=645 We’re pleased to announce our work with Reed Specialist Recruitment, one of the UK’s largest recruitment companies, where we helped them implement an Apache Solr powered application to allow their 3000+ staff to search for and match candidates to jobs. … More

The post Just the job for a recruitment client appeared first on Flax.

]]>
We’re pleased to announce our work with Reed Specialist Recruitment, one of the UK’s largest recruitment companies, where we helped them implement an Apache Solr powered application to allow their 3000+ staff to search for and match candidates to jobs. We built an innovative indexing framework, a configuration tool and performance monitoring system for Reed and the system launched on time and under budget, a great testament to the flexibility and power of this open source software. The new system responds in under a second – a massive improvement on the previous response time of several minutes. You can read the press release here.

If you’d like to hear more I’ll be giving a presentation on the project at Lucene Eurocon in Barcelona tomorrow – Wednesday 19th October at 1.30 p.m. – slides and a video will be online after the event.

If you can’t make it to Barcelona I’ll also be talking in London, on the business benefits of open source search, at around 10am on Tuesday 25th October with our client Stephen Wicks, CTO of Gorkana Group as part of Enterprise Search Europe – there are still tickets available and you can even get a 20% discount if you join the Cambridge or London Enterprise Search Meetups, who are hosting a joint event on the Monday evening of the conference.

The post Just the job for a recruitment client appeared first on Flax.

]]>
http://www.flax.co.uk/blog/2011/10/18/just-the-job-for-a-recruitment-client/feed/ 0
Another powerful API based on Solr launches, searching more patents than Google http://www.flax.co.uk/blog/2011/10/07/another-powerful-api-based-on-solr-launches-searching-more-patents-than-google/ http://www.flax.co.uk/blog/2011/10/07/another-powerful-api-based-on-solr-launches-searching-more-patents-than-google/#respond Fri, 07 Oct 2011 11:57:20 +0000 http://www.flax.co.uk/blog/?p=633 Our customer Cambridge Intellectual Property announced yesterday their new API for a collection of 55 million patents – 48 million more than Google Patents. It’s great to see a Cambridge company innovating in this space, especially as the service is … More

The post Another powerful API based on Solr launches, searching more patents than Google appeared first on Flax.

]]>
Our customer Cambridge Intellectual Property announced yesterday their new API for a collection of 55 million patents – 48 million more than Google Patents. It’s great to see a Cambridge company innovating in this space, especially as the service is powered by Apache Solr (we’ve given them some small assistance with configuring and tuning this software over the last few months).

The API, available on the Boliven website, offers a REST based service and returns patent data in JSON or XML – so users can easily integrate patent data with their own applications. It can also return PDFs or summaries of the selected patents. In addition, the API will allow users to search and query Boliven’s database of 45+ million science literature documents including journal publications and medical device trials. That’s around 100 million items in total.

Like the Guardian’s Open Platform which I wrote about previously, this is a great example of open source search technology as a platform for new delivery methods – showing how effective (and economical) it can be at this large scale.

It didn’t take me long to find my own small contribution to the patent landscape.

The post Another powerful API based on Solr launches, searching more patents than Google appeared first on Flax.

]]>
http://www.flax.co.uk/blog/2011/10/07/another-powerful-api-based-on-solr-launches-searching-more-patents-than-google/feed/ 0