Working with Hadoop, Kafka, Samza and the wider Big Data ecosystem

Posted on March 3, 2016 by Charlie Hull

We've been working on a number of projects recently involving open source software often quoted as 'Big Data' solutions - here's a quick overview of them. The grandfather of them all of course is Apache Hadoop, now not so much a single project as an ecosystem including storage and processing for potentially huge amounts of data, spread across clusters of machines. Interestingly Hadoop was originally created by D...Continue reading

As Hadoop gains, does Lucene benefit?

Posted on March 27, 2014 by Charlie Hull

The last few weeks have seen a rush of investment in companies that offer Hadoop-powered Big Data platforms - the most recent being Intel's investment in Cloudera, but Hortonworks has also snorted up $100m. Gartner Continue reading

Cambridge Search Meetup – six degrees of ontology and Elasticsearching products

Posted on March 7, 2014 by Charlie Hull

Last Wednesday evening the Cambridge Search Meetup was held with too very different talks - we started with Zoë Rose, an information architect who has lent her expertise to Proquest, the BBC and now the UK Government. She ga...Continue reading

Finding the elephant in the room: open source search & Hadoop grow closer together

Posted on September 18, 2013 by Charlie Hull

I've been lucky enough to attend two talks on Hadoop in the last few weeks which has made me take a closer look at this technology. In case you didn't know, Hadoop is an Apache top level open source project comprising a framework for distributed computing and storage, originally created by Doug Cutting (also the creator of Apache Lucene) while at Yahoo! in 2005. Distributed computing is carried out using Continue reading

Cambridge Search Meetup – Search for publication success and low-cost apps

Posted on October 18, 2012 by Charlie Hull

After a short break the Cambridge Search Meetup returned last night with our usual mix of presentations, questions, networking, beer and snacks. We had a few issues with the projector and cables (one of these is on the shopping list for next time) so thanks to both presenters and audience for their patience! First up was Li...Continue reading